#计算机科学#Python 数据科学学习笔记:深度学习 (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, 大数据 (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python 核心, AWS, Linux命令
新一代分布式任务调度与计算框架,支持CRON、API、固定频率、固定延迟等调度策略,提供工作流来编排任务解决依赖关系
Python clone of Spark, a MapReduce alike framework in Python
翻译 - Spark的Python克隆,Python中的MapReduce相似框架
#计算机科学# MapReduce, Spark, Java, and Scala for Data Algorithms Book
C# and F# language binding and extensions to Apache Spark
distributed_computing include mapreduce kvstore etc.
An open source framework for building data analytic applications.
🐎 A serverless MapReduce framework written for AWS Lambda
Uniffle is a high performance, general purpose Remote Shuffle Service.
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Compass is a task diagnosis platform for bigdata
Dynamic execution framework for your Redis data
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉