#计算机科学#A Flexible and Powerful Parameter Server for large-scale machine learning
翻译 - 灵活而强大的参数服务器,用于大规模机器学习
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
#计算机科学#.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
翻译 - .NET forApache®Spark™使.NET开发人员可以轻松访问Apache Spark™。
Wormhole is a SPaaS (Stream Processing as a Service) Platform
C# and F# language binding and extensions to Apache Spark
An open source framework for building data analytic applications.
Real Time Analytics and Data Pipelines based on Spark Streaming
Scala examples for learning to use Spark
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
翻译 - 适用于Apache Spark的Data Accelerator简化了大数据流的入门。它提供了丰富,易于使用的体验,可帮助在Azure HDInsights或Databricks上创建,编辑和管理Spark作业,同时启用Spark引擎的全部功能。
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Spark, Spark Streaming and Spark SQL unit testing strategies
Databricks framework to validate Data Quality of pySpark DataFrames
#计算机科学#A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apach...
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype