#

apachespark

https://static.github-zh.com/github_avatars/DataExpert-io?size=40
Jupyter Notebook 37.76 k
6 小时前
https://static.github-zh.com/github_avatars/holdenk?size=40
Scala 102
1 年前
https://static.github-zh.com/github_avatars/martandsingh?size=40

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

Python 102
1 年前
https://static.github-zh.com/github_avatars/funkyminds?size=40

type-class based data cleansing library for Apache Spark SQL

Scala 78
6 年前
https://static.github-zh.com/github_avatars/josephmachado?size=40

Code for blog at: https://www.startdataengineering.com/post/docker-for-de/

C 39
1 年前
https://static.github-zh.com/github_avatars/propelledanalytics?size=40

SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.

Julia 25
2 年前
https://static.github-zh.com/github_avatars/tspannhw?size=40

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...

21
5 天前
https://static.github-zh.com/github_avatars/SmartDataAnalytics?size=40
Jupyter Notebook 10
3 年前
https://static.github-zh.com/github_avatars/SandeepAswathnarayana?size=40

This repository contains all the projects and labs I worked on while pursuing professional certificate programs, specializations, and bootcamp. [Areas: Deep Learning, Machine Learning, Applied Data Sc...

Jupyter Notebook 9
5 年前
https://static.github-zh.com/github_avatars/datumbrain?size=40

Trigger spark-submit in Golang. A Go implementation of famous SparkLauncher.java.

Go 7
5 年前
https://static.github-zh.com/github_avatars/sfrechette?size=40
Scala 7
9 年前
https://static.github-zh.com/github_avatars/CarolinaNicasio?size=40

PySpark es una biblioteca de procesamiento de datos distribuidos en Python que permite procesar grandes volúmenes de datos en clústeres utilizando el framework Apache Spark, ofreciendo un alto rendim...

7
2 年前
https://static.github-zh.com/github_avatars/lensesio?size=40
Java 6
7 年前
https://static.github-zh.com/github_avatars/funkyminds?size=40
Scala 5
6 年前
https://static.github-zh.com/github_avatars/sahith?size=40

Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not give...

Scala 5
6 年前
https://static.github-zh.com/github_avatars/DigitalPebble?size=40

Enrichment pipeline for CUR / FOCUS reports which adds energy and carbon data allowing to report and reduce the impact of the your cloud usage.

Java 5
2 个月前
https://static.github-zh.com/github_avatars/ashkrit?size=40

Microservices for Spark application

Java 5
2 年前
https://static.github-zh.com/github_avatars/divithraju?size=40

A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)

Jupyter Notebook 3
3 年前
loading...
Website
Wikipedia