Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Apache Paimon Rust The rust implementation of Apache Paimon.
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflo...
Apache Paimon Python The Python implementation of Apache Paimon.
Streaming Data Lake with RSocket
Deploy a reddit streaming datalake on AWS.
A demo of DynamoDB CDC into data lake with AWS CDK v2
Dynamo to Lake Streaming
API to the GPT4All Datalake
Russia / Ukraine 2022 conflict related IOCs from CERT Orange Cyberdefense Threat Intelligence Datalake
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/sdk/
nginx rtmp扩展,让nginx支持rtmp流媒体服务
The streaming solution to end all streaming problems
⚡️ Streaming torrent client for the web
navidrome 是一个网络音乐收藏、在线播放web系统,类似于你的个人Spotify、QQ音乐
Data lake implementation demo, include iceberg on flink, iceberg on spark, hudi on flink, hudi on spark
Streaming torrent client for node.js
NATS Streaming System Server
#安卓#NewPipe 是一个第三方 Youtube Android 客户端。无广告,无需登录
Some demos of using Spark to write MySQL and Kafka data to data lake,such as Delta,Hudi,Iceberg
MongoDB → PostgreSQL streaming replication
NATS Streaming System