GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

deltalake

Website
Wikipedia
https://static.github-zh.com/github_avatars/paradedb?size=40
paradedb / pg_analytics

DuckDB-powered data lake analytics from Postgres

analyticsarrowcolumnardatafusionlakehouseparquetPostgreSQLduckdbolapbig-data数据库datalakedeltalakeicebergobject-storageSQLlakehouse-platform
Rust 522
3 个月前
https://static.github-zh.com/github_avatars/databrickslabs?size=40
databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...

pysparkPythondata-generationfakerApache Sparkspark-streamingdeltalakedatabrickssynthetic-data
Python 411
4 天前
https://static.github-zh.com/github_avatars/delta-io?size=40
delta-io / kafka-delta-ingest

A highly efficient daemon for streaming data from Kafka into Delta Lake

deltalakedeltaRustkafka
Rust 407
2 个月前
https://static.github-zh.com/github_avatars/MrPowers?size=40
MrPowers / mack

Delta Lake helper methods in PySpark

deltalakepyspark
Python 323
10 个月前
https://static.github-zh.com/github_avatars/japila-books?size=40
japila-books / delta-lake-internals

The Internals of Delta Lake

deltalakebookinternalsdelta-lakebooksdatalake
184
6 个月前
https://static.github-zh.com/github_avatars/smart-data-lake?size=40
smart-data-lake / smart-data-lake

Smart Automation Tool for building modern Data Lakes and Data Pipelines

data-lakeScalaApache Sparkhadoophivedeltalaketransform-datadata-pipelines
Scala 124
3 天前
https://static.github-zh.com/github_avatars/uname-n?size=40
uname-n / deltabase

a lightweight, comprehensive solution for managing delta tables built on polars and deltalake

数据库deltalakepolarsSQL
Python 119
6 个月前
https://static.github-zh.com/github_avatars/izhangzhihao?size=40
izhangzhihao / Real-time-Data-Warehouse

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

flinkdata-warehousedata-warehousingflink-sqldebeziumkafkaelasticsearchdelta-lakecdcchange-data-capturehudiicebergSQLdatalakedeltadeltalakeApache Sparkspark-sql
Dockerfile 114
2 年前
https://static.github-zh.com/github_avatars/WeBankFinTech?size=40
WeBankFinTech / Streamis

Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.

flinklinkisstreaminghudiicebergdatalakekafkadeltalake
Java 107
2 个月前
https://static.github-zh.com/github_avatars/martandsingh?size=40
martandsingh / ApacheSpark

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

apachespark数据分析data-engineering数据库databricksdatalakedeltalakeetl-pipelinehadoophiveApache Sparkspark-sqlspark-streamingtimetraveletlpysparkSQL
Python 100
1 年前
https://static.github-zh.com/github_avatars/anneglienke?size=40
anneglienke / 101_upsert-delta

This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.

deltadelta-lakedeltalake
Python 98
3 年前
https://static.github-zh.com/github_avatars/flintml?size=40
flintml / flintml

#计算机科学#One-click ML infrastructure for teams that just want to get sh*t done.

deltalakeJupyter Notebook机器学习mlopspolars数据科学
Python 94
4 天前
https://static.github-zh.com/github_avatars/dacort?size=40
dacort / faker-cli

Command-line interface to quickly generate fake CSV and JSON data

Amazon Web ServicesCSVJSONdeltalakeparquet
Python 73
1 年前
https://static.github-zh.com/github_avatars/bhavink?size=40
bhavink / databricks

Databricks Platform - Architecture, Security, Automation and much more!!

databricksdeltalake安全
Jupyter Notebook 51
2 个月前
https://static.github-zh.com/github_avatars/buoyant-data?size=40
buoyant-data / oxbow

Collection of AWS Lambdas for creating and managing Delta tables

deltalakeparquetdatalakelambdaRust
Rust 38
4 天前
https://static.github-zh.com/github_avatars/sankamuk?size=40
sankamuk / PysparkCheatsheet

PySpark Cheatsheet

Apache SparkPythondeltalake
Python 36
2 年前
https://static.github-zh.com/github_avatars/DataTech-Solutions?size=40
DataTech-Solutions / Threat-Detection-and-Visualization

#计算机科学#Threat Detection and Visualization

APIdatalakedefenderdeltalakePostmanpowerbisccmsiemSQLactive-directory机器学习
TSQL 32
2 年前
https://static.github-zh.com/github_avatars/mrjsj?size=40
mrjsj / delta-lake-explorer

Azuredeltalakeduckdbsql-client
Python 30
1 年前
https://static.github-zh.com/github_avatars/newfront?size=40
newfront / hitchhikers_guide_to_deltalake_streaming

Don't Panic. This guide will help you when it feels like the end of the world.

apacheApache Sparkdeltalake
Jupyter Notebook 25
19 天前
https://static.github-zh.com/github_avatars/leehuwuj?size=40
leehuwuj / olh

Open source stack lakehouse

bigdatalakehouseKubernetesApache Sparkdeltalake
Python 25
1 年前
loading...