”hudi“ 的搜索结果

hudi

Apache软件基金会@apache

Upserts, Deletes And Incremental Processing on Big Data.

翻译 - 大数据的更新，删除和增量处理。

hudi apachehudi datalake bigdata apachespark

Java5.47 k

2 小时前

Google Bing GitHub

apachespark bigdata apacheflink datalake hudi stream-processing apachehudi incremental-processing data-integration

hudi-resources

@leesf

汇总Apache Hudi相关资料

539

5 天前

hudi-demos

@leesf

汇总Apache Hudi中的一些Demo，便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)

Java72

4 年前

hudi-rs

Apache软件基金会@apache

A native Rust library for Apache Hudi, with bindings into Python

Rust155

2 天前

emr-hudi-example

@yhyyz

emr-hudi-example

Scala26

2 年前

hudi-demo

@dongkelun

Apache Hudi Demo

Java21

5 个月前

Real-time-Data-Warehouse

@izhangzhihao

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

Dockerfile109

1 年前

hudi-doc-zh

ApacheCN@apachecn

hudi 中文文档

Python37

5 年前

EMR-Hudi-Workshop

@nmukerje

EMR Hudi Workshop content

HTML12

3 年前

Hudi_Demo_Notebook

@vasveena

Hudi Demo Notebook

Jupyter Notebook11

9 个月前

spark-hudi

@XDSZJ

spark hudi demo

Scala2

4 年前

spark-hudi-example

@liangriyu

spark-hudi-example

Scala2

2 年前

datalake-example

@bihaiyang

Data lake implementation demo, include iceberg on flink, iceberg on spark, hudi on flink, hudi on spark

Java4

1 年前

hudi-spark-plus

@AirToSupply

A library based on Hudi for Spark.

Java9

3 年前

Build-Glue-Spark-Streaming-pipeline-for-clicksstreams-and-power-data-lake-with-Apache-Hudi-and-Quer

@soumilshah1995

Build Glue(Spark) Streaming pipeline for clicksstreams and power data lake with Apache Hudi and Query Real time with Athena

Python5

2 年前

dataLake-template

@Chenzhiling

Some demos of using Spark to write MySQL and Kafka data to data lake,such as Delta,Hudi,Iceberg

Scala2

2 年前

lst-bench

Microsoft@microsoft

LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.

Java69

8 天前

aws-emr-best-practices

Amazon Web Services@aws

A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices ...

Shell65

1 年前

bigdata_learning

@Mrkuhuo

大数据组件学习代码

Java42

7 个月前

wagtail

@BirdFlock

本项目计划打造基于国产化平台，包括飞腾和鲲鹏 CPU平台，麒麟和 UOS 操作系统的大数据生态组件管理工具以及组件安装包。规划适配Ambari,HDFS,Yarn,ZooKeeper,MapReduce,Hive,Tez,Spark,Pig,Storm,Flink,Sqoop,Flume,Datax,FlinkX,Filebeat,Canal,Debezium,Presto,Drui...

3 年前

编程语音