lakeFS - Data version control for your data lake | Git for data
翻译 - 一个开源平台,可为基于对象存储的数据湖提供弹性和可管理性
Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search...
Enterprise-grade, production-hardened, serverless data lake on AWS
翻译 - AWS上的企业级,经过生产强化,无服务器的数据湖
Samples and Docs for Azure Data Lake Store and Analytics
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
翻译 - 与数据工程相关的项目很少,包括数据建模,云上的基础架构设置,数据仓库和数据湖开发。
Resources for video demonstrations and blog posts related to DataOps on AWS
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
翻译 - 用于构建数据湖,数据仓库和分析平台的端到端GoodReads数据管道。
Making data lake work for time series
Apache Airavata Data Lake
Exporting data from Dynamics 365 Business Central to Azure data lake storage
Microsoft Azure Data Lake Store Filesystem Library for Python
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Microsoft Azure Data Lake Store Filesystem Library for Java
A highly efficient daemon for streaming data from Kafka into Delta Lake
#计算机科学#A self-hostable CDN for databases. Spice provides a unified SQL query interface and portable runtime to locally materialize, accelerate, and query datasets across databases, data warehouses, and data ...
翻译 - 面向开发人员的时间序列 AI
Delta Lake examples
Procedural Hydrology / River / Lake Simulation
Mind Lake SDK in TypeScript
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
The Internals of Delta Lake