An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
翻译 - 用于构建数据湖,数据仓库和分析平台的端到端GoodReads数据管道。
Samples and Docs for Azure Data Lake Store and Analytics
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
This workshop is meant to give customers a hands-on experience with mentioned AWS services. Serverless Data Lake workshop helps customers build a cloud-native and future-proof serverless data lake ar...
Sample .NET client library for Data Lake Analytics and Data Lake Store, built upon the Data Lake .NET SDKs.
A sample Python solution showing how to authenticate against Azure Active Directory (AAD) before using the Azure Data Lake Analytics (ADLA) Python SDKs.
Making data lake work for time series
A reasonably secure data lake for healthcare analytics
lakeFS - Data version control for your data lake | Git for data
翻译 - 一个开源平台,可为基于对象存储的数据湖提供弹性和可管理性
Apache Airavata Data Lake
Interactive data analytics
Enterprise-grade, production-hardened, serverless data lake on AWS
翻译 - AWS上的企业级,经过生产强化,无服务器的数据湖
Lightweight analytics reporting and publishing tool for Digital Analytics Program's Google Analytics 360 data.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
翻译 - 与数据工程相关的项目很少,包括数据建模,云上的基础架构设置,数据仓库和数据湖开发。
google data analytics professional certificate
oneAPI Data Analytics Library (oneDAL)
Exporting data from Dynamics 365 Business Central to Azure data lake storage
Microsoft Azure Data Lake Store Filesystem Library for Python
High-performance runtime for data analytics applications
翻译 - 数据分析应用程序的高性能运行时
Scalable Time Series Data Analytics
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
#数据库#ClickHouse是性能强悍、适合OLAP实时分析的列式数据库,支持SQL语法
An encrypted data analytics platform