数据工程师学习资源清单
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
翻译 - 元数据开放标准。发现、协作和正确获取数据的单一场所。
Compare tables within or across databases
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
翻译 - 用于数据掌握、重复数据删除和实体解析的可扩展模糊匹配。
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
#计算机科学#This repository provides various demos/examples of using Snowpark for Python.
An open source development framework to help you build data workflows and modern data architecture on AWS.
#面试#Roadmap for Data Engineering
Code and data for the Modern Polars book
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
#数据仓库#Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in ...
Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате
A Data Platform built for AWS, powered by Kubernetes.
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data E...
Simple stream processing pipeline
Recohut - Learn data engineering, data science
#计算机科学#Resources about data science, machine learning, deep learning, data engineering, and SQL.