免费数据工程师视频课程,共9周课时
Doris 是百度开源的支持对海量大数据进行快速分析的MPP数据库。
#计算机科学#🧙 Build, run, and manage data pipelines for integrating and transforming data.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
翻译 - 元数据开放标准。发现、协作和正确获取数据的单一场所。
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Self-serve BI to 10x your data team ⚡️
Compare tables within or across databases
Efficient data transformation and modeling framework that is backwards compatible with dbt.
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
re_data - fix data issues before your users & CEO would discover them 😊
#Awesome#A curated list of awesome dbt resources
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
#计算机科学#do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
#网络爬虫#Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as ...
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.