#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台
一个分布式易扩展的可视化DAG工作流任务调度系统。致力于解决数据处理流程中错综复杂的依赖关系,使调度系统在数据处理流程中开箱即用
#计算机科学#PipelineAI
翻译 - PipelineAI Kubeflow分布
#编辑器#Build data pipelines, the easy way 🛠️
翻译 - Orchest是用于创建数据科学管道的工具。
#Awesome#Curated list of resources about Apache Airflow
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
翻译 - DataSphereStudio是一站式数据应用程序开发和管理门户,涵盖了各种场景,包括数据交换,脱敏/清理,分析/挖掘,质量测量,可视化和任务调度。
#计算机科学#Elyra extends JupyterLab with an AI centric approach.
翻译 - Elyra以AI为中心的方法扩展了JupyterLab笔记本。
A series of DAGs/Workflows to help maintain the operation of Airflow
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
翻译 - 与数据工程相关的项目很少,包括数据建模,云上的基础架构设置,数据仓库和数据湖开发。
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
翻译 - 用于构建数据湖,数据仓库和分析平台的端到端GoodReads数据管道。
#面试#More than 2000+ Data engineer interview questions.
Dynamically generate Apache Airflow DAGs from YAML configuration files
#网络爬虫#Example end to end data engineering project.
#计算机科学#A Data Engineering & Machine Learning Knowledge Hub
Personal Data Engineering Projects
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤...
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
翻译 - Optimus 是一个易于使用、可靠且高性能的工作流编排器,用于数据转换、数据建模、管道和数据质量管理。