#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台
Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库
Doris 是百度开源的支持对海量大数据进行快速分析的MPP数据库。
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
翻译 - dbt(数据构建工具)使数据分析人员和工程师可以使用软件工程师用于构建应用程序的相同方法来转换数据。
SeaTunnel (原名为 waterdrop)是一个易用的支持海量数据实时同步的高性能分布式数据集成平台,每天可以稳定同步数百亿数据
#计算机科学#🧙 Build, run, and manage data pipelines for integrating and transforming data.
一个高性能ELT 框架,powered by Apache Arrow
Flink CDC Connector 是ApacheFlink的一组数据源连接器
Privacy and Security focused Segment-alternative, in Golang and React
翻译 - Golang和React中针对隐私和安全性的细分市场替代方案
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Scalable and efficient data transformation framework - backwards compatible with dbt.
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
#大语言模型#A system for agentic LLM-powered data processing and ETL
Dataform is a framework for managing SQL based data operations in BigQuery
Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres,...
#网络爬虫#Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as ...
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
翻译 - Optimus 是一个易于使用、可靠且高性能的工作流编排器,用于数据转换、数据建模、管道和数据质量管理。
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.