DataFusion 是一个可扩展的查询执行框架,用Rust 编写,使用Apache Arrow 作为其内存格式
the portable Python dataframe library
#数据仓库#Create full-fledged APIs for slowly moving datasets without writing a single line of code.
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
翻译 - 使用Apache Arrow内存模型在Rust中实现的分布式计算平台。
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.
DuckDB-powered data lake analytics from Postgres
Analytical database for data-driven Web applications 🪶
#区块链#Next-generation decentralized data lakehouse and a multi-party stream processing network
Rust implementation of Apache Iceberg with integration for Datafusion
Batteries included CLI, TUI, and server implementations for DataFusion.
Query and transform data with PRQL
A lightweight Logging and Tracing observability solution for Rust, built with Apache Arrow, Apache Parquet and Apache DataFusion.
etl engine 轻量级 跨平台 流批一体ETL引擎 数据抽取-转换-装载 ETL engine lightweight cross platform batch flow integration ETL engine data extraction transformation loading
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective quer...