Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
翻译 - 用于针对 JSON、CSV、Excel、Parquet 等运行 SQL 查询的命令行工具。
#数据仓库#Create full-fledged APIs for slowly moving datasets without writing a single line of code.
#数据仓库#Blazing-fast Data-Wrangling toolkit
#计算机科学#Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
A large-scale entity and relation database supporting aggregation of properties
Quilt is a data mesh for connecting people with actionable data
Postgres read replica optimized for analytics
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Postgres-native Data Warehouse
A portable embedded database using Arrow.
Simple Windows desktop application for viewing & querying Apache Parquet files
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML...
A Python library for fast, interactive geospatial vector data visualization in Jupyter.