一个非常快的 DataFrame 库,支持 Rust、Python、Node.js
A light-weight, flexible, and expressive statistical data testing library
The Universal Storage Engine
#数据仓库#In-memory tabular data in Julia
#计算机科学#DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
Easy pipelines for pandas DataFrames.
#计算机科学#Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
#数据仓库#Metaprogramming tools for DataFrames
Immutable and statically-typeable DataFrames with runtime type and data validation
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.
翻译 - 为期 3 小时的大熊猫介绍性研讨会,附有笔记本和练习以供后续学习。
64bit multithreaded python data analytics tools for numpy arrays and datasets
#计算机科学#⛈️ RumbleDB 1.23.0 "Mountain Ash" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to downl...
翻译 - ⛈️ RumbleDB 1.17.0 "Cacao tree" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
#算法刷题#O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Genomic interval operations on Pandas DataFrames
#计算机科学#Woodwork is a Python library that provides robust methods for managing and communicating data typing information.