一个非常快的 DataFrame 库,支持 Rust、Python、Node.js
A light-weight, flexible, and expressive statistical data testing library
The Universal Storage Engine
#数据仓库#In-memory tabular data in Julia
#计算机科学#DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
Easy pipelines for pandas DataFrames.
#计算机科学#Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
#数据仓库#Metaprogramming tools for DataFrames
Immutable and statically-typeable DataFrames with runtime type and data validation
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
64bit multithreaded python data analytics tools for numpy arrays and datasets
An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.
翻译 - 为期 3 小时的大熊猫介绍性研讨会,附有笔记本和练习以供后续学习。
#计算机科学#⛈️ RumbleDB 1.22.0 "Pyrenean oak" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to downl...
翻译 - ⛈️ RumbleDB 1.17.0 "Cacao tree" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
#算法刷题#O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Genomic interval operations on Pandas DataFrames
#计算机科学#Woodwork is a Python library that provides robust methods for managing and communicating data typing information.