Parallel computing with task scheduling
翻译 - 任务调度的并行计算
STUMPY is a powerful and scalable Python library for modern time series analysis
翻译 - STUMPY是一个功能强大且可扩展的Python库,可用于各种时间序列数据挖掘任务
#计算机科学#Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
翻译 - 火星是一个基于张量的统一框架,用于大规模数据计算,可扩展Numpy,Pandas和Scikit-learn。
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
#计算机科学#A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
A distributed task scheduler for Dask
#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Eliot: the logging system that tells you *why* it happened
Python package for earth-observing satellite data processing
翻译 - Python软件包,用于对地观测卫星数据处理
#计算机科学#Scalable machine 🤖 learning for time series forecasting.
Fast data store for Pandas time-series data
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Distributed SQL Engine in Python using Dask
Geospatial image resampling in Python
Library of derived climate variables, ie climate indicators, based on xarray.
A full pipeline AutoML tool for tabular data