Parallel computing with task scheduling
翻译 - 任务调度的并行计算
STUMPY is a powerful and scalable Python library for modern time series analysis
翻译 - STUMPY是一个功能强大且可扩展的Python库,可用于各种时间序列数据挖掘任务
Koalas: pandas API on Apache Spark
翻译 - 考拉:Apache Spark上的pandas API
Extract data from a wide range of Internet sources into a pandas DataFrame.
翻译 - 从各种Internet来源中提取数据到pandas DataFrame中。
A distributed task scheduler for Dask
Clean APIs for data cleaning. Python implementation of R package Janitor
A clean, three-column Sphinx theme with Bootstrap for the PyData community
PyData, The Complete Works of
RFC document, tooling and other content related to the array API standard
#计算机科学#High-Performance Python Compute Engine for Data and AI
#自然语言处理#Notebooks for the Seattle PyData 2017 talk on Scattertext
Social network analysis code examples for PyCon 2019 talk
Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics
#计算机科学#Machine learning with scikit-learn tutorial at PyData Chicago 2016
#计算机科学#Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020