pydata · GitHub Topics

dask / dask

Parallel computing with task scheduling

翻译 - 任务调度的并行计算

dask Python pydata NumPy pandas scikit-learn SciPy

Python 13.12 k

5 天前

rapidsai / cudf

cuDF - GPU DataFrame Library

翻译 - cuDF-GPU数据框库

gpu rapids cudf arrow CUDA pandas dataframe dask 数据分析数据科学 pydata C++Python

C++ 8.85 k

1 天前

TDAmeritrade / stumpy

STUMPY is a powerful and scalable Python library for modern time series analysis

翻译 - STUMPY是一个功能强大且可扩展的Python库，可用于各种时间序列数据挖掘任务

数据科学 time-series-analysis dask numba Python anomaly-detection pattern-matching pydata matrix-profile motif-discovery

Python 3.89 k

5 天前

databricks / koalas

Koalas: pandas API on Apache Spark

翻译 - 考拉：Apache Spark上的pandas API

Apache Spark pandas pydata dataframe mlflow big-data 数据科学

Python 3.35 k

1 年前

pydata / pandas-datareader

Extract data from a wide range of Internet sources into a pandas DataFrame.

翻译 - 从各种Internet来源中提取数据到pandas DataFrame中。

HTML 数据分析 data dataset stock-data finance financial-data Python pydata pandas economic-data

Python 3.02 k

10 天前

dask / distributed

A distributed task scheduler for Dask

pydata dask distributed-computing Python Hacktoberfest

Python 1.62 k

1 天前

pyjanitor-devs / pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

pandas dataframe data data-engineering pydata Hacktoberfest

Python 1.41 k

2 天前

pydata / pydata-sphinx-theme

A clean, three-column Sphinx theme with Bootstrap for the PyData community

sphinx sphinx-doc sphinx-theme Python pydata

Python 684

4 天前

DataTau / datascience-anthology-pydata

PyData, The Complete Works of

数据科学 pydata Video Python Hacktoberfest

299

8 年前

sgkit-dev / sgkit

Scalable genetics toolkit

pydata

Python 253

6 天前

bodo-ai / Bodo

#计算机科学#High-Performance Python Compute Engine for Data and AI

人工智能 big-data 数据科学 distributed 机器学习 NumPy optimization pandas parallel-computing pydata Python scikit-learn SQL

Python 246

2 天前

data-apis / array-api

RFC document, tooling and other content related to the array API standard

pydata standard spec

Python 232

10 天前

stringfestdata / advancing-into-analytics-book

Resources for Advancing into Analytics: From Excel to R and Python by George Mount (O'Reilly Media, 2021)

rstats pydata Python book excel data analytics 统计

Jupyter Notebook 208

1 年前

JDASoftwareGroup / kartothek

A consistent table management library in python

Python pydata dask arrow parquet

Python 159

2 年前

JasonKessler / Scattertext-PyData

#自然语言处理#Notebooks for the Seattle PyData 2017 talk on Scattertext

pydata 自然语言处理可视化 text-visualization word2vec computational-social-science

HTML 142

7 年前

dimgold / pycon_social_networkx

Social network analysis code examples for PyCon 2019 talk

Python social-network social-networks information-flow networkx pandas pydata Network graph-theory Jupyter Notebook

Jupyter Notebook 139

3 年前

python-graphblas / python-graphblas

Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics

graphblas suitesparse Python numba graph-algorithms graph-theory complex-networks graph-analysis linear-algebra pydata sparse sparse-data sparse-matrix

Jupyter Notebook 132

6 天前

rasbt / pydata-chicago2016-ml-tutorial

#计算机科学#Machine learning with scikit-learn tutorial at PyData Chicago 2016

scikit-learn 教程 pydata 机器学习

Jupyter Notebook 128

8 年前

sktime / sktime-tutorial-pydata-amsterdam-2020

#计算机科学#Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020

机器学习 time-series scikit-learn 教程 pydata

Jupyter Notebook 122

3 年前

WinVector / pyvtreat

#计算机科学#vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

pydata 机器学习数据科学 Python

Python 121

3 个月前