GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2025 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

集合主题趋势排行榜

#

数据科学

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Website
Wikipedia: 维基百科

相关主题

数据分析机器学习数据可视化

microsoft/ML-For-Beginners

microsoft / ML-For-Beginners

#新手入门#微软针对初学者的机器学习课程，课程分为12周，25课时。

机器学习数据科学 Python machinelearning-python scikit-learn scikit-learn-python R 教学 microsoft-for-beginners

HTML 74.69 k

1 个月前

apache/superset

apache / superset

Apache Superset 是一个企业级数据可视化和数据分析的平台。

superset apache apache-superset 数据可视化 data-viz analytics business-intelligence 数据科学 data-engineering asf bi business-analytics data-analytics 数据分析 Python React sql-editor Flask

TypeScript 67.12 k

1 小时前

keras-team / keras

#计算机科学#Keras是一个基于 Python 的深度学习库，能够在TensorFlow、Microsoft Cognitive Toolkit、Theano或PlaidML之上运行。

深度学习 Tensorflow neural-networks 机器学习数据科学 Python jax PyTorch

Python 63.21 k

1 天前

scikit-learn / scikit-learn

#计算机科学#scikit-learn 是基于 SciPy、NumPy、matplotlib 构建的 Python 机器学习框架

机器学习 Python 统计数据科学数据分析

Python 62.65 k

1 天前

pandas-dev / pandas

Pandas 是一个灵活、强大的Python数据操纵、数据分析库，提供标注过的数据结构类似于 R 语言的data.frame 对象、统计函数等等

数据分析 pandas flexible alignment Python 数据科学

Python 45.99 k

1 天前

apache/airflow

apache / airflow

#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台

airflow apache apache-airflow Python scheduler workflow 自动化 dag data-engineering data-integration data-orchestrator data-pipelines 数据科学 elt etl 机器学习 mlops orchestration workflow-engine workflow-orchestration

Python 41.01 k

1 小时前

GokuMohandas/Made-With-ML

GokuMohandas / Made-With-ML

#自然语言处理#学习如何设计、开发、部署、和迭代生产级机器学习应用

机器学习深度学习 PyTorch 自然语言处理数据科学 Python mlops data-engineering data-quality 大语言模型 ray distributed-training

Jupyter Notebook 40.91 k

1 年前

streamlit / streamlit

#计算机科学#streamlit 是一个通过Python脚本生成可视化交互式Web页面的工具，适合数据分析师

Python 机器学习数据科学深度学习数据可视化 Streamlit 数据分析 developer-tools

Python 40.42 k

6 小时前

gradio-app / gradio

#计算机科学#Gradio是一个开源的Python库，用于构建演示机器学习或数据科学，以及web应用程序。使用Gradio，您可以基于您的机器学习模型或数据科学工作流快速创建一个漂亮的用户界面，让用户可以”尝试“拖放他们自己的图像、粘贴文本、录制他们自己的声音，并通过浏览器与您的演示程序进行交互。

机器学习 models ui ui-components interface Python 数据科学数据可视化深度学习数据分析 gradio gradio-interface python-notebook deploy Hacktoberfest

Python 39.02 k

12 小时前

ray-project / ray

#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

ray distributed parallel 机器学习 reinforcement-learning 深度学习 Python rllib hyperparameter-search optimization 数据科学 hyperparameter-optimization serving 部署 PyTorch Tensorflow llm-serving large-language-models 大语言模型 llm-inference

Python 38.01 k

5 小时前

explosion / spaCy

#自然语言处理#工业级的 Python/CPython 自然语言处理（NLP）库

自然语言处理数据科学机器学习 Python cython 人工智能 spaCy nlp-library 神经网络 neural-networks 深度学习 named-entity-recognition Entity resolution text-classification tokenization

Python 31.97 k

2 个月前

AMAI-GmbH/AI-Expert-Roadmap

AMAI-GmbH / AI-Expert-Roadmap

#学习与技能提升#2022 人工智能专家学习路线图

深度学习人工智能路线图 ai-roadmap 机器学习 study-plan 数据科学数据分析神经网络

JavaScript 30.07 k

2 年前

microsoft/Data-Science-For-Beginners

microsoft / Data-Science-For-Beginners

适合所有人的数据科学课程，10周，20课时

数据科学 Python 数据可视化数据分析 pandas microsoft-for-beginners

Jupyter Notebook 29.94 k

1 个月前

Lightning-AI/pytorch-lightning

Lightning-AI / pytorch-lightning

#计算机科学#Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 深度学习人工智能 PyTorch 数据科学机器学习

Python 29.79 k

1 天前

donnemartin / data-science-ipython-notebooks

#计算机科学#Python 数据科学学习笔记：深度学习 (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, 大数据 (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python 核心, AWS, Linux命令

Python 机器学习深度学习数据科学 big-data Amazon Web Services Tensorflow theano caffe scikit-learn kaggle Apache Spark mapreduce hadoop matplotlib pandas NumPy SciPy Keras

Python 28.36 k

1 年前

eugeneyan/applied-ml

eugeneyan / applied-ml

#自然语言处理#精选大公司分享他们在生产中关于数据科学 & 机器学习的论文和技术博客等资源

applied-machine-learning production applied-data-science 机器学习数据科学 reinforcement-learning data-engineering recsys search 深度学习 data-quality data-discovery 机器视觉自然语言处理

28.11 k

1 年前

CamDavidsonPilon / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

bayesian-methods pymc mathematical-analysis Jupyter Notebook 数据科学统计

Jupyter Notebook 27.6 k

1 年前

academic / awesome-datascience

#Awesome#数据科学相关资源汇总

数据科学机器学习数据可视化 science data-mining Awesome Lists 深度学习 analytics data-scientists Hacktoberfest

26.87 k

17 天前

eriklindernoren / ML-From-Scratch

#计算机科学#Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...

机器学习深度学习 deep-reinforcement-learning machine-learning-from-scratch 数据科学 data-mining genetic-algorithm

Python 26.62 k

2 年前

d2l-ai / d2l-en

#自然语言处理#Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

深度学习机器学习 book notebook 机器视觉自然语言处理 Python kaggle 数据科学 mxnet PyTorch Tensorflow Keras gaussian-processes hyperparameter-optimization recommender-system reinforcement-learning jax

Python 26.31 k

1 年前

loading...