#自然语言处理#🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
#数据仓库#awesome-public-datasets - 收集了各种类别的开源数据,包括但不限于经济、农业、生物、民生、气象、数据安全等等
#数据仓库#TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
#数据仓库#Machine learning datasets used in tutorials on MachineLearningMastery.com
Datasets used in Plotly examples and documentation
A public repo of datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
A list of all public EEG-datasets
Video datasets
procedural reasoning datasets
#计算机科学#A repository of pretty cool datasets that I collected for network science and machine learning research.
#数据仓库#🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Toolkit for linearizing PDFs for LLM datasets/training
datasets resource
#数据仓库#Papers and Datasets about Point Cloud.
#人脸识别#Face related datasets
#数据仓库#Large datasets for conversational AI