#自然语言处理# 搜索所有中文NLP数据集,附常用英文NLP数据集
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
#网络爬虫# Crawl BookCorpus
翻译 - 爬网书
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
翻译 - 具有多语言支持的集成语料库工具,用于语言,文学和翻译研究
Generative AI for Math: MathPile