#

corpus

https://static.github-zh.com/github_avatars/dariusk?size=40

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

JavaScript 5.03 k
8 天前
https://static.github-zh.com/github_avatars/wainshine?size=40

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4.2 k
2 年前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4.19 k
1 个月前
https://static.github-zh.com/github_avatars/lucasjinreal?size=40

#网络爬虫#Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

Python 2.32 k
6 年前
https://static.github-zh.com/github_avatars/fendouai?size=40

Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:

Python 2.13 k
1 年前
https://static.github-zh.com/github_avatars/candlewill?size=40

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Python 2.05 k
5 年前
https://static.github-zh.com/github_avatars/gunthercox?size=40
Python 1.41 k
3 个月前
https://static.github-zh.com/github_avatars/NiuTrans?size=40

非常全的文言文(古文)-现代文平行语料

Python 1.38 k
1 年前
https://static.github-zh.com/github_avatars/wainshine?size=40

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

1.28 k
2 年前
https://static.github-zh.com/github_avatars/PlexPt?size=40

ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型

921
1 年前
https://static.github-zh.com/github_avatars/OYE93?size=40
Python 911
5 年前
https://static.github-zh.com/github_avatars/quanteda?size=40

#自然语言处理#An R package for the Quantitative Analysis of Textual Data

R 862
3 个月前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
Python 815
5 年前
loading...
Website
Wikipedia