📚中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室
MEV Data Corpus
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
📚中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室
ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型
#自然语言处理#WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Event StoryLine Corpus - annotated data, baselines and evaluation scripts, evaluation data.
A multilingual dialog corpus
The Abstraction and Reasoning Corpus
翻译 - 抽象和推理语料库
A bespoke NLP Chatbot trained using a corpus of Reddit data.
Collections of Chinese NLP corpus
ACE 2005 Corpus Preprocessing
Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:
自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。
Yet another CSS toolkit. Basically the stuff I use for most projects.
翻译 - 另一个CSS工具箱。基本上,我用于大多数项目的东西。
Simulation data from VCTK Corpus (version 0.92) for direction of arrival (DoA) estimation, and detailed data simulation process.
Indonesian-English Bilingual Corpus
It uses machine learning models (Multinomial NB & SVM) to predict whether the email is spam or ligitimate on two corpus namely Ling-spam corpus and Euron-spam corpus.
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
word2vec/glove/swivel binary file on chinese corpus
Corpus for github.com/dvyukov/go-fuzz examples
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 con...
ACE 2005 corpus preprocessing for Event Extraction task
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.