#自然语言处理#Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
#计算机科学#Large-scale pretraining for dialogue
翻译 - 对话的大规模预培训
#计算机科学#Large-scale pretrained models for goal-directed dialog
#自然语言处理#Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
翻译 - 将最好的TF集成到PyTorch中,用于机器学习,自然语言处理和文本生成
#自然语言处理#Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
#自然语言处理#Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation
#自然语言处理#Cleans Reddit Text Data 📜 🧹
Tools to uniformly read in text data including semi-structured transcripts
#自然语言处理#Question Classification for the dataset CogComp QC Dataset - [ http://cogcomp.org/Data/QA/QC/ ].
#自然语言处理#A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.
Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).
#网络爬虫#Scrape EDGAR filings from https://www.sec.gov/
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are...
#自然语言处理#How Will Your Tweet Be Received? Predicting theSentiment Polarity of Tweet Replies
#自然语言处理#A dataset which contains 30k+ so called "self-help" tweets from 100+ authors.
#自然语言处理#A machine learning model that predicts tags for a given question and body.
Directional Co-clustering with a Conscience (DCC)