#自然语言处理#NeuSpell: A Neural Spelling Correction Toolkit
#自然语言处理#Modern spell checking library - accurate, fast, multi-language
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
翻译 - 用于非商业研究用途和预处理脚本的Covid-19 Twitter数据集-正在积极开发中
#大语言模型#Next-token prediction in JavaScript — build fast language and diffusion models.
A C++ library providing fast language model queries in compressed space.
#自然语言处理#Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...
#自然语言处理#NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
#自然语言处理#Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
#自然语言处理#This Repository Contains Solution to the Assignments of the Natural Language Processing Specialization from Deeplearning.ai on Coursera Taught by Younes Bensouda Mourri, Łukasz Kaiser, Eddy Shyu
A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
#自然语言处理#Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
#自然语言处理#Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
#自然语言处理#🦜 NLP for Tibetan, in Python.
#自然语言处理#Rust library providing fast language model queries in compressed space
#计算机科学#Detecting Malware in PE files
#自然语言处理#NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.