ngrams · GitHub Topics

neuspell / neuspell

#自然语言处理#NeuSpell: A Neural Spelling Correction Toolkit

spelling-correction spellcheck neural-models spell-checker 自然语言处理 dataset ngrams

Python 691

2 年前

bakwc / JamSpell

#自然语言处理#Modern spell checking library - accurate, fast, multi-language

spellcheck spellchecker ngrams 自然语言处理 C++Python spelling-correction Java Ruby C#

C++ 634

7 个月前

thepanacealab / covid19_twitter

Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development

翻译 - 用于非商业研究用途和预处理脚本的Covid-19 Twitter数据集-正在积极开发中

tweets dataset retweets tweets-acquired frequent-terms twitter-stream dissemination ngrams

Jupyter Notebook 475

2 年前

bennyschmidt / next-token-prediction

#大语言模型#Next-token prediction in JavaScript — build fast language and diffusion models.

人工智能 autocomplete autocompletion diffusion-models language-models 大语言模型 markov-chain ngrams embeddings

JavaScript 143

7 个月前

jermp / tongrams

A C++ library providing fast language model queries in compressed space.

trie ngrams language-model

C++ 129

2 年前

proycon / colibri-core

#自然语言处理#Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...

C++Python 自然语言处理 ngrams skipgram ngram corpus Library text-processing computational-linguistics pattern-recognition

C++ 126

4 个月前

winkjs / wink-nlp-utils

#自然语言处理#NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.

tokenize STEM ngrams bag-of-words 自然语言处理

JavaScript 126

1 年前

landrok / language-detector

A fast and reliable PHP library for detecting languages

ngrams

PHP 124

1 年前

joshualoehr / ngram-language-model

#自然语言处理#Python implementation of an N-gram language model with Laplace smoothing and sentence generation.

ngram perplexity 自然语言处理 language-model Python ngrams language-models

Python 83

7 年前

shantanu1109 / Coursera-DeepLearning.AI-Natural-Language-Processing-Specialization

#自然语言处理#This Repository Contains Solution to the Assignments of the Natural Language Processing Specialization from Deeplearning.ai on Coursera Taught by Younes Bensouda Mourri, Łukasz Kaiser, Eddy Shyu

Coursera 自然语言处理 hashing logistic-regression naive-bayes pca bag-of-words cbow markov-chain ngrams pos-tagging tokenization

Jupyter Notebook 73

2 年前

postmodern / raingrams

A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

Ruby ngrams

Ruby 69

4 年前

orgtre / google-books-ngram-frequency

Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code

Google language-learning ngrams wordlist

Python 64

2 年前

anfederico / poesy

#自然语言处理#Poetry generation via natural language markov models

poetry 自然语言处理 modeling ngrams

Python 55

2 个月前

starlordvk / Typing-Assistant

#自然语言处理#Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

自然语言处理 JavaScript Python ngrams autocompletion prediction corpus keyboard

CSS 54

7 年前