A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
#自然语言处理#A sentence segmenter that actually works!
#自然语言处理#Punctuation restoration and spell correction experiments.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Text normalization library for Python
Text and Punctuation correction with Deep Learning
Pre-process arabic text (remove diacritics, punctuations and repeating characters)
#自然语言处理#Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to...
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
#自然语言处理#Apache OpenNLP wrapper for Nodejs
#自然语言处理#A small seq2seq punctuator tool based on DistilBERT
#自然语言处理#Нейронная сеть для восстановления пунктуации на русском языке.
#自然语言处理##Sentimental Analytics
#自然语言处理#Sequence to sequence model for Arabic punctuation prediction.
Regular expression for matching punctuation characters.
A blazingly fast tool for converting to English punctuations
Regular Expressions for finding wrong punctuation before publishing.
a ConTeXt LMTX module to support Chinese punctuation