Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
翻译 - 用于神经机器翻译和文本生成的无监督分词
Morfessor is a tool for unsupervised and semi-supervised morphological segmentation
Context-sensitive word embeddings with subwords. In Rust.
Properly handle position-dependent phones in a subword lexicon FST
#自然语言处理#Morfessor EM+Prune
Semantic role labeling with subwords (character, character-ngram and morphology)
#自然语言处理#A python package to build a corpus vocabulary using the byte pair methodology and also a tokenizer to tokenize input texts based on the built vocab.
#自然语言处理#A tool for generating sub-word (phone or grapheme) level embeddings from an HTK-style MLF ASR corpus
#自然语言处理#Classified sentences into one of Slovak, Czech, and English. Implemented relevant preprocessing steps, addressed the class imbalance in training set by employing the learned theory of Naive Bayes Mode...
This repository contains the code to learn subword embeddings from the arXiv dataset of 1.7M+ scholarly papers.
#自然语言处理#🕵️ Language Model based on RNN for generating Sherlock Holmes stories.