An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
翻译 - 具有多语言支持的集成语料库工具,用于语言,文学和翻译研究
#网络爬虫#Bitextor generates translation memories from multilingual websites
#自然语言处理# Python scripts preprocessing Penn Treebank and Chinese Treebank
#自然语言处理#OpusFilter - Parallel corpus processing toolkit
Utilities for Processing the Switchboard Dialogue Act Corpus
#自然语言处理#A Serverless Text Annotation Tool for Corpus Development
A parser for annotated MuseScore 3 files.
#自然语言处理#Reading the data from OPIEC - an Open Information Extraction corpus
Utilities for Processing the Meeting Recorder Dialogue Act Corpus
A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry
#自然语言处理#Korpuslinguistik war noch nie so einfach...
A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.
Hard-Forked from JuliaText/TextAnalysis.jl
#自然语言处理#ALvisNLP corpus processing engine
#自然语言处理#Measure the similarity of text corpora for 74 languages
Scripts for building a geo-located web corpus using Common Crawl data
A set of corpus-based sampling & analysis M4L devices
Script that sets up and configures an entire CQPweb server installation
#自然语言处理#Plotly-Dash NLP project. Document similarity measure using Latent Dirichlet Allocation, principal component analysis and finally follow with KMeans clustering. Project is completed with dynamic visual...