⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
翻译 - :zap:从查找文本到搜索和替换,从排序到美化文本等等:art:
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
翻译 - Diff Match Patch是一种使用多种语言的高性能库,可处理纯文本。
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Intuitive find & replace CLI (sed alternative)
#自然语言处理#fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
翻译 - fastNLP:模块化和可扩展的NLP框架。目前仍在孵化中。
#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Python library for creating PEG parsers
#计算机科学#Text Classification Algorithms: A Survey
翻译 - 文本分类算法:调查
#自然语言处理#Persian NLP Toolkit
#自然语言处理#The most accurate natural language detection library for Go, suitable for short text and mixed-language text
翻译 - 👄 Go 生态中最准确的自然语言检测库,适用于长短文本
Program to convert lines of text into a tree structure.
翻译 - 将文本行转换为树形结构的程序。
A fast implementation of Aho-Corasick in Rust.
#自然语言处理#Thai natural language processing in Python
A fast and convenient fuzzy matcher library for rust
A simple Python module for parsing human names into their individual components
#自然语言处理#Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...
#自然语言处理#Natural language detection library for Go
#自然语言处理#Open Korean Text Processor - An Open-source Korean Text Processor