#

text-processing

https://static.github-zh.com/github_avatars/learnbyexample?size=40

⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨

Shell 10.19 k
1 年前
pymupdf/PyMuPDF
https://static.github-zh.com/github_avatars/pymupdf?size=40

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8.23 k
2 天前
https://static.github-zh.com/github_avatars/google?size=40

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

Python 7.93 k
1 年前
https://static.github-zh.com/github_avatars/chmln?size=40

Intuitive find & replace CLI (sed alternative)

Rust 6.63 k
6 个月前
https://static.github-zh.com/github_avatars/fastnlp?size=40
Python 3.14 k
2 年前
chonkie-ai/chonkie
https://static.github-zh.com/github_avatars/chonkie-ai?size=40
Python 2.87 k
7 个月前
https://static.github-zh.com/github_avatars/pyparsing?size=40
Python 2.4 k
15 天前
https://static.github-zh.com/github_avatars/helix-editor?size=40

A fast and convenient fuzzy matcher library for rust

Rust 1.24 k
5 个月前
https://static.github-zh.com/github_avatars/birchb1024?size=40

Program to convert lines of text into a tree structure.

Go 1.2 k
2 年前
https://static.github-zh.com/github_avatars/BurntSushi?size=40

A fast implementation of Aho-Corasick in Rust.

Rust 1.14 k
1 年前
https://static.github-zh.com/github_avatars/sstadick?size=40
Rust 720
4 个月前
https://static.github-zh.com/github_avatars/derek73?size=40

A simple Python module for parsing human names into their individual components

Python 688
1 年前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40

Text Normalization & Inverse Text Normalization

Python 677
8 天前
https://static.github-zh.com/github_avatars/cbaziotis?size=40

#自然语言处理#Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...

Python 671
4 个月前
loading...
Website
Wikipedia