ocr-correction · GitHub Topics

#大语言模型#Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 2.61 k

1 个月前

Python 3 library for processing historical English

Python 65

8 个月前

#计算机科学#Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"

Jupyter Notebook 36

1 年前

This project is to correct errors from OCR text for Languages which its sentences without spaces

Python 6

6 年前

🐀 Clean up that ratty OCR

Python 3

2 年前