An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex...
#大语言模型#Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
A tiny frontend for OCRing PDF files via the web.
Code for my medium article: ["Faster Notes with Python and Deep Learning"](https://medium.com/p/b713bbb3c186/edit)
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
Adds text to PDF files using the cuneiform OCR software
翻译 - 使用cuneiform OCR软件将文本添加到PDF文件
PDF OCR Application, adds an OCR text layer to scanned PDF files, allowing them to be copied and searched.
PDF OCR using Pure Javascript by tesseract.js api
给ocr过的pdf加书签
PDF text data extraction web app with OCR for scanned documents
AppleScript for Hazel to send PDF scans to Acrobat for OCR
Convert a PDF via OCR to a TXT file in UTF-8 encoding
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with full coordinate as well as searchable PDF
Sane command-line scan-to-pdf script on Linux with OCR and deskew support
Extract tables from scanned documents pdf into csv file using ocr and image processing
Text is extracted from scanned PDF document using OCR in python
一个简单的在线PDF工具箱,目前支持pdf压缩以及OCR
yolo3+ocr
翻译 - yolo3 + ocr