OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
#大语言模型#Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSO...
#自然语言处理#Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
#自然语言处理#OCR, Archive, Index and Search: Implementation agnostic OCR framework.
#计算机科学#A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in th...
Lightweight & fast OCR models for license plate text recognition.
A FLOSS software for Persian Optical Character Recognition
PDF text data extraction web app with OCR for scanned documents
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SOTA。
#自然语言处理#OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
Turn any OCR models into online inference API endpoint 🚀 🌖
Custom C++ implementation of deep learning based OCR
MyLittleOCR 是一个统一的 OCR 库包装器,提供一致的 API,便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.
Fully-Featured Automated License Plate Recognition Database for Blue Iris + CodeProject AI Server 🚘
Optical Character Recognition in Python.