”pdf-ocr“ 的搜索结果

OCRmyPDF

@ocrmypdf

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

翻译 - OCRmyPDF将OCR文本层添加到扫描的PDF文件中，从而可以对其进行搜索

Python ocr pdf 图像处理 tesseract

Python14.24 k

2 天前

Google Bing GitHub

image-processing ai-assist trainticket yolo3 tesseract chinese-ocr darknet-text-detect llama2 ocr-correction ocr

OCRmyPDF

@fritz-hh

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

260

9 年前

pypdfocr

@virantha

Python script to do PDF OCR conversion using Tesseract

Python373

1 年前

swift-ocr-llm-powered-pdf-to-markdown

@yigitkonur

An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex...

Python709

2 个月前

llm_aided_ocr

@Dicklesworthstone

#大语言模型#Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

ai-assist llama2 llm ocr tesseract

Python2.21 k

3 个月前

OCRmyPDF-web

@sseemayer

A tiny frontend for OCRing PDF files via the web.

JavaScript46

5 年前

ocr_for_transcribing_pdf_slides

@EnkrateiaLucca

Code for my medium article: ["Faster Notes with Python and Deep Learning"](https://medium.com/p/b713bbb3c186/edit)

Jupyter Notebook139

4 年前

pdftabextract

@WZBSocialScienceCenter

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Python2.22 k

2 年前

obsidian-omnisearch

@scambier

A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.

TypeScript1.25 k

20 天前

pdfocr

Geza Kovacs@gkovacs

Adds text to PDF files using the cuneiform OCR software

翻译 - 使用cuneiform OCR软件将文本添加到PDF文件

Ruby325

4 年前

OCRmyPDF-Desktop

@FanQinFred

PDF OCR Application, adds an OCR text layer to scanned PDF files, allowing them to be copied and searched.

1 年前

pdf-to-text

@maiaPhilippe

PDF OCR using Pure Javascript by tesseract.js api

HTML18

7 年前

autobookmark

@zwxbest

给ocr过的pdf加书签

Java11

2 年前

PDFtoTXT

@lucab85

Python code to read text from a PDF file (OCR).

Python64

5 年前

pdf-text-data-extractor

@nainiayoub

PDF text data extraction web app with OCR for scanned documents

Python80

6 个月前

Hazel-Acrobat-OCR-AppleScript

@macdrifter

AppleScript for Hazel to send PDF scans to Acrobat for OCR

15 年前

ocr2text

@writecrow

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Python101

2 年前

ocr-python

@NanoNets

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

Jupyter Notebook26

2 年前

java-ocr-api

@Asprise

Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with full coordinate as well as searchable PDF

Java132

9 年前