Get your documents ready for gen AI
#自然语言处理#Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Knowledge Agents and Management in the Cloud
PDF Verse is a powerful web based PDF Editor with tools for editing, converting, and manipulating PDFs. Merge, compress, add or remove pages, or extract text using OCR technology. Convert PDF to DOC, ...
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Parse bank and credit card statements
Sao kê của Mặt Trận Tổ Quốc Việt Nam (MTTQ) về việc hỗ trợ đồng bào sau bão Yagi
#大语言模型#Build a RAG preprocessing pipeline
Python client library for Graphlit Platform
NodeJS library to convert JSON to PDF or vice versa
A cute PDF parser that gives position of elements for inspection purposes.
ByteScout PDF Extractor SDK source code samples
Quick way to convert files (PDF, DOCX, HTML, PPTX, Images) to (MD, JSON, YAML) using Docling and Streamlit
This project for converting books from PDF to Proper JSON objects by separating title and content. After you take your output, you can insert your JSON file in the database easily.
🛠️ ipuresult-cli is tool for creating json files from pdf result files 📚 of GGSIPU Results
Convert MayBank email statement delivery to CSV or JSON format
TypeScript client for Graphlit Platform