#自然语言处理#Transforms PDF, Documents and Images into Enriched Structured Data
翻译 - 将PDF,文档和图像转换为丰富的结构化数据
#计算机科学#Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
翻译 - 对抗性鲁棒性工具箱(ART)-用于机器学习安全性的Python库-规避,中毒,提取,推理
extract internal monitoring data from application logs for collection in a timeseries database
翻译 - 从应用程序日志中提取白盒监视数据以收集在时间序列数据库中
a library for audio and music analysis
翻译 - 音频和音乐分析库
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Provides functions to read and write from/to an object or array using a simple string notation
翻译 - PropertyAccess组件提供使用简单的字符串表示法从对象或数组读取和写入对象或数组的功能。
Visual Novels resource browser
翻译 - Visual Novels资源浏览器
Extract files from any kind of container formats
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
翻译 - 用于从html,pdf,doc,docx,xls,xlsx,csv,pptx,png,jpg,gif,rtf等提取文本的node.js模块!
#自然语言处理#Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
#大语言模型#🦜⛏️ Did you say you like data?
#自然语言处理#Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
A C++ static library offering a clean and simple interface to the 7-zip shared libraries.
#自然语言处理#Stanford Open Information Extraction made simple!
#自然语言处理#北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
File Injector is a script that allows you to store any file in an image using steganography
PHP URI Template (RFC 6570) supports both URI expansion & extraction
DataTool is a program that lets you extract models, maps, and files from Overwatch.