#网络爬虫#news-please - an integrated web crawler and information extractor for news that just works
⛓ Extract web links information: title, description, images, videos, etc. [via OpenGraph], runs on mobiles and node.
#自然语言处理#A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Receipt scanner extracts information from your PDF or image receipts - built in NodeJS
python implementation of jordansissel's grok regular expression library
#自然语言处理#Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...
Pluck text in a fast and intuitive way 🐓
Extract Information from web corpus using Open Information Extraction.
From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
#自然语言处理#An open information extraction system that provides compact extractions
#自然语言处理# simple rule based named entity recognition
Morphological Building Index, extract Buildings from a high-resolution top view image.
This program can be used to parse the NCBI GenBank file to create a tabulated csv file.
#大语言模型#Template for an AI application that extracts the job information from a job description using openAI functions and langchain
#自然语言处理#Natural Language Processing is process in which computer understand human language. This library provides a set of tools to understand and extract information from unstructured text in Slovak language...
#计算机科学#🏆 An applicant tracking system (ATS) is a software application that enables the electronic handling of recruitment and hiring needs. Corporate recruiters or hiring managers can then search and...
Github Action to extract info from the webhook payload object using jq filters.