#自然语言处理#Extract Keywords from sentence or Replace keywords in sentences.
翻译 - 从句子中提取关键字或替换句子中的关键字。
#网络爬虫#🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (f...
#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
#网络爬虫#Lightweight library for scraping web-sites with LLMs
#网络爬虫#📰 Let ChatGPT Summarize Hacker News for You
A beginner-friendly yet powerful Python toolkit for financial analysis and automation — built to make modern investing accessible to everyone
🚜 Parse text and tables from PDF files.
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
Undetected Web-Scraping & Seamless HTML Parsing in Python!
A tool for scraping emails, social media accounts, and much more information from websites using Google Search Results.
A python client for the Sypht API
This repository provides usage examples for the Python module Newspaper3k.
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Structured HTML table data extraction from URLs in Go that has almost no external dependencies
#大语言模型#Accurate, private and configurable document retrieval LLM