"Your own personal internet archive" (网站存档 / 爬虫),一个自托管的网站时光机
#下载器#Google Drive Public File Downloader when Curl/Wget Fails
#网络爬虫#Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...
#网络爬虫#Bitextor generates translation memories from multilingual websites
Create an EPUB from a list of URLs. Standing on the shoulders of Wget, Readability and Pandoc.
A high-performance, high-stability, cross-platform HTTP client.
Download sequencing data and metadata from GSA, SRA, ENA, and DDBJ databases.
A multipurpose whatsapp bot buillt on node.js
A simple command line utility to download a remote file, similar to wget. This is not intended to be a full feature wget replacement but a simple tool to test few Rust crates.
🎯 A command line download/upload tool with resume.
Install ubuntu in Termux Without Rooted Device
#网络爬虫#A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.