wget · GitHub Topics

"Your own personal internet archive" (网站存档 / 爬虫)，一个自托管的网站时光机

pocket wget browser-bookmarks pinboard Chromium Firefox backups RSS web-archiving Python wayback-machine youtube-dl 自托管 headless-browser digipres warc

Python 23.62 k

23 天前

wkentaro / gdown

#下载器#Google Drive Public File Downloader when Curl/Wget Fails

Google Drive wget cURL download 下载器 Python

Python 4.59 k

8 个月前

circulosmeos / gdown.pl

Google Drive direct download of big files

Google Drive wget Dockerfile

Perl 942

2 年前

benibela / xidel

#网络爬虫#Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...

xquery XML HTML JSON xpath 命令行界面 HTTP Web REST API css-selector wget cURL httpie webscraper webscraping scraper datascraping data-processing

Pascal 721

2 个月前

bitextor / bitextor

#网络爬虫#Bitextor generates translation memories from multilingual websites

dictionaries 爬虫 wget Parsing warc corpus-tools corpus-processing machine-translation neural-machine-translation statistical-machine-translation

Python 292

5 个月前

icy / google-group-crawler

#网络爬虫#[Deprecated] Get (almost) original messages from google group archives. Your data is yours.

Google cookie wget Bash 爬虫 ownership cURL

Shell 215

3 年前

teracow / googliser

#下载器#a fast BASH multiple-image downloader

Bash Google Image download Script Shell gallery Linux wget cURL fast imagemagick montage Debian macOS manjaro

Shell 208

4 年前

Yezz123-Archive / Phisher

Perform various social engineering attacks using PHP, Apache, Ngrok 🦥

phishing PHP ngrok wget Linux apache

HTML 205

4 年前

georgjaehnig / webpages-to-ebook

Create an EPUB from a list of URLs. Standing on the shoulders of Wget, Readability and Pandoc.

epub-generation epub wget pandoc ebook JavaScript npm

JavaScript 201

10 个月前

jiejieTop / http-client

A high-performance, high-stability, cross-platform HTTP client.

HTTP http-client rtos Linux tcpip http-parser cross-platform TLS (Transport Layer Security)request wget cURL

C 197

2 年前

BioOmics / iSeq

Download sequencing data and metadata from GSA, SRA, ENA, and DDBJ databases.

Bioinformatics metadata ngs wget

Shell 184

11 天前

yiliyassh / authority-data

#网络爬虫#官方权威数据：统计年签，统计公报，互联网行业报告，工信部数据，ICT报告等 Official authoritative data (Chinese)

Shell wget awk data 免费爬虫统计 Python

Python 168

6 个月前

fdciabdul / InsideHeartz-WhatsApp-Bot

A multipurpose whatsapp bot buillt on node.js

下载器 WhatsApp whatsapp-api YouTube nhentai Hacktoberfest hacktoberfest2020 voice mp3 wget

JavaScript 164

2 年前

mihaigalos / aim

🎯 A command line download/upload tool with resume.

Rust 下载器 wget resume cURL 命令行界面 command-line-tool

Rust 135

1 天前

otavio / rsget

A simple command line utility to download a remote file, similar to wget. This is not intended to be a full feature wget replacement but a simple tool to test few Rust crates.

wget Rust simple

Rust 135

6 个月前