#网络爬虫#Crawlee - 一个用于Node.js 开发的网页爬虫和浏览器自动化库
SingleFile 是一个用来保存完整HTML网页的浏览器插件,只会生成一个文件,图片不会丢失。支持Chrome, Firefox,Edge,Vivaldi, Brave, Waterfox, Yandex,Opera等浏览器
Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
翻译 - Puppeteer记录器是一个Chrome扩展程序,可记录您的浏览器交互并生成Puppeteer脚本。
Docker 中运行chrome ,实现浏览器headless自动化任务
Proxy server to bypass Cloudflare protection
An AI web browsing framework focused on simplicity and extensibility.
A developer-friendly API for converting numerous document formats into PDF files, and more!
翻译 - Docker驱动的无状态API,用于将HTML,Markdown和Office文档转换为PDF
Lightpanda: the headless browser designed for AI and automation
Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.
翻译 - 网页PDF / PNG渲染正确。用于呈现收据,发票或任何内容的自助服务。
💯 Teach puppeteer new tricks through plugins.
翻译 - through通过插件教木偶新技巧。
Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on ...
翻译 - Venom是使用JavaScript开发的高性能系统,可为WhatsApp创建机器人
A Headless Chrome rendering solution
翻译 - 无头Chrome渲染解决方案
#网络爬虫#Turn any webpage into structured data using LLMs
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
翻译 - 🌐→📖一个命令行工具,可将网页转换为格式精美的PDF
#网络爬虫#Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
翻译 - 分析机器人保护系统和可用的对策🚿。如何在抓取网页时击败反机器人系统👻并绕过浏览器指纹脚本🕵️♂️?
Scan your entire site with Google Lighthouse in 2 minutes (on average). Open source, fully configurable with minimal setup.
Headless chrome/chromium automation library (unofficial port of puppeteer)
翻译 - 无头铬/铬自动化库(伪造者的非官方端口)
Headless chrome/chromium automation library (unofficial port of puppeteer)
翻译 - 无头铬/铬自动化库(伪造者的非官方端口)