crawling
Here are 118 public repositories matching this topic...
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
-
Updated
May 9, 2025 - JavaScript
a reliable high-level web crawling & scraping framework for Node.js.
-
Updated
Mar 12, 2025 - JavaScript
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
-
Updated
May 19, 2020 - JavaScript
Sasori is a dynamic web crawler powered by Puppeteer, designed for lightning-fast endpoint discovery.
-
Updated
Jul 23, 2024 - JavaScript
-
Updated
Mar 16, 2024 - JavaScript
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
-
Updated
Aug 20, 2025 - JavaScript
⛏ A versatile Web scraper for Node.js
-
Updated
Oct 7, 2025 - JavaScript
A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
-
Updated
Aug 24, 2023 - JavaScript
ProxyCrawl Node library for scraping and crawling
-
Updated
Jul 3, 2023 - JavaScript
A web page content extractor
-
Updated
Aug 13, 2024 - JavaScript
b̶̡̪̬͒l̸̰̗̝̀ỏ̷̡̩g̴͇̑g̶̲̱̽͐i̵̹͗n̶̤̥͂̅̆g̴̮̾̅͜ ̷̧͎͆i̷̛͒͜͠n̸̥̺͒ ̶͚͚͊̿͜t̸̺͙̭̆̊̈́ḧ̶̟́̐e̸̱͔̟̓̓͝ ̶̨͔̾͛̑d̵̥̣̏ȧ̷̼̊r̷̰̝̥̅̌͝k̵̟̥̞̉̍͛
-
Updated
Nov 11, 2018 - JavaScript
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
-
Updated
Jul 11, 2025 - JavaScript
A Node.js XML DOM, Parser & Stringifier.
-
Updated
Apr 19, 2022 - JavaScript
2023.11) velog statistics dashboard fullstack
-
Updated
Nov 23, 2023 - JavaScript
Improve this page
Add a description, image, and links to the crawling topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the crawling topic, visit your repo's landing page and select "manage topics."