Get clean data from tricky documents, powered by vision-language models ⚡
-
Updated
Oct 18, 2025 - Python
Get clean data from tricky documents, powered by vision-language models ⚡
source for Open States scrapers
A framework for creating semi-automatic web content extractors
Python scripts for building 'Short Jokes' dataset, featured on Kaggle
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Scraper for Google Maps "Popular Times" for place entries
A Elegant Fast Multi Threaded Bypass Bot for Bigger Deeds. Try Now !!
A Python scraper for Tokopedia that supports filtered product search, detailed product information, and customer reviews with accurate mobile pricing and Jupyter Notebook compatibility.
A basic python 3 based web scraper for extracting reviews from Amazon. Built using Selectorlib and requests
CLI OSINT app that can fetch data from Instagram's Web API without authentication.
Library for scraping websites or apis at any scale
Using Apache Airflow to schedule web scrapers
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis
In this guide on how to web scrape with Selenium, we will be using Python 3. The code should work with any version of Python above 3.6
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Board game data scraper
This Python code scrapes Google search results then applies sentiment analysis, generates text summaries, and ranks keywords.
Add a description, image, and links to the scrapers topic page so that developers can more easily learn about it.
To associate your repository with the scrapers topic, visit your repo's landing page and select "manage topics."