Rip data from the net, leaving no trace. Welcome to the future of web scraping.
CyberScraper 2077 is not just another web scraping tool – it's a glimpse into the future of data extraction. Born from the neon-lit streets of a cyberpunk world, this AI-powered scraper uses OpenAI to slice through the web's defenses, extracting the data you need with unparalleled precision and style.
Whether you're a corpo data analyst, a street-smart netrunner, or just someone looking to pull information from the digital realm, CyberScraper 2077 has got you covered.
- 🤖 AI-Powered Extraction: Utilizes cutting-edge AI models to understand and parse web content intelligently.
- 💻 Sleek Streamlit Interface: User-friendly GUI that even a chrome-armed street samurai could navigate.
- 🔄 Multi-Format Support: Export your data in JSON, CSV, HTML, SQL or Excel – whatever fits your cyberdeck.
- 🌐 Stealth Mode: Built-in proxy support to keep you ghosting through the net. (Currently under development)
- 🚀 Async Operations: Lightning-fast scraping that would make a Trauma Team jealous.
- 🧠 Smart Parsing: Structures scraped content as if it was extracted straight from the engram of a master netrunner.
- 🛡️ Ethical Scraping: Respects robots.txt and site policies. We may be in 2077, but we still have standards.
- 🛡️ Navigate through the Pages: Navigate through the webpage and scrap the data from different pages. (Coming Soon)
Check out our YouTube video for a full walkthrough of CyberScraper 2077's capabilities.
-
Clone this repository:
git clone https://github.com/itsOwen/CyberScraper-2077.git cd CyberScraper-2077
-
Create and activate a virtual environment:
virtualenv even source venv/bin/activate # Optional
-
Install the required packages:
pip install -r requirements.txt
-
Install the playwright:
playwright install
-
Fire up the Streamlit app:
streamlit run main.py
-
Open your browser and navigate to
http://localhost:8501
. -
Enter the URL of the site you want to scrape or ask a question about the data you need.
-
Ask the chatbot to extract the data in any format, Select whatever data you want to export or even everything from the webpage.
-
Watch as CyberScraper 2077 tears through the net, extracting your data faster than you can say "flatline"!
We welcome all cyberpunks, netrunners, and code samurais to contribute to CyberScraper 2077!
Ran into a glitch in the matrix? Let me know by adding the issue to this repo so that we can fix it together.
Q: Is CyberScraper 2077 legal to use? A: CyberScraper 2077 is designed for ethical web scraping. Always ensure you have the right to scrape a website and respect their robots.txt file.
Q: Can I use this for commercial purposes? A: Yes, under the terms of the MIT License. But remember, in Night City, there's always a price to pay, Just kidding!
This project is licensed under the MIT License - see the LICENSE file for details. Use it, mod it, sell it – just don't blame us if you end up flatlined.
Got questions? Need support? Want to hire me for a gig?
- 📧 Email: owensingh72@gmail.com
- 🐦 Twitter: @_owensingh
- 💬 Website: Portfolio
CyberScraper 2077 – Because in 2077, what makes someone a criminal? Getting caught.
Built with ❤️ and chrome by the streets of Night City | © 2077 Owen Singh