Web Scrapers built using:
- Node.js
- Express
- Cheerio
- Axios
- Dotenv
- File System (fs)
- Path
- PM2 Process Management (daemon process manager)
- Node-Fetch
- Unirest
- Got-Scraping
- Node-Cron
- Crawlee
- Puppeteer
- Playwright
- EJS
- yt-trending-scraper
- PDFKit
- Json2Csv
- CsvToJson
- Docker
- Scrapes the Formula One website for the latest news, results,standings, converts the scraped data into a PDF file, and saves it to a local folder.
- Scrapes the website for the latest books, converts the scraped data into a CSV file & saves it to a local folder.
- Version 1: Scrapes the website for the latest news.
- Version 2: The CheerioCrawler version using Crawlee is similar, but since Crawlee "simulates" the actions of a real user, the browser settings are defaulted to
"headless: false", so the designated browser opens & the whole program runs as automated. Also, any & all Datasets are stored in a storage folder in the root directory, & containerized using Docker.
- Version 1: Scrapes a website for a specific product & takes a screenshot of the webpage. Code is currently set for mintmobile.com.
- Version 2: The PlaywrightCrawler version using Crawlee is similar, but since Crawlee "simulates" the actions of a real user, the browser settings are defaulted to
"headless: false", so the designated browser opens & the whole program runs as automated. Also, any & all Datasets are stored in a storage folder in the root directory, & containerized using Docker.
- Version 1: Scrapes Amazon for a specific product & takes a screenshot of the webpage.
- Version 2: The Playwright version is similar, but since Playwright "simulates" the actions of a real user, the browser settings are defaulted to
"headless: false", so the designated browser opens & the whole program runs as automated.
- Scrapes Yelp for the latest restaurants, their corresponding information & saves it in a local folder.
- Scrapes Google for the latest search results.
- Running as a background app via PM2 (Process Management), Job scrapers scrapes Google for the latest jobs in an specific area, converts the scraped data into a PDF file, saves to a local folder, & uploaded/sent as an email via custom-made Email Sender App.
- Scrapes Google for the latest images in an area, and downloads them to a local folder.
- Scrapes a website for all of its images, and downloads them to a local folder.
- Scrapes YouTube for the latest trending videos by country & category.
- Scrapes multiple websites for images, texts, can perform operations such as button clicking, form submission, as well as saves the scraped data to a local folder. Can also be automated using Node-Cron.