Skip to content

pranayahirwar/WebScraping-Python-Docker-MongoDBAtlas

Repository files navigation

Scraping Quotes and Saving it MongoDB Atlas

Document contain all the detailed steps which I took for this assignment. So let's go according to the instructions.

For this I have used Azure Virtual Machine but not from AWS, this doesn't mean I don't know AWS. I started my cloud journey with AWS but because my free tier is over I don't want to incur anymore charges.

  1. Analyze Given Python Code
  2. Containerize the Python script
  3. Setup A Selenium Grid Environment
  4. Scale The Scraper Using Docker And AWS & Setting up Atlas DB
  5. IaC Using Terraform

Results

5 concurrent.futures & 5 Selenium-chrome-nodes

This Screenshot is taken when Python Script was configured to launch 5 concurrent web-driver requests to selenium-hub and there are 5 selenium-chrome-nodes are running.

Pasted image 20230711171635

Screenshot 2023-07-11 163839


10 Concurrent.futures & 10 Selenium-chrome-nodes

Pasted image 20230711182724 Pasted image 20230711182726


MongoDB Atlas Image of Records

BEFORE Screenshot 2023-07-11 182050 AFTER

Screenshot 2023-07-11 182834

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published