Skip to content

BenSnyder90/Mission-to-Mars-web-scraping-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

web-scraping-challenge

CWRU Data Analytics Bootcamp - HW 12

Objectives

  • Use Splinter and Beautiful Soup to parse various websites HTML code
  • Store text, tables, and image sources using parsed data using MongoDB
  • Display stored info on a website in a single HTML page

Contents

/Mission_To_Mars/

  • mission_to_mars.ipynb - Jupyter Notebook that performs the initial scraping of each site. Markdown includes steps taken to find the desired information via HTML code
  • scrape_mars.py - Python script that uses the parsing methods from the Jupyter Notebook that returns a dictionary object to be inserted into MongoDB
  • app.py - Python Flask app that loads the Index.html as a template and runs the scrape_mars.py parsing script, and loads the parsed info through MongoDB to be used by the template
  • chromedriver.exe - Browser used by Splinter to parse HTML

    /templates/

  • Index.html - HTML page that displays the information parsed through various Mars websites and creates a dashboard of information. Styled using Bootstrapping

Results

Page1 Page2

About

Mission to Mars Web Scraping Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published