Skip to content

Latest commit

 

History

History
28 lines (15 loc) · 1.17 KB

File metadata and controls

28 lines (15 loc) · 1.17 KB

WebScrape_MarsNews

Web scrapes data from Mars News related websites, loads the data into mongo db and displays the information in a single HTML page.

Mission to Mars

Mars Latest News

Mars Hemispheres

  1. Technology Stack - HTML, CSS, BootStrap, Jupyter, Python

  2. Python Libraries - Pandas, Beautiful Soup, Splinter, PyMongo

  3. Database - Mongo DB

  4. App Server - Flask

Steps :

  1. To scrape various websites for data related to the Mission to Mars and display output on Jupyter Notebook [Scraping_mission_to_mars.ipynb].

  2. To create a Python Script [scrape_mars.py] to scrape and execute all scraping code and return one Python dictionary containing all of the scraped data.

  3. To create a Flask App [app.py] to create route (index and scrape). The root route / will query Mongo database and pass the mars data into an HTML template to display the data.

  4. To create HTML file [index.html] that will take the mars data dictionary and display all of the data in the appropriate HTML elements.

  5. To create Mongo db and collection to store the scraped data. PyMongo was used to set up mongo connection and to define db and collection.