Skip to content

Crithane/dynamicWebScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dynamic Web Scraper

Scrapes a Website that is taken from an inputted emails domain.

This project uses ECMAScript 6(2015)

This is a NodeJS project that is designed to get as much data as possible from an email address. It will grab the domain from an email address and visit the corresponding website. From the website it will scrape phone numbers and email addresses with a fantastic success rate. Then it will use GoogleMaps API to find out much more about the company that owns the website. It grabs the address, the company name, company GMaps rating and whether or not the company is currently open. All scraped data is saved in a json file, which is also passed to a HTML5 table, within a webapp, hosted on localhost:8000, to make the data much more readable.

Screenshots

Webapp view

CLI view

Installation

Windows & OS X & Linux:

git pull https://github.com/Crithane/dynamicWebScraper.git
npm install

Usage example

node app.js johndoe@example.com

Features

  • Dynamically scrapes any domain's website from an inputted email address
  • Grabs all phone numbers from website
  • Grabs all email addresses from website
  • Utilizes GMaps API to grab companies address
  • Utilizes GMaps API to grab companies name
  • Utilizes GMaps API to grab companies rating
  • Utilizes GMaps API to see if company is open at the time of the request
  • Saves all scraped data to /domain/domain.json
  • Displays data in a table on a webapp hosted on localhost:8000

Dependencies

Authors

  • Shane Critchley-Kenyon

About

Javascript webscraper that scrapes the domain of a given email

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published