MySQL Error Google Scrapper
This nodeJs script scrappes mysql errors found on google using puppeteer.
All error will be save on a file .txt
It's look like THIS
Installation
mkdir megs
cd megs
git clone https://github.com/YR72dpi/mysql_error_google_scrapper.git
# unzip
npm updateUsage
npm run megs- config.js
const config = []
config["watcher"] = false /* true if you want to watch all error found in real time */
config["headless"] = false /* true if you want to hidde the google chrome copy */
config["file"] = "error.csv" /* csv file with all error */
config['print_excl_w'] = false /* true if you want to print the url even if there is excluded word */
/**
* These string will be put on regex expression
*/
const exclude = [
"stackoverflow\.com",
"openclassrooms\.com",
"php\.net",
"dev",
"forum(s)?"
]
const regexFlag = "gmi"- error.csv
Don't edit the fisrt line
ERROR
inurl:id= & intext:"Warning: mysql_fetch_assoc()"
inurl:id= & intext:"Warning: query()"
inurl:id= & intext:"Warning: mysql_fetch_array()"
inurl:id= & intext:"Warning: mysql_num_rows()"
inurl:id= & intext:"Warning: mysql_result()"
inurl:id= & intext:"Warning: pg_exec()"
inurl:id= & intext:"Warning: mysql_result()"
inurl:id= & intext:"Warning: mysql_num_rows()"
inurl:id= & intext:"Warning: mysql_query()"
This script work only for french google.
Watch in pupperteer.js. It's because of css selectors
Solution : [X]
This script is caught as a robot, often, after ~35 query.
Solution :
- connect the script to a vpn and change of vpn each 25 query
- connect the script to a google account and manage the reCaptcha (which already trust you)
- Install an extension which solve the recaptcha like Buster
As well as solve these two issues, it's could be good to :
- Make possible to save information in another format than .txt
- Make a file with words to exclude instead of config file
- Make one command line system to catch MySQL Error with one command line
- Make a GUI interface
