This Python script lets you extract news article data from the www.moneycontrol.com website for each company year-wise.
This requires Python 3.8 or higher package dependencies.
Run script:
python article_link_extractor.py
When you run this script you will need to provide the input starting to the current year e.g (2011 to 2024) and company name to the script and then script will generate the CSV file containing all article links.
and the run below script:
python news_article_extractor.py
When you run this news_article_extractor.py script you will need to provide the path of the CSV file which is generated by the above script then, it will generate the CSV file which contains all important information about the article.