A Python tool to automatically track, filter, and categorize research publications from Google Scholar for a specific list of authors.
main.py: The entry point. It loadsconfig.json, runs the scraper, and saves the final text report.scraper.py: The core logic engine. It handles API requests, filters by year, and categorizes papers based on your keywords.config.conf: Your settings file (API keys, Author IDs, and keyword lists).requirements.txt: Contains the necessary Python libraries for the project.
- Sign up at SerpApi.com.
- Copy your API Key from your private dashboard.
- Go to Google Scholar.
- Search for an author and click their profile.
- Look at the URL. The ID is the string of characters after
user=.- Example:
https://scholar.google.com/citations?user=h1AbC2_AAAAJ→ The ID ish1AbC2_AAAAJ.
- Example:
Open your terminal in this folder and install the required library:
pip install -r requirements.txtOpen config.conf and fill in your specific details:
api_key: Your SerpApi key.
author_ids: A list of IDs (e.g., ["ID1", "ID2"]).
start_year / current_year: The range of publications to fetch.
Execute the program from your terminal:
python main.pyNote: Every page of articles and every DOI search uses 1 SerpApi credit. Monitor your usage at SerpApi.com.