This work is an Exploratory Data Analysis (EDA) project that provides an automated pipeline to fetch, process, and visualize the academic trajectories of Computer Science researchers using the DBLP Search API. The analysis focuses on three database pioneers: Michael Stonebraker, Hector Garcia-Molina, and Jennifer Widom.
The data acquisition pipeline uses DBLP Method 2 for fetching complete bibliographies in JSON format, which includes the implementation of a fetching loop with pagination at 200 results per page and a safety limit of 10,000 results. To ensure the stability of the server and API compliance, a delay of one second is introduced between consecutive requests. The fetched JSON results are then converted into structured dictionary format for the exact tracking of publication volumes and categories for a given year, including journals, conferences, and books.
Annual Publication Volume Across Authors
Career Peaks and Longitudinal Activity
| Michael Stonebraker | Hector Garcia-Molina | Jennifer Widom |
|---|---|---|
![]() |
![]() |
![]() |


