Skip to content

Latest commit

 

History

History
28 lines (18 loc) · 1.78 KB

File metadata and controls

28 lines (18 loc) · 1.78 KB

DBLP Publication Explorer

This work is an Exploratory Data Analysis (EDA) project that provides an automated pipeline to fetch, process, and visualize the academic trajectories of Computer Science researchers using the DBLP Search API. The analysis focuses on three database pioneers: Michael Stonebraker, Hector Garcia-Molina, and Jennifer Widom.

Technical Implementation

The data acquisition pipeline uses DBLP Method 2 for fetching complete bibliographies in JSON format, which includes the implementation of a fetching loop with pagination at 200 results per page and a safety limit of 10,000 results. To ensure the stability of the server and API compliance, a delay of one second is introduced between consecutive requests. The fetched JSON results are then converted into structured dictionary format for the exact tracking of publication volumes and categories for a given year, including journals, conferences, and books.

Interactive Visualizations (Plotly)

Annual Publication Volume Across Authors

Career Peaks and Longitudinal Activity

Publication Type Distribution

Michael Stonebraker Hector Garcia-Molina Jennifer Widom