Dataset: COVID-19 Global Data (Full Version)
Source: Our World in Data GitHub Repository – Dataset Link
Description:
This dataset contains country-level daily COVID-19 statistics, including cases, deaths, vaccinations, testing, and demographic/economic indicators. Key columns include:
location: Country or region namedate: Date of observationtotal_cases: Cumulative confirmed COVID-19 casestotal_deaths: Cumulative deathspeople_vaccinated: Number of people vaccinated
Size: Approximately 430,000 rows and 67 columns.
Suitability & Relevance:
- Real-world data with numeric, categorical, and date variables.
- Contains missing values, outliers, and inconsistencies -> ideal for demonstrating data cleaning techniques.
- Large enough to perform meaningful analysis but manageable for a Jupyter Notebook.
- Relevant for public health, statistics, and data analysis assignments, making it easy to justify insights or visualizations.
⚠ Note: This GitHub version is no longer updated as of August 19, 2024. For the latest data, OWID provides updated CSVs through their data catalog.