Skip to content

akhtarrkhan/covid-data-exploration-sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

COVID-19 Data Exploration in SQL

Overview

This project explores global COVID-19 deaths and vaccination data using SQL.

The analysis focuses on identifying infection rates, death counts, death percentages and vaccination progress across countries and continents. It also demonstrates practical SQL concepts commonly used in data analysis projects.

Objectives

The main questions explored in this project include:

  • How did total cases compare with total deaths?
  • What percentage of the population was infected?
  • Which countries had the highest infection rates?
  • Which countries and continents had the highest death counts?
  • How did vaccination totals grow over time?

Dataset

This project uses public COVID-19 deaths and vaccination data.

Tables used:

  • 'CovidDeaths'
  • 'CovidVaccinations'

Typical fields used include:

  • location
  • date
  • population
  • total_cases
  • new_cases
  • total_deaths
  • new_deaths
  • new_vaccinations
  • continent

Tools Used

  • SQL
  • Joins
  • Common Table Expressions (CTEs)
  • Temporary Tables
  • Window Functions
  • Aggregate Functions
  • Views

SQL Skills Demonstrated

This project includes:

  • Filtering and sorting data
  • Aggregate analysis
  • Percentage calculations
  • Grouping by country and continent
  • Joining multiple tables
  • Running totals with window functions
  • CTEs
  • Temporary tables
  • View creation

Analysis Performed

The SQL queries cover:

  1. Initial exploration of COVID deaths and vaccination data
  2. Total cases vs total deaths
  3. Total cases vs population
  4. Countries with the highest infection rate
  5. Countries with the highest death count
  6. Continents with the highest death count
  7. Global summary numbers
  8. Rolling vaccination totals by location
  9. Percentage of population vaccinated

Files

  • 'covid_data_exploration.sql' - SQL queries for the full analysis

Key Takeaways

  • Infection rates varied significantly across countries.
  • Total deaths were concentrated in a smaller set of countries and regions.
  • Vaccination progress can be tracked effectively using joins and window functions.
  • SQL can be used to turn public health data into clear analytical insights.

Author

Akhtar R. Khan

About

Exploratory SQL analysis of global COVID-19 deaths and vaccination data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors