This project examines age-adjusted prevalences of obesity, diabetes and low physical activity in the United States. Data was taken from the CDC Chronic Disease Indicators dataset and includes the years 2019 to 2022. SQL queries for exploring and processing the original dataset were executed locally using DuckDB, and visualizations of processed data were made using Tableau. The final dashboard aims to show geographic variation across states, disparities between sexes both at the state and national levels, and correlations between the indicators.
The interactive dashboard can be viewed:
- Online: Tableau Public
- Locally: Open
tableau/cdi.twbxin Tableau Desktop
cdc-chronic-disease-analysis/
│
├── data/
│ ├── raw/
│ │ └── cdi.csv
│ └── processed/
│ ├── state_obesity_diabetes.csv
│ ├── state_ob_di_by_sex.csv
│ ├── state_no_pa.csv
│ └── natl_indicators_by_sex.csv
│
├── sql/
│ ├── exploratory_queries.sql
│ └── analysis_queries.sql
│
├── tableau/
│ ├── cdi.twbx
│ └── cdi_dashboard.png
│
└── README.md
cdi.csvis the original dataset.- All data sets in the
data/processed/directory result from running the queries inanalysis_queries.sql. exploratory_queries.sqlserves as a notebook and does not export any data.
Data processing can be done locally with DuckDB by running the command:
duckdb -init sql/analysis_queries.sqlEnsure that the working directory in which the command is run contains data/raw/, data/processed/, and sql/ subdirectories and that data/raw/cdi.csv and sql/analysis_queries.sql exist.
- The maps for prevalence of obesity and diabetes by state show a pattern of higher prevalence for both in the southern region.
- Mirrored bar charts and scatterplots indicate a strong positive relationship between obesity and diabetes prevalence across states.
- Despite correlation between obesity and diabetes, sex disparity maps show that higher obesity prevalence in women is more common, while higher diabetes prevalence in men is more common.
- This may reflect behavioral, biological, or socioeconomic factors.
- Could also be influenced by differences in physical activity as shown in the national-level bar chart.
- Prevalence of low physical activity is positively correlated with both obesity and diabetes prevalence.
- SQL (DuckDB)
- Tableau
- Git / GitHub
- CDC Chronic Disease Indicators dataset
- Compare prevalence by race/ethnicity
- Include race/ethnicity demographic data for states to explore its relation to obesity/diabetes prevalence
- Incorporate regional groupings
