Predicts county-level U.S. air quality risk for 2026 using EPA AQI data, clusters counties into pollution archetypes, and visualizes results with an interactive map.
What’s included:
- datathon_aqi_pipeline.py: Builds dataset, trains model, predicts 2026 unhealthy days, outputs CSV tables
- app.py: Streamlit dashboard and interactive U.S. map
- Data/: EPA AQI CSVs and county FIPS lookup
- aqi_outputs/: Generated tables (auto-created)
Run
- datathon_aqi_pipeline.py to generate predictions and clusters
- app.py to launch an interactive app