Skip to content

USFOneHealthCodeathon2021/Team6

Repository files navigation

Project Name: Spatial-Temporal Prediction Models for COVID-19

SPTM-COVID Logo


Team Leaders: Awtum Brashear, Jing Lin, Ming Ji

Team Members: Michelle Grundahl, Santiago Hernandez, Weiliang Cen, Deepika Kolli, Siva Allam, Chang Li, Charley Wang

GVN/USF mentors:

Introduction

Following the first cases of Coronavirus disease 2019 (COVID-19) in Wuhan-China, the Severe Acute Respiratory Coronavirus 2 (SARS-CoV-2) has spread worldwide to over 190 countries or territories. This extremely contagious virus has been responsible for the ongoing pandemic that has claimed over 2.5 million lives around the world (WHO). As of February 24th 2021, the US reported 28,297,193 confirmed cases and 503,777 deaths. As the COVID-19 pandemic continues, it is imperative to create bioinformatic tools that provide timely forecast of the number of cases at local, regional and national level as a decision-making tool for public health professionals and state authorities.

Incorporating spatial as well as temporal data has been shown to improve the capability of models to forecast epidemiological data in other viruses (Myer and Johnston, 2019). Spatio-temporal models have also shown to allow quick response to change in patterns such as outbreaks. Live prediction of COVID-19 cases based on spatio-temporal data could vastly improve our ability to foresee and respond to changes in case numbers.

Objectives

The aim of this study is to propose a spatiotemporal epidemiological predictive model of COVID-19 infections at local, regional and national level to generate a forecast of COVID-19 daily cases and deaths in the United States of America.

Flowchart of objectives

Methods and Implementation

Data was extracted from 1point3acres.com website which is a public database that uses epidemiologic and spatial data from the John Hopkins University coronavirus Resource Center. The R-package ‘caret’ was used to create a temporal and spatio-temporal prediction model of new cases and death on a county level. Regression was performed using the machine learning algorithm ‘Spatial Random Forest’ to create spatio-temporal prediction models.

Additionally, Diffusion Convolutional Recurrent Neural Network (DCRNN, https://github.com/liyaguang/DCRNN) was explored using similar database. The model can effectively utilize spatial-temporal data by accounting for spatial dependencies using diffusion convolutional layers and by accounting for temporal dependecies using recurrent layers. Adjacency matrix was calculated by each county's representative geo-coordinates, i.e. longitude and latitude and their pair-wise distance. Tensorflow v1.4 was used to apply the model to train and evaluate COVID-19 data.

Results

DCRNN performance

Using a DCRNN network we were able to train a model to closely predict cases in high-case counties

Hillsborough County

Miami Dade Predictions

We used this output to generate maps which show potential cases

February 24 March 3
Current-day data Prediction map

Future Directions

We intend to further our development of this project to improve its utility in case prediction. We plan to:

  1. Finish implementation of other spatio-temporal model types. We have created multiple models in the course of this project which can be tested separately for ideal applications.
  2. Propose a final model which can be added to the CDC forecast website and ensemble model to influence decisions related to COVID-19.
  3. Build a user-friendly GUI to display our predictions at the county and state level.
  4. Automate the pulling of data, updating of models and display of data.

About

Team Leaders: Awtum Brashear, Jing Lin, Ming Ji

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 7

Languages