Skip to content

d-cogswell/covid19-forecast-accuracy

Repository files navigation

Compare COVID19 forcasts against truth data

A collection of Bash and Python scripts to download, process, plot, and compare COVID19 data and forecasts. Each separate data set has its own unique organziation, and the goal of this project is to provide a standard way of manipulating data from different sources. Each data set is loaded as a Pandas dataframe with the following standardized fields:

  • date : calendar date for each record, stored as a datetime object
  • cases : new number cases reported each day
  • cumCases : the cumulative number of cases that have been reported
  • hosptialAdmissions : new hospital admissions reported each day
  • hospitalizedCurrently : number of people currently hospitalized at each date
  • deaths : new number of deaths reported each day
  • cumDeaths : the cumulative number of deaths

Truth data sources

Not yet implemented

Forecasts available:

Installation

Downloading data requires a Bash shell scripting envoronment. Windows users are encouraged to use the Windows Subsystem for Linux (WSL). The following prerequisites are required to run the Bash scripts: wget, curl, unzip, git

Data can be downloaded by running the following commands from the root directory. Beware that the forecasting data requires >20GB of storage space, and the initial download takes significant time. Running the commands again will fetch and process any new data.

Download/update forecast data:

  ./update-forecasts.sh

Download/udpate truth data:

  ./update-truth.sh

About

Compare covid19 forecasts with truth data.

Resources

Stars

Watchers

Forks