| Literature DB >> 33199721 |
Douglas Arneson1, Matthew Elliott1, Arman Mosenia1,2, Boris Oskotsky1, Samuel Solodar1, Rohit Vashisht1, Travis Zack1,3, Paul Bleicher4, Atul J Butte1,5,6, Vivek A Rudrapatna7,8.
Abstract
Management of the COVID-19 pandemic has proven to be a significant challenge to policy makers. This is in large part due to uneven reporting and the absence of open-access visualization tools to present local trends and infer healthcare needs. Here we report the development of CovidCounties.org, an interactive web application that depicts daily disease trends at the level of US counties using time series plots and maps. This application is accompanied by a manually curated dataset that catalogs all major public policy actions made at the state-level, as well as technical validation of the primary data. Finally, the underlying code for the site is also provided as open source, enabling others to validate and learn from this work.Entities:
Mesh:
Year: 2020 PMID: 33199721 PMCID: PMC7669883 DOI: 10.1038/s41597-020-00731-8
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Technical validation of the datasets. (a–c) Comparison of estimated ICU bed occupancy (n = 2,321) (a), daily cases (n = 2,348) (b), and daily deaths (n = 2,348) (c) reported by CovidCounties against corresponding data reported by the California Department of Public Health. Each point corresponds to a measurement from a given California county on a particular date where both datasets report counts. Data is from 6/28/2020 - 8/8/2020. (d–f) Comparison of the estimated hospital bed occupancy (n = 248) (d), daily cases (n = 240) (e), and daily deaths (n = 240) (f) reported by CovidCounties against corresponding data reported by the Connecticut Department of Public Health. Each point corresponds to a measurement from a given Connecticut county on a particular date where both datasets report counts. Data is from 6/28/2020 - 8/8/2020. (g,h) Comparison of the estimated daily cases (n = 121,944) (g) and daily deaths (n = 121,944) (h) reported by CovidCounties against corresponding data reported by the website Corona Data Scraper. Data is from 6/28/2020 – 8/8/2020. Each point corresponds to a measurement from any US county in the dataset at a particular time where both datasets report counts. (i–k) Comparison of the estimated hospital bed occupancy (n = 287) (i), daily cases (n = 287) (j), and daily deaths (n = 287) (k) reported by CovidCounties against corresponding data reported by 7 different state Departments of Public Health. Data is from 6/28/2020 – 8/8/2020; curated state data is available in the data file accompanying this manuscript. R2 and p-values are derived from the Pearson correlation coefficient.
Descriptive statistics on the New York Times data set.
| 99.4% | ||
| 40 states tied at 100%* | ||
| Hawaii (80.0%-4/5) | Alaska (89.7%-26/29) | |
| Rhode Island (8.3%; 1,738; 1,644) | Wisconsin (6.2%; 4,534; 780) | |
| Trousdale, Tennessee (144,297) | Lafayette, Florida (129.295) | |
| Dawes, Nebraska (1.57 days) | Sanders, Montana (2.46 days) | |
| Madera, California (200%-20/10) | Rosebud, Montana (184%-1.84/1) | |
Data reported as of 8/20/2020. States with highest % of unknown cases shows the percent of cases from unknown counties as a fraction of total cases in the state, the absolute number of cases from unknown counties in the state, and the cases per million from unknown counties in the state. States with lowest % of counties reporting shows the percentage of counties reporting, the number of counties reporting, and total counties in the state.
†In the NY Times data all counties that reported cases also reported deaths (or were assumed to be 0).
*AL, AZ, AR, CA, CT, DE, DC, FL, GA, ID, IL, IN, IA, KY LA, ME, MD, MA, MI, MN, MS, MO, NH, NJ, NY, NC, ND, OH, OK, PA, RI, SC, SD, TN, VT, VA, WA, WV, WI, WY.
Fig. 2Effect of shelter in place orders on doubling time. (a) States within the United States are color-coded by percentile of date to implement state mandated shelter in place. White indicates earlier dates (among states) while dark orange indicates later dates or no state mandate. (b) States within the United States are color-coded by percentile of case doubling time on April 15, 2020. Dark orange indicates a fast doubling time (among states), white indicates a slow doubling time.
Fig. 3Overview of CovidCounties.org. (a) The primary view of CovidCounties.org is the line plot view, depicting time-series trends by individual county. Depicted counties may be selected by single or double clicking the counties displayed in the legend. They may also be selected by typing in counties (including from outside of a given state) at the bottom. (b) User-selected individual states are color coded according to the variable of interest (e.g. cumulative cases). Dark orange corresponds to the highest percentiles within the state, white indicates the lowest percentile. Hovering functionality displays statistics corresponding to a given county. (c) Line plot views can be extensively customized, with features to enable axis re-centering/scaling, count normalization, depiction of doubling time, and predicted percentage ICU bed utilization. Individual state and United States plots update to reflect selected parameters where appropriate. (d) States within the United States are color-coded by percentile according to the variable of interest (e.g. cumulative cases). Dark orange indicates relatively high percentile (among states), white indicates low percentile. Hovering functionality displays statistics corresponding to a given state. The dropdown menu below allows the user to change the view to depict timing that various social distancing policies were implemented: white indicates relatively early adoption (by percentile), dark orange indicates late or no current adoption.
Descriptive statistics on the curated policy data set.
| 60.8% | ||
| Washington (2/29/2020) | California (3/4/2020) | |
| Kentucky & Ohio (3/12/2020) | Delaware, Virginia & W Virginia (3/13/2020) | |
| Arizona (3/11/2020) | California (3/19/2020) | |
| Ohio (3/15/2020) | 12 states** on (3/16/2020) | |
Counties with highest estimated ICU needs shows the ICU needs as a percentage based on the estimated number of ICU beds needed and KHN reported number of ICU beds.
**CA, CT, DE, DC, KY, LA, MI, NJ, NY, PA, RI, WA.
Fig. 4Database schematic. Source data was obtained from The New York Times, US Census Bureau, Kaiser Health News, and from a manual curation of state governmental websites and news outlets as described in Methods. Data was processed to reflect case and death counts at the level of states and counties. Functions were written to perform x- and y-axis rescaling, normalization by population, doubling time estimation, and ICU bed utilization. Results were depicted using interactive line plots and maps.
Descriptive statistics on the Kaiser Health News data set.
| 45.0% | ||
| Los Angeles, California (2,126) | Cook, Illinois (1,606) | |
| Otero, Colorado (27,452) | Montour, Pennsylvania (2,303) | |
| Wright, Minnesota (22) | Clinton, Michigan (25.2) | |
Data reported as of 8/20/2020. Counties with highest estimated ICU needs shows the ICU needs as a percentage based on the estimated number of ICU beds needed and KHN reported number of ICU beds.