| Literature DB >> 35446873 |
Pete Riley1, Allison Riley1, James Turtle1, Michal Ben-Nun1.
Abstract
More than a year since the appearance of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), many questions about the disease COVID-19 have been answered; however, many more remain poorly understood. Although the situation continues to evolve, it is crucial to understand what factors may be driving transmission through different populations, both for potential future waves, as well as the implications for future pandemics. In this report, we compiled a database of more than 28 potentially explanatory variables for each of the 50 U.S. states through early May 2020. Using a combination of traditional statistical and modern machine learning approaches, we identified those variables that were the most statistically significant, and, those that were the most important. These variables were chosen to be fiduciaries of a range of possible drivers for COVID-19 deaths in the USA. We found that population-weighted population density (PWPD), some "stay at home" metrics, monthly temperature and precipitation, race/ethnicity, and chronic low-respiratory death rate, were all statistically significant. Of these, PWPD and mobility metrics dominated. This suggests that the biggest impact on COVID-19 deaths was, at least initially, a function of where you lived, and not what you did. However, clearly, increasing social distancing has the net effect of (at least temporarily) reducing the effective PWPD. Our results strongly support the idea that the loosening of "lock-down" orders should be tailored to the local PWPD. In contrast to these variables, while still statistically significant, race/ethnicity, health, and climate effects could only account for a few percent of the variability in deaths. Where associations were anticipated but were not found, we discuss how limitations in the parameters chosen may mask a contribution that might otherwise be present.Entities:
Mesh:
Year: 2022 PMID: 35446873 PMCID: PMC9022803 DOI: 10.1371/journal.pone.0266330
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
List of parameters used in multiple-regression analysis, together with their basic statistical properties.
See text for detailed explanation of each parameter.
| Statistic | N | Mean | St. Dev. | Min | Pctl(25) | Pctl(75) | Max |
|---|---|---|---|---|---|---|---|
| Retail | 50 | −46.580 | 6.081 | −66 | −50 | −42.2 | −36 |
| Grocery | 50 | −17.380 | 7.982 | −34 | −23.8 | −11 | 1 |
| parks | 50 | 16.400 | 42.524 | −70 | −6 | 29.8 | 134 |
| Transit | 50 | −48.520 | 13.167 | −76 | −58 | −37.5 | −21 |
| workplaces | 50 | −37.760 | 5.336 | −55 | −41.8 | −35 | −27 |
| Residential | 50 | 11.380 | 2.147 | 8 | 10 | 12 | 16 |
| Age | 50 | 38.616 | 2.328 | 31.000 | 37.325 | 39.600 | 44.900 |
| Low indust. toxins | 50 | 25.500 | 14.577 | 1 | 13.2 | 37.8 | 50 |
| Low poll. health risk | 50 | 25.500 | 14.577 | 1 | 13.2 | 37.8 | 50 |
| Chron. low resp. death rate | 50 | 42.790 | 10.520 | 20 | 34.9 | 48.7 | 64 |
| ≥65 years old | 50 | 16.506 | 1.926 | 11 | 15.7 | 17.5 | 21 |
| Race param. 1 | 50 | 11.954 | 9.686 | 0.900 | 4.850 | 15.975 | 38.900 |
| Race param. 2 | 50 | 5.580 | 8.046 | 1.100 | 2.325 | 5.775 | 56.800 |
| Race param. 3 | 50 | 12.070 | 10.469 | 1.400 | 5.125 | 13.975 | 49.100 |
| Race param. 4 | 50 | 76.164 | 12.881 | 24.300 | 67.825 | 85.000 | 94.300 |
| Obesity rates | 50 | 27.740 | 3.305 | 19.000 | 25.600 | 29.600 | 35.200 |
| Av. rel. humidity | 50 | 67.102 | 8.346 | 38.300 | 65.950 | 71.475 | 77.100 |
| Av. dew point | 50 | 41.710 | 9.224 | 26.500 | 34.925 | 46.750 | 65.200 |
| Av. ann. temp. (C) | 50 | 11.074 | 4.830 | −3.000 | 7.325 | 14.800 | 21.500 |
| Av. ann. precip. (mm) | 50 | 941.820 | 371.752 | 241 | 622.5 | 1,216.2 | 1,618 |
| State of emerg. dec. | 50 | −3.100 | 3.666 | −13 | −4.8 | 0 | 3 |
| Av. spring temp. | 50 | 10.550 | 4.941 | −4.100 | 6.650 | 14.050 | 21.100 |
| Av. spring precip. | 50 | 82.840 | 34.067 | 20 | 56.8 | 104.5 | 151 |
| Rel. hum. (morning) | 50 | 76.280 | 8.593 | 43 | 73.2 | 81 | 90 |
| rel. hum. (afternoon) | 50 | 48.440 | 9.004 | 17 | 47 | 53.8 | 61 |
| UV Index | 50 | 7.780 | 2.376 | 1 | 7 | 9 | 12 |
| Deaths/100k ( | 50 | 20.769 | 52.440 | 0.092 | 1.313 | 18.250 | 349.854 |
| PWPD | 50 | 3,298.370 | 4,113.740 | 694.900 | 1,356.450 | 3,561.875 | 28,161.500 |
| Date of first death | 50 | 18,341.040 | 6.465 | 18,331 | 18,337 | 18,346 | 18,365 |
Fig 1The variation of (Left) Cumulative number of confirmed cases per 100,000 for each state (colored arbitrarily to better separate each curve). (Right) Cumulative number of deaths per 100,000 for each state. Data runs from 2020–03-06 through 2020–05-10.
Fig 2The relationship between the number of deaths per 100,000 and PWPD.
Each state is identified with a dot and the variability and smooth profile are shown with the dark grey region and blue line, respectively.
Fig 3Relative importance of explanatory variables using the DALEX method.
See text for more details.