Literature DB >> 32526559

Examining the effect of social distancing on the compound growth rate of COVID-19 at the county level (United States) using statistical analyses and a random forest machine learning model.

J S Cobb1, M A Seale2.   

Abstract

OBJECTIVES: The goal of the present work is to investigate trends among US counties and coronavirus disease 2019 (COVID-19) growth rates in relation to the existence of shelter-in-place (SIP) orders in that county. STUDY
DESIGN: This is a prospective cohort study.
METHODS: Compound growth rates were calculated using cumulative confirmed COVID-19 cases from January 21, 2020, to March 31, 2020, in all 3139 US counties. Compound growth was chosen as it gives a single number that can be used in machine learning to represent the speed of virus spread during defined time intervals. Statistical analyses and a random forest machine learning model were used to analyze the data for differences in counties with and without SIP orders.
RESULTS: Statistical analyses revealed that the March 16 presidential recommendation (limiting gatherings to ≤10 people) lowered the compound growth rate of COVID-19 for all counties in the US by 6.6%, and the counties that implemented SIP after March 16 had a further reduction of 7.8% compared with the counties that did not implement SIP after March 16. A random forest machine learning model was built to predict compound growth rate after a SIP order and was found to have an accuracy of 92.3%. The random forest found that population, longitude, and population per square mile were the most important features when predicting the effect of SIP.
CONCLUSIONS: SIP orders were found to be effective at reducing the growth rate of COVID-19 cases in the US. Counties with a large population or a high population density were found to benefit the most from a SIP order.
Copyright © 2020 The Royal Society for Public Health. All rights reserved.

Entities:  

Keywords:  COVID-19; Machine learning; SARS-CoV-2; Shelter-in-place; Social distancing; Statistics

Mesh:

Year:  2020        PMID: 32526559      PMCID: PMC7186211          DOI: 10.1016/j.puhe.2020.04.016

Source DB:  PubMed          Journal:  Public Health        ISSN: 0033-3506            Impact factor:   2.427


Novel coronavirus (severe acute respiratory syndrome coronavirus 2 [SARS-CoV-2] or coronavirus disease 2019 [COVID-19]) originated in the province of Wuhan, China, in December 2019, after which it spread rapidly across the globe owing to infected persons exhibiting little to no symptoms within the first five days of contracting the virus. The devastation and infection rate triggered by the virus caused the World Health Organization to declare it a global pandemic. There are currently more than 170 countries infected with COVID-19, and all 50 states in the United States (US) have confirmed cases according to the Centers for Disease Control and Prevention. In the US, community transmission has become the prominent mode of transmission of the virus. It has therefore become imperative that the effectiveness of the primary forms of limiting social contact used by local and national governments be evaluated. Enough data at the county level are now available to provide a fair assessment of the efficacy of the presidential guidelines issued on March 16, 2020, instituting a form of ‘social distancing’ by limiting gatherings to 10 or fewer people. Data are additionally sufficient to assess the efficacy of county-level shelter-in-place (SIP) orders versus counties that did not issue any SIP orders after the guidelines issued on March 16. County metrics were obtained from the US Census Bureau, USA Counties (2011) data sets from the 2010 census. The data included in this study are latitude, longitude, population, median age, number of physicians, median income, population per square mile, and water use per capita. Counties were placed into one of the two bins: (1) counties that had confirmed cases of COVID-19 before issue of guidelines on March 16 and experienced a SIP order on or after March 19 (186 counties, referred to as wSIP); and (2) counties that had confirmed cases before March 16 and experienced no SIP order (60 counties, referred to as noSIP). A Student t-test was used to compare two groups for significance. Analysis of variance with the Tukey post hoc test was used to compare multiple groups. Significance was defined as P <0.05. All data are reported as mean ± 95% confidence interval. There were no statistically significant differences in the US census data between the wSIP and noSIP groups apart from latitude (P < 0.0001) and the number of physicians (P = 0.04). The wSIP group had a latitude of 39.47 ± 0.75, which places it in the northern US, compared with the noSIP group with a latitude of 34.6 ± 1.16°, placing it further south. The difference in number of physicians is a function of latitude with a lower mean number of physicians in the south (1697 ± 500) compared with the north (2677 ± 538). The number of confirmed COVID-19 cases in each county was collected from local health department data and county/state press releases from January 21 to March 25. Confirmed cases from March 26 to 31 were obtained from The New York Times coronavirus data repository. The two data sets were compared to ensure consistency between the collected values. Data collection stopped on March 31 because the mean number of days with confirmed cases was approximately the same as that before issue of guidelines on March 16, after March 16 but before the institution of a SIP order, and after March 16 with a SIP order. This allowed for comparison of these time intervals with an equal number of days (7.62 ± 0.35). Compound growth was calculated using the following equation: (final confirmed cases/first confirmed cases)ˆ(1/number of days). Fig. 1 shows the compound growth rate for the wSIP (1.39 ± 0.044) and noSIP (1.30 ± 0.059) groups before the issue of presidential guidelines on March 16, the compound growth rate after March 16 for the wSIP (1.30 ± 0.023) and noSIP (1.21 ± 0.016) groups, and the compound growth rate for the wSIP group (1.19 ± 0.011) after the SIP orders went into effect. The lower compound growth rates seen in noSIP data are due to the difference in latitude between the two data sets and suggest that southern states experienced a slower spread of the virus at the onset. This makes sense, given that before March 16, the hot spots for COVID-19 were northern states such as Washington, New York, and Illinois. The noSIP compound growth rates were normalized to the compound growth rate of wSIP data before March 16 to account for geographical differences. The normalized compound growth rates after March 16 were shown to be statistically similar between the wSIP and noSIP groups (P > 0.05, Fig. 1). This indicates that the presidential guidelines had the same magnitude of effect on reducing the compound growth rate by 6.6 ± 1.4% between the wSIP and noSIP groups before the wSIP group instituted a SIP. After instituting a SIP order, compound growth rate of the wSIP group decreased an additional 7.8%, for a total decrease of 14.4 ± 1.6% from the compound growth rates before March 16. This indicates that the effects from the presidential guidelines and SIP orders were additive in the US. This is reasonable, considering the virus is thought to spread by virus-containing airborne droplets and orders for social distancing limit the interaction of people who could potentially be infected. A study modeling the effect of social distancing from China indicated a strong association with both a decrease in the rate the virus spreads and the implementation of social distancing.
Fig. 1

Mean compound growth rate of counties that had confirmed cases (1) before the presidential guidelines issued on March 16, (2) from March 16 to issuance of SIP, and (3) from issuance of SIP to March 31 (black bars) compared with counties with confirmed cases (1) before March 16 and (2) after March 16 and noSIP (white bars). Statistical analyses indicated no differences in summary data for the two groups except for latitude, indicating that counties with a SIP order were further north and those without a SIP order were predominately located in the south. The counties without SIP were normalized to those with SIP before March 16 to account for this difference (gray bars). SIP = shelter-in-place; wSIP, with SIP; noSIP, no SIP.

Mean compound growth rate of counties that had confirmed cases (1) before the presidential guidelines issued on March 16, (2) from March 16 to issuance of SIP, and (3) from issuance of SIP to March 31 (black bars) compared with counties with confirmed cases (1) before March 16 and (2) after March 16 and noSIP (white bars). Statistical analyses indicated no differences in summary data for the two groups except for latitude, indicating that counties with a SIP order were further north and those without a SIP order were predominately located in the south. The counties without SIP were normalized to those with SIP before March 16 to account for this difference (gray bars). SIP = shelter-in-place; wSIP, with SIP; noSIP, no SIP. A random forest machine learning model was trained to predict the compound growth rate after a SIP order was given in a county. The random forest was chosen as it has been shown to have the highest accuracy in disease prediction. The model achieved a mean absolute percentage error of 92.3% in the test data set. The three most important features were population, longitude, and population per square mile in predicting the compound growth rate after a SIP order was instituted in a county. The data for these features were split into four equal groups to explain how these features matter for the model in predicting the compound growth rate after a SIP order was issued. Counties that instituted a SIP order with a longitude between −79.7102° and −97.2363° had the largest decrease in compound growth rate at 10.4% compared with 8.2% for counties outside of that longitudinal range. Counties with the highest populations between 143,962 and 984,8011 saw the largest percent reduction after instituting a SIP order at 10.5%, compared to counties with a lower population between 7457 and 142,151 at 8.2%. Similar to population, population per square mile showed the largest reduction in compound growth rate in counties with a population per square mile between 405.8 and 1755.5 at 11.6% compared with counties with a lower population density of 2.1–405.6 at 8.9%. In conclusion, the data suggest that at a county level, in the US, the SIP order is effective at decreasing the compound growth rate of COVID-19 (Fig. 1). The counties that have the largest impact from a SIP order are ones with a large population or a high population density, as indicated by the random forest feature importance.

Author statements

Acknowledgments

The authors wish to thank William Glenn Bond and Meredith B. Cobb for their input and edits of the manuscript.

Ethical approval

Ethical approval was not required as this study made use of publicly available data.

Funding

None.

Competing interests

None declared.
  13 in total

1.  Review on COVID-19 diagnosis models based on machine learning and deep learning approaches.

Authors:  Zaid Abdi Alkareem Alyasseri; Mohammed Azmi Al-Betar; Iyad Abu Doush; Mohammed A Awadallah; Ammar Kamal Abasi; Sharif Naser Makhadmeh; Osama Ahmad Alomari; Karrar Hameed Abdulkareem; Afzan Adam; Robertas Damasevicius; Mazin Abed Mohammed; Raed Abu Zitar
Journal:  Expert Syst       Date:  2021-07-28       Impact factor: 2.812

2.  Monitoring social distancing under various low light conditions with deep learning and a single motionless time of flight camera.

Authors:  Adina Rahim; Ayesha Maqbool; Tauseef Rana
Journal:  PLoS One       Date:  2021-02-25       Impact factor: 3.240

3.  Underreporting of Death by COVID-19 in Brazil's Second Most Populous State.

Authors:  Thiago Henrique Evangelista Alves; Tafarel Andrade de Souza; Samyla de Almeida Silva; Nayani Alves Ramos; Stefan Vilges de Oliveira
Journal:  Front Public Health       Date:  2020-12-15

4.  Disinfection chain: A novel method for cheap reusable and chemical free disinfection of public places from SARS-CoV-2.

Authors:  Sushanta Debnath; Mohiul Islam
Journal:  ISA Trans       Date:  2021-03-29       Impact factor: 5.911

5.  Large Gathering Attendance is Associated with Increased Odds of Contracting COVID-19: A Survey Based Study.

Authors:  Mary E Schroeder; Michelle Tb Manderski; Chris Amro; Sneha Swaminathan; Akshat Parekh; Sho Yoshitake; Jason Yang; Paul Romeo; Daniel Reyes; Rachel Choron; Michael Rodricks
Journal:  J Prev (2022)       Date:  2022-01-06

6.  Effectiveness of non-pharmaceutical public health interventions against COVID-19: A systematic review and meta-analysis.

Authors:  Shabnam Iezadi; Kamal Gholipour; Saber Azami-Aghdash; Akbar Ghiasi; Aziz Rezapour; Hamid Pourasghari; Fariba Pashazadeh
Journal:  PLoS One       Date:  2021-11-23       Impact factor: 3.240

7.  Problems with evidence assessment in COVID-19 health policy impact evaluation: a systematic review of study design and evidence strength.

Authors:  Noah A Haber; Emma Clarke-Deelder; Avi Feller; Emily R Smith; Joshua A Salomon; Benjamin MacCormack-Gelles; Elizabeth M Stone; Clara Bolster-Foucault; Jamie R Daw; Laura Anne Hatfield; Carrie E Fry; Christopher B Boyer; Eli Ben-Michael; Caroline M Joyce; Beth S Linas; Ian Schmid; Eric H Au; Sarah E Wieten; Brooke Jarrett; Cathrine Axfors; Van Thu Nguyen; Beth Ann Griffin; Alyssa Bilinski; Elizabeth A Stuart
Journal:  BMJ Open       Date:  2022-01-11       Impact factor: 2.692

8.  Impact of excluding nursing home COVID-19 cases when assessing the relationship between county-level social distancing behavior and COVID-19 cases across the US during the early phase of the pandemic, February 2020-May 2020.

Authors:  Phoebe Tran; Lam Tran; Liem Tran
Journal:  PLoS One       Date:  2021-11-30       Impact factor: 3.240

9.  Diversity Forests: Using Split Sampling to Enable Innovative Complex Split Procedures in Random Forests.

Authors:  Roman Hornung
Journal:  SN Comput Sci       Date:  2021-10-21

10.  Influence of state reopening policies in COVID-19 mortality.

Authors:  Ka-Ming Tam; Nicholas Walker; Juana Moreno
Journal:  Sci Rep       Date:  2022-01-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.