| Literature DB >> 35692599 |
Mohammad Moosazadeh1, Pouya Ifaei1, Amir Saman Tayerani Charmchi1, Somayeh Asadi2, ChangKyoo Yoo1.
Abstract
A mature and hybrid machine-learning model is verified by mature empirical analysis to measure county-level COVID-19 vulnerability and track the impact of the imposition of pandemic control policies in the U.S. A total of 30 county-level social, economic, and medical variables and a timeline of the imposed policies constitutes a COVID-19 database. A hybrid feature-selection model composed of four machine-learning algorithms is developed to emphasize the regional impact of community features on the case fatality rate (CFR). A COVID-19 vulnerability index (COVULin) is proposed to measure the county's vulnerability, the effects of model's parameters on mortality, and the efficiency of control policies. The results showed that the dense counties in which minority groups represent more than 45% of the population and those with poverty rates greater than 24% were the most vulnerable counties during the first and the last pandemic peaks, respectively. Highly-correlated CFR and COVULin scores indicated a close agreement between the model outcomes and COVID-19 impacts. Counties with higher poverty and uninsured rates were the most resistant to government intervention. It is anticipated that the proposed model can play an essential role in identifying vulnerable communities and help reduce damages during long-term alike disasters.Entities:
Keywords: COVID-19 control policies; Data analysis; Machine learning; Social vulnerability; The U.S. cities
Year: 2022 PMID: 35692599 PMCID: PMC9167466 DOI: 10.1016/j.scs.2022.103990
Source DB: PubMed Journal: Sustain Cities Soc ISSN: 2210-6707 Impact factor: 10.696
Fig. 1County-level COVID-19 mortality and morbidity in the U.S. (a) Confirmed cases during the first peak. (b) Confirmed cases during the last peak. (c) Mortality during the first peak. (d) Mortality during the last peak. (e) Confirmed and mortality rates during the first and second pandemic peaks.
Fig. 2A representation of employed methods and outcomes. (a) A flow diagram of the process and methods from data collection to the comparison of outcomes for 2 different peaks. (b) A graphical view of the expected outcomes at each step of the research.
Explanatory variables and policies used in this study together with definitions and sources.
| Category | Variable name | Source |
|---|---|---|
Socioeconomic status | Percentage of population below a poverty level Percentage of the unemployed population age 16 and older Estimated income per capita Percentage of persons age 25+ with no high school diploma | |
Household composition/disability | Percentage of persons aged 65 and older Percentage of persons aged 17 and younger Percentage of civilian noninstitutionalized population age 5 and older with a disability Percentage of single-parent households with children under 18 | |
Minority status/language | Estimated minority percentage of total population (all persons except white, non-Hispanic) Percentage of persons age 5 and older who speak English "less than well" | |
Housing/transportation | Percentage of housing in structures with 10 or more units Percentage of mobile homes Percentage of occupied housing units with more people than rooms Percentage of households with no vehicle available Percentage of persons in institutionalized group quarter | |
Health-care system factors | Percentage of the population uninsured Percentage of households without access to indoor plumbing Intensive care unit beds per 100,000 people Hospital beds per 100,000 people Agency for Healthcare Research and Quality – Prevention Quality Indicator Overall Composite: admission rates for preventable conditions (via good outpatient care) adjusted per population Emergency services per 100,000 people (includes emergency and relief services and freestanding ambulatory surgical and emergency centers) Epidemiologists per 100,000 people Health labs per 100,000 people Health spending per capita Total Public Health Emergency Preparedness funding per capita Long-term care (nursing homes, assisted living, and care homes) residents per 100,000 | |
High-risk environment | Percentage of the population employed in a high-risk industry (includes employees in farming, manufacturing, printing, and related support activities and textile North American Industry Classification System subsectors) Prisons population per 100,000 | |
Population | County level population density (person/km2) County-level population | |
Pandemic control policies | Including state-level stay-at-home orders, travel restrictions, etc. Mask-mandate order (state- and county-level mask mandates |
Fig. 3A hybrid feature-selection method to determine the variables most related to COVID-19 CFRs.
Fig. 4Mask mandate timeline.in four states.
Fig. 5(a) The most important features related to COVID-19 CFR in the first peak, and (b) the most important features related to CFR in the last peak. (c) A comparison of the top 10% of counties with the highest value of each feature among 200 counties with the highest mortality in each peak.
Fig. 6Clustering results in (a) a GIS map of counties and (b) a significance diagram of clustered features.
Fig. 7Machine-learning clustering and classification results. (a) Comparing clusters considering the impact of COVID-19 on counties during the two different peaks. (b) Classification validation using a ROC-AUC technique for test data.
Fig. 8(a) COVULin map for the first peak, and (b) COVULin map during the last peak. (c) Distribution of most-least vulnerable counties in four clusters.
Fig. 9Comparing counties in two pandemic peaks, considering the policies regarding (a) the number of counties reporting an increase or decrease in indicators (deaths, confirmed cases, CFR, and COVULin score) in the last peak compared with the first, and (b) four important variables in counties that imposed restrictive policies and reported an increase (red) or decrease (blue) in COVULin scores during the last peak (compare to the first peak).