| Literature DB >> 35121784 |
Ali Cheshmehzangi1,2,3, Yujian Li4, Haoran Li5, Shuyue Zhang5, Xiangliang Huang5, Xu Chen5, Zhaohui Su6, Maycon Sedrez5,7, Ayotunde Dawodu5,7.
Abstract
With evidence-based measures, COVID-19 can be effectively controlled by advanced data analysis and prediction. However, while valuable insights are available, there is a shortage of robust and rigorous research on what factors shape COVID-19 transmissions at the city cluster level. Therefore, to bridge the research gap, we adopted a data-driven hierarchical modeling approach to identify the most influential factors in shaping COVID-19 transmissions across different Chinese cities and clusters. The data used in this study are from Chinese officials, and hierarchical modeling conclusions drawn from the analysis are systematic, multifaceted, and comprehensive. To further improve research rigor, the study utilizes SPSS, Python and RStudio to conduct multiple linear regression and polynomial best subset regression (PBSR) analysis for the hierarchical modeling. The regression model utilizes the magnitude of various relative factors in nine Chinese city clusters, including 45 cities at a different level of clusters, to examine these aspects from the city cluster scale, exploring the correlation between various factors of the cities. These initial 12 factors are comprised of 'Urban population ratio', 'Retail sales of consumer goods', 'Number of tourists', 'Tourism Income', 'Ratio of the elderly population (> 60 year old) in this city', 'population density', 'Mobility scale (move in/inbound) during the spring festival', 'Ratio of Population and Health facilities', 'Jobless rate (%)', 'The straight-line distance from original epicenter Wuhan to this city', 'urban per capita GDP', and 'the prevalence of the COVID-19'. The study's results provide rigorously-tested and evidence-based insights on most instrumental factors that shape COVID-19 transmissions across cities and regions in China. Overall, the study findings found that per capita GDP and population mobility rates were the most affected factors in the prevalence of COVID-19 in a city, which could inform health experts and government officials to design and develop evidence-based and effective public health policies that could curb the spread of the COVID-19 pandemic.Entities:
Mesh:
Year: 2022 PMID: 35121784 PMCID: PMC8817036 DOI: 10.1038/s41598-022-05859-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Nine representative city clusters highlighted in China’s City Cluster Plan (Source: The map of China comes from the National Platform for Common Geospatial Information Services in China. The map is generated from this available database and relevant clusters operation is completed on Powerpoint).
The urbanization hierarchical structure of nine selected city clusters in China, three per level (high, medium, and low), including five cities per city cluster.
Figure 2Overview of methodology and model analysis.
Figure 3Flow chart of multiple linear regression analysis.
Figure 4Flow chart for best subset regression analysis by R.
Figure 5Prevalence rates in 45 cities of nine clusters and different tier level orders.
Figure 6The Ridge paths of parameters.
Comparison of cases among different cities and city clusters.
| Prevalence | High level clusters | Middle level clusters | Low level clusters | Average |
|---|---|---|---|---|
| Tier 1st | 32.25 | 13.28 | 11.58 | 19.04 |
| Tier 2nd | 13.95 | 16.46 | 8.32 | 12.91 |
| Tier 3rd | 13.10 | 4.15 | 4.04 | 7.10 |
| Tier 4th | 4.84 | 6.96 | 3.47 | 5.09 |
| Tier 5th | 11.65 | 6.88 | 2.76 | 7.10 |
| Average | 15.16 | 9.55 | 6.04 | N/A |
The Pearson correlation analyses between the economy, social mobility, population structure, and geographical location variables and the city level prevalence of COVID-19.
Dark color** Correlation is significant at the 0.01 level.
Moderate color* Correlation is significant at the 0.05 level.
Light color indicates ‘No strong correlation’.
Mallow’s and parameter selections of 11 variables.
| Cluster | Selected p | Mallow’s | |
|---|---|---|---|
| High-level | 5 | ‘LTN’, ‘JOR’, ‘PHF’, ‘PGDP’ ‘DRW’. LMS | 0.488 |
| Middle-level | 4 | ‘LTI’, ‘JOR’, ‘PHF’ ‘DRW’ | − 0.1436 |
| Low-level | 6 | ‘LTN’, ‘LTI’, ‘LPD’ ‘JOR’, ‘PGDP’, ‘LDW’ | 4.3701 |
The parameter numbers () indicates the maximal parameter number in the mathematical model.
AIC and effective parameters.
| Cluster | Parameters (except prevalence) | |
|---|---|---|
| High-level | − 34.99 | LTN, JOR, PGDP |
| Middle-level | − 44.82 | LRS, LTN, LMS, JOR, PHF, DRW |
| Low-level | − 52.61 | LRS, LTN, LTI, LMS, UPR, JOR, PHF, ROA, PGDP, DRW |