| Literature DB >> 33001396 |
Yaowen Luo1,2, Jianguo Yan3, Stephen McClure4.
Abstract
The COVID-19 outbreak has become a global pandemic. The spatial variation in the environmental, health, socioeconomic, and demographic risk factors of COVID-19 death rate is not well understood. Global models and local linear models were used to estimate the impact of risk factors of the COVID-19, but these do not account for the nonlinear relationships between the risk factors and the COVID-19 death rate at various geographical locations. We proposed a local nonlinear nonparametric regression model named geographically weighted random forest (GW-RF) to estimate the nonlinear relationship between COVID-19 death rate and 47 risk factors derived from the US Environmental Protection Agency, National Center for Environmental Information, Centers for Disease Control and the US census. The COVID-19 data were employed to a global regression model random forest (RF) and a local model GW-RF. The adjusted R2 of the RF is 0.69. The adjusted R2 of the proposed GW-RF is 0.78. The result of GW-RF showed that the risk factors (i.e. going to work by walking, airborne benzene concentration, householder with a mortgage, unemployment, airborne PM2.5 concentration and per cent of the black or African American) have a high correlation with the spatial distribution of the COVID-19 death rate, and these key factors driven from the GW-RF were mapped, which could provide useful implications for controlling the spread of the COVID-19 pandemic.Entities:
Keywords: COVID-19 death rate; Environment; Health; Local nonlinear model; Socioeconomic; Spatial variation
Mesh:
Year: 2020 PMID: 33001396 PMCID: PMC7527667 DOI: 10.1007/s11356-020-10962-2
Source DB: PubMed Journal: Environ Sci Pollut Res Int ISSN: 0944-1344 Impact factor: 5.190
Definitions of indicators and sources
| Theme | Indicators | Indicator meaning | Source |
|---|---|---|---|
| Atmosphere | Airborne PM2.5 concentration | Annual average ambient concentrations of PM2.5 in micrograms per cubic metre | US Environmental Protection Agency ( |
| Airborne benzene concentration | Annual average concentration of benzene estimates in microgram per cubic metre | ||
| Airborne formaldehyde concentration | Annual average air concentration of formaldehyde estimates in microgram per cubic metre | ||
| Airborne acetaldehyde concentration | Annual average air concentration of acetaldehyde estimates in microgram per cubic metre | ||
| Airborne carbon tetrachloride concentration | Annual average air concentration of carbon tetrachloride estimates in microgram per cubic metre | ||
| Climate | Air temperature | Average daily max air temperature (°F) | National Center for Environmental Information ( |
| Precipitation | Average daily precipitation (mm) | ||
| Sunlight exposure | Annual average sunlight exposure measured by solar irradiance (kJ/m2) | Centers for Diseases Control and Prevention ( | |
| UV radiation exposure | Annual average daily dose of UV irradiance (J/m2) | ||
| Land cover | Land cover with water | Per cent of land covered by water | |
| Land cover with forest | Per cent of land covered by forest | ||
| Disaster | Drought | Number of weeks of moderate drought or worse per year | |
| Flood | Percentage of people within FEMA-designated flood hazard area | ||
| Health status | Disability | Percentage of population aged 5 years and over with a disability | |
| Asthma | Per cent of adults diagnosed with asthma | ||
| Obese | Percentage of adults aged 18 years and over who were obese | ||
| Overweight | Percentage of adults aged 18 years and over who were overweight | ||
| Cancer | Number of people with lung and bronchus cancer per 1,000,000 population | ||
| Commuting to work | Go to work by private transportation | Percentage of workers 16 years and over who drove alone (car, truck or van) | US Census Bureau ( |
| Go to work by public transportation | Percentage of workers 16 years and over who go to work by public transportation (excluding taxicab) | ||
| Go to work by walking | Percentage of workers 16 years and over who go to work by walking | ||
| Work at home | Percentage of workers 16 years and over who worked at home | ||
| Mean travel time to work | Mean travel time to work (min) of the workers 16 years and over | ||
| Socioeconomic | Health insurance | Percentage of population without health insurance | |
| Householder with a mortgage | Percentage of household with a mortgage | ||
| Poverty | Percentage of population whose income is below the poverty level | ||
| Service occupations | Percentage of employed population 16 years and over with service occupations | ||
| Unemployment | Percentage of population 16 years and over unemployed | ||
| Hospital | Number of hospitals | Centers for Diseases Control and Prevention ( | |
| Hospital beds | Number of hospital beds per 10,000 population | ||
| People living in group quarter | Percentage of population living in group quarter | US Census Bureau ( | |
| People living near a park | Percentage of population living within a half mile of a park | ||
| Householder with no internet access | Percentage of households with no internet access | ||
| Median household income | |||
| Mean household retirement income | |||
| Mean household cash public assistance income | |||
| Mean household supplemental security income | |||
| Demographic | Per cent of males | ||
| Median age | |||
| Per cent of people under 18 years | |||
| Per cent of people 65 years and over | |||
| Per cent of the white race | |||
| Per cent of the black or African American | |||
| Per cent of American Indian and Alaska Native | |||
| Per cent of Asian | |||
| Per cent of native Hawaiian and other Pacific islander | |||
| Per cent of Hispanic or Latino |
Fig. 1The variable importance of the independent variables of the RF model in modelling COVID-19 death rate
The statistic of local R2 of the GW-RF in modelling COVID-19 death rate; we calculated the average value of local R2 and the percentage of counties in five local R2 range (≤ 0.2, (0.2, 04], (0.4, 06], (0.6, 08], > 0.8)
| The value of local | GW-RF |
|---|---|
| Average value | 0.59 |
| ≤ 0.2 | 1.1% |
| (0.2, 04] | 9.5% |
| (0.4, 06] | 38.9% |
| (0.6, 08] | 44.8% |
| > 0.8 | 5.7% |
Fig. 2The distribution of local R2 of the GW-RF
Fig. 3The average local variable importance of 47 potential risk factors on COVID-19 death rate in the GW-RF model
The proportion of counties with local primary risk factor (the risk factor with the highest value of local variable importance) on COVID-19 death rate at county level in the GW-RF
| Local primary risk factor | Proportion of counties |
|---|---|
| Go to work by walking | 35% |
| Airborne benzene concentration | 25% |
| Householder with a mortgage | 13% |
| Unemployment | 12% |
| Other risk factors | 16% |
Fig. 4The spatial distribution of the local variable importance of a going to work by walking and b airborne benzene concentration on COVID-19 death rate in GW-RF model
Fig. 5The spatial distribution of the local variable importance of a householder with a mortgage and b unemployment on COVID-19 death rate in GW-RF model
Fig. 6The spatial distribution of the local variable importance of a airborne PM2.5 concentration and b per cent of the black or African American on COVID-19 death rate in GW-RF model