| Literature DB >> 32534259 |
Abstract
Several recent studies have explored the association between environmental factors, such as temperature, humidity, and air pollution, and the severity of the COVID-19 outbreak by analyzing the statistical association at the district level. However, we argue that the modifiable areal unit problem (MAUP) arises when aggregating disease and environmental data into districts, leading to bias in such studies. Therefore, in this study, we analyzed the association between environmental factors and the number of COVID-19 death cases under different aggregation strategies to illustrate the presence of MAUP. We used real-world COVID-19 outbreak data from the Hubei and Henan Provinces and studied their association with atmospheric NO2 levels. By fitting linear regression models with penalized splines on NO2, we found that the association between COVID-19 mortality and NO2 varies when data were aggregated (1) at the city level, (2) under two different aggregation strategies, and (3) at the provincial level, indicating the presence of MAUP. Therefore, this study reminds researchers of the presence of MAUP and the necessity to minimize this problem while exploring the environmental determinants of the COVID-19 outbreak.Entities:
Keywords: COVID-19; Modifiable areal unit problem; NO(2); Spatial analysis; Statistical bias
Mesh:
Year: 2020 PMID: 32534259 PMCID: PMC7274979 DOI: 10.1016/j.scitotenv.2020.139984
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Fig. 1An example of MAUP.
Note: a denotes the home addresses of several residents who are either cases or non-cases and their temperature level. b indicates the association between disease and temperature at the individual level. We aggregated individual-level disease data into districts and calculated the prevalence at the district level and the average temperature (area-weighted average). We found that the association between district-level prevalence and average temperature can be positive, null, or negative.
Fig. 2Relationship between daily NO2 (μg/m3) and COVID-19 death cases under different scenarios in Hubei Province.
a shows different aggregation strategies. b shows the corresponding NO2-death relationship under each scenario. The unit for NO2 on the x-axes is μg/m3 (b).
At the city level, each city was presented individually (denoted with different colors); for the two aggregation strategies, adjacent cities were aggregated into districts (denoted with different colors); at the provincial level, all city-level data were aggregated. The COVID-19 data were available from the “nCov2019” package in R, and the environmental data were obtained from the China National Environmental Monitoring Centre (National real time air quality data platform of the China National Environmental Monitoring Centre). We used data from January 27, 2020 to March 10, 2020, when COVID-19 surged in China. During aggregation, deaths across the city were summed and the daily NO2 level on the same day was averaged.
Under aggregation strategy 1, cities were aggregated into three districts in the following manner: District 1: Enshi, Yichang, Xiangyang, and Shiyan; District 2: Jingmen, Jingzhou, Suizhou, Xiaogan, and Wuhan; District 3: Hunaggang, Xianning, Huangshi, and Ezhou. Under aggregation strategy 2, District 1: Xiangyang, Shiyan, Suizhou, and Xiaogan; District 2: Enshi, Jingmen, and Yichang; District 3: Jingzhou, Wuhan, and Huanggang; District 4: Ezhou, Huangshi, and Xianning. Four cities were excluded because of missing environmental data or zero COVID-19 mortality.
We fit regression models with (1) the linear term of NO2 to estimate the beta coefficient and (2) a penalized spline on NO2 to estimate the dose-response curves with COVID-19 death cases under different aggregation strategies. For city- and district-level data, we put a random effect on the city or aggregated district. The dose-response curves in b were placed in the same order as those in a. The corresponding beta coefficients for the four scenarios were − 0.052 (city level), −0.75 (strategy 1), −0.83 (strategy 2), and − 1.30 (provincial level).
Fig. 3Relationship between daily NO2 (μg/m3) and COVID-19 Death Cases under Different scenarios in Henan Province.
a shows the different aggregation strategies. b shows the corresponding NO2-death relationship under different aggregation strategies. The x-axes unit for NO2 is μg/m3 (b).
Similar to Hubei Province, data from January 27, 2020 to March 10, 2020 was used because mortality subsequently fell to zero.
Under aggregation strategy 1, cities were aggregated into three districts in the following manner: District 1: Sanmenxia, Nanyang, and Xinyang; District 2: Luoyang, Pingdingshan, Xinxiang, and Jiaozuo; District 3: Zhengzhou, Xuchang, Zhoukou, and Shangqiu. Under aggregation strategy 2, District 1: Xinyang, Nanyang, and Pingdingshan; District 2: Shangqiu, Zhoukou, and Xuchang; District 3: Xinxiang, Jiaozuo, Luoyang, Sanmenxia, and Zhengzhou. Seven cities were excluded because of no COVID-19 mortality.
We fit regression models with (1) the linear term of NO2 to estimate the beta coefficient and (2) a penalized spline on NO2 to estimate the dose-response curves with COVID-19 death cases under different aggregation strategies. For city- and district-level data, we put a random effect on the city or aggregated district. The dose-response curves in b were placed in the same order as those in a. The corresponding beta coefficients for the four scenarios were 1.9 ∗ 10−3 (city level),−2.9 ∗ 10−3 (strategy 1), 1.2 ∗ 10−2 (strategy 2), and 4.7 ∗ 10−5 (provincial level).