Literature DB >> 33148655

Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis.

X Wu1, R C Nethery1, M B Sabath1, D Braun1,2, F Dominici3.   

Abstract

Assessing whether long-term exposure to air pollution increases the severity of COVID-19 health outcomes, including death, is an important public health objective. Limitations in COVID-19 data availability and quality remain obstacles to conducting conclusive studies on this topic. At present, publicly available COVID-19 outcome data for representative populations are available only as area-level counts. Therefore, studies of long-term exposure to air pollution and COVID-19 outcomes using these data must use an ecological regression analysis, which precludes controlling for individual-level COVID-19 risk factors. We describe these challenges in the context of one of the first preliminary investigations of this question in the United States, where we found that higher historical PM2.5 exposures are positively associated with higher county-level COVID-19 mortality rates after accounting for many area-level confounders. Motivated by this study, we lay the groundwork for future research on this important topic, describe the challenges, and outline promising directions and opportunities.
Copyright © 2020 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

Entities:  

Mesh:

Year:  2020        PMID: 33148655      PMCID: PMC7673673          DOI: 10.1126/sciadv.abd4049

Source DB:  PubMed          Journal:  Sci Adv        ISSN: 2375-2548            Impact factor:   14.136


INTRODUCTION

The suddenness and global scope of the coronavirus disease 2019 (COVID-19) pandemic have raised urgent questions that require coordinated investigation to slow the disease’s devastation. A critically important public health objective is to identify key modifiable environmental factors that may contribute to the severity of health outcomes [e.g., intensive care unit (ICU) hospitalization and death] among individuals with COVID-19. Numerous scientific studies reviewed by the U.S. Environmental Protection Agency (EPA) have linked fine particles (PM2.5; particles with diameter, ≤ 2.5 μm) to a variety of adverse health events () including death (). It has been hypothesized that because long-term exposure to PM2.5 adversely affects the respiratory and cardiovascular systems and increases mortality risk (–), it may also exacerbate the severity of COVID-19 symptoms and worsen the prognosis of this disease (). Epidemiological studies to estimate the association between long-term exposure to air pollution and COVID-19 hospitalization and death is a rapidly expanding area of research that is attracting attention around the world. Two studies have been published using data from European countries (, ), and many more are available as preprints. However, because of the unprecedented nature of the pandemic, researchers face serious challenges when conducting these studies. One key challenge is that, to our knowledge, individual-level data on COVID-19 health outcomes for large, representative populations are not publicly available or accessible to the scientific community. Therefore, the only way to generate preliminary evidence on the link between PM2.5 and COVID-19 severity and outcomes using these aggregate data is to use an ecological regression analysis. With this study design, publicly available area-level COVID-19 mortality rates are regressed against area-level air pollution concentrations while accounting for area-level potential confounding factors. Here, we discuss the strengths and limitations of conducting ecological regression analyses of air pollution and COVID-19 health outcomes and describe additional challenges related to evolving data quality, statistical modeling, and control of measured and unmeasured confounding, paving the way for future research on this topic. We discuss these challenges and illustrate them in the context of a specific study, in which we investigated the impact of long-term PM2.5 exposure on COVID-19 mortality rates in 3089 counties in the United States, covering 98% of the population.

Illustration of an ecological regression analysis of historical exposure to PM2.5 and COVID-19 mortality rate

We begin by describing how to conduct an ecological regression analysis in this setting. COVID-19 death counts (a total of 116,747 deaths) were obtained from the Johns Hopkins University Coronavirus Resource Center and were cumulative up to 18 June 2020. We used data from 3089 counties, of which 1244 (40.3%) had reported zero COVID-19 deaths at the time of our analysis. Daily PM2.5 concentrations were estimated across the United States on a 0.01° × 0.01° grid for the period 2000–2016 using well-validated atmospheric chemistry and machine learning models (). We used zonal statistics to aggregate PM2.5 concentration estimates to the county level and then averaged across the period 2000–2016 to perform health outcome analyses. Figure 1 illustrates the spatial variation in 2000–2016 average (hereafter referred to as “long-term average”) PM2.5 concentrations and COVID-19 mortality rates (per 1 million population) by county.
Fig. 1

National maps of historical PM2.5 concentrations and COVID-19 deaths.

Maps show (A) county-level 17-year long-term average of PM2.5 concentrations (2000–2016) in the United States in μg/m3 and (B) county-level number of COVID-19 deaths per 1 million population in the United States up to and including 18 June 2020.

National maps of historical PM2.5 concentrations and COVID-19 deaths.

Maps show (A) county-level 17-year long-term average of PM2.5 concentrations (2000–2016) in the United States in μg/m3 and (B) county-level number of COVID-19 deaths per 1 million population in the United States up to and including 18 June 2020. We fit a negative binomial mixed model using COVID-19 mortality rates as the outcome and long-term average PM2.5 as the exposure of interest, adjusting for 20 county-level covariates. We conducted more than 80 sensitivity analyses to assess the robustness of the findings to various modeling assumptions. We found that an increase of 1 μg/m3 in the long-term average PM2.5 is associated with a statistically significant 11% (95% CI, 6 to 17%) increase in the county’s COVID-19 mortality rate (see Table 1); this association continues to be stable as more data accumulate (fig. S3). We also found that population density, days since the first COVID-19 case was reported, median household income, percent of owner-occupied housing, percent of the adult population with less than high school education, age distribution, and percent of Black residents are important predictors of the COVID-19 mortality rate in the model. We found a 49% (95% CI, 38 and 61%) increase in COVID-19 mortality rate associated with a 1-SD (per 14.1%) increase in percent Black residents of the county. Details on the data sources, statistical methods, and analyses are summarized in the Supplementary Materials. All data sources used in the analyses, along with fully reproducible code, are publicly available at .
Table 1

Mortality rate ratios (MRR), 95% confidence intervals (CI), and P values for all variables in the main analysis.

Details of the statistical models are available in section S2. Q, quintile.

MRR95% CIP value
PM2.51.11(1.06–1.17)0.00
Population density(Q2)0.91(0.71–1.15)0.42
Population density(Q3)0.91(0.71–1.16)0.45
Population density(Q4)0.74(0.57–0.95)0.02
Population density(Q5)0.92(0.69–1.23)0.56
% In poverty1.04(0.96–1.12)0.31
Log(median housevalue)1.13(0.99–1.29)0.07
Log(medianhousehold income)1.19(1.04–1.35)0.01
% Owner-occupiedhousing1.12(1.04–1.20)0.00
% Less than highschool education1.20(1.10–1.32)0.00
% Black1.49(1.38–1.61)0.00
% Hispanic1.06(0.97–1.16)0.23
% ≥ 65 years of age1.04(0.93–1.17)0.46
% 45–64 years ofage0.77(0.67–0.90)0.00
% 15–44 yearsof age0.76(0.68–0.85)0.00
Days sincestay-at-home order1.18(0.92–1.52)0.20
Days since first case2.40(2.05–2.80)0.00
Rate of hospitalbeds1.00(0.93–1.08)0.95
% Obese0.96(0.90–1.03)0.32
% Smokers1.13(1.00–1.28)0.05
Average summertemperature (°F)1.11(0.95–1.30)0.20
Average wintertemperature (°F)0.86(0.69–1.07)0.19
Averagesummer relativehumidity (%)0.93(0.80–1.09)0.38
Averagewinter relativehumidity (%)0.97(0.87–1.07)0.52

Mortality rate ratios (MRR), 95% confidence intervals (CI), and P values for all variables in the main analysis.

Details of the statistical models are available in section S2. Q, quintile.

Strengths and limitations of an ecological regression analysis

Ecological regression analysis provides a simple and cost-effective approach for studying potential associations between historical exposure to air pollution and increased vulnerability to COVID-19 in large representative populations, as illustrated in our study in the previous section. This approach is regularly applied in many areas of research (). Using our study as an example, we summarize in Table 2 the strengths, limitations, and opportunities considering (i) study design, (ii) COVID-19 health outcome data, (iii) historical exposure to air pollution, and (iv) measured and unmeasured confounders, with the goal of paving the way for future research.
Table 2

Strengths and limitations of ecological regression analyses applied to research on air pollution and COVID-19 and opportunities for future research.

StrengthsLimitationsFuture research
Study design: ecological regressionFeasible, timely, and cost-effectiveCannot be used to make inferenceabout individual-levelassociations, doing so leads toecological fallacyAugment county-level data withindividual-level data to adjust forecological bias (12)
Data are representative of the entireU.S. populationCannot adjust for individual-level riskfactors such as age, gender, andrace (1921)Conduct studies of individual-levelhealth records using traditionalregression and causal inferencemethods as in Wu et al. (2)
Allows inference at the area level,which can be useful forpolicy-making (11)Results are sensitive to theassumptions of the statisticalmodel (11)
Computationally efficient and can beconducted daily to allow for thedynamic nature of the data andobserve temporal trends; see fig. S3
Facilitates comparison of resultsacross countries
Outcome: COVID-19 deathsaggregated at the county levelPublicly available data updatedalmost dailyPotential for outcome misclassification(22), particularly differentialmisclassification over time andspace, which could bias resultsAccess to nationwide registry data withthe validated cause of death (14)
Analyses using county excess deathsas the outcome (23)
Exposure: 2000–2016 averageexposure to PM2.5 at the countylevelUse of well-validated atmosphericchemistry models and machinelearning models (9, 24)Aggregation assumes that everyonein a county experiences the sameexposures, leading to exposuremisclassification, especially for thelargest countiesIndividual-level data on COVID-19deaths with geocoded addressesto link to air pollution data at theplace of residence
PM2.5 exposure estimated at finegrids, which can be aggregated tothe county level to assess exposureeven in unmonitored areas (24)Can be used to assess historicalexposures to air pollution but notreal-time exposuresAdditional statistical methods toaccount for the propagation ofexposure error into the ecologicalregression model (13)
As opposed to using monitor data,aggregation of modeled estimatesensures that county PM2.5exposure estimates represent thedistribution across the entire area
Measured confoundersMore than 20 area-level variablescapture age distribution, racedistribution, socioeconomicstatus, population density,behavioral risk factors, epidemicstage, and stay-at-home orders(see tables S1 and S2)County average features may notrepresent the features ofCOVID-19 patients, leading toinadequate adjustmentCausal inference approaches toadjust for measured confoundingbias, producing results that areless sensitive to statisticalmodeling assumptions
These overlap with the confoundersets used in much of the previousliterature on air pollution andhealth (25, 26)Difficult to formalize the notion of“epidemic stage,” which may bean important confounder
The threat of unmeasuredconfounding bias still presentCausal inference approaches toassess covariate balance (2)
Sensitive to the form of the statisticalmodel specified (i.e., assumptions oflinearity and no effect modification)Individual-level data on keymeasured confounders such assmoking and body mass index
Unmeasured confoundersLeverage existing approaches, suchas the calculation of the E-value(27), to assess how strong theeffect of an unmeasuredconfounder would need to be toexplain away the associationsdetected (see section S3)The most important threat to thevalidity of any observational studyNatural experiment designs andinstrumental variables can beused to reduce the threat ofunmeasured confounding but areless common
Even measures like the E-valuecannot inform us about thelikelihood that a strongunmeasured confounder exists;this must be evaluated on thebasis of subject matter knowledge
Among the key limitations, by design, ecological regression analyses are unable to adjust for individual-level risk factors (e.g., age, race, and smoking status); when individual-level data are unavailable, this approach leaves us unable to make conclusions regarding individual-level associations. In the context of COVID-19 health outcomes, this is a severe limitation, as individual-level risk factors are known to affect COVID-19 health outcomes. It is important to note that confusion between ecological associations and individual associations may present an ecological fallacy. In extreme cases, this fallacy can lead to associations detected in ecological regression that do not exist or are in the opposite direction of true associations at the individual level. However, ecological regression analyses still allow us to make conclusions at the area level, which can be useful for policy-making (). For the association between COVID-19 health outcomes and PM2.5 exposure, we argue that area-level conclusions are valuable, as they can inform important immediate policy actions that will benefit public health, such as (i) prioritization of precautionary measures [e.g., personal protective equipment (PPE) allocations and hospital beds] to areas with historical higher air pollution and (ii) further strengthening the scientific argument for lowering the U.S. National Ambient Air Quality Standards for PM2.5 and other pollutants. To completely avoid potential ecological bias, a representative sample of individual-level data is necessary. While this may not be feasible in the near future, as some COVID-19 outcome data become available at the individual level, existing approaches that augment county-level data with individual-level data () could be used to correct for ecological bias. Furthermore, air pollution exposure misclassification, due to between-area mobility and within-area variation, is another potential source of bias that could affect the ecological regression results described in our example study. Methods to account for the propagation of exposure error into the ecological regression model () could be applied to help mitigate the impact of measurement error. Outcome misclassification is another limitation that can be partially overcome by accessing nationwide registry data with the validated cause of death (). As in all observational studies, adjustment for measured and unmeasured confounding presents another key challenge in ecological regression analyses, which may be exacerbated when dealing with dynamic pandemic data, as in our study. Conducting studies using both traditional regressions and methods for causal inference as in Wu et al. () is necessary to assess the robustness of the findings. Increasing the scientific rigor of research in this area requires access to representative, individual-level data on COVID-19 health outcomes, including information about patients’ residential address, demographics, and individual-level confounders. This is an enormous challenge that will require consideration of many privacy, legal, and ethical trade-offs (). Future areas of research also include the application of statistical methods to quantify and correct for ecological bias and measurement error, reproducible methods for causal inference, and sensitivity analysis of measured and unmeasured confounding bias as suggested above. These strengths and limitations are illustrated further in the context of our own study (see the Supplementary Materials).

DISCUSSION

Ecological regression analyses are crucial to stimulate innovations in a rapidly evolving area of research. Ongoing research has already focused on overcoming some aspects of these limitations (, ). For example, ecological regression analysis of air pollution and COVID-19, using data with finer geographic resolution, is being conducted for different countries and regions around the world. Cole et al. () published an ecological regression analysis using data in Dutch municipalities and found results consistent with our own investigation; the California Air Resources Board (CARB) is planning to conduct a similar study at the census tract level (). Although an ecological regression analysis cannot provide insight into the mechanisms underlying the relationship between PM2.5 exposure and COVID-19 mortality, studies are starting to shed light on the potential biological mechanisms that may explain the relationship between air pollution and viral infection outcomes (). For example, it has been hypothesized that chronic exposure to PM2.5 causes alveolar angiotensin-converting enzyme 2 (ACE-2) receptor overexpression and impairs host defenses (). This could cause a more severe form of COVID-19 in ACE-2–depleted lungs, increasing the likelihood of poor outcomes, including death (). The associations detected in ecological regression analyses provide strong justification for follow-up investigations as more and higher-quality COVID-19 data become available. Such studies would include validation of our findings with other data sources and study types, as well as investigations into mediating factors and effect modifiers, biological mechanisms, impacts of PM2.5 exposure timing, and relationships between PM2.5 and other COVID-19 outcomes such as hospitalization. Research on how modifiable factors may exacerbate COVID-19 symptoms and increase mortality risk is essential to guide policies and behaviors to minimize fatality related to the pandemic. Such research could also provide a strong scientific argument for revision of the U.S. Ambient Air Quality Standards for PM2.5 and other environmental policies in the midst of a pandemic.
  46 in total

Review 1.  Ecological bias, confounding, and effect modification.

Authors:  S Greenland; H Morgenstern
Journal:  Int J Epidemiol       Date:  1989-03       Impact factor: 7.196

2.  Improving ecological inference using individual-level data.

Authors:  Christopher Jackson; Nicky Best; Sylvia Richardson
Journal:  Stat Med       Date:  2006-06-30       Impact factor: 2.373

3.  Short-Term Elevation of Fine Particulate Matter Air Pollution and Acute Lower Respiratory Infection.

Authors:  Benjamin D Horne; Elizabeth A Joy; Michelle G Hofmann; Per H Gesteland; John B Cannon; Jacob S Lefler; Denitza P Blagev; E Kent Korgenski; Natalie Torosyan; Grant I Hansen; David Kartchner; C Arden Pope
Journal:  Am J Respir Crit Care Med       Date:  2018-09-15       Impact factor: 21.405

4.  Impact of Long-Term Exposures to Ambient PM2.5 and Ozone on ARDS Risk for Older Adults in the United States.

Authors:  Jongeun Rhee; Francesca Dominici; Antonella Zanobetti; Joel Schwartz; Yun Wang; Qian Di; John Balmes; David C Christiani
Journal:  Chest       Date:  2019-03-26       Impact factor: 9.410

5.  Regional Estimates of Chemical Composition of Fine Particulate Matter Using a Combined Geoscience-Statistical Method with Information from Satellites, Models, and Monitors.

Authors:  Aaron van Donkelaar; Randall V Martin; Chi Li; Richard T Burnett
Journal:  Environ Sci Technol       Date:  2019-02-12       Impact factor: 9.028

6.  Exposure to urban air particulates alters the macrophage-mediated inflammatory response to respiratory viral infection.

Authors:  S Becker; J M Soukup
Journal:  J Toxicol Environ Health A       Date:  1999-08-13

7.  Evaluating the impact of long-term exposure to fine particulate matter on mortality among the elderly.

Authors:  X Wu; D Braun; J Schwartz; M A Kioumourtzoglou; F Dominici
Journal:  Sci Adv       Date:  2020-07-17       Impact factor: 14.136

8.  Cardiovascular mortality and long-term exposure to particulate air pollution: epidemiological evidence of general pathophysiological pathways of disease.

Authors:  C Arden Pope; Richard T Burnett; George D Thurston; Michael J Thun; Eugenia E Calle; Daniel Krewski; John J Godleski
Journal:  Circulation       Date:  2003-12-15       Impact factor: 29.690

9.  Possible explanations for why some countries were harder hit by the pandemic influenza virus in 2009 - a global mortality impact modeling study.

Authors:  Kathleen F Morales; John Paget; Peter Spreeuwenberg
Journal:  BMC Infect Dis       Date:  2017-09-25       Impact factor: 3.090

10.  Severe air pollution links to higher mortality in COVID-19 patients: The "double-hit" hypothesis.

Authors:  Antonio Frontera; Lorenzo Cianfanelli; Konstantinos Vlachos; Giovanni Landoni; George Cremona
Journal:  J Infect       Date:  2020-05-21       Impact factor: 6.072

View more
  232 in total

1.  Influenza Vaccination and COVID-19 Mortality in the USA: An Ecological Study.

Authors:  Claudio Zanettini; Mohamed Omar; Wikum Dinalankara; Eddie Luidy Imada; Elizabeth Colantuoni; Giovanni Parmigiani; Luigi Marchionni
Journal:  Vaccines (Basel)       Date:  2021-04-24

2.  Effects of country success on COVID-19 cumulative cases and excess deaths in 169 countries.

Authors:  A Kaklauskas; V Milevicius; L Kaklauskiene
Journal:  Ecol Indic       Date:  2022-02-25       Impact factor: 4.958

3.  Challenges of COVID-19 Vaccination Delivery in Pakistan.

Authors:  Shahan Tariq; Muhammad Ammar Bin Hamid
Journal:  Pak J Med Sci       Date:  2021 Nov-Dec       Impact factor: 1.088

Review 4.  COVID-19 false dichotomies and a comprehensive review of the evidence regarding public health, COVID-19 symptomatology, SARS-CoV-2 transmission, mask wearing, and reinfection.

Authors:  Kevin Escandón; Angela L Rasmussen; Isaac I Bogoch; Eleanor J Murray; Karina Escandón; Saskia V Popescu; Jason Kindrachuk
Journal:  BMC Infect Dis       Date:  2021-07-27       Impact factor: 3.090

5.  An Italian individual-level data study investigating on the association between air pollution exposure and Covid-19 severity in primary-care setting.

Authors:  Valeria Pegoraro; Franca Heiman; Antonella Levante; Duccio Urbinati; Ilaria Peduto
Journal:  BMC Public Health       Date:  2021-05-12       Impact factor: 3.295

6.  Characterizing all-cause excess mortality patterns during COVID-19 pandemic in Mexico.

Authors:  Sushma Dahal; Juan M Banda; Ana I Bento; Kenji Mizumoto; Gerardo Chowell
Journal:  BMC Infect Dis       Date:  2021-05-07       Impact factor: 3.090

7.  Toward Urban Planetary Health Solutions to Climate Change and Other Modern Crises.

Authors:  Jonathan A Patz; José G Siri
Journal:  J Urban Health       Date:  2021-06       Impact factor: 3.671

8.  Changes in short-lived climate pollutants during the COVID-19 pandemic in Tehran, Iran.

Authors:  Faezeh Borhani; Majid Shafiepour Motlagh; Andreas Stohl; Yousef Rashidi; Amir Houshang Ehsani
Journal:  Environ Monit Assess       Date:  2021-05-08       Impact factor: 3.307

9.  Commentary: Intersectional perspectives on COVID-19 exposure.

Authors:  Jade S Sasser; Bronwyn Leebaw; Cesunica Ivey; Brandon Brown; Chikako Takeshita; Alexander Nguyen
Journal:  J Expo Sci Environ Epidemiol       Date:  2021-05-08       Impact factor: 5.563

10.  Associations of acute exposure to airborne pollutants with COVID-19 infection: evidence from China.

Authors:  Bingqing Lu; Na Wu; Jiakui Jiang; Xiang Li
Journal:  Environ Sci Pollut Res Int       Date:  2021-05-08       Impact factor: 5.190

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.