Literature DB >> 32402974

Imputation methods for addressing missing data in short-term monitoring of air pollutants.

Steven J Hadeed1, Mary Kay O'Rourke2, Jefferey L Burgess2, Robin B Harris2, Robert A Canales3.   

Abstract

Monitoring of environmental contaminants is a critical part of exposure sciences research and public health practice. Missing data are often encountered when performing short-term monitoring (<24 h) of air pollutants with real-time monitors, especially in resource-limited areas. Approaches for handling consecutive periods of missing and incomplete data in this context remain unclear. Our aim is to evaluate existing imputation methods for handling missing data for real-time monitors operating for short durations. In a current field-study, realtime PM2.5 monitors were placed outside of 20 households and ran for 24-hours. Missing data was simulated in these households at four consecutive periods of missingness (20%, 40%, 60%, 80%). Univariate (Mean, Median, Last Observation Carried Forward, Kalman Filter, Random, Markov) and multivariate time-series (Predictive Mean Matching, Row Mean Method) methods were used to impute missing concentrations, and performance was evaluated using five error metrics (Absolute Bias, Percent Absolute Error in Means, R2 Coefficient of Determination, Root Mean Square Error, Mean Absolute Error). Univariate methods of Markov, random, and mean imputations were the best performing methods that yielded 24-hour mean concentrations with the lowest error and highest R2 values across all levels of missingness. When evaluating error metrics minute-by-minute, Kalman filters, median, and Markov methods performed well at low levels of missingness (20-40%). However, at higher levels of missingness (60-80%), Markov, random, median, and mean imputation performed best on average. Multivariate methods were the worst performing imputation methods across all levels of missingness. Imputation using univariate methods may provide a reasonable solution to addressing missing data for short-term monitoring of air pollutants, especially in resource-limited areas. Further efforts are needed to evaluate imputation methods that are generalizable across a diverse range of study environments.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Ambient PM2.5; Imputation; Missing data; Real-time monitoring

Year:  2020        PMID: 32402974      PMCID: PMC7745257          DOI: 10.1016/j.scitotenv.2020.139140

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


  3 in total

1.  Imputation of missing longitudinal data: a comparison of methods.

Authors:  Jean Mundahl Engels; Paula Diehr
Journal:  J Clin Epidemiol       Date:  2003-10       Impact factor: 6.437

Review 2.  Review: a gentle introduction to imputation of missing values.

Authors:  A Rogier T Donders; Geert J M G van der Heijden; Theo Stijnen; Karel G M Moons
Journal:  J Clin Epidemiol       Date:  2006-07-11       Impact factor: 6.437

3.  A 'missing not at random' (MNAR) and 'missing at random' (MAR) growth model comparison with a buprenorphine/naloxone clinical trial.

Authors:  Sterling McPherson; Celestina Barbosa-Leiker; Mary Rose Mamey; Michael McDonell; Craig K Enders; John Roll
Journal:  Addiction       Date:  2014-10-16       Impact factor: 6.526

  3 in total
  2 in total

1.  Short-term exposure to sulphur dioxide (SO2) and all-cause and respiratory mortality: A systematic review and meta-analysis.

Authors:  Pablo Orellano; Julieta Reynoso; Nancy Quaranta
Journal:  Environ Int       Date:  2021-02-15       Impact factor: 9.621

2.  Assessing temporal correlation in environmental risk factors to design efficient area-specific COVID-19 regulations: Delhi based case study.

Authors:  Vishal Chaudhary; Pradeep Bhadola; Ajeet Kaushik; Mohammad Khalid; Hidemitsu Furukawa; Ajit Khosla
Journal:  Sci Rep       Date:  2022-07-28       Impact factor: 4.996

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.