Literature DB >> 31229871

A comparison of linear regression, regularization, and machine learning algorithms to develop Europe-wide spatial models of fine particles and nitrogen dioxide.

Jie Chen1, Kees de Hoogh2, John Gulliver3, Barbara Hoffmann4, Ole Hertel5, Matthias Ketzel6, Mariska Bauwelinck7, Aaron van Donkelaar8, Ulla A Hvidtfeldt9, Klea Katsouyanni10, Nicole A H Janssen11, Randall V Martin12, Evangelia Samoli13, Per E Schwartz14, Massimo Stafoggia15, Tom Bellander16, Maciek Strak17, Kathrin Wolf18, Danielle Vienneau19, Roel Vermeulen20, Bert Brunekreef21, Gerard Hoek22.   

Abstract

Empirical spatial air pollution models have been applied extensively to assess exposure in epidemiological studies with increasingly sophisticated and complex statistical algorithms beyond ordinary linear regression. However, different algorithms have rarely been compared in terms of their predictive ability. This study compared 16 algorithms to predict annual average fine particle (PM2.5) and nitrogen dioxide (NO2) concentrations across Europe. The evaluated algorithms included linear stepwise regression, regularization techniques and machine learning methods. Air pollution models were developed based on the 2010 routine monitoring data from the AIRBASE dataset maintained by the European Environmental Agency (543 sites for PM2.5 and 2399 sites for NO2), using satellite observations, dispersion model estimates and land use variables as predictors. We compared the models by performing five-fold cross-validation (CV) and by external validation (EV) using annual average concentrations measured at 416 (PM2.5) and 1396 sites (NO2) from the ESCAPE study. We further assessed the correlations between predictions by each pair of algorithms at the ESCAPE sites. For PM2.5, the models performed similarly across algorithms with a mean CV R2 of 0.59 and a mean EV R2 of 0.53. Generalized boosted machine, random forest and bagging performed best (CV R2~0.63; EV R2 0.58-0.61), while backward stepwise linear regression, support vector regression and artificial neural network performed less well (CV R2 0.48-0.57; EV R2 0.39-0.46). Most of the PM2.5 model predictions at ESCAPE sites were highly correlated (R2 > 0.85, with the exception of predictions from the artificial neural network). For NO2, the models performed even more similarly across different algorithms, with CV R2s ranging from 0.57 to 0.62, and EV R2s ranging from 0.49 to 0.51. The predicted concentrations from all algorithms at ESCAPE sites were highly correlated (R2 > 0.9). For both pollutants, biases were low for all models except the artificial neural network. Dispersion model estimates and satellite observations were two of the most important predictors for PM2.5 models whilst dispersion model estimates and traffic variables were most important for NO2 models in all algorithms that allow assessment of the importance of variables. Different statistical algorithms performed similarly when modelling spatial variation in annual average air pollution concentrations using a large number of training sites.
Copyright © 2019 The Authors. Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Fine particles; Land use regression; Machine learning; Nitrogen dioxide

Mesh:

Substances:

Year:  2019        PMID: 31229871     DOI: 10.1016/j.envint.2019.104934

Source DB:  PubMed          Journal:  Environ Int        ISSN: 0160-4120            Impact factor:   9.621


  13 in total

1.  Exploring Built-Up Indices and Machine Learning Regressions for Multi-Temporal Building Density Monitoring Based on Landsat Series.

Authors:  R Suharyadi; Deha Agus Umarhadi; Disyacitta Awanda; Wirastuti Widyatmanti
Journal:  Sensors (Basel)       Date:  2022-06-22       Impact factor: 3.847

2.  High-Resolution Urban Air Quality Mapping for Multiple Pollutants Based on Dense Monitoring Data and Machine Learning.

Authors:  Rong Guo; Ying Qi; Bu Zhao; Ziyu Pei; Fei Wen; Shun Wu; Qiang Zhang
Journal:  Int J Environ Res Public Health       Date:  2022-06-29       Impact factor: 4.614

3.  Long-term exposure to ambient air pollution and bladder cancer incidence in a pooled European cohort: the ELAPSE project.

Authors:  Ole Raaschou-Nielsen; Gerard Hoek; Jie Chen; Sophia Rodopoulou; Maciej Strak; Kees de Hoogh; Tahir Taj; Aslak Harbo Poulsen; Zorana J Andersen; Tom Bellander; Jørgen Brandt; Emanuel Zitt; Daniela Fecht; Francesco Forastiere; John Gulliver; Ole Hertel; Barbara Hoffmann; Ulla Arthur Hvidtfeldt; W M Monique Verschuren; Jeanette T Jørgensen; Klea Katsouyanni; Matthias Ketzel; Anton Lager; Karin Leander; Shuo Liu; Petter Ljungman; Gianluca Severi; Marie-Christine Boutron-Ruault; Patrik K E Magnusson; Gabriele Nagel; Göran Pershagen; Annette Peters; Debora Rizzuto; Yvonne T van der Schouw; Evangelia Samoli; Mette Sørensen; Massimo Stafoggia; Anne Tjønneland; Gudrun Weinmayr; Kathrin Wolf; Bert Brunekreef
Journal:  Br J Cancer       Date:  2022-02-16       Impact factor: 9.075

4.  Modeling spatial variation of gaseous air pollutants and particulate matters in a Metropolitan area using mobile monitoring data.

Authors:  Jia Xu; Wen Yang; Zhipeng Bai; Renyi Zhang; Jun Zheng; Meng Wang; Tong Zhu
Journal:  Environ Res       Date:  2022-02-08       Impact factor: 8.431

5.  Development of Europe-Wide Models for Particle Elemental Composition Using Supervised Linear Regression and Random Forest.

Authors:  Jie Chen; Kees de Hoogh; John Gulliver; Barbara Hoffmann; Ole Hertel; Matthias Ketzel; Gudrun Weinmayr; Mariska Bauwelinck; Aaron van Donkelaar; Ulla A Hvidtfeldt; Richard Atkinson; Nicole A H Janssen; Randall V Martin; Evangelia Samoli; Zorana J Andersen; Bente M Oftedal; Massimo Stafoggia; Tom Bellander; Maciej Strak; Kathrin Wolf; Danielle Vienneau; Bert Brunekreef; Gerard Hoek
Journal:  Environ Sci Technol       Date:  2020-11-25       Impact factor: 9.028

6.  Long-Term Exposure to Fine Particle Elemental Components and Natural and Cause-Specific Mortality-a Pooled Analysis of Eight European Cohorts within the ELAPSE Project.

Authors:  Jie Chen; Sophia Rodopoulou; Kees de Hoogh; Maciej Strak; Zorana J Andersen; Richard Atkinson; Mariska Bauwelinck; Tom Bellander; Jørgen Brandt; Giulia Cesaroni; Hans Concin; Daniela Fecht; Francesco Forastiere; John Gulliver; Ole Hertel; Barbara Hoffmann; Ulla Arthur Hvidtfeldt; Nicole A H Janssen; Karl-Heinz Jöckel; Jeanette Jørgensen; Klea Katsouyanni; Matthias Ketzel; Jochem O Klompmaker; Anton Lager; Karin Leander; Shuo Liu; Petter Ljungman; Conor J MacDonald; Patrik K E Magnusson; Amar Mehta; Gabriele Nagel; Bente Oftedal; Göran Pershagen; Annette Peters; Ole Raaschou-Nielsen; Matteo Renzi; Debora Rizzuto; Evangelia Samoli; Yvonne T van der Schouw; Sara Schramm; Per Schwarze; Torben Sigsgaard; Mette Sørensen; Massimo Stafoggia; Anne Tjønneland; Danielle Vienneau; Gudrun Weinmayr; Kathrin Wolf; Bert Brunekreef; Gerard Hoek
Journal:  Environ Health Perspect       Date:  2021-04-12       Impact factor: 9.031

7.  NeuroSmog: Determining the Impact of Air Pollution on the Developing Brain: Project Protocol.

Authors:  Iana Markevych; Natasza Orlov; James Grellier; Katarzyna Kaczmarek-Majer; Małgorzata Lipowska; Katarzyna Sitnik-Warchulska; Yarema Mysak; Clemens Baumbach; Maja Wierzba-Łukaszyk; Munawar Hussain Soomro; Mikołaj Compa; Bernadetta Izydorczyk; Krzysztof Skotak; Anna Degórska; Jakub Bratkowski; Bartosz Kossowski; Aleksandra Domagalik; Marcin Szwed
Journal:  Int J Environ Res Public Health       Date:  2021-12-28       Impact factor: 3.390

8.  Assessing the health estimation capacity of air pollution exposure prediction models.

Authors:  Jenna R Krall; Joshua P Keller; Roger D Peng
Journal:  Environ Health       Date:  2022-03-17       Impact factor: 5.984

9.  Deep Ensemble Machine Learning Framework for the Estimation of PM2.5 Concentrations.

Authors:  Wenhua Yu; Shanshan Li; Tingting Ye; Rongbin Xu; Jiangning Song; Yuming Guo
Journal:  Environ Health Perspect       Date:  2022-03-07       Impact factor: 11.035

10.  The impact of measurement error in modeled ambient particles exposures on health effect estimates in multilevel analysis: A simulation study.

Authors:  Evangelia Samoli; Barbara K Butland; Sophia Rodopoulou; Richard W Atkinson; Benjamin Barratt; Sean D Beevers; Andrew Beddows; Konstantina Dimakopoulou; Joel D Schwartz; Mahdieh Danesh Yazdi; Klea Katsouyanni
Journal:  Environ Epidemiol       Date:  2020-05-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.