Meng Wang1, Bert Brunekreef, Ulrike Gehring, Adam Szpiro, Gerard Hoek, Rob Beelen. 1. From the aInstitute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands; bDepartment of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA; cJulius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands; and dDepartment of Biostatistics, University of Washington, Seattle, WA.
Abstract
BACKGROUND: Leave-one-out cross-validation that fails to account for variable selection does not properly reflect prediction accuracy when the number of training sites is small. The impact on health effect estimates has rarely been studied. The objective of this study was to develop an improved validation procedure for land-use regression models with variable selection and investigate health effect estimates in relation to land-use regression model performance. METHODS: We randomly generated 10 training and test sets for nitrogen dioxide and particulate matter. For each training set, we developed models and evaluated them using a cross-holdout validation approach. Cross-holdout validation develops new models for each evaluation compared with refitting the model without variable selection, as in standard leave-one-out cross-validation. We also implemented holdout validation, which evaluates model predictions using independent test sets. We evaluated the relationship between cross-holdout validation and holdout validation R and estimates of the association between air pollution and forced vital capacity in the Dutch birth cohort. RESULTS: Cross-holdout validation Rs were generally identical to holdout validation Rs, but were notably smaller than leave-one-out cross-validation Rs. Decreases in forced vital capacity in relation to air pollution exposure were larger for land-use regression models that had larger holdout validation and cross-holdout validation Rs rather than leave-one-out cross-validation R. CONCLUSION: Cross-holdout validation accurately reflects predictive ability of land-use regression models and is a useful validation approach for small datasets. Land-use regression predictive ability in terms of holdout validation and cross-holdout validation rather than leave-one-out cross-validation was associated with the magnitude of health effect estimates in a case study.
BACKGROUND:Leave-one-out cross-validation that fails to account for variable selection does not properly reflect prediction accuracy when the number of training sites is small. The impact on health effect estimates has rarely been studied. The objective of this study was to develop an improved validation procedure for land-use regression models with variable selection and investigate health effect estimates in relation to land-use regression model performance. METHODS: We randomly generated 10 training and test sets for nitrogen dioxide and particulate matter. For each training set, we developed models and evaluated them using a cross-holdout validation approach. Cross-holdout validation develops new models for each evaluation compared with refitting the model without variable selection, as in standard leave-one-out cross-validation. We also implemented holdout validation, which evaluates model predictions using independent test sets. We evaluated the relationship between cross-holdout validation and holdout validation R and estimates of the association between air pollution and forced vital capacity in the Dutch birth cohort. RESULTS: Cross-holdout validation Rs were generally identical to holdout validation Rs, but were notably smaller than leave-one-out cross-validation Rs. Decreases in forced vital capacity in relation to air pollution exposure were larger for land-use regression models that had larger holdout validation and cross-holdout validation Rs rather than leave-one-out cross-validation R. CONCLUSION: Cross-holdout validation accurately reflects predictive ability of land-use regression models and is a useful validation approach for small datasets. Land-use regression predictive ability in terms of holdout validation and cross-holdout validation rather than leave-one-out cross-validation was associated with the magnitude of health effect estimates in a case study.
Authors: Danielle Vienneau; Kees de Hoogh; Matthew J Bechle; Rob Beelen; Aaron van Donkelaar; Randall V Martin; Dylan B Millet; Gerard Hoek; Julian D Marshall Journal: Environ Sci Technol Date: 2013-11-11 Impact factor: 9.028
Authors: Marloes Eeftens; Rob Beelen; Kees de Hoogh; Tom Bellander; Giulia Cesaroni; Marta Cirach; Christophe Declercq; Audrius Dėdelė; Evi Dons; Audrey de Nazelle; Konstantina Dimakopoulou; Kirsten Eriksen; Grégoire Falq; Paul Fischer; Claudia Galassi; Regina Gražulevičienė; Joachim Heinrich; Barbara Hoffmann; Michael Jerrett; Dirk Keidel; Michal Korek; Timo Lanki; Sarah Lindley; Christian Madsen; Anna Mölter; Gizella Nádor; Mark Nieuwenhuijsen; Michael Nonnemacher; Xanthi Pedeli; Ole Raaschou-Nielsen; Evridiki Patelarou; Ulrich Quass; Andrea Ranzi; Christian Schindler; Morgane Stempfelet; Euripides Stephanou; Dorothea Sugiri; Ming-Yi Tsai; Tarja Yli-Tuomi; Mihály J Varró; Danielle Vienneau; Stephanie von Klot; Kathrin Wolf; Bert Brunekreef; Gerard Hoek Journal: Environ Sci Technol Date: 2012-10-01 Impact factor: 9.028
Authors: Bert Brunekreef; Jet Smit; Johan de Jongste; Herman Neijens; Jorrit Gerritsen; Dirkje Postma; Rob Aalberse; Laurens Koopman; Marjan Kerkhof; Alet Wilga; Rob van Strien Journal: Pediatr Allergy Immunol Date: 2002 Impact factor: 6.377
Authors: Rob Beelen; Ole Raaschou-Nielsen; Massimo Stafoggia; Zorana Jovanovic Andersen; Gudrun Weinmayr; Barbara Hoffmann; Kathrin Wolf; Evangelia Samoli; Paul Fischer; Mark Nieuwenhuijsen; Paolo Vineis; Wei W Xun; Klea Katsouyanni; Konstantina Dimakopoulou; Anna Oudin; Bertil Forsberg; Lars Modig; Aki S Havulinna; Timo Lanki; Anu Turunen; Bente Oftedal; Wenche Nystad; Per Nafstad; Ulf De Faire; Nancy L Pedersen; Claes-Göran Östenson; Laura Fratiglioni; Johanna Penell; Michal Korek; Göran Pershagen; Kirsten Thorup Eriksen; Kim Overvad; Thomas Ellermann; Marloes Eeftens; Petra H Peeters; Kees Meliefste; Meng Wang; Bas Bueno-de-Mesquita; Dorothea Sugiri; Ursula Krämer; Joachim Heinrich; Kees de Hoogh; Timothy Key; Annette Peters; Regina Hampel; Hans Concin; Gabriele Nagel; Alex Ineichen; Emmanuel Schaffner; Nicole Probst-Hensch; Nino Künzli; Christian Schindler; Tamara Schikowski; Martin Adam; Harish Phuleria; Alice Vilier; Françoise Clavel-Chapelon; Christophe Declercq; Sara Grioni; Vittorio Krogh; Ming-Yi Tsai; Fulvio Ricceri; Carlotta Sacerdote; Claudia Galassi; Enrica Migliore; Andrea Ranzi; Giulia Cesaroni; Chiara Badaloni; Francesco Forastiere; Ibon Tamayo; Pilar Amiano; Miren Dorronsoro; Michail Katsoulis; Antonia Trichopoulou; Bert Brunekreef; Gerard Hoek Journal: Lancet Date: 2013-12-09 Impact factor: 79.321
Authors: Meng Wang; Rob Beelen; Tom Bellander; Matthias Birk; Giulia Cesaroni; Marta Cirach; Josef Cyrys; Kees de Hoogh; Christophe Declercq; Konstantina Dimakopoulou; Marloes Eeftens; Kirsten T Eriksen; Francesco Forastiere; Claudia Galassi; Georgios Grivas; Joachim Heinrich; Barbara Hoffmann; Alex Ineichen; Michal Korek; Timo Lanki; Sarah Lindley; Lars Modig; Anna Mölter; Per Nafstad; Mark J Nieuwenhuijsen; Wenche Nystad; David Olsson; Ole Raaschou-Nielsen; Martina Ragettli; Andrea Ranzi; Morgane Stempfelet; Dorothea Sugiri; Ming-Yi Tsai; Orsolya Udvardy; Mihaly J Varró; Danielle Vienneau; Gudrun Weinmayr; Kathrin Wolf; Tarja Yli-Tuomi; Gerard Hoek; Bert Brunekreef Journal: Environ Health Perspect Date: 2014-05-02 Impact factor: 9.031
Authors: Gerard Hoek; Ranjini M Krishnan; Rob Beelen; Annette Peters; Bart Ostro; Bert Brunekreef; Joel D Kaufman Journal: Environ Health Date: 2013-05-28 Impact factor: 5.984
Authors: Silas Bergen; Lianne Sheppard; Paul D Sampson; Sun-Young Kim; Mark Richards; Sverre Vedal; Joel D Kaufman; Adam A Szpiro Journal: Environ Health Perspect Date: 2013-06-11 Impact factor: 9.031
Authors: No Ol Lim; Jinhoo Hwang; Sung-Joo Lee; Youngjae Yoo; Yuyoung Choi; Seongwoo Jeon Journal: Int J Environ Res Public Health Date: 2022-04-22 Impact factor: 4.614
Authors: Erik van Nunen; Roel Vermeulen; Ming-Yi Tsai; Nicole Probst-Hensch; Alex Ineichen; Mark Davey; Medea Imboden; Regina Ducret-Stich; Alessio Naccarati; Daniela Raffaele; Andrea Ranzi; Cristiana Ivaldi; Claudia Galassi; Mark Nieuwenhuijsen; Ariadna Curto; David Donaire-Gonzalez; Marta Cirach; Leda Chatzi; Mariza Kampouri; Jelle Vlaanderen; Kees Meliefste; Daan Buijtenhuijs; Bert Brunekreef; David Morley; Paolo Vineis; John Gulliver; Gerard Hoek Journal: Environ Sci Technol Date: 2017-03-13 Impact factor: 9.028
Authors: Carlos Llanes-Álvarez; Jesús M Andrés-de Llano; Ana I Álvarez-Navares; M Teresa Pastor-Hidalgo; Carlos Roncero; Manuel A Franco-Martín Journal: J Clin Med Date: 2019-12-02 Impact factor: 4.241
Authors: Evangelia Samoli; Barbara K Butland; Sophia Rodopoulou; Richard W Atkinson; Benjamin Barratt; Sean D Beevers; Andrew Beddows; Konstantina Dimakopoulou; Joel D Schwartz; Mahdieh Danesh Yazdi; Klea Katsouyanni Journal: Environ Epidemiol Date: 2020-05-27