Literature DB >> 29554737

Machine learning methods as a tool to analyse incomplete or irregularly sampled radon time series data.

M Janik1, P Bossew2, O Kurihara3.   

Abstract

Machine learning is a class of statistical techniques which has proven to be a powerful tool for modelling the behaviour of complex systems, in which response quantities depend on assumed controls or predictors in a complicated way. In this paper, as our first purpose, we propose the application of machine learning to reconstruct incomplete or irregularly sampled data of time series indoor radon (222Rn). The physical assumption underlying the modelling is that Rn concentration in the air is controlled by environmental variables such as air temperature and pressure. The algorithms "learn" from complete sections of multivariate series, derive a dependence model and apply it to sections where the controls are available, but not the response (Rn), and in this way complete the Rn series. Three machine learning techniques are applied in this study, namely random forest, its extension called the gradient boosting machine and deep learning. For a comparison, we apply the classical multiple regression in a generalized linear model version. Performance of the models is evaluated through different metrics. The performance of the gradient boosting machine is found to be superior to that of the other techniques. By applying learning machines, we show, as our second purpose, that missing data or periods of Rn series data can be reconstructed and resampled on a regular grid reasonably, if data of appropriate physical controls are available. The techniques also identify to which degree the assumed controls contribute to imputing missing Rn values. Our third purpose, though no less important from the viewpoint of physics, is identifying to which degree physical, in this case environmental variables, are relevant as Rn predictors, or in other words, which predictors explain most of the temporal variability of Rn. We show that variables which contribute most to the Rn series reconstruction, are temperature, relative humidity and day of the year. The first two are physical predictors, while "day of the year" is a statistical proxy or surrogate for missing or unknown predictors.
Copyright © 2018 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Environment; Linear regression; Machine learning; Neural network; Radon; Sensitivity analysis

Year:  2018        PMID: 29554737     DOI: 10.1016/j.scitotenv.2018.02.233

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


  4 in total

1.  Automated classification platform for the identification of otitis media using optical coherence tomography.

Authors:  Guillermo L Monroy; Jungeun Won; Roshan Dsouza; Paritosh Pande; Malcolm C Hill; Ryan G Porter; Michael A Novak; Darold R Spillman; Stephen A Boppart
Journal:  NPJ Digit Med       Date:  2019-03-28

2.  Radiological Assessment of Indoor Radon and Thoron Concentrations and Indoor Radon Map of Dwellings in Mashhad, Iran.

Authors:  Mohammademad Adelikhah; Amin Shahrokhi; Morteza Imani; Stanislaw Chalupnik; Tibor Kovács
Journal:  Int J Environ Res Public Health       Date:  2020-12-28       Impact factor: 3.390

3.  Imputation by feature importance (IBFI): A methodology to envelop machine learning method for imputing missing patterns in time series data.

Authors:  Adil Aslam Mir; Kimberlee Jane Kearfott; Fatih Vehbi Çelebi; Muhammad Rafique
Journal:  PLoS One       Date:  2022-01-13       Impact factor: 3.240

Review 4.  Development of a Geogenic Radon Hazard Index-Concept, History, Experiences.

Authors:  Peter Bossew; Giorgia Cinelli; Giancarlo Ciotoli; Quentin G Crowley; Marc De Cort; Javier Elío Medina; Valeria Gruber; Eric Petermann; Tore Tollefsen
Journal:  Int J Environ Res Public Health       Date:  2020-06-10       Impact factor: 3.390

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.