| Literature DB >> 32080258 |
Muhammad Rafique1, Aleem Dad Khan Tareen2, Adil Aslim Mir3, Malik Sajjad Ahmed Nadeem3, Khawaja M Asim4,5, Kimberlee Jane Kearfott6.
Abstract
We propose a new method based on the idea of delegating regressors for predicting the soil radon gas concentration (SRGC) and anomalies in radon or any other time series data. The proposed method is compared to different traditional boosting e.g., Extreme Gradient Boosting (EGB) and simple regression methods e.g., support vector regressors with linear kernel and radial kernel in terms of accurate predictions. R language has been used for the statistical analysis of radon time series (RTS) data. The results obtained show that the proposed methodology predicts SRGC more accurately when compared to different traditional boosting and regression methods. The best correlation is found between the actual and predicted radon concentration for window size of 2 i.e., two days before and after the start of seismic activities. RTS data was collected from 05 February 2017 to 16 February 2018, including 7 seismic events recorded during the study period. Findings of study show that the proposed methodology predicts the SRGC with more precision, for all the window sizes, by overlapping predicted with the actual radon time series concentrations.Entities:
Year: 2020 PMID: 32080258 PMCID: PMC7033208 DOI: 10.1038/s41598-020-59881-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Simulation Plan for radon anomaly detection using different machine learning methods.
Figure 2Basic representation of Support Vector Machine for regression[60].
Figure 3Support Vector Regressor with slacked variable[60].
Figure 4Number of measurements in seismic and non-seismic dataset with respect to window sizes ranging from 1 to 12.
Figure 5Mean correlation of actual and predicted radon concentration using different regression methods.
Root mean squared error (RMSE) of different regression methods for prediction of radon concentration in soil with respect to different window sizes.
| Win Size | Extreme Gradient Boosting (XGBoost) | Support Vector Machine Linear (SVML) | Support Vector Machine Radial (SVMR) | Delegated Regressor Method (DRM) |
|---|---|---|---|---|
| 1 | 2505.088 | 6108.028 | 4826.279 | 1809.784 |
| 2 | 2473.006 | 6149.988 | 4977.869 | 1806 |
| 3 | 2416.071 | 6225.425 | 5318.782 | 2017.899 |
| 4 | 2518.619 | 6365.508 | 6527.553 | 1927.861 |
| 5 | 2593.956 | 6531.951 | 6699.957 | 1731.9 |
| 6 | 2670.358 | 6745.238 | 7109.096 | 2479.699 |
| 7 | 2761.739 | 6945.34 | 7704.349 | 2200.264 |
| 8 | 3033.962 | 7150.594 | 9223.404 | 1991.526 |
| 9 | 3135.076 | 7223.843 | 9424.501 | 2269.731 |
| 10 | 3138.239 | 7396.725 | 9659.676 | 2291.885 |
| 11 | 3596.169 | 7568.502 | 10229.27 | 2300.619 |
| 12 | 3757.667 | 7618.878 | 10881.65 | 2667.437 |
Figure 6(a–l) Represents radon concentration for 1 through 12 days before and after earthquake with red lines actually showing the earthquake with its magnitude.
Figure 7(a–l) Actual and predicted radon concentration using delegated and other regression methods using window size of one through 12 days.