| Literature DB >> 28755002 |
Qiquan Li1, Changquan Wang2, Tianfei Dai1,3, Wenjiao Shi4,5, Xin Zhang1, Yi Xiao1, Weiping Song6, Bing Li1, Yongdong Wang1.
Abstract
A suitable method and appropriate environmental variables are important for accurately predicting heavy metal distribution in soils. However, the classical methods (e.g., ordinary kriging (OK)) have a smoothing effect that results in a tendency to neglect local variability, and the commonly used environmental variables (e.g., terrain factors) are ineffective for improving predictions across plains. Here, variables were derived from the obvious factors affecting soil cadmium (Cd), such as road traffic, and were used as auxiliary variables for a combined method (HASM_RBFNN) that was developed using high accuracy surface modelling (HASM) and radial basis function neural network (RBFNN) model. This combined method was then used to predict soil Cd distribution in a typical area of Chengdu Plain in China, considering the spatial non-stationarity of the relationships between soil Cd and the derived variables based on 339 surface soil samples. The results showed that HASM_RBFNN had lower prediction errors than OK, regression kriging (RK) and HASM_RBFNNs, which didn't consider the spatial non-stationarity of the soil Cd-derived variables relationships. Furthermore, HASM_RBFNN provided improved detail on local variations. The better performance suggested that the derived environmental variables were effective and HASM_RBFNN was appropriate for improving the prediction of soil Cd distribution across plains.Entities:
Year: 2017 PMID: 28755002 PMCID: PMC5533786 DOI: 10.1038/s41598-017-07690-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Relationships between soil Cd content and distance to the Minjiang River (a), MODIS NDVI (b), the distances to primary (c) and secondary (d) grade roads, and the density of primary (e) and secondary (f) grade roads.
Results of regression analysis using different factors as independent variables.
| Factors | Regression equation |
|
|
|
|---|---|---|---|---|
| Distance to the Minjiang River | Y = −0.039 ln (X) + 0.306 | 0.220 | 94.908 | <0.01 |
| MODIS NDVI | Y = −0.423X + 0.451 | 0.128 | 49.354 | <0.01 |
| Density of primary roads | Y = 0.009X + 0.211 | 0.014 | 4.931 | 0.027 |
| Density of secondary roads | Y = 0.005X + 0.202 | 0.057 | 20.277 | <0.01 |
Relationships between soil Cd content and the factors in different subareas.
| Subareas | Sample number | Distance to the Minjiang River | MODIS NDVI | Density of primary roads | Density of secondary roads |
|---|---|---|---|---|---|
| Within 10 km of the Minjiang River | 84 | −0.44** | −0.64** | 0.25* | 0.43** |
| Beyond 10 km of the Minjiang River | 189 | −0.02 | −0.12* | 0.14** | −0.01 |
Prediction errors of the different methods for the independent validation points. MAE, mean absolute error; RMSE, root mean square error; MRE, mean relative error; OK, ordinary kriging; RK, regression kriging; HASM_RBFNN, the combined method (HASM_RBFNN) developed using high-accuracy surface modelling (HASM) and radial basis function neural network (RBFNN) modelling, taking into account the spatial non-stationarity of the relationships between soil Cd and the auxiliary variables; HASM_ RBFNNs, the combined method (HASM_RBFNN), without taking into account the spatial non-stationarity of the relationships.
| Methods | Sample number | MAE | RMSE | MRE |
|---|---|---|---|---|
| HASM_RBFNN | 66 | 0.034 | 0.042 | 15.715 |
| HASM_RBFNNs | 66 | 0.036 | 0.045 | 16.622 |
| RK | 66 | 0.037 | 0.046 | 17.746 |
| OK | 66 | 0.040 | 0.051 | 18.083 |
Figure 2The spatial distribution maps by HASM_RBFNN (a), HASM_RBFNNs (b), RK (c) and OK (d). (OK, ordinary kriging; RK, regression kriging; HASM_RBFNN, the combined method (HASM_RBFNN) developed using high-accuracy surface modelling (HASM) and radial basis function neural network (RBFNN) modelling, taking into account the spatial non-stationarity of the relationships between soil Cd and the auxiliary variables; HASM_RBFNNs, the combined method (HASM_RBFNN), without taking into account the spatial non-stationarity of the relationships.). All the maps were generated in ArcGIS10.1, http://www.esrichina-bj.cn/softwareproduct/ArcGIS/.
Semivariogram parameters of soil Cd and the regression residuals.
| Variables | Models | Nugget (C0) | Sill (C0 + C) | Nugget/sillC0/(C0 + C) | Range (km) | R2 |
|---|---|---|---|---|---|---|
| Soil Cd | Exponential | 0.038 | 0.087 | 0.437 | 25.2 | 0.968 |
| Regression residuals | Gaussian | 0.0075 | 0.017 | 0.441 | 20.7 | 0.913 |
Figure 3The location of the study area in Sichuan Province (a), the digital elevation model (DEM) and the soil sample distribution in the study area (b), and the spatial distribution of roads and the Minjiang River (c). All the maps were generated in ArcGIS10.1, http://www.esrichina-bj.cn/softwareproduct/ArcGIS/.