| Literature DB >> 31613923 |
Wei Wu1, Hong-Bin Liu2.
Abstract
Soil pH is a critical soil quality index and controls soil microbial activities, soil nutrient availability, and plant roots growth and development. The current study aims to evaluate various pedotransfer functions for predicting soil pH using different geochemical indices (CaO, ratios of Al2O3, Fe2O3, TiO2, SiO2, MgO, and K2O to CaO) in forest soils. Various models including empirical functions (quadratic, cubic, sigmoid, logarithmic) and artificial neural network with these geochemical indices were assessed by independent testing set. Mean bias error (MBE), root mean square error (RMSE), mean absolute percentage error (MAPE), mean absolute error (MAE), coefficient of determination (R2), t-statistics (t-stat), and Akaike's Information Criterion (AIC) were applied to evaluate the model performances. Additionally, a new indicator (global performance indictor, GPI) was originally introduced in this study and was used to rank these models. According to GPI, the sigmoid functions and ANNs performed better than others. On average, they could explain above 70% of the variability in soil pH. Both model structure and dataset shape impact on model performance. The best input was CaO for ANNs, sigmoid, and logarithmic functions. The ratios of K2O to CaO and Al2O3 to CaO were the best inputs for quadratic and cubic equations, respectively.Entities:
Year: 2019 PMID: 31613923 PMCID: PMC6793886 DOI: 10.1371/journal.pone.0223764
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Maps of study area location and sample sites.
Empirical models used in the current study.
| Name | Ab. | Equation |
|---|---|---|
| Q | y = | |
| C | y = | |
| Sig | ||
| Log | y = |
ak and p are the minimum and range of the response, respectively.
Fig 2ANN structure.
Descriptive statistics of soil pH and geochemical indices (N = 1163).
| Min | Max | Median | Mean | Std. Dev | CV% | |
|---|---|---|---|---|---|---|
| 4.34 | 8.7 | 7.46 | 7.16 | 1.09 | 15.22 | |
| 0.08 | 29.98 | 1.10 | 2.63 | 4.05 | 153.77 | |
| 24.62 | 99.54 | 93.09 | 87.55 | 13.59 | 15.52 | |
| 10.09 | 98.53 | 82.67 | 75.48 | 19.05 | 25.24 | |
| 1.35 | 91.00 | 39.71 | 39.57 | 21.32 | 53.86 | |
| 48.65 | 99.87 | 98.31 | 95.71 | 6.93 | 7.24 | |
| 4.46 | 92.26 | 61.57 | 55.87 | 17.23 | 30.84 | |
| 5.73 | 95.99 | 70.33 | 63.4 | 21.35 | 33.67 |
Fig 3Histogram plots for the geochemical elements.
Pearson’s correlation coefficients between soil pH and geochemical indices (p<0.01).
| CaO (%) | AlCa (%) | FeCa (%) | TiCa (%) | SiCa (%) | MgCa (%) | KCa (%) |
|---|---|---|---|---|---|---|
| 0.5 | -0.61 | -0.68 | -0.83 | -0.49 | -0.71 | -0.76 |
Fig 4Relationships between soil pH and the geochemical indices.
Differences in soil pH and geochemical indices between calibration and validation sets (N = 877 and 286 for calibration (Cal) and validation (Val) sets, respectively.).
| Item | Min | Max | Median | Mean | Std.Dev | F | p value | |
|---|---|---|---|---|---|---|---|---|
| Cal | 4.52 | 8.6 | 7.41 | 7.14 | 1.09 | 0.954 | 0.329 | |
| Val | 4.34 | 8.7 | 7.57 | 7.21 | 1.08 | |||
| Cal | 0.11 | 29.98 | 1.10 | 2.62 | 4.04 | 0.014 | 0.904 | |
| Val | 0.08 | 24.74 | 1.11 | 2.66 | 4.08 | |||
| Cal | 24.62 | 99.03 | 93.18 | 87.57 | 13.67 | 0.008 | 0.929 | |
| Val | 31.14 | 99.54 | 93.07 | 87.48 | 13.38 | |||
| Cal | 10.09 | 97.46 | 82.67 | 75.55 | 19.12 | 0.048 | 0.827 | |
| Val | 13.79 | 98.53 | 82.67 | 75.26 | 18.85 | |||
| Cal | 1.35 | 85.70 | 39.71 | 39.65 | 21.25 | 0.050 | 0.823 | |
| Val | 1.82 | 91.00 | 39.7 | 39.33 | 21.54 | |||
| Cal | 48.65 | 99.87 | 98.31 | 95.72 | 6.90 | 0.015 | 0.902 | |
| Val | 57.80 | 99.87 | 98.3 | 95.66 | 7.00 | |||
| Cal | 4.46 | 91.72 | 61.73 | 56.03 | 0.58 | 0.284 | 0.594 | |
| Val | 6.83 | 92.26 | 61.07 | 55.4 | 1.05 | |||
| Cal | 5.73 | 94.31 | 70.2 | 63.52 | 0.72 | 0.105 | 0.746 | |
| Val | 7.08 | 95.99 | 70.66 | 63.04 | 1.27 |
Model calibration (N = 877, p<0.01).
| Input | Function | b0 | b1 | b2 | b3 | R2 |
|---|---|---|---|---|---|---|
| Quadratic | 6.4103 | 0.4148 | -0.0156 | 0.43 | ||
| Cubic | 6.0427 | 0.8099 | -0.0684 | 0.0016 | 0.56 | |
| Sigmoid | 0.6823 | 1.3914 | 0.74 | |||
| Logarithmic | 6.9224 | 0.7888 | 0.64 | |||
| ANN | 0.76 | |||||
| Quadratic | 1.281 | 0.2493 | -0.002 | 0.59 | ||
| Cubic | 24.0871 | -0.8736 | 0.0151 | -0.00008 | 0.71 | |
| Sigmoid | 94.9733 | 31.04 | 0.73 | |||
| Logarithmic | 20.2979 | -2.9535 | 0.3 | |||
| ANN | 0.77 | |||||
| Quadratic | 6.3399 | 0.1023 | -0.0011 | 0.65 | ||
| Cubic | 9.7379 | -0.1244 | 0.0031 | -0.00002 | 0.69 | |
| Sigmoid | 87.0113 | 10.6616 | 0.67 | |||
| Logarithmic | 14.689 | -1.7664 | 0.34 | |||
| ANN | 0.7 | |||||
| Quadratic | 8.6033 | -0.0276 | -0.00018 | 0.69 | ||
| Cubic | 8.2332 | 0.02226 | -0.00167 | 0.000012 | 0.7 | |
| Sigmoid | 51.1643 | 2.465 | 0.7 | |||
| Logarithmic | 10.4478 | -0.9595 | 0.51 | |||
| ANN | 0.72 | |||||
| Quadratic | -22.1396 | 0.8114 | -0.0053 | 0.42 | ||
| Cubic | -2.4129 | 0 | 0.0055 | -0.000046 | 0.45 | |
| Sigmoid | 98.8379 | 120.5882 | 0.73 | |||
| Logarithmic | 34.5359 | -6.0105 | 0.21 | |||
| ANN | 0.77 | |||||
| Quadratic | 7.6946 | 0.0516 | -0.001 | 0.6 | ||
| Cubic | 7.4604 | 0.0718 | -0.0015 | 3.352E-6 | 0.6 | |
| Sigmoid | 67.2264 | 5.7029 | 0.6 | |||
| Logarithmic | 13.2347 | -1.5416 | 0.37 | |||
| ANN | 0.61 | |||||
| Quadratic | 7.4643 | 0.0595 | -0.0009 | 0.73 | ||
| Cubic | 8.1355 | 0.0045 | 0.0003 | -7.469E-6 | 0.73 | |
| Sigmoid | 77.0693 | 7.1629 | 0.73 | |||
| Logarithmic | 12.8597 | -1.4093 | 0.41 | |||
| ANN | 0.76 |
aBox in grey denoted the highest value of R2.
Fig 5Root mean square error (RMSE) and coefficient of determination (R2) for ANNs with different numbers of hidden nodes (The black box indicates the lowest value of RMSE or highest value of R2).
Model performance (N = 286).
| Fun. | Input | MBE | RMSE | MAPE | MAE | R2 | AIC | t-stat | GPI | Rank |
|---|---|---|---|---|---|---|---|---|---|---|
| AlCa | 0.014 | 0.514 | 0.054 | 0.371 | 0.78 | -0.446 | 0.472 | 1.01 | 2 | |
| FeCa | 0.007 | 0.587 | 0.064 | 0.439 | 0.71 | -0.226 | 0.207 | 2.84 | 5 | |
| SiCa | 0.024 | 0.508 | 0.054 | 0.365 | 0.78 | -0.455 | 0.794 | 1.21 | 3 | |
| TiCa | 0.039 | 0.565 | 0.061 | 0.414 | 0.73 | -0.195 | 1.165 | 3.3 | 6 | |
| MgCa | 0.034 | 0.679 | 0.075 | 0.507 | 0.61 | -0.362 | 0.847 | 5.52 | 7 | |
| KCa | 0.069 | 0.534 | 0.057 | 0.393 | 0.76 | -0.925 | 2.218 | 2.6 | 4 | |
| CaO | 0.014 | 0.512 | 0.054 | 0.367 | 0.78 | -0.557 | 0.476 | 0.81 | 1 | |
| AlCa | 0.047 | 0.698 | 0.085 | 0.589 | 0.59 | -0.692 | 1.132 | 3.08 | 4 | |
| FeCa | 0.04 | 0.638 | 0.074 | 0.514 | 0.65 | -0.87 | 1.059 | 1.55 | 2 | |
| SiCa | 0.068 | 0.825 | 0.104 | 0.71 | 0.42 | -0.356 | 1.389 | 6.48 | 7 | |
| TiCa | 0.061 | 0.591 | 0.065 | 0.446 | 0.71 | -1.024 | 1.743 | 2.33 | 3 | |
| MgCa | 0.058 | 0.694 | 0.077 | 0.527 | 0.5 | -0.702 | 1.412 | 3.72 | 5 | |
| KCa | 0.052 | 0.556 | 0.061 | 0.42 | 0.74 | -1.147 | 1.579 | 1.19 | 1 | |
| CaO | 0.066 | 0.815 | 0.102 | 0.7 | 0.44 | -0.389 | 1.377 | 6.19 | 6 | |
| AlCa | 0.036 | 0.579 | 0.067 | 0.465 | 0.72 | -1.056 | 1.043 | 1.21 | 1 | |
| FeCa | 0.046 | 0.595 | 0.065 | 0.447 | 0.7 | -1.004 | 1.308 | 2.23 | 3 | |
| SiCa | 0.051 | 0.721 | 0.089 | 0.61 | 0.56 | -0.619 | 1.206 | 6 | 7 | |
| TiCa | 0.051 | 0.584 | 0.063 | 0.431 | 0.71 | -1.04 | 1.484 | 2.42 | 4 | |
| MgCa | 0.056 | 0.694 | 0.077 | 0.529 | 0.59 | -0.695 | 1.37 | 5.26 | 5 | |
| KCa | 0.054 | 0.543 | 0.058 | 0.403 | 0.75 | -1.185 | 1.689 | 1.9 | 2 | |
| CaO | 0.048 | 0.71 | 0.088 | 0.601 | 0.57 | -0.658 | 1.151 | 5.49 | 6 | |
| AlCa | 0.072 | 0.919 | 0.117 | 0.797 | 0.28 | -0.148 | 1.326 | 5.88 | 6 | |
| FeCa | 0.07 | 0.885 | 0.112 | 0.765 | 0.34 | -0.223 | 1.343 | 5.33 | 5 | |
| SiCa | 0.068 | 0.976 | 0.125 | 0.85 | 0.19 | -0.027 | 1.179 | 6.32 | 7 | |
| TiCa | 0.063 | 0.769 | 0.095 | 0.654 | 0.5 | -0.505 | 1.4 | 3.43 | 3 | |
| MgCa | 0.046 | 0.861 | 0.108 | 0.736 | 0.61 | -0.278 | 0.903 | 2.75 | 2 | |
| KCa | 0.059 | 0.845 | 0.106 | 0.726 | 0.4 | -0.317 | 1.175 | 4.01 | 4 | |
| CaO | 0.056 | 0.642 | 0.076 | 0.53 | 0.65 | -0.873 | 1.486 | 1.38 | 1 | |
| AlCa | -0.065 | 0.557 | 0.063 | 0.435 | 0.75 | -1.135 | 1.988 | 2.68 | 4 | |
| FeCa | -0.043 | 0.625 | 0.072 | 0.494 | 0.68 | -0.904 | 1.161 | 4.06 | 6 | |
| SiCa | -0.072 | 0.55 | 0.062 | 0.429 | 0.76 | -1.159 | 2.239 | 2.69 | 5 | |
| TiCa | 0.003 | 0.593 | 0.066 | 0.453 | 0.7 | -1.012 | 0.085 | 1.89 | 3 | |
| MgCa | -0.005 | 0.701 | 0.079 | 0.541 | 0.59 | -0.677 | 0.125 | 5.05 | 7 | |
| KCa | -0.027 | 0.564 | 0.062 | 0.429 | 0.74 | -1.11 | 0.8 | 1.67 | 2 | |
| CaO | 0.033 | 0.533 | 0.057 | 0.397 | 0.76 | -1.231 | 1.045 | 0.88 | 1 | |
| Q | 0.056 | 0.688 | 0.081 | 0.558 | 0.58 | -0.740 | 1.384 | 4.16 | 4 | |
| Cubic | 0.049 | 0.632 | 0.072 | 0.498 | 0.66 | -0.894 | 1.322 | 2.79 | 3 | |
| Log | 0.062 | 0.842 | 0.106 | 0.723 | 0.42 | -0.339 | 1.259 | 6.75 | 5 | |
| Sig | -0.025 | 0.589 | 0.066 | 0.454 | 0.71 | -1.033 | 1.063 | 0.82 | 1 | |
| ANN | 0.029 | 0.557 | 0.060 | 0.408 | 0.73 | -0.452 | 0.883 | 0.93 | 2 | |
| Overall mean | 0.034 | 0.629 | 0.074 | 0.507 | 0.59 | -0.566 | 1.184 | |||
Box in grey presented the best performance suggested by the corresponding error indicator.
Fig 6Scatter plot of the observed and predicted soil pH by (a) artificial neural network with CaO and (b) sigmoid with CaO.
The red dash line is the 1:1 line.
Statistics of validation results (N = 286).
| pH | Min | Max | Median | Mean | Std.Dev | F | p value |
|---|---|---|---|---|---|---|---|
| 4.34 | 8.7 | 7.57 | 7.21 | 1.08 | |||
| 4.87 | 8.36 | 7.35 | 7.2 | 0.98 | 0.028 | 0.868 | |
| 4.72 | 8.57 | 7.22 | 7.18 | 1 | 0.143 | 0.706 |