| Literature DB >> 31430307 |
Tobias Rentschler1,2, Philipp Gries1, Thorsten Behrens1, Helge Bruelheide3,4, Peter Kühn1, Steffen Seitz1, Xuezheng Shi5, Stefan Trogisch3,4, Thomas Scholten1, Karsten Schmidt1,2.
Abstract
As limited resources, soils are the largest terrestrial sinks of organic carbon. In this respect, 3D modelling of soil organic carbon (SOC) offers substantial improvements in the understanding and assessment of the spatial distribution of SOC stocks. Previous three-dimensional SOC modelling approaches usually averaged each depth increment for multi-layer two-dimensional predictions. Therefore, these models are limited in their vertical resolution and thus in the interpretability of the soil as a volume as well as in the accuracy of the SOC stock predictions. So far, only few approaches used spatially modelled depth functions for SOC predictions. This study implemented and evaluated an approach that compared polynomial, logarithmic and exponential depth functions using non-linear machine learning techniques, i.e. multivariate adaptive regression splines, random forests and support vector machines to quantify SOC stocks spatially and depth-related in the context of biodiversity and ecosystem functioning research. The legacy datasets used for modelling include profile data for SOC and bulk density (BD), sampled at five depth increments (0-5, 5-10, 10-20, 20-30, 30-50 cm). The samples were taken in an experimental forest in the Chinese subtropics as part of the biodiversity and ecosystem functioning (BEF) China experiment. Here we compared the depth functions by means of the results of the different machine learning approaches obtained based on multi-layer 2D models as well as 3D models. The main findings were (i) that 3rd degree polynomials provided the best results for SOC and BD (R2 = 0.99 and R2 = 0.98; RMSE = 0.36% and 0.07 g cm-3). However, they did not adequately describe the general asymptotic trend of SOC and BD. In this respect the exponential (SOC: R2 = 0.94; RMSE = 0.56%) and logarithmic (BD: R2 = 84; RMSE = 0.21 g cm-3) functions provided more reliable estimates. (ii) random forests with the exponential function for SOC correlated better with the corresponding 2.5D predictions (R2: 0.96 to 0.75), compared to the 3rd degree polynomials (R2: 0.89 to 0.15) which support vector machines fitted best. We recommend not to use polynomial functions with sparsely sampled profiles, as they have many turning points and tend to overfit the data on a given profile. This may limit the spatial prediction capacities. Instead, less adaptive functions with a higher degree of generalisation such as exponential and logarithmic functions should be used to spatially map sparse vertical soil profile datasets. We conclude that spatial prediction of SOC using exponential depth functions, in conjunction with random forests is well suited for 3D SOC stock modelling, and provides much finer vertical resolutions compared to 2.5D approaches.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31430307 PMCID: PMC6701766 DOI: 10.1371/journal.pone.0220881
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study area in mainland China with BEF-China plot scheme and indication of sampled plots.
Upper right panel with permission by R. Hijmans; https://gadm.org/.
Fig 2Datasets for SOC and BD used in this study summarized in boxplots.
The boxplots show the variation of the SOC and BD values for each depth increment. SOC and BD samples were taken in five depth increments and 9 cores per plot were bulked (Note that depth increments do not increase linearly). The grey lines show model depth functions (3rd degree polynomial for SOC and natural logarithmic function for BD; see subsection “3D mapping with soil depth functions”).
Fig 3Empirical cumulative distribution functions (ECDF) for SOC and BD datasets.
The ECDFs show the locations of the sampling sites in the state space of the elevation (DEM) in metres above sea level (m a.s.l.). The aim is to show the coverage of the DEM feature space by the samples. It can be seen that most samples are located in the mid-range of the elevation values. Therefore, predictions at grid locations which are only sparsely covered by the samples (i.e. locations close to the minimum and maximum values of the DEM) may be less accurate. The minimum, median and maximum values of both datasets (DEM and sampling locations) are shown with vertical lines (dashed grey: DEM, dashed black: sampling locations) to compare the full range of the respective feature spaces.
Terrain attributes used for SOC and bulk density modelling.
| Covariates | Method | Author(s) | |
|---|---|---|---|
| Local | Slope and aspect | Fitted 2nd degree polynomial | [ |
| Fitted 3rd degree polynomial | [ | ||
| Least squares fitted plane | [ | ||
| Maximum triangle slope | [ | ||
| Fitted 2nd degree polynomial | [ | ||
| Plan, profile, longitudinal, tangential and flowline curvature | Fitted 2nd degree polynomial | [ | |
| Fitted 3rd degree polynomial | [ | ||
| Fitted 2nd degree polynomial | [ | ||
| Vertical distance to channel network | [ | ||
| Sky visibility, sky view factor, direct and diffusive insolation | [ | ||
| Regional | Catchment area | Top-down | [ |
| Recursive | |||
| Combined | Topographic Wetness Index (TWI) | Any combination of slope and catchment area | [ |
| Slope length and steepness factor (LS-Factor) | Any combination of slope and catchment area | [ |
66] and select the most informative covariates without expert knowledge. Further, we omitted feature reduction.
Fig 4Flow chart summarizing the methodology steps of the 3D mapping and the used datasets at each step.
Performance of 10-fold cross-validation for MARS, RF and SVM applied on the sampled standard depths of SOC and BD.
| R2 | RMSE | ||||||
|---|---|---|---|---|---|---|---|
| depth (cm) | MARS | RF | SVM | MARS | RF | SVM | |
| SOC (%) | 0–5 | 0.28 | 0.41 | 0.37 | 0.59 | 0.48 | 0.51 |
| 0–10 | 0.25 | 0.41 | 0.42 | 0.46 | 0.4 | 0.4 | |
| 10–20 | 0.31 | 0.31 | 0.26 | 0.37 | 0.32 | 0.34 | |
| 20–30 | 0.46 | 0.47 | 0.46 | 0.3 | 0.28 | 0.29 | |
| 30–50 | 0.38 | 0.45 | 0.43 | 0.24 | 0.2 | 0.21 | |
| 0.34 | 0.41 | 0.39 | 0.39 | 0.34 | 0.35 | ||
| BD (g cm-³) | 0–5 | 0.51 | 0.53 | 0.61 | 0.07 | 0.06 | 0.06 |
| 0–10 | 0.5 | 0.52 | 0.49 | 0.07 | 0.06 | 0.06 | |
| 10–20 | 0.31 | 0.26 | 0.24 | 0.11 | 0.11 | 0.11 | |
| 20–30 | 0.41 | 0.35 | 0.33 | 0.1 | 0.1 | 0.1 | |
| 30–50 | 0.42 | 0.31 | 0.3 | 0.09 | 0.09 | 0.09 | |
| 0.43 | 0.39 | 0.39 | 0.09 | 0.08 | 0.08 | ||
Performance of a 10-fold cross-validation for MARS, RF and SVM applied on function coefficients of a 3rd degree polynomial (f1 for SOC and BD with four coefficients) and natural logarithmic function (f3 for BD with two coefficients).
| R2 | RMSE | nRMSE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MARS | RF | SVM | MARS | RF | SVM | MARS | RF | SVM | ||
| SOC (f1) | c0 | 0.29 | 0.28 | 0.26 | 0.83 | 0.75 | 0.79 | 0.20 | 0.18 | 0.19 |
| c1 | 0.36 | 0.43 | 0.46 | 0.15 | 0.13 | 0.14 | 0.22 | 0.19 | 0.20 | |
| c2 | 0.29 | 0.28 | 0.24 | 0.008 | 0.007 | 0.007 | 0.2 | 0.18 | 0.18 | |
| c3 | 0.3 | 0.21 | 0.31 | 0.0001 | 0.0001 | 0.0001 | 0.14 | 0.14 | 0.14 | |
| 0.31 | 0.3 | 0.32 | - | - | - | 0.19 | 0.17 | 0.18 | ||
| BD (f1) | c0 | 0.56 | 0.45 | 0.38 | 0.09 | 0.09 | 0.09 | 0.23 | 0.2 | 0.20 |
| c1 | 0.38 | 0.34 | 0.218 | 0.02 | 0.02 | 0.02 | 0.14 | 0.14 | 0.14 | |
| c2 | 0.38 | 0.17 | 0.26 | 0.001 | 0.001 | 0.001 | 0.2 | 0.2 | 0.2 | |
| c3 | 0.25 | 0.27 | 0.31 | 0.00002 | 0.00002 | 0.00002 | 1.3×105 | 1.2×105 | 1.2×105 | |
| 0.39 | 0.31 | 0.28 | - | - | - | 3.2×104 | 3×104 | 3×104 | ||
| BD (f3) | c1 | 0.56 | 0.48 | 0.53 | 0.09 | 0.09 | 0.09 | 0.2 | 0.18 | 0.18 |
| c2 | 0.34 | 0.24 | 0.2 | 0.03 | 0.04 | 0.04 | 0.14 | 0.19 | 0.14 | |
| 0.45 | 0.36 | 0.36 | - | - | - | 0.17 | 0.19 | 0.16 | ||
Note that coefficients dimensions are different and specifying a mean of the RMSE is not reasonable.
Coefficient of correlation (R2), Lin’s concordance correlation coefficient (ρc) and RMSE of 2.5D reference predictions and correspondent depths of 3D predictions with polynomial (f1), logarithmic (f3) and exponential (f4) depth function.
| R2 | ρc | RMSE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MARS | RF | SVM | MARS | RF | SVM | MARS | RF | SVM | ||
| SOC (%; f1) | 2.5 cm | 0.02 | 0.92 | 0.89 | -0.03 | 0.95 | 0.79 | 4.13 | 0.09 | 0.12 |
| 7.5 cm | 0 | 0.69 | 0.89 | -0.01 | 0.81 | 0.85 | 3.84 | 0.16 | 0.1 | |
| 15 cm | 0.02 | 0.41 | 0.72 | -0.03 | 0.43 | 0.47 | 2.42 | 0.37 | 0.22 | |
| 25 cm | 0 | 0.17 | 0.45 | 0 | 0.17 | 0.66 | 9.09 | 0.68 | 0.15 | |
| 40 cm | 0 | 0.07 | 0.15 | 0 | 0.04 | 0.31 | 46.96 | 1.73 | 0.33 | |
| 0.01 | 0.45 | 0.62 | -0.01 | 0.48 | 0.62 | 13.29 | 0.61 | 0.18 | ||
| SOC (%; f4) | 2.5 cm | 0 | 0.96 | 0.93 | 0 | 0.93 | 0.79 | 19.22 | 0.11 | 0.14 |
| 7.5 cm | 0.1 | 0.84 | 0.67 | 0 | 0.39 | 0.29 | 26.25 | 0.38 | 0.35 | |
| 15 cm | 0 | 0.89 | 0.88 | 0 | 0.94 | 0.93 | 29.42 | 0.06 | 0.05 | |
| 25 cm | 0.06 | 0.85 | 0.93 | 0 | 0.55 | 0.79 | 31.28 | 0.21 | 0.1 | |
| 40 cm | 0.02 | 0.88 | 0.75 | 0 | 0.31 | 0.26 | 32.71 | 0.31 | 0.3 | |
| 0.04 | 0.88 | 0.83 | 0 | 0.62 | 0.61 | 27.78 | 0.21 | 0.19 | ||
| BD (g cm-3; f3) | 2.5 cm | 0.02 | 0.94 | 0.87 | -0.05 | 0.39 | 0.53 | 0.48 | 0.09 | 0.07 |
| 7.5 cm | 0 | 0.8 | 0.71 | 0 | 0.29 | 0.24 | 0.44 | 0.09 | 0.1 | |
| 15 cm | 0.01 | 0.66 | 0.5 | -0.05 | 0.48 | 0.31 | 0.46 | 0.05 | 0.07 | |
| 25 cm | 0.01 | 0.44 | 0.57 | 0.02 | 0.59 | 0.53 | 0.43 | 0.04 | 0.04 | |
| 40 cm | 0.02 | 0.76 | 0.43 | -0.12 | 0.17 | 0.08 | 0.64 | 0.1 | 0.11 | |
| 0.01 | 0.72 | 0.62 | -0.04 | 0.38 | 0.34 | 0.49 | 0.07 | 0.08 | ||
Fig 53D predictions of sampled depth increments plotted against corresponding 2.5D predictions.
3D prediction of SOC was calculated with 3rd degree polynomials (upper row) and exponential function (middle row). The 3D prediction for BD with logarithmic function (lower row).
Internal validation results of the final 3D models with the exponential function for SOC and the logarithmic function for BD.
| SOC (%) | BD (g cm-3) | |||
|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | |
| 2.5 cm | 0.88 | 0.32 | 0.87 | 0.29 |
| 7.5 cm | 0.74 | 0.47 | 0.85 | 0.07 |
| 15 cm | 0.77 | 0.24 | 0.72 | 0.12 |
| 25 cm | 0.76 | 0.29 | 0.74 | 0.08 |
| 40 cm | 0.8 | 0.31 | 0.66 | 0.12 |
| 0.79 | 0.33 | 0.77 | 0.14 | |
Fig 6Three-dimensional prediction of SOC stocks for the whole catchment.
The final 3D SOC stock model is shown in vertical slices 150 m apart to display the vertical variability, which is larger than the spatial variability.