| Literature DB >> 35408363 |
Sebastian Semella1, Christopher Hutengs1,2,3, Michael Seidel1,2, Mathias Ulrich1, Birgit Schneider4, Malte Ortner5, Sören Thiele-Bruhn5, Bernard Ludwig6, Michael Vohland1,2,3.
Abstract
Soil spectroscopy in the visible-to-near infrared (VNIR) and mid-infrared (MIR) is a cost-effective method to determine the soil organic carbon content (SOC) based on predictive spectral models calibrated to analytical-determined SOC reference data. The degree to which uncertainty in reference data and spectral measurements contributes to the estimated accuracy of VNIR and MIR predictions, however, is rarely addressed and remains unclear, in particular for current handheld MIR spectrometers. We thus evaluated the reproducibility of both the spectral reflectance measurements with portable VNIR and MIR spectrometers and the analytical dry combustion SOC reference method, with the aim to assess how varying spectral inputs and reference values impact the calibration and validation of predictive VNIR and MIR models. Soil reflectance spectra and SOC were measured in triplicate, the latter by different laboratories, for a set of 75 finely ground soil samples covering a wide range of parent materials and SOC contents. Predictive partial least-squares regression (PLSR) models were evaluated in a repeated, nested cross-validation approach with systematically varied spectral inputs and reference data, respectively. We found that SOC predictions from both VNIR and MIR spectra were equally highly reproducible on average and similar to the dry combustion method, but MIR spectra were more robust to calibration sample variation. The contributions of spectral variation (ΔRMSE < 0.4 g·kg-1) and reference SOC uncertainty (ΔRMSE < 0.3 g·kg-1) to spectral modeling errors were small compared to the difference between the VNIR and MIR spectral ranges (ΔRMSE ~1.4 g·kg-1 in favor of MIR). For reference SOC, uncertainty was limited to the case of biased reference data appearing in either the calibration or validation. Given better predictive accuracy, comparable spectral reproducibility and greater robustness against calibration sample selection, the portable MIR spectrometer was considered overall superior to the VNIR instrument for SOC analysis. Our results further indicate that random errors in SOC reference values are effectively compensated for during model calibration, while biased SOC calibration data propagates errors into model predictions. Reference data uncertainty is thus more likely to negatively impact the estimated validation accuracy in soil spectroscopy studies where archived data, e.g., from soil spectral libraries, are used for model building, but it should be negligible otherwise.Entities:
Keywords: Monte Carlo cross-validation; dry combustion; mid-infrared; partial least-squares regression; portable; ring trial; soil organic carbon; spectroscopy; uncertainty; visible-to-near infrared
Mesh:
Substances:
Year: 2022 PMID: 35408363 PMCID: PMC9003508 DOI: 10.3390/s22072749
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Soil sampling sites in western Rhineland-Palatinate, Germany, stratified for different geological origins, with individual sampling locations (n = 75) illustrated in panels A–C.
Measurement and instrument parameters for collected VNIR and MIR series.
| Set | Instrument | Co-Added Scans | Spectral Resolution | Sampling Interval |
|---|---|---|---|---|
| VNIR1 | ASD FieldSpec 4 | 2 × 75 | 3 nm at 700 nm | 1.4 nm (350–1000 nm) |
| VNIR2 | ||||
| VNIR3 | ||||
| MIR1 | Agilent 4300 | 2 × 64 | 4 cm−1 | 1.86 cm−1 (4000–650 cm−1) |
| MIR2 | ||||
| MIR3 |
Figure 2Processing scheme of the repeated cross-validation (CV) approach to study uncertainties from (a) spectral measurements and (b) reference SOC data.
Summary statistics of SOC measurements (g kg−1) from three different laboratories and their average (n = 75); Q1 = first Quartile, Q3 = third quartile, SD = standard deviation.
| Minimum | Q1 | Median | Q3 | Maximum | Mean | SD | Skewness | |
|---|---|---|---|---|---|---|---|---|
|
| 6.16 | 11.16 | 14.50 | 23.03 | 35.06 | 17.01 | 7.73 | 0.52 |
|
| 6.00 | 10.88 | 14.38 | 22.79 | 35.28 | 16.91 | 7.84 | 0.54 |
|
| 6.37 | 11.37 | 14.88 | 24.27 | 36.26 | 17.60 | 8.06 | 0.52 |
|
| 6.18 | 11.12 | 14.59 | 23.36 | 35.54 | 17.17 | 7.87 | 0.53 |
Comparison of agreement between SOC measurements (g·kg−1) from three different laboratories and their average (n = 75); bias refers to the mean difference between row and column measurement series.
| RMSE | Bias | R2 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Lab1 | Lab2 | Lab3 | LabAVG | Lab1 | Lab2 | Lab3 | LabAVG | Lab1 | Lab2 | Lab3 | LabAVG | |
|
| – | 0.36 | 0.78 | 0.30 | – | 0.10 | −0.59 | −0.16 | – | 0.998 | 0.997 | 0.999 |
|
| – | 0.80 | 0.32 | – | −0.69 | −0.26 | – | 0.998 | 0.999 | |||
|
| – | 0.52 | – | 0.43 | – | 0.999 |
Figure 3VNIR and MIR spectra of the collected soil samples (n = 75) in light gray showing the total spectral variation across the sampled soil data; the mean spectrum (black) and its associated ±2 standard deviation (SD) envelope (dark gray) illustrate prominent absorption features in the spectra. Right-hand panel shows the corresponding location of the spectra in the principal component feature space, color-coded by underlying geologic strata.
Summary of spectral repeatability (S) analysis for replicate VNIR and MIR measurement series 1–3; Q1 = first quartile, Q3 = third quartile, SD = standard deviation, CV% = coefficient of variation.
| VNIR | MIR | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Q1 | Mean | Q3 | SD | CV% | Q1 | Mean | Q3 | SD | CV% | |
|
| 7499 | 13,523 | 17,485 | 8855 | 65.5 | 568 | 1350 | 1751 | 932 | 69.0 |
|
| 9201 | 13,944 | 17,525 | 6907 | 49.5 | 983 | 1766 | 2404 | 895 | 50.7 |
|
| 7533 | 14,797 | 19,355 | 8695 | 58.8 | 632 | 1165 | 1542 | 671 | 57.6 |
|
| 18,023 | 32,187 | 39,730 | 22,100 | 68.7 | 1357 | 2729 | 3782 | 1775 | 65.0 |
|
| 17,967 | 32,351 | 44,361 | 19,811 | 61.2 | 1363 | 2211 | 2941 | 1097 | 49.6 |
|
| 14,440 | 27,615 | 39,002 | 17,805 | 64.5 | 1573 | 2308 | 2849 | 1116 | 48.4 |
Validation accuracy of VNIR and MIR SOC models for the ‘best-case’ scenario with averaged spectral data from all measurement series (VNIRAVG and MIRAVG) and averaged SOC reference data from all three laboratories. Statistics represent average values of 100 randomized runs of the nested cross-validation approach with two standard deviations of their respective distributions given in parentheses. RPD (ratio of performance to deviation) and RPIQ (ratio of performance to interquartile range) scores represent the ratios of reference SOC standard deviation and interquartile range, respectively, to the RMSE of the predicted SOC values.
| RMSE (g·kg−1) | R2 | Bias (g·kg−1) | RPD | RPIQ | |
|---|---|---|---|---|---|
|
| 2.57 (±0.50) | 0.89 (±0.04) | 0.12 (±0.39) | 3.04 (±0.59) | 4.67 (±0.91) |
|
| 1.12 (±0.16) | 0.98 (±0.01) | 0.25 (±0.14) | 7.01 (±0.99) | 10.65 (±1.50) |
Figure 4Pooled validation results of VNIR and MIR SOC estimates for the ‘best-case’ scenario with averaged spectral data from all measurement series (VNIRAVG and MIRAVG) and averaged SOC reference data from all three laboratories. Red points correspond to the runs with average RMSE values to illustrate typical errors of VNIR- or MIR-predicted SOC values. Transparent gray points represent predictions from all 100 cross-validation runs, indicating the predictive uncertainty associated with each data point.
Validation accuracy (RMSE in g·kg−1) of VNIR and MIR SOC models for different combinations of calibration and validation (prediction) spectra using the laboratory-average SOC (LabAVG) as reference data. Statistics represent average RMSE values of 100 randomized runs of the nested cross-validation approach with two standard deviations of their respective distributions given in parentheses.
| Calibration Spectra * | Validation Spectra | |||||||
|---|---|---|---|---|---|---|---|---|
| VNIR1 | VNIR2 | VNIR3 | VNIRAVG | MIR1 | MIR2 | MIR3 | MIRAVG | |
|
| 2.91 (±0.44) | 2.77 (±0.48) | 2.83 (±0.44) | 2.75 (±0.45) | 1.36 (±0.19) | 1.39 (±0.16) | 1.34 (±0.13) | 1.20 (±0.15) |
|
| 2.73 (±0.49) | 2.65 (±0.51) | 2.71 (±0.47) | 2.58 (±0.50) | 1.43 (±0.21) | 1.48 (±0.21) | 1.30 (±0.21) | 1.25 (±0.20) |
|
| 2.91 (±0.45) | 2.84 (±0.47) | 2.86 (±0.49) | 2.78 (±0.47) | 1.47 (±0.18) | 1.38 (±0.19) | 1.45 (±0.17) | 1.30 (±0.16) |
|
| 2.77 (±0.49) | 2.63 (±0.49) | 2.70 (±0.50) | 2.57 (±0.50) | 1.37 (±0.19) | 1.30 (±0.17) | 1.31 (±0.16) | 1.12 (±0.16) |
* SPEC refers to VNIR and MIR, respectively.
Validation accuracy (RMSE in g·kg−1) of VNIR and MIR SOC models for different combinations of calibration and validation laboratory SOC reference data using the averaged spectral data (VNIRAVG, MIRAVG) for model calibration and validation (prediction). Statistics represent average RMSE values of 100 randomized runs of the nested cross-validation approach with two standard deviations of their respective distributions given in parentheses.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
|
| 2.56 (±0.50) | 2.56 (±0.50) | 2.68 (±0.47) | 2.56 (±0.49) | 1.13 (±0.16) | 1.13 (±0.16) | 1.37 (±0.16) | 1.15 (±0.16) |
|
| 2.59 (±0.49) | 2.59 (±0.49) | 2.73 (±0.47) | 2.60 (±0.48) | 1.12 (±0.18) | 1.12 (±0.18) | 1.41 (±0.17) | 1.16 (±0.17) |
|
| 2.71 (±0.53) | 2.71 (±0.53) | 2.65 (±0.51) | 2.64 (±0.52) | 1.37 (±0.19) | 1.37 (±0.19) | 1.24 (±0.17) | 1.25 (±0.18) |
|
| 2.59 (±0.51) | 2.59 (±0.51) | 2.65 (±0.49) | 2.57 (±0.50) | 1.15 (±0.16) | 1.15 (±0.16) | 1.28 (±0.16) | 1.12 (±0.16) |