| Literature DB >> 29039810 |
Elena Tamburini1, Fabio Vincenzi2, Stefania Costa3, Paolo Mantovi4, Paola Pedrini5, Giuseppe Castaldelli6.
Abstract
Near-Infrared Spectroscopy is a cost-effective and environmentally friendly technique that could represent an alternative to conventional soil analysis methods, including total organic carbon (TOC). Soil fertility and quality are usually measured by traditional methods that involve the use of hazardous and strong chemicals. The effects of physical soil characteristics, such as moisture content and particle size, on spectral signals could be of great interest in order to understand and optimize prediction capability and set up a robust and reliable calibration model, with the future perspective of being applied in the field. Spectra of 46 soil samples were collected. Soil samples were divided into three data sets: unprocessed, only dried and dried, ground and sieved, in order to evaluate the effects of moisture and particle size on spectral signals. Both separate and combined normalization methods including standard normal variate (SNV), multiplicative scatter correction (MSC) and normalization by closure (NCL), as well as smoothing using first and second derivatives (DV1 and DV2), were applied to a total of seven cases. Pretreatments for model optimization were designed and compared for each data set. The best combination of pretreatments was achieved by applying SNV and DV2 on partial least squares (PLS) modelling. There were no significant differences between the predictions using the three different data sets (p < 0.05). Finally, a unique database including all three data sets was built to include all the sources of sample variability that were tested and used for final prediction. External validation of TOC was carried out on 16 unknown soil samples to evaluate the predictive ability of the final combined calibration model. Hence, we demonstrate that sample preprocessing has minor influence on the quality of near infrared spectroscopy (NIR) predictions, laying the ground for a direct and fast in situ application of the method. Data can be acquired outside the laboratory since the method is simple and does not need more than a simple band ratio of the spectra.Entities:
Keywords: moisture effect; near infrared spectroscopy; particle size effect; soil; spectra pretreatments; total organic carbon
Year: 2017 PMID: 29039810 PMCID: PMC5677457 DOI: 10.3390/s17102366
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Sampling site area and sampling positioning (scale 1:200).
Figure 2NIR absorbance spectra of (a) all soils sample sets; (b) the wet sample (WS) set highlighted in red; (c) the dried sample (DS) set highlighted in red; (d) the ground and sieved sample (GSS) set highlighted in red.
Figure 3Principal component 2 (PC2) vs. PC1 plot for classification of soils samples based on Fourier transform-(FT) NIR spectra: WS (red); DS (green); GSS (blue).
Figure 4Pre-processing treatment on (a) average NIR spectra calculated separately from the WS (blue spectrum), DS (red spectrum) and GSS (green spectrum): effects of (b) normalization by closure (NCL); (c) multiplicative scatter correction (MSC); (d) standard normal variate (SNV).
Figure 5Pre-processing treatment on average NIR spectra calculated separately from the WS (blue spectrum), DS (red spectrum) and GSS (green spectrum): effects of (a) first derivative; (b) second derivative.
The optimal partial least squares (PLS) model prediction results for total organic carbon (TOC) and the corresponding statistical parameters of the various single and combined pretreatments for the DS set (NCL = normalization by closure; MSC = full-multiplicative scatter correction, SNV = standard normal variate; DV1 = 1st derivative; DV2 = 2nd derivative).
| Pre-Treatment Applied | Outliers | F | R2CAL | SEC | R2CV | SECV | RPDCAL | RPDCV | DW | Q-Value |
|---|---|---|---|---|---|---|---|---|---|---|
| NCL | 1 | 5 | 0.91 | 0.13 | 0.88 | 0.15 | 2.50 | 2.15 | 1.6 | 0.76 |
| MSC | 1 | 4 | 0.91 | 0.14 | 0.91 | 0.14 | 2.43 | 2.31 | 1.4 | 0.69 |
| SNV | 1 | 6 | 0.92 | 0.13 | 0.92 | 0.15 | 2.64 | 2.23 | 1.5 | 0.68 |
| DV1 | 1 | 6 | 0.82 | 0.19 | 0.76 | 0.16 | 0.34 | 2.04 | 1.9 | 0.52 |
| DV2 | 1 | 3 | 0.92 | 0.15 | 0.92 | 0.18 | 2.25 | 1.89 | 1.8 | 0.88 |
| SNV + DV1 | 1 | 4 | 0.91 | 0.11 | 0.91 | 0.13 | 3.04 | 2.48 | 1.6 | 0.71 |
| SNV + DV2 | 1 | 3 | 0.96 | 0.06 | 0.93 | 0.09 | 5.31 | 3.52 | 2.0 | 0.89 |
The optimal PLS model prediction results for TOC and the corresponding statistical parameters of the various single and combined pretreatments for the GSS set. (NCL = normalization by closure; MSC = full-multiplicative scatter correction, SNV = standard normal variate; DV1 = 1st derivative; DV2 = 2nd derivative).
| Pre-Treatment Applied | Outliers | F | R2CAL | SEC | R2CV | SECV | RPDCAL | RPDCV | DW | Q-Value |
|---|---|---|---|---|---|---|---|---|---|---|
| NCL | 1 | 7 | 0.89 | 0.12 | 0.74 | 0.16 | 2.81 | 2.12 | 0.7 | 2.32 |
| MSC | 1 | 2 | 0.90 | 0.11 | 0.89 | 0.14 | 3.16 | 2.34 | 0.5 | 2.45 |
| SNV | 1 | 3 | 0.95 | 0.11 | 0.93 | 0.10 | 3.13 | 3.28 | 0.7 | 1.91 |
| DV1 | 1 | 4 | 0.90 | 0.10 | 0.87 | 0.15 | 3.38 | 2.21 | 0.7 | 1.87 |
| DV2 | 1 | 2 | 0.96 | 0.09 | 0.92 | 0.10 | 3.45 | 3.25 | 0.8 | 2.54 |
| SNV + DV1 | 1 | 2 | 0.92 | 0.10 | 0.90 | 0.11 | 3.28 | 3.01 | 0.7 | 2.18 |
| SNV + DV2 | 1 | 2 | 0.99 | 0.09 | 0.93 | 0.10 | 3.52 | 3.31 | 0.8 | 2.64 |
The optimal PLS model prediction results for TOC and the corresponding statistical parameters of the various single and combined pretreatments for the WS set. (NCL = normalization by closure; MSC = full-multiplicative scatter correction, SNV = standard normal variate; DV1 = 1st derivative; DV2 = 2nd derivative).
| Pre-Treatment Applied | Outliers | F | R2CAL | SEC | R2CV | SECV | RPDCAL | RPDCV | DW | Q-Value |
|---|---|---|---|---|---|---|---|---|---|---|
| NCL | 2 | 3 | 0.79 | 0.16 | 0.73 | 0.23 | 2.13 | 1.53 | 0.5 | 1.21 |
| MSC | 2 | 5 | 0.74 | 0.19 | 0.69 | 0.17 | 1.70 | 1.93 | 0.5 | 1.31 |
| SNV | 2 | 3 | 0.80 | 0.16 | 0.78 | 0.21 | 2.07 | 1.57 | 0.5 | 1.62 |
| DV1 | 1 | 4 | 0.74 | 0.14 | 0.74 | 0.17 | 2.31 | 1.93 | 0.6 | 1.65 |
| DV2 | 1 | 4 | 0.89 | 0.13 | 0.85 | 0.18 | 2.50 | 1.90 | 0.6 | 1.58 |
| SNV + DV1 | 2 | 4 | 0.93 | 0.08 | 0.91 | 0.10 | 4.03 | 3.64 | 0.6 | 1.83 |
| SNV + DV2 | 1 | 3 | 0.96 | 0.11 | 0.92 | 0.12 | 3.07 | 2.81 | 0.7 | 2.01 |
Figure 6Calibration curves for parameter TOC in soils of original vs. predicted values from the WS dataset (a); DS dataset (b); GSS dataset (c).
External validation of the NIR calibration model for the prediction of TOC in unknown samples of WS, DS and GSS sets. Range and SD refer to NIR predicted values.
| Sample Set | Range | SD | MAE ± SDMAE | Outliers | R2EX.VAL. | RMSEP | Bias | Slope |
|---|---|---|---|---|---|---|---|---|
| WS | 1.21%–2.05% | 0.32% | 0.11 ± 0.08 | 0 | 0.76 | 0.25 | 0.38 | 0.72 |
| DS | 1.16%–1.99% | 0.33% | 0.10 ± 0.09 | 0 | 0.79 | 0.27 | 0.21 | 0.87 |
| GSS | 1.00%–2.16% | 0.27% | 0.13 ± 0.08 | 0 | 0.83 | 0.26 | 0.20 | 0.88 |
Figure 7Calibration curve for parameter TOC in soils of original vs. predicted values grouping together all three data sets.
Figure 8External validation of the NIR calibration model for the prediction of TOC content in unknown samples of soils.