| Literature DB >> 35591149 |
Lalit M Kandpal1, Muhammad A Munnaf1, Cristina Cruz2, Abdul M Mouazen1.
Abstract
Previous works indicate that data fusion, compared to single data modelling can improve the assessment of soil attributes using spectroscopy. In this work, two different kinds of proximal soil sensing techniques i.e., mid-infrared (MIR) and X-ray fluorescence (XRF) spectroscopy were evaluated, for assessment of seven fertility attributes. These soil attributes include pH, organic carbon (OC), phosphorous (P), potassium (K), magnesium (Mg), calcium (Ca) and moisture contents (MC). Three kinds of spectra fusion (SF) (spectra concatenation) approaches of MIR and XRF spectra were compared, namely, spectra fusion-Partial least square (SF-PLS), spectra fusion-Sequential Orthogonalized Partial least square (SF-SOPLS) and spectra fusion-Variable Importance Projection-Sequential Orthogonalized Partial least square (SF-VIP-SOPLS). Furthermore, the performance of SF models was compared with the developed single sensor model (based on individual spectra of MIR and XRF). Compared with the results obtained from single sensor model, SF models showed improvement in the prediction performance for all studied attributes, except for OC, Mg, and K prediction. More specifically, the highest improvement was observed with SF-SOPLS model for pH [R2p = 0.90, root mean square error prediction (RMSEP) = 0.15, residual prediction deviation (RPD) = 3.30, and ratio of performance inter-quantile (RPIQ) = 3.59], successively followed by P (R2p = 0.91, RMSEP = 4.45 mg/100 g, RPD = 3.53, and RPIQ = 4.90), Ca (R2p = 0.92, RMSEP = 177.11 mg/100 g, RPD = 3.66, and RPIQ = 3.22) and MC (R2p = 0.80, RMSEP = 1.91%, RPD = 2.31, RPIQ = 2.62). Overall the study concluded that SF approach with SOPLS attained better performance over the traditional model developed with the single sensor spectra, hence, SF is recommended as the best SF method for improving the prediction accuracy of studied soil attributes. Moreover, the multi-sensor spectra fusion approach is not limited for only MIR and XRF data but in general can be extended for complementary information fusion in order to improve the model performance in precision agriculture (PA) applications.Entities:
Keywords: multi-sensor; precision agriculture (PA); sequential orthogonalized partial least square (SOPLS); soil fertility; spectra fusion (SF)
Mesh:
Substances:
Year: 2022 PMID: 35591149 PMCID: PMC9099966 DOI: 10.3390/s22093459
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Characteristics of the study fields in Spain and Belgium.
| Field | Period | Area (ha) | Crop Type | N | Soil Texture | Average MC (%) | Average OC (%) |
|---|---|---|---|---|---|---|---|
| SP1, Spain | 2019 | 50 | Opium, Garlic | 100 | Clay loam | 13.18 | 1.48 |
| Keerkestraat, Belgium | 2020 | 1.2 | Maize | 10 | Loam | 21.63 | 1.26 |
| Krokey, Belgium | 2020 | 13 | Oil seed rape | 4 | Loam | 19.31 | 1.66 |
| Kattestraat, Belgium | 2020 | 5 | Potatoes | 9 | Loam | 18.00 | 1.27 |
| VDD Tegen ti hof, Belgium | 2020 | 5 | Potatoes | 20 | Loam | 16.61 | 1.50 |
| Langs de route, Belgium | 2020 | 6 | Potatoes | 18 | Polder | 17.78 | 1.12 |
| Bijna vrij, Belgium | 2020 | 7 | Sprout | 35 | Polder | 22.43 | 1.08 |
N = number of samples; MC = moisture content; OC = organic carbon.
Figure 1Location of seven experiment fields with sampling points in Belgium and Spain (described in Table 1).
Figure 2Experiment process for soil measurement. (a) mid infrared (MIR) scanning of soil samples, (b) X-ray fluorescence (XRF) scanning of soil samples.
Descriptive statistics of laboratory measured soil attributes for selected sample sets used for building training and test sets.
| Soil Indicators | N | Sample Set | Range | Mean ± SD |
|---|---|---|---|---|
| pH | 156 | Training set | 6.50–8.65 | 8.02 ± 0.50 |
| 40 | Test set | 6.60–8.76 | 8.19 ± 0.51 | |
| OC (%) | 156 | Training set | 0.73–2.47 | 1.34 ± 0.30 |
| 40 | Test set | 0.79–1.84 | 1.43 ± 0.28 | |
| P (mg/100 g) | 156 | Training set | 0.33–69 | 19.21 ± 18.77 |
| 40 | Test set | 0.51–58 | 10.31 ± 15.59 | |
| K (mg/100 g) | 156 | Training set | 9.00–122.91 | 41.56 ± 22.46 |
| 40 | Test set | 10.00–110.28 | 48.81 ± 20.72 | |
| Mg (mg/100 g) | 156 | Training set | 17.00–175.29 | 62.25 ± 23.78 |
| 40 | Test set | 18.00–102.91 | 62.48 ± 23.12 | |
| Ca (mg/100 g) | 156 | Training set | 196.00–3880 | 1380 ± 956.93 |
| 40 | Test set | 212.00–2900 | 929.23 ± 652.40 | |
| MC (%) | 156 | Training set | 9.01–26.01 | 16.75 ± 4.33 |
| 40 | Test set | 7.02–23.05 | 14.83 ± 4.31 |
N = number of samples; OC = organic carbon; P = Phosphorous; K = potassium; Mg = Magnesium; Ca = Calcium; MC = Moisture content; SD = standard deviation.
Best preprocessing steps considered for correction of mid infrared (MIR) and X-ray fluorescence (XRF) spectral data.
| Data | Spectral Pretreatment | Soil Quality Indicators |
|---|---|---|
| MIR | Moving average → Normalization | pH, OC, Mg, MC |
| MIR | Moving average → SNV | P |
| MIR | Moving average | K |
| MIR | Moving average → MSC | Ca |
| XRF | Baseline correction → Compton normalization → Moving average → Normalization | pH, OC, Mg, MC |
| XRF | Baseline correction → Compton normalization → Moving average → SNV | P |
| XRF | Baseline correction → Compton normalization → Moving average | K |
| XRF | Baseline correction → Compton normalization → Moving average → MSC | Ca |
MIR = mid-infrared; XRF = X-ray fluorescence; SNV = Standard normal variate; MSC: Multiplicative scatter correction; OC = organic carbon; P = Phosphorous; K = potassium; Mg = Magnesium; Ca = Calcium; MC = Moisture content; SD = standard deviation.
Figure 3The mid-infrared (MIR) (a) and X-ray florescence (XRF) spectra (b) of samples used during the study.
Prediction results of soil pH, organic carbon (OC), phosphorous (P), potassium (K), magnesium (Mg), calcium (Ca), moisture content (MC) using traditional PLS model (TPLS), and spectra-fusion (SF-PLS, SF-VIP-SOPLS and SF-SOPLS).
| Soil Indicators | Model Type | Training Set | Test Set | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
| RMSEC | RPD |
| RMSEP | RPD | RPIQ | Variables | ||
| pH | MIR-TPLS | 0.90 | 0.15 | 3.19 | 0.89 | 0.16 | 3.03 | 3.51 | 908 |
| XRF-TPLS | 0.89 | 0.16 | 3.07 | 0.88 | 0.17 | 2.95 | 2.78 | 2048 | |
| SF-PLS | 0.89 | 0.16 | 2.66 | 0.88 | 0.17 | 2.95 | 2.76 | 2948 | |
| SF-SOPLS | 0.94 | 0.11 | 4.14 | 0.90 | 0.15 | 3.30 | 3.59 | 2948 | |
| SF-VIP-SOPLS | 0.91 | 0.14 | 3.49 | 0.90 | 0.15 | 3.22 | 3.54 | 406 | |
| OC (%) | MIR-TPLS | 0.76 | 0.14 | 2.05 | 0.63 | 0.17 | 1.63 | 1.66 | 900 |
| XRF-TPLS | 0.59 | 0.19 | 1.57 | 0.30 | 0.24 | 1.18 | 1.01 | 2048 | |
| SF-PLS | 0.56 | 0.19 | 1.50 | 0.35 | 0.22 | 1.24 | 1.11 | 2948 | |
| SF-SOPLS | 0.75 | 0.13 | 2.05 | 0.75 | 0.13 | 2.02 | 2.47 | 2948 | |
| SF-VIP-SOPLS | 0.78 | 0.13 | 2.17 | 0.66 | 0.17 | 1.70 | 2.09 | 524 | |
| P (mg/100 g) | MIR-TPLS | 0.87 | 6.74 | 2.78 | 0.84 | 7.73 | 2.45 | 2.69 | 900 |
| XRF-TPLS | 0.85 | 7.04 | 2.66 | 0.83 | 6.36 | 2.45 | 2.34 | 2048 | |
| SF-PLS | 0.90 | 5.90 | 3.17 | 0.82 | 6.81 | 2.28 | 2.04 | 2948 | |
| SF-SOPLS | 0.95 | 4.11 | 4.56 | 0.91 | 4.45 | 3.53 | 4.90 | 2948 | |
| SF-VIP-SOPLS | 0.92 | 5.27 | 3.56 | 0.88 | 5.20 | 2.99 | 2.90 | 387 | |
| K (mg/100 g) | MIR-TPLS | 0.71 | 12.69 | 1.86 | 0.65 | 14.12 | 1.70 | 1.90 | 900 |
| XRF-TPLS | 0.66 | 13.74 | 1.72 | 0.48 | 15.03 | 1.37 | 1.68 | 2048 | |
| SF-PLS | 0.67 | 12.31 | 1.74 | 0.48 | 14.90 | 1.39 | 1.56 | 2948 | |
| SF-SOPLS | 0.72 | 11.82 | 1.78 | 0.67 | 11.67 | 1.77 | 2.13 | 2948 | |
| SF-VIP-SOPLS | 0.76 | 10.51 | 2.04 | 0.64 | 12.27 | 1.68 | 1.67 | 453 | |
| Mg (mg/100 g) | MIR-TPLS | 0.77 | 11.39 | 2.08 | 0.74 | 11.64 | 1.98 | 1.76 | 900 |
| XRF-TPLS | 0.65 | 13.94 | 1.70 | 0.59 | 15.33 | 1.50 | 0.99 | 2048 | |
| SF-PLS | 0.78 | 10.18 | 2.16 | 0.61 | 13.78 | 1.59 | 1.48 | 2948 | |
| SF-SOPLS | 0.80 | 9.54 | 2.26 | 0.78 | 10.65 | 2.17 | 2.13 | 2948 | |
| SF-VIP-SOPLS | 0.79 | 9.93 | 2.21 | 0.76 | 11.13 | 2.07 | 1.87 | 449 | |
| Ca (mg/100 g) | MIR-TPLS | 0.91 | 274.46 | 3.45 | 0.85 | 261.87 | 2.49 | 2.70 | 900 |
| XRF-TPLS | 0.84 | 372.97 | 2.54 | 0.71 | 466.43 | 1.81 | 2.24 | 2048 | |
| SF-PLS | 0.87 | 331.82 | 2.85 | 0.73 | 419.32 | 1.89 | 2.34 | 2948 | |
| SF-SOPLS | 0.96 | 176.73 | 5.36 | 0.92 | 177.11 | 3.66 | 3.22 | 2948 | |
| SF-VIP-SOPLS | 0.96 | 185.57 | 5.11 | 0.92 | 180.73 | 3.60 | 3.20 | 503 | |
| MC (%) | MIR-TPLS | 0.81 | 1.84 | 2.34 | 0.71 | 2.32 | 1.85 | 2.26 | 900 |
| XRF-TPLS | 0.76 | 2.09 | 2.07 | 0.64 | 2.57 | 1.67 | 1.57 | 2048 | |
| SF-PLS | 0.77 | 2.07 | 2.08 | 0.66 | 2.52 | 1.70 | 2.09 | 2948 | |
| SF-SOPLS | 0.85 | 1.75 | 2.47 | 0.80 | 1.91 | 2.31 | 2.62 | 2948 | |
| SF-VIP-SOPLS | 0.86 | 1.59 | 2.71 | 0.74 | 2.16 | 2.01 | 2.49 | 466 | |
R2cv and R2p = coefficient of determination for cross-validation and prediction; RMSEC and RMSEP = root mean square error of cross-validation and prediction; RPD = Residual prediction deviation; RPIQ = ratio of performance to interquartile distance; MIR-TPLS = mid-infrared-traditional partial least square; XRF-TPLS = X-ray fluorescence-traditional partial least square; SF-PLS = spectra fusion based on partial least square; SF-SOPLS = spectra fusion based on sequential orthogonalized partial least squares; SF-VIP-SOPLS = spectra fusion based on sequential orthogonalized partial least squares with variable importance in projection; OC = organic carbon; P = Phosphorous; K = Potassium; Mg = Magnesium; Ca = Calcium; MC = Moisture content.
Figure 4Predicted vs. measured scatter plots of best spectra fusion method (SF-SOPLS) for predicting seven soil attributes in test-set. Units of RMSEP are the same as units of respective soil properties.
Figure 5Variable impotence in projection (VIP) score plots for pH, organic carbon (OC), phosphorus (P), potassium (K), magnesium (Mg), calcium (Ca) and moisture content (MC) with corresponding scatter plots of measured against predicted values using the SF-VIP-SOPLS model developed based on selected variables. Units of root mean square error of prediction (RMSEP) are the same as units of respective soil properties. In VIP score plots the solid horizontal lines indicate the threshold values (value = 1) used for variable selection, and the vertical highlighted region is the significant range selected for prediction.