| Literature DB >> 32377459 |
Guangfei Wei1,2, Yu Li1, Zhitao Zhang1,2, Yinwen Chen3, Junying Chen1,2, Zhihua Yao1,2, Congcong Lao1,2, Huifang Chen1,2.
Abstract
Soil salinization is a global problem closely related to the sustainable development of social economy. Compared with frequently-used satellite-borne sensors, unmanned aerial vehicles (UAVs) equipped with multispectral sensors provide an opportunity to monitor soil salinization with on-demand high spatial and temporal resolution. This study aims to quantitatively estimate soil salt content (SSC) using UAV-borne multispectral imagery, and explore the deep mining of multispectral data. For this purpose, a total of 60 soil samples (0-20 cm) were collected from Shahaoqu Irrigation Area in Inner Mongolia, China. Meanwhile, from the UAV sensor we obtained the multispectral data, based on which 22 spectral covariates (6 spectral bands and 16 spectral indices) were constructed. The sensitive spectral covariates were selected by means of gray relational analysis (GRA), successive projections algorithm (SPA) and variable importance in projection (VIP), and from these selected covariates estimation models were built using back propagation neural network (BPNN) regression, support vector regression (SVR) and random forest (RF) regression, respectively. The performance of the models was assessed by coefficient of determination (R 2), root mean squared error (RMSE) and ratio of performance to deviation (RPD). The results showed that the estimation accuracy of the models had been improved markedly using three variable selection methods, and VIP outperformed GRA and GRA outperformed SPA. However, the model accuracy with the three machine learning algorithms turned out to be significantly different: RF > SVR > BPNN. All the 12 SSC estimation models could be used to quantitatively estimate SSC (RPD > 1.4) while the VIP-RF model achieved the highest accuracy (R c 2 = 0.835, R P 2 = 0.812, RPD = 2.299). The result of this study proved that UAV-borne multispectral sensor is a feasible instrument for SSC estimation, and provided a reference for further similar research.Entities:
Keywords: Estimation models; Machine learning algorithms; Multispectral sensor; Soil salt content; Unmanned aerial vehicle (UAV); Variable selection methods
Year: 2020 PMID: 32377459 PMCID: PMC7194094 DOI: 10.7717/peerj.9087
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1The distribution of sampling point.
(A–D) The distribution of the four study areas respectively.
Figure 2The distribution of sampling point.
(A–D) The distribution of the four study areas respectively.
Figure 3(A) M600 unmanned aerial vehicle; (B) micro-MCA multispectral sensor.
MCA multispectral sensor parameters.
| Parameter | Size |
|---|---|
| Weight/g | 670 |
| Field angle | 38.26 × 30.97 |
| The highest pixel | 1,280 × 1,024 |
| Band and band width/nm | B1 490 (10–25) |
| B2 550 (10–25) | |
| B3 680 (10–25) | |
| B4 720 (10–25) | |
| B5 800 (10–25) | |
| B6 900 (10–25) |
Reference spectral indices.
B, G, R, NIR1 and NIR2 are spectral reflectance at wavelengths of 490 nm, 550 nm, 680 nm, 800 nm and 900 nm, respectively. B and B represent the reflectance values from random spectral bands available from the multispectral sensor.
| Spectral index | Formula | Full name | References |
|---|---|---|---|
| S1 | Salinity Index 1 | ||
| S2 | Salinity Index 2 | ||
| S3 | Salinity Index 3 | ||
| S4 | Salinity Index 4 | ||
| S5 | Salinity Index 5 | ||
| S6-1 | Salinity Index 6-1 | ||
| S6-2 | Salinity Index 6-2 | ||
| SR-1 | Simple Ratio Index 1 | ||
| SR-2 | Simple Ratio Index 2 | ||
| BI-1 | Brightness Index 1 | ||
| BI-2 | Brightness Index 2 | ||
| NDSI-1 | Normalized Difference Salinity Index 1 | ||
| NDSI-2 | Normalized Difference Salinity Index 2 | ||
| DI | Difference Index | ||
| RI | Ratio Index | ||
| NDI | Normalization Index |
Figure 4Method flow chart.
(A) Data preprocessing; (B) modeling and (C) analysis.
Summary statistics of soil salinity sampling points.
| Data set | None salinization | Mild salinization | Severe salinization | Min/% | Max/% | CV |
|---|---|---|---|---|---|---|
| Entire dataset ( | 20 | 24 | 16 | 0.08 | 0.81 | 0.54 |
| Calibration ( | 14 | 16 | 10 | 0.08 | 0.81 | 0.54 |
| Validation ( | 6 | 8 | 6 | 0.09 | 0.71 | 0.57 |
Figure 5Violin plots showing the statistics of SSC for entire dataset, calibration dataset and validation dataset (%).
S.D.: standard deviation.
The relationship between SSC and spectral indices.
| Spectral index | | | Spectral index | | |
|---|---|---|---|
| S1 | 0.36 | SR-2 | 0.30 |
| S2 | 0.26 | BI-1 | 0.71 |
| S3 | 0.59 | BI-2 | 0.21 |
| S4 | 0.66 | NDSI-1 | 0.19 |
| S5 | 0.43 | NDSI-2 | 0.72 |
| S6-1 | 0.71 | DI | 0.49 |
| S6-2 | 0.27 | RI | 0.59 |
| SR-1 | 0.13 | NDI | 0.55 |
Notes:
p < 0.05.
p < 0.01.
Figure 6The correlation coefficients of the measured SSC data and the three 2D indices for the two random spectral bands.
(A) RI; (B) DI; (C) NDI. The color bar on the right side represents the color of Pearson’s correlation coefficient (PCC) values. Red stands for positive correlation and blue for negative. The darker the color was, the larger the PCC value was.
Figure 7Gray correlation degree between the variables and SSC.
Figure 8The selected variables based on SPA.
Figure 9VIP score of variables for SSC estimation.
The details on model parameters.
| BPNN | SVR | RF | |||
|---|---|---|---|---|---|
| size | ntree | mtry | |||
| Raw-BPNN | 5 | – | – | – | – |
| GRA-BPNN | 3 | – | – | – | – |
| SPA-BPNN | 2 | – | – | – | – |
| VIP-BPNN | 3 | – | – | – | – |
| Raw-SVR | – | 1000 | 0.01 | – | – |
| GRA-SVR | – | 100 | 0.01 | – | – |
| SPA-SVR | – | 100 | 0.001 | – | – |
| VIP-SVR | – | 1000 | 0.01 | – | – |
| Raw-RF | – | – | – | 500 | 3 |
| GRA-RF | – | – | – | 500 | 3 |
| SPA-RF | – | – | 500 | 2 | |
| VIP-RF | – | – | – | 500 | 3 |
Comparisons of different machine learning models based on different selection methods.
| Acronym | RMSEC | RMSEP | RPD | ||
|---|---|---|---|---|---|
| Raw-BPNN | 0.599 | 0.135 | 0.574 | 0.137 | 1.494 |
| GRA-BPNN | 0.661 | 0.116 | 0.677 | 0.116 | 1.764 |
| SPA-BPNN | 0.643 | 0.116 | 0.659 | 0.121 | 1.691 |
| VIP-BPNN | 0.675 | 0.118 | 0.695 | 0.113 | 1.811 |
| Raw-SVR | 0.533 | 0.136 | 0.566 | 0.145 | 1.410 |
| GRA-SVR | 0.645 | 0.120 | 0.625 | 0.131 | 1.562 |
| SPA-SVR | 0.582 | 0.126 | 0.581 | 0.133 | 1.539 |
| VIP-SVR | 0.643 | 0.115 | 0.631 | 0.128 | 1.598 |
| Raw-RF | 0.650 | 0.115 | 0.631 | 0.127 | 1.642 |
| GRA-RF | 0.768 | 0.099 | 0.765 | 0.105 | 1.949 |
| SPA-RF | 0.747 | 0.098 | 0.736 | 0.108 | 1.895 |
| VIP-RF | 0.835 | 0.085 | 0.812 | 0.089 | 2.299 |
Note:
Raw, all variables; Rc2, determination coefficient of calibration; RMSEC, root mean squared error of calibration; RP2, determination coefficient of validation; RMSEP, root mean squared error of validation; RPD, ratio of performance to deviation.
Figure 10Prediction performance of the models.
Figure 11Comparison of the estimation results of the variable selection models.
(A) GRA-BPNN, (B) SPA-BPNN, (C) VIP-BPNN, (D) GRA-SVR, (E) SPA-SVR, (F) VIP-SVR, (G) GRA-RF, (H) SPA-RF and (I) VIP-RF.