| Literature DB >> 29857572 |
Wenwen Kong1, Chu Zhang2,3, Feng Cao4,5, Fei Liu6,7, Shaoming Luo8, Yu Tang9, Yong He10,11.
Abstract
Hyperspectral imaging was explored to detect Sclerotinia stem rot (SSR) on oilseed rape leaves with chemometric methods, and the influences of variable selection, machine learning, and calibration transfer methods on detection performances were evaluated. Three different sample sets containing healthy and infected oilseed rape leaves were acquired under different imaging acquisition parameters. Four discriminant models were built using full spectra, including partial least squares-discriminant analysis (PLS-DA), support vector machine (SVM), soft independent modeling of class analogies (SIMCA), and k-nearest neighbors (KNN). PLS-DA and SVM models were also built with the optimal wavelengths selected by principal component analysis (PCA) loadings, second derivative spectra, competitive adaptive reweighted sampling (CARS), and successive projections algorithm (SPA). The optimal wavelengths selected for each sample set by different methods were different; however, the optimal wavelengths selected by PCA loadings and second derivative spectra showed similarity between different sample sets. Direct standardization (DS) was successfully applied to reduce spectral differences among different sample sets. Overall, the results demonstrated that using hyperspectral imaging with chemometrics for plant disease detection can be efficient and will also help in the selection of optimal variable selection, machine learning, and calibration transfer methods for fast and accurate plant disease detection.Entities:
Keywords: Sclerotinia stem rot; calibration transfer; discriminant methods; hyperspectral imaging; oilseed rape; variable selection
Mesh:
Year: 2018 PMID: 29857572 PMCID: PMC6021932 DOI: 10.3390/s18061764
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The average spectra of healthy and infected leaves of sample sets 1 (a), 2 (b) and 3 (c).
The results of discriminant models using full spectra of three different sample sets.
| Sample | Models | Parameters * | Calibration Accuracy (%) | Prediction Accuracy (%) |
|---|---|---|---|---|
| Set 1 | PLS-DA | 16 | 100.00 | 90.91 |
| SVM | (256, 0.3229) | 79.42 | 81.82 | |
| KNN | 4 | 66.17 | 59.09 | |
| SIMCA | (10, 10) | 80.88 | 95.95 | |
| Set 2 | PLS-DA | 6 | 96.67 | 96.67 |
| SVM | (256, 1.7411) | 100.00 | 100.00 | |
| KNN | 4 | 91.11 | 100.00 | |
| SIMCA | (10, 10) | 90.00 | 90.00 | |
| Set 3 | PLS-DA | 15 | 100.00 | 100.00 |
| SVM | (48.5029, 1.7411) | 98.89 | 100.00 | |
| KNN | 3 | 93.33 | 93.33 | |
| SIMCA | (9, 9) | 95.56 | 100.00 |
* The parameters indicate the parameters of each model; the parameters for partial least squares-discriminant analysis (PLS-DA), support vector machine (SVM), k-nearest neighbors (KNN), and soft independent modeling of class analogies (SIMCA) are the optimal number of latent variables, (C, g), the number of nearest neighbors, and number of principal components (PCs) of each class, respectively.
Figure 2The optimal wavelength selection by the second derivative spectra of the sample sets 1, 2, and 3 ((a,c,e), respectively) and the principal component analysis (PCA) loadings of the three sets ((b,d,f), respectively).
The optimal wavelengths selected by the PCA loadings, second derivative spectra, successive projections algorithm (SPA), and competitive adaptive reweighted sampling (CARS) for the three sample sets.
| Sample | Methods | No. * | Wavelengths (nm) |
|---|---|---|---|
| Set 1 | Second derivative | 7 | 636, 649, 676, 696, 710, 732, 764 |
| PCA loading | 5 | 553, 657, 714, 723, 924 | |
| SPA | 14 | 668, 673, 550, 570, 681, 831, 521, 770, 663, 690, 932, 638, 650, 511 | |
| CARS | 16 | 550, 554, 559, 580, 589, 621, 628, 643, 663, 664, 852, 871, 901, 916, 945, 950 | |
| Set 2 | Second derivative | 12 | 533, 560, 608, 636, 650, 668, 674, 693, 707, 734, 764, 773 |
| PCA loading | 6 | 557, 673, 707, 714, 739, 770 | |
| SPA | 5 | 730, 782, 702, 558, 950 | |
| CARS | 16 | 629, 634, 636, 649, 762, 771, 798, 828, 837, 870, 875, 900, 908, 926, 940, 950 | |
| Set 3 | Second derivative | 13 | 533, 562, 609, 636, 650, 658, 667, 674, 693, 710, 737, 765, 771 |
| PCA loading | 5 | 557, 633, 705, 716, 747 | |
| SPA | 16 | 813, 833, 896, 767, 867, 507, 908, 513, 930, 535, 501, 743, 557, 696, 649, 707 | |
| CARS | 9 | 558, 564, 571, 604, 657, 662, 766, 809, 833 |
* No. refers to the number.
The results of PLS-DA and SVM models using optimal wavelengths of three sample sets.
| Model | Sample | Methods | Parameters | Calibration Accuracy (%) | Prediction Accuracy (%) |
|---|---|---|---|---|---|
| PLS-DA | Set 1 | PCA loadings | 5 | 76.47 | 81.82 |
| Second derivative | 7 | 77.94 | 68.18 | ||
| CARS | 14 | 100.00 | 72.73 | ||
| SPA | 14 | 100.00 | 82.61 | ||
| Set 2 | PCA loadings | 5 | 97.78 | 96.67 | |
| Second derivative | 5 | 93.33 | 96.67 | ||
| CARS | 12 | 100.00 | 100.00 | ||
| SPA | 5 | 93.33 | 96.67 | ||
| Set 3 | PCA loadings | 5 | 97.78 | 100.00 | |
| Second derivative | 5 | 100.00 | 100.00 | ||
| CARS | 9 | 100.00 | 96.67 | ||
| SPA | 12 | 100.00 | 100.00 | ||
| SVM | Set 1 | PCA loadings | (84.4485, 27.8576) | 72.06 | 77.27 |
| Second derivative | (0.5743, 256) | 69.12 | 90.91 | ||
| CARS | (256,84.4485) | 91.18 | 81.82 | ||
| SPA | (147.0334, 256) | 94.12 | 90.91 | ||
| Set 2 | PCA loadings | (147.0334, 256) | 98.89 | 96.67 | |
| Second derivative | (147.0334, 84.4485) | 97.78 | 100.00 | ||
| CARS | (256, 27.8576) | 90.00 | 93.33 | ||
| SPA | (256, 147.0334) | 100.00 | 96.67 | ||
| Set 3 | PCA loadings | (27.8576, 84.4485) | 97.78 | 100.00 | |
| Second derivative | (256, 16) | 100.00 | 100.00 | ||
| CARS | (5.2780, 27.8576) | 91.11 | 83.22 | ||
| SPA | (5.2780, 84.4485) | 98.89 | 100.00 |
The discriminant results of the sample sets using the optimal wavelengths selected by the other sample sets.
| Model | Sample/Wavelength | Methods | Parameters | Calibration Accuracy (%) | Prediction Accuracy (%) |
|---|---|---|---|---|---|
| PLS-DA | Set 1/OPW 2 a | PCA loadings | 4 | 77.94 | 77.27 |
| Second derivative | 11 | 98.53 | 90.91 | ||
| CARS | 14 | 98.53 | 81.82 | ||
| SPA | 5 | 85.29 | 81.82 | ||
| Set 1/OPW 3 b | PCA loadings | 4 | 75.00 | 77.27 | |
| Second derivative | 11 | 98.53 | 86.36 | ||
| CARS | 7 | 95.59 | 77.27 | ||
| SPA | 11 | 94.12 | 86.36 | ||
| Set 2/OPW 1 c | PCA loadings | 2 | 85.56 | 90.00 | |
| Second derivative | 4 | 88.89 | 90.00 | ||
| CARS | 3 | 84.44 | 90.00 | ||
| SPA | 3 | 82.22 | 90.00 | ||
| Set 2/OPW 3 | PCA loadings | 5 | 94.44 | 96.67 | |
| Second derivative | 5 | 92.22 | 96.67 | ||
| CARS | 2 | 82.22 | 80.00 | ||
| SPA | 7 | 98.89 | 96.67 | ||
| Set 3/OPW 1 | PCA loadings | 4 | 95.56 | 90.00 | |
| Second derivative | 4 | 92.22 | 86.67 | ||
| CARS | 11 | 98.89 | 100.00 | ||
| SPA | 9 | 100.00 | 96.67 | ||
| Set 3/OPW 2 | PCA loadings | 5 | 94.44 | 90.00 | |
| Second derivative | 5 | 100.00 | 96.67 | ||
| CARS | 11 | 100.00 | 100.00 | ||
| SPA | 3 | 94.44 | 86.67 | ||
| SVM | Set 1/OPW 2 a | PCA loadings | (1.7411, 84.4485) | 69.12 | 90.91 |
| Second derivative | (256, 147.0334) | 89.71 | 86.36 | ||
| CARS | (256, 48.5029) | 83.82 | 81.82 | ||
| SPA | (256, 147.0334) | 88.24 | 86.362 | ||
| Set 1/OPW 3 b | PCA loadings | (16, 84.4485) | 72.06 | 86.362 | |
| Second derivative | (256, 147.0334) | 89.71 | 81.82 | ||
| CARS | (256, 147.0334) | 80.88 | 68.18 | ||
| SPA | (256, 9.1896) | 79.41 | 81.82 | ||
| Set 2/OPW 1 c | PCA loadings | (256, 9.1896) | 90.00 | 93.33 | |
| Second derivative | (256, 84.4485) | 96.67 | 100.00 | ||
| CARS | (147.0334, 84.4485) | 93.33 | 90.00 | ||
| SPA | (48.5029, 147.0334) | 96.67 | 96.67 | ||
| Set 2/OPW 3 | PCA loadings | (147.0334, 256) | 96.67 | 100.00 | |
| Second derivative | (147.0334, 84.4485) | 96.67 | 96.67 | ||
| CARS | (256, 147.0334) | 92.22 | 96.67 | ||
| SPA | (27.8576, 147.0334) | 96.67 | 96.67 | ||
| Set 3/OPW 1 | PCA loadings | (9.1896, 147.0334) | 97.78 | 90.00 | |
| Second derivative | (147.0334, 27.8576) | 97.78 | 93.33 | ||
| CARS | (147.0334, 256) | 97.78 | 100.00 | ||
| SPA | (147.0334, 27.8576) | 100.00 | 100.00 | ||
| Set 3/OPW 2 | PCA loadings | (84.4885, 48.5029) | 98.89 | 100.00 | |
| Second derivative | (147.0334, 16) | 98.89 | 100.00 | ||
| CARS | (147.0334, 91.1111) | 91.11 | 86.67 | ||
| SPA | (256, 48.5029) | 98.89 | 96.67 |
a OPW 2 means the optimal wavelengths were selected by sample set 2; b OPW 3 means the optimal wavelengths were selected by sample set 3; c OPW 1 means the optimal wavelengths were selected by sample set 1.
The results of the SVM models built by using the full spectra of sample set 3 as the calibration set and the untransferred and transferred full spectra of sample sets 1 and 2 as the prediction sets.
| Pretreatment | Sample | Correctly Classified Samples/Total Healthy Samples | Correctly Classified Samples/Total Infected Samples | Total Prediction Accuracy (%) |
|---|---|---|---|---|
| Untransferred | Set 1 | 8/45 | 35/45 | 47.78 |
| Set 2 | 51/60 | 58/60 | 90.83 | |
| Transferred | Set 1 | 42/45 | 43/45 | 94.44 |
| Set 2 | 59/60 | 55/60 | 95.00 |
Figure 3The discriminant results of the SVM models using a single wavelength band of sample set 1 (a); set 2 (b); and set 3 (c).
The results of the different calibration models using combined sample sets.
| Sample Sets | Models | Parameters | Calibration Accuracy (%) | Prediction Accuracy (%) |
|---|---|---|---|---|
| Combined Sets 1, 2, and 3 | PLS-DA | 10 | 92.74 | 92.68 |
| SVM | (256, 3.0314) | 97.58 | 95.12 | |
| KNN | 3 | 87.10 | 90.24 | |
| SIMCA | (20, 20) | 80.24 | 84.15 |