| Literature DB >> 36015456 |
Mafalda Reis-Pereira1,2, Renan Tosin1,2, Rui Martins1,2, Filipe Neves Dos Santos2, Fernando Tavares1,3,4, Mário Cunha1,2.
Abstract
Pseudomonas syringae pv. actinidiae (Psa) has been responsible for numerous epidemics of bacterial canker of kiwi (BCK), resulting in high losses in kiwi production worldwide. Current diagnostic approaches for this disease usually depend on visible signs of the infection (disease symptoms) to be present. Since these symptoms frequently manifest themselves in the middle to late stages of the infection process, the effectiveness of phytosanitary measures can be compromised. Hyperspectral spectroscopy has the potential to be an effective, non-invasive, rapid, cost-effective, high-throughput approach for improving BCK diagnostics. This study aimed to investigate the potential of hyperspectral UV-VIS reflectance for in-situ, non-destructive discrimination of bacterial canker on kiwi leaves. Spectral reflectance (325-1075 nm) of twenty plants were obtained with a handheld spectroradiometer in two commercial kiwi orchards located in Portugal, for 15 weeks, totaling 504 spectral measurements. Several modeling approaches based on continuous hyperspectral data or specific wavelengths, chosen by different feature selection algorithms, were tested to discriminate BCK on leaves. Spectral separability of asymptomatic and symptomatic leaves was observed in all multi-variate and machine learning models, including the FDA, GLM, PLS, and SVM methods. The combination of a stepwise forward variable selection approach using a support vector machine algorithm with a radial kernel and class weights was selected as the final model. Its overall accuracy was 85%, with a 0.70 kappa score and 0.84 F-measure. These results were coherent with leaves classified as asymptomatic or symptomatic by visual inspection. Overall, the findings herein reported support the implementation of spectral point measurements acquired in situ for crop disease diagnosis.Entities:
Keywords: Pseudomonas syringae; actinidia; feature selection; hyperspectral spectroscopy; in-situ diagnosis; leaf bacterial canker; plant pathology; support vector machine
Year: 2022 PMID: 36015456 PMCID: PMC9414239 DOI: 10.3390/plants11162154
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Typical symptoms of Bacterial Canker of Kiwi (BCK) caused by Pseudomonas syringae pv. actinidiae (Psa) on the adaxial (a) and abaxial (b) sides of leaves in an advanced stage of the disease.
Figure 2Representation of the spectra collected (a), and after its filtering (b) using the MSC log algorithm.
Selected discriminative wavelengths for model development.
| Method | Selected Discriminative Wavelengths (nm) |
|---|---|
|
| 326, 327, 329, 330, 335, 336, 352, 359, 360, 364, 365, 408, 562, 583, 762, 777, 778, 779, 786, 828, 897, 908, 923, 995, 1018, 1031, 1038, 1045, 1057, 1059, 1061, 1067, 1068 |
|
| 388, 401, 406, 414, 415, 419, 443, 446, 510, 515, 556, 671, 724, 754, 759, 781, 794, 807, 969, 970, 981, 983, 1009, 1027, 1031, 1032, 1035, 1045, 1048, 1049, 1050, 1053, 1066, 1068, 1070 |
|
| 424, 464, 549, 716, 753, 759, 935 |
|
| 388, 414, 415, 419, 443, 510, 759, 794, 970, 981, 982, 1001, 1031, 1035, 1045, 1048, 1049, 1050, 1053, 1066 |
|
| 329, 369, 375, 510, 531, 536, 617, 671, 771, 772, 778, 903, 932, 959, 969, 970, 1045, 1048, 1050, 1052, 1061, 1070 |
SFFS + JM sequential forward floating selection using Jeffries–Matusita Distance; SFVS—Stepwise forward variable selection; glmStepAIC—Generalized linear model with stepwise feature selection; LASSO—Lasso regression (glmnet).
Validation results for models classifying bacterial canker of kiwi (BCK) disease.
| Feature | Model | Validation Set | Statistics of Validation Sets | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | BT | CT | Mean | CV | ||||||||||||
| Acc | K | F1 | Acc | K | F1 | Acc | K | F1 | Acc | K | F1 | Acc | K | F1 | ||
|
| PLS | 0.7083 | 0.4047 | 0.6589 | 0.6806 | 0.3329 | 0.7356 | 0.7292 | 0.3536 | 0.5412 | 0.7060 | 0.3637 | 0.6452 | 3.4530 | 10.1605 | 15.1756 |
| N = 751 | SVM-L | 0.8274 | 0.6444 | 0.7883 | 0.8012 | 0.6154 | 0.8313 | 0.8403 | 0.6167 | 0.7262 | 0.8230 | 0.6255 | 0.7819 | 2.4209 | 2.6188 | 6.7574 |
| SVM-LW | 0.8115 | 0.6274 | 0.8104 | 0.7917 | 0.5464 | 0.8421 | 0.8264 | 0.6324 | 0.7685 | 0.8099 | 0.6021 | 0.8070 | 2.1494 | 8.0180 | 4.5747 | |
| SVM-R | 0.7857 | 0.5628 | 0.7500 | 0.7593 | 0.5015 | 0.7969 | 0.8056 | 0.5435 | 0.6818 | 0.7835 | 0.5359 | 0.7429 | 2.9643 | 5.8482 | 7.7908 | |
|
| SVM-RW | 0.8056 | 0.6066 | 0.7822 | 0.7778 | 0.5367 | 0.8154 | 0.8264 | 0.6073 | 0.7368 | 0.8033 | 0.5835 | 0.7781 | 3.0356 | 6.9508 | 5.0708 |
| N = 7 | FDA | 0.7698 | 0.5339 | 0.7411 | 0.7546 | 0.4876 | 0.7969 | 0.7812 | 0.5013 | 0.6631 | 0.7685 | 0.5076 | 0.7337 | 1.7364 | 4.6856 | 9.1599 |
| N = 20 | glmStepAIC | 0.8147 | 0.6243 | 0.8342 | 0.7824 | 0.5471 | 0.7283 | 0.8392 | 0.6318 | 0.8814 | 0.8121 | 0.6011 | 0.8049 | 3.5081 | 7.8006 | 13.4507 |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| GLM | 0.7937 | 0.5806 | 0.7636 | 0.7454 | 0.4754 | 0.7826 | 0.8299 | 0.6121 | 0.7380 | 0.7897 | 0.5560 | 0.7614 | 5.3686 | 12.8742 | 2.9395 |
| N = 35 | PLS | 0.7679 | 0.5249 | 0.7247 | 0.7685 | 0.527 | 0.7984 | 0.7674 | 0.4553 | 0.6215 | 0.7679 | 0.5024 | 0.7149 | 0.0717 | 8.1217 | 12.4302 |
| SVM-L | 0.7619 | 0.5115 | 0.7143 | 0.7454 | 0.4942 | 0.7769 | 0.7708 | 0.4649 | 0.6292 | 0.7609 | 0.4902 | 0.7068 | 1.3715 | 4.8054 | 10.4888 | |
| SVM-R | 0.8512 | 0.6994 | 0.8344 | 0.8426 | 0.6773 | 0.864 | 0.8542 | 0.6667 | 0.7742 | 0.8485 | 0.6811 | 0.8242 | 0.8821 | 2.4494 | 5.5521 | |
| SVM-LW | 0.7897 | 0.583 | 0.7854 | 0.7778 | 0.5153 | 0.8322 | 0.8125 | 0.595 | 0.7404 | 0.7933 | 0.5644 | 0.7860 | 2.2226 | 7.6132 | 5.8401 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| GLM | 0.7202 | 0.4327 | 0.6831 | 0.7222 | 0.4109 | 0.7778 | 0.7500 | 0.4162 | 0.5955 | 0.7308 | 0.4199 | 0.6855 | 2.2794 | 2.7074 | 13.3009 |
| N = 33 | PLS | 0.7242 | 0.4355 | 0.6729 | 0.7407 | 0.4501 | 0.7926 | 0.7257 | 0.3209 | 0.4968 | 0.7302 | 0.4022 | 0.6541 | 1.2495 | 17.5938 | 22.7478 |
| SVM-L | 0.7222 | 0.4253 | 0.6517 | 0.7593 | 0.4894 | 0.8074 | 0.7153 | 0.2849 | 0.4605 | 0.7323 | 0.3999 | 0.6399 | 3.2317 | 26.1576 | 27.1545 | |
| SVM-R | 0.7639 | 0.5117 | 0.7047 | 0.7639 | 0.5184 | 0.7935 | 0.8194 | 0.5618 | 0.6829 | 0.7824 | 0.5306 | 0.6270 | 4.0955 | 5.1256 | 31.9489 | |
| SVM-LW | 0.7381 | 0.4637 | 0.6887 | 0.7639 | 0.4984 | 0.8118 | 0.7188 | 0.2957 | 0.4706 | 0.7403 | 0.4193 | 0.6570 | 3.0567 | 25.8569 | 26.2985 | |
| SVM-RW | 0.8075 | 0.6057 | 0.7707 | 0.7824 | 0.5532 | 0.8127 | 0.8333 | 0.6022 | 0.7176 | 0.8077 | 0.5870 | 0.7670 | 3.1509 | 5.0002 | 6.2135 | |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| GLM | 0.7560 | 0.5056 | 0.7248 | 0.7176 | 0.4021 | 0.7732 | 0.7847 | 0.4973 | 0.6517 | 0.7528 | 0.4683 | 0.7166 | 4.4724 | 12.2796 | 8.5361 |
| N = 22 | PLS | 0.7560 | 0.5028 | 0.7172 | 0.7407 | 0.4501 | 0.7926 | 0.7674 | 0.437 | 0.5939 | 0.7547 | 0.4633 | 0.7012 | 1.7752 | 7.5177 | 14.3045 |
| SVM-L | 0.7599 | 0.5127 | 0.7269 | 0.7361 | 0.4393 | 0.7897 | 0.7778 | 0.4725 | 0.6279 | 0.7579 | 0.4748 | 0.7148 | 2.7601 | 7.7407 | 11.4114 | |
| SVM-R | 0.8353 | 0.6654 | 0.8118 | 0.8009 | 0.5842 | 0.8352 | 0.8611 | 0.6774 | 0.7778 | 0.8324 | 0.6423 | 0.8083 | 3.6282 | 7.8933 | 3.5709 | |
| SVM-LW | 0.7639 | 0.523 | 0.7373 | 0.7269 | 0.4217 | 0.4807 | 0.7917 | 0.5213 | 0.6739 | 0.7608 | 0.4887 | 0.6306 | 4.2728 | 11.8692 | 21.1945 | |
| SVM-RW | 0.8373 | 0.6708 | 0.8178 | 0.8009 | 0.5828 | 0.8365 | 0.8646 | 0.6913 | 0.7914 | 0.8343 | 0.6483 | 0.8152 | 3.8307 | 8.8915 | 2.7795 | |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
CV—Coefficient of Variation; Acc—Accuracy; F1—F-measure; GLM—Generalized linear model; glmStepAIC—Generalized linear model with stepwise feature selection; FDA—Flexible discriminant analysis; K—Kappa; LASSO—Lasso regression (glmnet); PLS—Partial least squares; SFFS + JM—Sequential forward floating selection using Jeffries–Matusita distance; SFVS—Stepwise forward variable selection; SVM—Support vector machine (L—Linear kernel; LW—Linear kernel with class weights; R—Radial kernel; RW—Radial kernel with class weights).
Confusion matrix for the selected model characterized by executing SFVS followed by an SVM algorithm with radial kernel and class weights (stepsvmrw) using the BT, CT, and complete dataset.
| BT ( | CT ( | ALL ( | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Actual value | Actual value | Actual value | |||||||||
| ‘No’ | ‘Yes’ | ‘No’ | ‘Yes’ | ‘No’ | ‘Yes’ | ||||||
| Predicted | ‘No’ | 71 | 15 | Predicted | ‘No’ | 169 | 19 | Predicted | ‘No’ | 240 | 33 |
| ‘Yes’ | 18 | 112 | ‘Yes’ | 23 | 77 | ‘Yes’ | 41 | 190 | |||
‘No’ and ‘Yes’ correspond to asymptomatic and symptomatic leaves, respectively.
Figure 3Percentage of correct classification predictions as ‘asymptomatic’ by date and test site using the SFVS strategy, followed by an SVM algorithm with radial kernel and class weights (stepsvmrw model). Values of BT site are represented with triangles and CT with circles. DOY—Day of the year.
Figure 4(a) Median of the spectra of the 25% observations best classified as ‘asymptomatic’ (green) and ‘symptomatic’ (red) for the selected model combining the SFVS with SVM with radial kernel and class weights (stepsvmrw); (b) Variance of the reflectance data measured by spectral wavelength and class (green line representing the variance in the mean spectra of ‘asymptomatic’ samples, and red line illustrating the variance in the mean data of ‘symptomatic’ leaves).
Number of observations (leaves and plants) per test site and symptomatology.
| Test Site | Sites | Dates | Plants | Asymptomatic Leaves | Symptomatic Leaves | Total Measurements |
|---|---|---|---|---|---|---|
| Briteiros (BT) | 1 | 9 | 8 | 89 | 127 | 216 |
| Caldas das Taipas (CT) | 1 | 8 | 12 | 192 | 96 | 288 |
| Total | 2 | 9 | 20 | 281 | 223 | 504 |
Figure 5Conceptual diagram for the predictive modeling approaches of bacterial canker of kiwi (BCK).