| Literature DB >> 35010134 |
Nader Ekramirad1, Alfadhl Y Khaled1, Lauren E Doyle1, Julia R Loeb1, Kevin D Donohue2, Raul T Villanueva3, Akinbode A Adedeji1.
Abstract
Codling moth (CM) (Cydia pomonella L.), a devastating pest, creates a serious issue for apple production and marketing in apple-producing countries. Therefore, effective nondestructive early detection of external and internal defects in CM-infested apples could remarkably prevent postharvest losses and improve the quality of the final product. In this study, near-infrared (NIR) hyperspectral reflectance imaging in the wavelength range of 900-1700 nm was applied to detect CM infestation at the pixel level for three organic apple cultivars, namely Gala, Fuji and Granny Smith. An effective region of interest (ROI) acquisition procedure along with different machine learning and data processing methods were used to build robust and high accuracy classification models. Optimal wavelength selection was implemented using sequential stepwise selection methods to build multispectral imaging models for fast and effective classification purposes. The results showed that the infested and healthy samples were classified at pixel level with up to 97.4% total accuracy for validation dataset using a gradient tree boosting (GTB) ensemble classifier, among others. The feature selection algorithm obtained a maximum accuracy of 91.6% with only 22 selected wavelengths. These findings indicate the high potential of NIR hyperspectral imaging (HSI) in detecting and classifying latent CM infestation in apples of different cultivars.Entities:
Keywords: apples; codling moth; feature selection; hyperspectral imaging; machine learning; near-infrared
Year: 2021 PMID: 35010134 PMCID: PMC8750721 DOI: 10.3390/foods11010008
Source DB: PubMed Journal: Foods ISSN: 2304-8158
Figure 1Schematic of the hyperspectral imaging system.
Figure 2Flowchart of apple infestation area acquisition around calyx end for building the classification model. HIS: hyperspectral imaging; ROI: region of interest.
Figure 3Mean reflectance spectra of control and CM-infested samples acquired by near-infrared hyperspectral imaging (NIR HIS). CM: codling moth.
Figure 4Principal component analysis of two types of apple sample tissues for Fuji cultivar computed from the mean spectral of the whole fruit.
Results of the PCA-based classification of control and infested samples for training and validation sets based on mean spectra for each sample.
| Sample | Classifier 1 | Training Set (%) | Validation Set (%) | ||||
|---|---|---|---|---|---|---|---|
| Precision | Recall | Total Accuracy | Precision | Recall | Total | ||
| Stem | LDA | 95.00 | 94.00 | 94.70 | 57.00 | 58.00 | 62.50 |
| kNN | 58.00 | 57.00 | 57.90 | 36.00 | 42.00 | 62.50 | |
|
|
|
|
|
|
|
| |
| AdaBoost | 95.00 | 95.00 | 94.70 | 75.00 | 83.00 | 75.00 | |
|
|
|
|
|
|
|
| |
| Calyx | LDA | 90.00 | 90.00 | 90.50 | 78.00 | 78.00 | 77.80 |
| kNN | 63.00 | 61.00 | 61.90 | 68.00 | 68.00 | 67.00 | |
| RF | 100 | 100 | 100 | 88.00 | 83.00 | 83.30 | |
|
|
|
|
|
|
|
| |
| PLS-DA | 90.00 | 90.00 | 90.50 | 78.00 | 78.00 | 78.00 | |
| Side | LDA | 100 | 100 | 100 | 83.00 | 80.00 | 77.80 |
| kNN | 86.00 | 80.00 | 80.00 | 75.00 | 60.00 | 55.60 | |
| RF | 100 | 100 | 100 | 83.00 | 80.00 | 77.80 | |
| AdaBoost | 100 | 100 | 100 | 83.00 | 80.00 | 77.80 | |
|
|
|
|
|
|
|
| |
| All | LDA | 80.00 | 80.00 | 79.00 | 71.00 | 73.00 | 72.00 |
| kNN | 76.00 | 76.00 | 76.30 | 70.00 | 71.00 | 72.00 | |
| RF | 100 | 100 | 100 | 91.00 | 94.00 | 92.00 | |
| AdaBoost | 100 | 100 | 100 | 88.00 | 91.00 | 88.00 | |
|
|
|
|
|
|
|
| |
1 LDA: Linear Discriminant Analysis, kNN: k-Nearest Neighbors, RF: Random Forest, PLS-DA: Partial Least Squares-Discriminant Analysis. Bolded line indicates the best result.
Results of the PCA-based classification of control and infested samples for training and validation data sets based on manually selected ROI.
| Classifier 1 | Training Set (%) | Validation Set (%) | ||||
|---|---|---|---|---|---|---|
| Precision | Recall | Total Accuracy | Precision | Recall | Total Accuracy | |
| LDA | 72.20 | 79.20 | 75.24 | 71.60 | 78.40 | 74.64 |
| kNN | 100 | 99.20 | 99.52 | 99.60 | 98.80 | 99.06 |
| RF | 100 | 100 | 100 | 99.20 | 99.60 | 99.24 |
| AdaBoost | 100 | 100 | 100 | 98.00 | 98.4 | 98.20 |
| PLS-DA | 84.60 | 88.80 | 86.40 | 80.60 | 82.60 | 80.18 |
1 LDA: Linear Discriminant Analysis, kNN: k-Nearest Neighbors, RF: Random Forest, PLS-DA: Partial Least Squares-Discriminant Analysis.
Classification accuracy (%) for validation data set based on automatically selected pixels for three apple cultivars.
| Classifier | Raw Data (No Dimensionality Reduction) | PCA-Based | ||||||
|---|---|---|---|---|---|---|---|---|
| Gala | Granny | Fuji | All | Gala | Granny | Fuji | All | |
| LDA | 65.38 ± 0.62 | 72.24 ± 0.23 | 70.46 ± 0.72 | 69.22 ± 0.10 | 65.38 ± 0.62 | 70.38 ± 0.17 | 66.94 ± 0.33 | 68.70 ± 0.14 |
| SVM | 80.18 ± 0.06 | 76.42 ± 0.17 | 81.40 ± 0.44 | 72.54 ± 0.36 | 82.60 ± 0.70 | 77.20 ± 0.18 | 81.62 ± 0.33 | 73.84 ± 0.39 |
| kNN | 93.72 ± 0.19 | 93.26 ± 0.15 | 95.46 ± 0.32 | 89.12 ± 0.12 | 93.80 ± 0.15 | 93.30 ± 0.07 | 95.69 ± 0.26 | 88.84 ± 0.11 |
| RF | 89.66 ± 0.19 | 89.04 ± 0.18 | 91.52 ± 0.27 | 82.82 ± 0.14 | 94.28 ± 0.31 | 93.22 ± 0.25 | 96.62 ± 0.13 | 89.74 ± 0.13 |
| GTB | 92.32 ± 0.37 | 91.00 ± 0.25 | 94.68 ± 0.39 | 84.66 ± 0.18 | 94.76 ± 0.16 | 93.66 ± 0.18 | 97.36 ± 0.28 | 90.00 ± 0.23 |
| PLS-DA | 62.76 ± 0.66 | 71.64 ± 0.24 | 68.56 ± 0.15 | 69.14 ± 0.15 | 62.76 ± 0.66 | 71.34 ± 0.16 | 66.92 ± 0.35 | 68.72 ± 0.16 |
PCA: principal component analysis, LDA: Linear Discriminant Analysis, SVM: support vector machine, kNN: k-Nearest Neighbors, RF: Random Forest, GTB: Gradient tree boosting, PLS-DA: Partial Least Squares-Discriminant Analysis.
Classification performance of gradient tree boosting for control and infested samples for three apple cultivars based on automatically selected pixels for three apple cultivars.
| Cultivars | Classes | Precision | Recall | F1-Score | Overall Accuracy (%) |
|---|---|---|---|---|---|
| Fuji | Control | 0.98 | 0.96 | 0.97 | 97.36 |
| Infested | 0.97 | 0.98 | 0.97 | ||
| Gala | Control | 0.93 | 0.93 | 0.93 | 94.76 |
| Infested | 0.95 | 0.96 | 0.95 | ||
| Granny | Control | 0.91 | 0.90 | 0.91 | 93.46 |
| Infested | 0.95 | 0.95 | 0.95 |
Figure 5Classification performance (accuracy) as a function of the number of wavelengths.
Classification performance of selected optimal wavelengths.
| No. of Wavelengths | Gala | Granny Smith | Fuji | |||
|---|---|---|---|---|---|---|
| Selected | Classification | Selected | Classification | Selected | Classification | |
| 30 | 900.1, 903.5, 920.3, 970.6, 997.4, 100.7, 1014.1, 1071.0, 1077.7, 1261.4, 1278.1, 1281.4, 1298.1, 1324.7, 1328.1, 1361.4, 1384.7, 1408.0, 1447.9, 1464.5, 1447.8, 1477.8, 1627.1, 1647.0, 1653.7, 1657.0, 1663.6, 1666.9, 1676.8, 1693.4 | 88.5% | 900.1, 916.9, 977.2, 1010.7, 1020.8, 1030.8, 1047.6, 1074.3, 1178.0, 1181.3, 1204.7, 1274.7, 1284.7, 1294.7, 1298.1, 1304.7, 1308.1, 1371.4, 1414.6, 1471.1, 1481.1, 1494.4, 1653.7, 1660.3, 1666.9, 1673.5, 1680.2, 1683.5, 1686.8, 1693.4 | 87.7% | 977.2, 980.6, 1044.2, 1074.3, 1077.7, 1081.0, 1137.9, 1147.9, 1151.2, 1211.3, 1264.7, 1294.7, 1314.7, 1344.7, 1348.0, 1381.3, 1421.3, 1507.7, 1530.9, 1544.2, 1560.8, 1580.7, 1623.8, 1630.5, 1647.0, 1650.3, 1653.7, 1657.0, 1663.6, 1673.5 | 92.4% |
| 22 | 923.6, 973.9, 1000.7, 1067.6, 1081.0, 1084.4, 1127.8, 1268.1, 1281.4, 1308.1, 1351.4, 1401.3, 1411.3, 1461.2, 1491.1, 1607.3, 1643.7, 1663.6, 1670.2, 1676.8, 1690.1, 1693.4 | 87.8% | 903.5, 916.9, 987.3, 1047.6, 1081.0, 1131.2, 1141.2,1181.3, 1204.7, 1274.7, 1288.1, 1304.7, 1371.4, 1467.8, 1471.1, 1481.1, 1643.7, 1673.5, 1680.2, 1683.5, 1686.8, 1693.4 | 87.5% | 977.2, 983.9, 1050.9, 1064.3, 1081.0, 1151.28, 1184.6, 1228.0, 1248.1, 1288.1, 1351.4, 1447.9, 1530.9, 1554.2, 1574.1, 1590.7, 1627.1, 1647.0, 1653.7, 1657.0, 1663.6, 1680.2 | 91.6% |
| 15 | 903.5, 990.6, 997.3, 1071.0, 1084.4, 1281.4, 1294.7, 1371.4, 1384.7, 1447.9, 1477.8, 1663.6, 1673.5, 1680.2, 1690.1 | 86.2% | 1010.7, 1081.0, 1131.2, 1181.3, 1184.6, 1281.4, 1298.1, 1491.1, 1657.0, 1663.6, 1670.2, 1680.2, 1683.5, 1686.8, 1693.4 | 86.3% | 977.2, 983.9, 1050.9, 1074.3, 1081.0, 1311.4, 1381.3, 1401.3, 1447.9, 1507.7, 1627.1, 1637.1, 1647.0, 1653.7, 1673.5 | 91.0% |
| 5 | 997.3, 1084.4, 1281.4, 1663.6, 1693.4 | 81.5% | 1014.1, 1274.7, 1494.4, 1683.5, 1693.4 | 80.7% | 983.9, 1050.9, 1311.4, 1653.7, 1663.6 | 86.2% |