| Literature DB >> 31500333 |
Susu Zhu1,2, Lei Zhou3,4, Pan Gao5, Yidan Bao6,7, Yong He8,9, Lei Feng10,11.
Abstract
Cotton seed purity is a critical factor influencing the cotton yield. In this study, near-infrared hyperspectral imaging was used to identify seven varieties of cotton seeds. Score images formed by pixel-wise principal component analysis (PCA) showed that there were differences among different varieties of cotton seeds. Effective wavelengths were selected according to PCA loadings. A self-design convolution neural network (CNN) and a Residual Network (ResNet) were used to establish classification models. Partial least squares discriminant analysis (PLS-DA), logistic regression (LR) and support vector machine (SVM) were used as direct classifiers based on full spectra and effective wavelengths for comparison. Furthermore, PLS-DA, LR and SVM models were used for cotton seeds classification based on deep features extracted by self-design CNN and ResNet models. LR and PLS-DA models using deep features as input performed slightly better than those using full spectra and effective wavelengths directly. Self-design CNN based models performed slightly better than ResNet based models. Classification models using full spectra performed better than those using effective wavelengths, with classification accuracy of calibration, validation and prediction sets all over 80% for most models. The overall results illustrated that near-infrared hyperspectral imaging with deep learning was feasible to identify cotton seed varieties.Entities:
Keywords: classifier; convolution neural network; cotton seed; near-infrared hyperspectral imaging; residual network
Mesh:
Year: 2019 PMID: 31500333 PMCID: PMC6766998 DOI: 10.3390/molecules24183268
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Average spectra of seven varieties of cotton seeds with standard deviation of four wavelengths (peaks: 1119 and 1308 nm; valleys: 1204 and 1470 nm).
Figure 2Pseudo raw image of the seven varieties of cotton seeds and the PCA score images of the first ten PCs. The letter (a) represent the pseudo raw image (1000, 1200 and 1400 nm); (b–k) represent the PCA score images of PC1–PC10. Numbers in the brackets are percentage of explained total variance.
Figure 3Effective wavelengths selection using the first ten PCs. The letters from (a–j) represent the PCs from PC1 to PC 10.
Effective wavenumbers selected by PCA loadings.
| Methods | No. | Effective Wavelengths (nm) |
|---|---|---|
| PCA loadings | 43 | 1009, 1025, 1032, 1052, 1069, 1082, 1096, 1116, 1119, 1123, 1126, |
Results of classification models using full spectra and effective wavelengths.
| Classifier | Full Spectra (%) | Effective Wavelengths (%) | ||||
|---|---|---|---|---|---|---|
| Calibration | Validation | Prediction | Calibration | Validation | Prediction | |
| CNN-SoftMax a | 91.191 | 89.065 | 88.838 | 87.629 | 84.071 | 82.860 |
| CNN-LR | 94.060 | 88.611 | 87.752 | 90.070 | 83.731 | 83.276 |
| CNN-PLS-DA | 91.112 | 88.082 | 86.644 | 87.088 | 82.709 | 82.027 |
| CNN-SVM | 93.695 | 89.255 | 88.006 | 89.970 | 84.487 | 84.260 |
| ResNet-SoftMax | 95.381 | 85.698 | 86.039 | 92.273 | 79.985 | 79.228 |
| ResNet-LR | 99.585 | 84.335 | 82.324 | 98.238 | 76.040 | 75.952 |
| ResNet-PLS-DA | 95.130 | 85.585 | 85.358 | 91.707 | 78.509 | 77.677 |
| ResNet-SVM | 96.325 | 85.963 | 85.887 | 94.098 | 79.153 | 79.115 |
| LR | 84.156 | 82.406 | 83.012 | 62.736 | 62.429 | 65.305 |
| PLS-DA | 81.764 | 79.947 | 80.401 | 78.870 | 77.261 | 77.147 |
| SVM | 93.557 | 89.217 | 88.422 | 89.441 | 84.147 | 84.033 |
a. CNN-SoftMax means using SoftMax function as classifier for the CNN model.
Figure 4The confusion matrices of calibration (a), validation (b) and prediction (c) for CNN-SVM model using full spectra.
Figure 5The architectures of proposed classification models: (a) The architecture of CNN-SoftMax, CNN-LR, CNN-PLS-DA and CNN-SVM; (b) the architecture of the Convolution Block; (c) the architecture of ResNet-SoftMax, ResNet-LR, ResNet-PLS-DA and ResNet-SVM; (d) the architecture of the Residual Block.