| Literature DB >> 35620676 |
Zheli Wang1,2, Wenqian Huang1, Xi Tian1, Yuan Long1, Lianjie Li1, Shuxiang Fan1.
Abstract
The aged seeds have a significant influence on seed vigor and corn growth. Therefore, it is vital for the planting industry to identify aged seeds. In this study, hyperspectral reflectance imaging (1,000-2,000 nm) was employed for identifying aged maize seeds using seeds harvested in different years. The average spectra of the embryo side, endosperm side, and both sides were extracted. The support vector machine (SVM) algorithm was used to develop classification models based on full spectra to evaluate the potential of hyperspectral imaging for maize seed detection and using the principal component analysis (PCA) and ANOVA to reduce data dimensionality and extract feature wavelengths. The classification models achieved perfect performance using full spectra with an accuracy of 100% for the prediction set. The performance of models established with the first three principal components was similar to full spectrum models, but that of PCA loading models was worse. Compared to other spectra, the two-band ratio (1,987 nm/1,079 nm) selected by ANOVA from embryo-side spectra achieved a better classification accuracy of 95% for the prediction set. The image texture features, including histogram statistics (HS) and gray-level co-occurrence matrix (GLCM), were extracted from the two-band ratio image to establish fusion models. The results demonstrated that the two-band ratio selected from embryo-side spectra combined with image texture features achieved the classification of maize seeds harvested in different years with an accuracy of 97.5% for the prediction set. The overall results indicated that combining the two wavelengths with image texture features could detect aged maize seeds effectively. The proposed method was conducive to the development of multi-spectral detection equipment.Entities:
Keywords: ANOVA; SVM - support vector machine; classification; hyperspectral imaging; maize seeds
Year: 2022 PMID: 35620676 PMCID: PMC9127793 DOI: 10.3389/fpls.2022.849495
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1The original reflectance spectra of different sides of single maize seed. (A) Embryo side, (B) endosperm side, and (C) both sides.
The classification results based on original spectra using SVM algorithm.
| Spectral type | Parameters | Classification accuracy | |
| Calibration set | Prediction set | ||
| Embryo side | c 0.5 g 4 | 100 | 100 |
| Endosperm side | c 128 g 8 | 100 | 100 |
| Both sides | c 0.25 g 64 | 100 | 100 |
Abbreviations: PCs: principal components. c: the penalty coefficient. g: the kernel function parameter.
FIGURE 2The results of PCA analysis of embryo-side original reflectance spectra. (A) Scatter plot of first three principal components, and (B) loading plots for the first three principal components. Abbreviations: PC1: the first principal component. PC2: the second principal component. PC3: the third principal component. FWs: the feature wavelengths.
The results of feature wavelength selection from different spectral types based on loading of PC3.
| Spectral type | Feature wavelengths |
| Embryo side | 1111 nm 1198 nm 1310 nm 1151 nm |
| Endosperm side | 1104 nm 1197 nm 1304 nm 1518 nm |
| Both sides | 1111 nm 1198 nm 1310 nm 1151 nm |
The classification results based on the first three PCs and the loading of PC3 using SVM algorithm.
| Spectral type | Model | Parameters | Classification accuracy (%) | |
| Calibration set | Prediction set | |||
| Embryo side | PCs | c 0.5 g 4 | 100 | 100 |
| Loading | c 1024 g 4 | 85.83 | 85.83 | |
| Endosperm side | PCs | c 512 g 0.5 | 99.17 | 99.17 |
| Loading | c 16 g 64 | 68.75 | 71.67 | |
| Both sides | PCs | c 0.5 g 16 | 100 | 100 |
| Loading | c 512 g 4 | 72.50 | 80.83 | |
Abbreviations: PCs: principal components. c: the penalty coefficient. g: the kernel function parameter.
The above-mentioned results indicated that SVM combined with the first three PCs based on the embryo-side and both-side spectra could establish perfect classifiers to classify maize seeds harvested in different years. However, PCs are a linear combination of the full spectra. In terms of rapid detection equipment development, this method still needs to extract the full spectra to establish a classification model, which cannot effectively reduce the development cost and model complexity. Therefore, it is necessary to find a more effective data dimension reduction method for further study.
FIGURE 3The contour plots of F-value calculated from different waveband ratio combinations. The color change from blue to red represents the F-value increases from low to high. (A) Embryo side, (B) endosperm side, and (C) both sides.
FIGURE 4The distribution of two-band ratio for different samples. (A) Embryo side, (B) endosperm side, and (C) both sides.
The classification results using threshold values based on the two-band ratio values.
| Spectral type | Two-band ratio | Threshold | Classification accuracy (%) | |
| Calibration set | Prediction set | |||
| Embryo side | 1987 nm/1079 nm | t1 0.8046 t2 0.8784 | 95.83 | 95.00 |
| Endosperm side | 1011 nm/1987 nm | t1 1.0140 t2 1.0550 | 76.67 | 72.50 |
| Both sides | 1980 nm/1048 nm | t1 0.8631 t2 0.9174 | 91.67 | 89.17 |
Abbreviations: t1: the first threshold value; t2: the second threshold value.
Compared to the classification results obtained by PCA-SVM models, the number of wavelengths selected by ANOVA was significantly lower. The result provides a more efficient and cost-effective solution for the development of a maize seed classification approach based on hyperspectral imaging technology. However, a two-band ratio alone may not provide sufficient information, and it is necessary to explore more features to improve the classification accuracy.
FIGURE 5Comparison of the images obtained by using the hyperspectral image.
The classification results based on various feature variables using SVM algorithm.
| Spectral type | Two-band ratio | Model | Variable number | Classification accuracy (%) | |
| Calibration set | Prediction set | ||||
| Embryo side | 1987 nm/1079 nm | Two-band ratio | 1 | 96.67 | 95 |
| Image textures | 10 | 65 | 59.17 | ||
| Data fusion | 11 | 98.75 | 97.5 | ||
| Endosperm side | 1011 nm/1987 nm | Two-band ratio | 1 | 75.83 | 73.33 |
| Image textures | 10 | 58.33 | 44.17 | ||
| Data fusion | 11 | 79.17 | 80 | ||
The above-mentioned results showed that the proposed method can be used to classify the maize seeds harvested in different years. However, only the new and aged seeds need to be identified for general production requirements. Therefore, the maize seeds harvested in 2020 were defined as new seeds, and the remaining were classified as aged seeds. Then, the classification model was built according to the proposed method. This model showed better performance with an accuracy of 99.17% in the prediction set. It is also clear from
FIGURE 6The confusion matrix of the data fusion model based on embryo side spectra. (A) is the classification results of maize seed harvested in 2018, 2019 and 2020. (B) is the classification results of new (2020) and aged (2018 and 2019) maize seed.