| Literature DB >> 35658997 |
Christian Nansen1,2, Mohammad S Imtiaz3, Mohsen B Mesgaran4, Hyoseok Lee5.
Abstract
BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge.Entities:
Keywords: Classification models; Classification performance; Machine vision; Optical sensing; Proximal sensing; Seed analysis
Year: 2022 PMID: 35658997 PMCID: PMC9164469 DOI: 10.1186/s13007-022-00912-z
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 5.827
Fig. 1Images and average reflectance profiles of tomato seeds included in this study. Photos of tomato seeds from two varieties, A and B, and five subsamples for each variety (a). Average reflectance profiles of five subsamples of tomato seed variety 1 (b) and 2 (c) included in this study
Fig. 2Average reflectance profiles from germinating and non-germinating seeds. Average profiles of non-germinating and germinating tomato seeds from variety 1 and 2 (a), and relative effects of germination (germination / non-germination) of variety 1 and 2 (b)
Fig. 3Results from experimental performance assessments of classification models. Training data set from tomato variety 1 was manipulated in three different ways, and for each manipulation, we examined the effect on accuracy of linear discriminant (LDA) and support vector machine (SVM) classification models (based on ten-fold cross validation). Object assignment error: effect of individual seeds being assigned to the wrong class (a). Spectral repeatability: effect of introducing known levels of stochastic noise to individual reflectance values (b). Size of training data set: effect of randomly reducing the number of observations in the training data set (c)
Germination data and numbers of tomato seeds included in this study
| Variety and sample | Germination results (%) | Number of seeds | ||
|---|---|---|---|---|
| Company | Our results | Training | Validation | |
| 97 | 97.0, 97.9, 95.8, 92.7 | 496 | 96, 96 | |
| 56 | 56.0, 65.3, 68.8, 58.9 | 1751 | 96, 96 | |
| 1b | 83 | 96, 96 | ||
| 1c | 82 | 96, 96 | ||
| 1d | 66 | 96, 96 | ||
| 97 | 97.0, 94.4, 96.9, 91.7 | 513 | 96, 96 | |
| 95 | 73.0, 74.0, 79.2, 82.7 | 886 | 96, 96 | |
| 2g | 91 | 96, 96 | ||
| 2h | 86 | 96, 96 | ||
| 2i | 73 | 96, 96 | ||
Tomato seeds from five subsamples of each of two varieties (1 and 2) were included in this study (10 samples, see also Fig. 1a). Two subsamples for each variety (variety 1: 1a and e, variety 2: 2f and j) were used as training data, and these are highlighted in bold. For all 10 tomato seed subsamples, we obtained germination results (%) from the seed company, and we performed four replicated germination tests of subsamples used as training data. Hyperspectral images of individual tomato seeds in training and validation data sets were acquired on different days, and two sets of validation data were acquired on separate days
Fig. 4Correlations between observed and predicted seed germination (%) based on linear discriminant (LDA) and support vector machine (SVM) classification models. Validation data (see Table 1) were used to predict tomato seed germination (%) in five seed subsamples from two varieties. We performed validations of both linear discriminant (LDA) and support vector machine (SVM) classification models. Seed germination percentages obtained from the seed company are presented as colored circles and considered “known germination”. Blue circles represent germination percentages of samples, which were used as training data. Red colored circles represent germination percentages of validation samples (not included in training data). Colored squares represent predicted germination percentages of training (blue squares) and validation (red squares) samples. Each colored symbol represents germination percentage based on 96 individual seeds
Root mean square error (RMSE) of validation results
| Function | Variety 1 | Variety 2 |
|---|---|---|
| LDA | 10.56 | 26.15 |
| SVM | 10.44 | 12.58 |
Observed and predicted results from classifications with linear discriminant analysis (LDA) and support vector machine (SVM) functions of validation samples (Fig. 4) from tomato seed varieties 1 and 2