| Literature DB >> 30367590 |
Balaji Veeramani1, John W Raymond2, Pritam Chanda2.
Abstract
BACKGROUND: Maize is a leading crop in the modern agricultural industry that accounts for more than 40% grain production worldwide. THe double haploid technique that uses fewer breeding generations for generating a maize line has accelerated the pace of development of superior commercial seed varieties and has been transforming the agricultural industry. In this technique the chromosomes of the haploid seeds are doubled and taken forward in the process while the diploids marked for elimination. Traditionally, selective visual expression of a molecular marker within the embryo region of a maize seed has been used to manually discriminate diploids from haploids. Large scale production of inbred maize lines within the agricultural industry would benefit from the development of computer vision methods for this discriminatory task. However the variability in the phenotypic expression of the molecular marker system and the heterogeneity arising out of the maize genotypes and image acquisition have been an enduring challenge towards such efforts.Entities:
Keywords: Agriculture; Convolutional neural networks; Corn; Deep learning; Double haploid induction; Molecular markers
Mesh:
Year: 2018 PMID: 30367590 PMCID: PMC6101072 DOI: 10.1186/s12859-018-2267-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Corn seeds expressing the R1-nj marker in (a) Diploid and (b) Haploid seeds. The marker is only expressed in the diploid embryo as a vertical dark purple patch indicated by the arrow in (a). Variability in visual indications of marker expression, seed morphology, color and texture, embryo positioning with respect to camera, and lighting conditions across multiple (c) diploid and (d) haploid seeds
Fig. 2Schematic architecture of DeepSort Convolution network “Arch-1” used for classifying maize seeds. Input maize images are convolved with 16 filter kernels in the first convolutional layer followed by pooling and normalization layers. Outputs of these operations are again convolved with 16 kernels in the second convolutional followed by pooling, normalization and two fully connected layers
Comparison of classification accuracies of DeepSort and other classifiers. Other classifiers were tested with all features described in text (values within brackets), and using only Haralick texture features (values outside brackets). CV indicates 5-fold cross-validation
| DeepSort | Random Forest | SVM | Logistic Reg | |
|---|---|---|---|---|
| CV | 0.961 | 0.840 (0.823) | 0.857 (0.836) | 0.749 (0.777) |
| Train | 1.000 | 1.000 (0.997) | 0.911 (0.994) | 0.751 (0.786) |
| Test | 0.968 | 0.845 (0.824) | 0.876 (0.839) | 0.775 (0.772) |
Confusion Matrix for DeepSort and SVM
| DeepSort | SVM | |||
|---|---|---|---|---|
| Pred-Diploid | Pred-Haploid | Pred-Diploid | Pred-Haploid | |
| True-Diploid | 556 | 11 | 545 | 22 |
| True-Haploid | 12 | 131 | 66 | 77 |
Pred: Predicted label; True: Actual label; Using test data (143 haploids, 567 diploids)
Fig. 3Figure (a, b) shows the activations of all neurons in the convolutional layer 1 (each row corresponds to the activations that share a kernel) across images of 15 random diploid (a) and haploid (b) seeds (each column for a seed shown at the top row) from the test data set. Similar to figures (a, b), figures (c, d) shows the activations of neurons in the convolutional layer 2 across the same set of seeds. Kernels in the convolutional layers 1 and 2 perform various feature extractions and their complex compositions. For example, kernels 3 of first layer segments the seed from the background, and kernel 5 of the second layer provides discriminatory features (for other examples see text). Figure (e) shows visualizations of 16 kernels from the convolutional layer 1
Effect of CNN architecture on classification accuracy (cols. 2,3) and number of parameters per layer (cols. 4-9) in each architecture
| Method | Train | Test | Conv1 | Conv2 | Full1 | Full2 | Output | Total |
|---|---|---|---|---|---|---|---|---|
| Arch-1 | 1.000 | 0.968 | 1216 | 6416 | 786,624 | 18,528 | 194 | 812,978 |
| Arch-2 | 1.000 | 0.968 | 608 | 1608 | 196,704 | 4656 | 98 | 203,674 |
| Arch-3 | 0.992 | 0.941 | 304 | 404 | 49,200 | 1176 | 50 | 51,134 |
| Arch-4 | 0.989 | 0.935 | 304 | - | 98,328 | - | 50 | 98,682 |
Conv[1/2]: [first/second] convolution layers, Full[1/2] : [first/second] fully connected layers, output: final softmax layer