| Literature DB >> 34349161 |
Tao Lu1,2, Baokun Han2, Lipin Chen3, Fanqianhui Yu4, Changhu Xue3,5.
Abstract
A generic intelligent tomato classification system based on DenseNet-201 with transfer learning was proposed and the augmented training sets obtained by data augmentation methods were employed to train the model. The trained model achieved high classification accuracy on the images of different quality, even those containing high levels of noise. Also, the trained model could accurately and efficiently identify and classify a single tomato image with only 29 ms, indicating that the proposed model has great potential value in real-world applications. The feature visualization of the trained models shows their understanding of tomato images, i.e., the learned common and high-level features. The strongest activations of the trained models show that the correct or incorrect target recognition areas by a model during the classification process will affect its final classification accuracy. Based on this, the results obtained in this study could provide guidance and new ideas to improve the development of intelligent agriculture.Entities:
Year: 2021 PMID: 34349161 PMCID: PMC8338978 DOI: 10.1038/s41598-021-95218-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The number of images used in different training sets.
| Tomato type | Training set 1 | Training set 2 | Training set 3 | Training set 4 | Training set 5 |
|---|---|---|---|---|---|
| Tomato 1 | 738 | 246 | 1476 | 2952 | 2952 |
| Tomato 2 | 672 | 225 | 1344 | 2688 | 2688 |
| Tomato 3 | 738 | 246 | 1476 | 2952 | 2952 |
| Tomato 4 | 479 | 160 | 958 | 1916 | 1916 |
| Cherry red | 492 | 164 | 984 | 1968 | 1968 |
| Heart | 684 | 228 | 1368 | 2736 | 2736 |
| Maroon | 367 | 127 | 734 | 1468 | 1468 |
| Tomato not ripened | 474 | 158 | 948 | 1896 | 1896 |
| Yellow | 459 | 153 | 918 | 1836 | 1836 |
| Total | 5103 | 1707 | 10,206 | 20,412 | 20,412 |
For each type of tomato in each training set, 1/5 of the images were used for validation and the remaining 4/5 of the images were used for training.
The number of images used in different testing sets.
| Tomato type | Testing set 1 | Testing set 2 | Testing set 3 ( | … | Testing set 13 ( |
|---|---|---|---|---|---|
| Tomato 1 | 246 | 738 | 246 | … | 246 |
| Tomato 2 | 225 | 672 | 225 | … | 225 |
| Tomato 3 | 246 | 738 | 246 | … | 246 |
| Tomato 4 | 160 | 479 | 160 | … | 160 |
| Cherry red | 164 | 492 | 164 | … | 164 |
| Heart | 228 | 684 | 228 | … | 228 |
| Maroon | 127 | 367 | 127 | … | 127 |
| Tomato not ripened | 158 | 474 | 158 | … | 158 |
| Yellow | 153 | 459 | 153 | … | 153 |
| Total | 1707 | 5103 | 1707 | … | 1707 |
A total of 11 testing sets, from testing set 3 to 13, were added with different levels of noise, and the noise addition was increased from M = 0 to M = 1.0, with an increment of 0.1.
Performance comparison of five CNN-based models on different datasets.
| NasNet-Mobile | Xception | DenseNet-201 | Inception-Resnetv2 | Inception-v3 | |
|---|---|---|---|---|---|
| Accuracy (%) | 98.95 | 99.94 | 100.00 | 99.94 | 99.12 |
| Training time (s) | 24,875 | 49,007 | 28,706 | 55,022 | 12,030 |
| Testing time (s) | 126 | 119 | 53 | 121 | 29 |
| Accuracy (%) | 88.97 | 91.63 | 96.16 | 90.40 | 92.16 |
| Training time (s) | 8126 | 17,704 | 13,340 | 21,347 | 4397 |
| Testing time (s) | 478 | 876 | 157 | 392 | 80 |
Figure 1Tomato images with the addition of different levels of Gaussian white noise.
Figure 2Performance variation of four DenseNet-201-based models (trained on different training sets) on twelve testing sets with different levels of noise.
Figure 3Sample images of nine types of tomatoes and feature visualization of the last fully connected layer of each trained model.
Figure 4Randomly selected tomato images containing different levels of noise from testing sets 1, 9, and 13, and their corresponding strongest activations images generated by each trained model.