| Literature DB >> 34109221 |
Jennifer Hobbs1, Vachik Khachatryan2, Barathwaj S Anandan1,3, Harutyun Hovhannisyan2, David Wilson2.
Abstract
Crop monitoring and yield prediction are central to management decisions for farmers. One key task is counting the number of kernels on an ear of corn to estimate yield in a field. As ears of corn can easily have 400-900 kernels, manual counting is unrealistic; traditionally, growers have approximated the number of kernels on an ear of corn through a mixture of counting and estimation. With the success of deep learning, these human estimates can now be replaced with more accurate machine learning models, many of which are efficient enough to run on a mobile device. Although a conceptually simple task, the counting and localization of hundreds of instances in an image is challenging for many image detection algorithms which struggle when objects are small in size and large in number. We compare different detection-based frameworks, Faster R-CNN, YOLO, and density-estimation approaches for on-ear corn kernel counting and localization. In addition to the YOLOv5 model which is accurate and edge-deployable, our density-estimation approach produces high-quality results, is lightweight enough for edge deployment, and maintains its computational efficiency independent of the number of kernels in the image. Additionally, we seek to standardize and broaden this line of work through the release of a challenging dataset with high-quality, multi-class segmentation masks. This dataset firstly enables quantitative comparison of approaches within the kernel counting application space and secondly promotes further research in transfer learning and domain adaptation, large count segmentation methods, and edge deployment methods.Entities:
Keywords: UNET; YOLO; counting; dataset; density estimation; edge deployment; machine vision application; precision agriculture
Year: 2021 PMID: 34109221 PMCID: PMC8183680 DOI: 10.3389/frobt.2021.627009
Source DB: PubMed Journal: Front Robot AI ISSN: 2296-9144
Figure 1The ultimate goal of this model is count and localize the healthy kernels on ears of corn. (A) Our dataset is densely labeled with per-instance segmentation masks (area shown in faded magenta with centers in blue). (B,C) In addition to healthy (magenta) kernels, we also label barren tips (red), incomplete/underdeveloped kernels (cyan), diseased kernels (green), and kernel areas (beige). (D–L) The dataset contains a diverse range of images with different types of corn, numbers and sizes of ears and kernels, photos and cartoon-imagery, shadowing and occlusion challenges, scales and resolution, as well as kernels which are loose, and therefore should be ignored. (M) The distribution of the number of ears in each image in the full dataset.
Figure 2(Left) Distribution of the number of kernels per image in each of the three datasets. (Right) Distribution of the kernel size (i.e., area) in pixels in each of the three datasets.
Detection model performance.
| Faster R-CNN: ResNet50 | Base | 0.80 | 0.74 | 0.21 | – |
| Faster R-CNN: VGG16 | Base | 0.85 | 0.73 | 0.24 | – |
| YOLOv5: CSP | Base | 0.70 | 0.70 | 0.24 | 0.20 |
| YOLOv5: CSP, no pretraining | Base | 0.74 | 0.64 | 0.25 | 0.29 |
| YOLOv5: CSP | Narrow | 0.63 | 0.63 | 0.17 | 0.17 |
Figure 3Results from YOLOv5 model trained on the base data. Samples from the base test set are shown on the left and middle and two samples from the unseen test narrow set are shown on the far right.
Density estimation model performance.
| EfficientNet-b1 | Base | 0.12 | 0.18 | 0.19 |
| Mobilenet-v2 | Base | 0.18 | 0.19 | 0.18 |
| EfficientNet-b1 | Narrow | 0.16 | 0.18 | 0.18 |
| Mobilenet-v2 | Narrow | 0.16 | 0.15 | 0.15 |
Figure 4Results from our Density Estimation approach using an EfficientNet-b1 model. Other models are visually similar. Even though it was trained on the base dataset (left two columns), it performs quite well on the narrow (third column) and many (far right) dataset as well. Even though the number of instances may be large, the model does not struggle with memory issues and continues to be computationally efficient.