| Literature DB >> 29118821 |
Hao Lu1, Zhiguo Cao1, Yang Xiao1, Bohan Zhuang2, Chunhua Shen2.
Abstract
BACKGROUND: Accurately counting maize tassels is important for monitoring the growth status of maize plants. This tedious task, however, is still mainly done by manual efforts. In the context of modern plant phenotyping, automating this task is required to meet the need of large-scale analysis of genotype and phenotype. In recent years, computer vision technologies have experienced a significant breakthrough due to the emergence of large-scale datasets and increased computational resources. Naturally image-based approaches have also received much attention in plant-related studies. Yet a fact is that most image-based systems for plant phenotyping are deployed under controlled laboratory environment. When transferring the application scenario to unconstrained in-field conditions, intrinsic and extrinsic variations in the wild pose great challenges for accurate counting of maize tassels, which goes beyond the ability of conventional image processing techniques. This calls for further robust computer vision approaches to address in-field variations.Entities:
Keywords: Computer vision; Convolutional neural networks; Deep learning; Maize tassels; Object counting
Year: 2017 PMID: 29118821 PMCID: PMC5664836 DOI: 10.1186/s13007-017-0224-0
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Fig. 1Intrinsic and extrinsic variations in the maize field. These variations pose significant challenges for in-field counting of maize tassels. a Shape and size vary significantly as plants grow over time. b Appearance variations due to different cultivars. c Illumination variations due to different weather conditions. d Pose variations due to wind, imaging and perspective distortions. e Occlusions between leaves and tassels or different tassels. f Cluttered background caused by wires, poles and weeds. g Image degradation due to dust or rain drops on the camera lens. h Texture variations due to different flowering status
Fig. 2The main technical pipeline of in-field counting of maize tassels. Sub-images are first densely sampled from a raw field image. Each sub-image will be fed into our TasselNet to regress a local count associating with the sub-image. After merging and normalizing all local counts, a count map for the field image can be acquired. The raw image count can thus be computed by integrating the count map
Fig. 3Image acquisition devices in the maize field. Our devices are currently set up in four different places
Training set (train), validation set (val) and test set (test) settings of the MTC dataset
| Sequence |
| Cultivar | train | val | test |
|---|---|---|---|---|---|
| Zhengzhou2010 | 37 | Jundan No. 20 |
|
| |
| Zhengzhou2011 | 24 | Jundan No. 20 |
| ||
| Zhengzhou2012 | 22 | Zhengdan No. 958 |
|
| |
| Taian2010_1 | 30 | Wuyue No. 3 |
|
| |
| Taian2010_2 | 32 | Wuyue No. 3 |
| ||
| Taian2011_1 | 21 | Nongda No. 108 |
|
| |
| Taian2011_2 | 19 | Nongda No. 108 |
| ||
| Taian2012_1 | 41 | Zhengdan No. 958 |
|
| |
| Taian2012_2 | 23 | Zhengdan No. 958 |
| ||
| Taian2013_1 | 8 | Zhengdan No. 958 |
|
| |
| Taian2013_2 | 8 | Zhengdan No. 958 |
| ||
| Gucheng2012 | 15 | Jidan No. 32 |
|
| |
| Gucheng2014 | 45 | Zhengdan No. 958 |
| ||
| Jalaid2015_1 | 12 | Tianlong No. 9 |
|
| |
| Jalaid2015_2 | 12 | Tianlong No. 9 |
| ||
| Jalaid2015_3 | 12 | Tianlong No. 9 |
|
Num refers to the number of images in each sequence
Fig. 4Example images in the MTC dataset with dotted annotations. Images are from the a Zhengzhou2010, b Gucheng2012, c Taian2011_1 and d Jalaid2015_1 sequences, respectively
Fig. 5Conceptual differences of different regression targets. The global count regression directly regresses the number of image counts in an image. (Local) density map regression treats the two-dimensional (local) density map as the regression target. Our proposed local count regression regresses the local count computed from the local density map (best viewed in colour)
Fig. 6An example of manually-annotated dot image (left) and its corresponding ground truth density map (right)
Fig. 7Three typical CNN architectures used in TasselNet
Default parameters setting used in our experiments
| Parameter | Remark | Value |
|---|---|---|
| Network architecture | AlexNet-like | |
| Loss function |
| |
|
| Sub-image size | 32 |
|
| Sampling stride during training |
|
|
| Sampling stride during prediction |
|
|
| Gaussian kernel parameter | 8 |
Fig. 8Two example images from the Jalaid2015_2 (a) and Jalaid2015_3 (b) sequences. Images in two sequences exhibit dramatic illumination variations, dazzling visual characteristics, as well as extremely crowded distributions, which renders great difficulties for counting even for a human expert
Fig. 9Training (train) and validation (val) errors in terms of MAE versus the number of epochs on LeNet-like, AlexNet-like and VGG-VD16-like TasselNet architectures
Comparison of different network architectures for maize tassels counting on the test set of MTC dataset
| Network | Sequences | Overall | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Zhengzhou2011 | Taian2010_2 | Taian2011_2 | Taian2012_2 | Taian2013_2 | Gucheng2014 | Jalaid2015_2 | Jalaid2015_3 | |||||||||||
| MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | |
| LeNet | 4.4 | 5.4 | 6.3 | 8.0 | 2.9 | 3.7 | 6.4 | 7.9 | 4.9 | 5.8 |
|
| 16.3 | 17.0 | 28.7 | 33.0 | 7.2 | 11.3 |
| AlexNet | 4.9 | 6.1 |
|
|
|
|
|
|
|
| 5.3 | 6.5 | 16.0 | 16.6 |
| 25.2 |
|
|
| VGG-VD16-Net |
|
| 10.6 | 12.4 | 13.1 | 15.9 | 5.5 | 10.0 | 4.3 | 5.4 | 10.0 | 11.3 | 10.7 | 11.2 | 20.8 |
| 9.3 | 12.4 |
The lowest error is italicised
Counting performance with different number of training samples () on the MTC dataset
|
| MAE | MSE |
|---|---|---|
|
| 9.5 | 14.2 |
|
| 8.5 | 13.4 |
|
| 6.6 |
|
|
|
| 10.8 |
The performance is averaged over 8 test sequences, and the lowest error is italicised
Comparison of different loss functions for maize tassels counting on the MTC dataset
| Loss | MAE | MSE |
|---|---|---|
| Huber ( | 8.5 | 12.2 |
| Huber ( | 7.5 | 10.5 |
| Huber ( | 7.3 | 10.0 |
|
| 7.3 | 10.3 |
|
|
|
|
The performance is averaged over 8 test sequences, and the lowest error is italicised
Comparison of different Gaussian kernel parameter for maize tassels counting on the MTC dataset
|
| MAE | MSE |
|---|---|---|
|
| 7.0 | 11.3 |
|
|
|
|
|
| 7.6 | 10.9 |
The performance is averaged over 8 test sequences, and the lowest error is italicised
Comparison of different sub-image sizes for maize tassels counting on the MTC dataset
|
| MAE | MSE |
|---|---|---|
|
| 9.9 | 13.4 |
|
|
|
|
|
| 6.8 | 10.8 |
|
| 6.9 | 11.5 |
The performance is averaged over 8 test sequences, and the lowest error is italicised
Fig. 10Qualitative results of ground truth density maps overlaid on original images and counting maps predicted by TasselNet. The number shown below each sub-figure denotes the tassel count integrated over the density/count map. The last row shows three unsuccessful predictions
Mean absolute errors (MAE) and mean squared errors (MSE) for maize tassels counting on the test set of MTC dataset
| Method | Sequences | Overall | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Zhengzhou2011 | Taian2010_2 | Taian2011_2 | Taian2012_2 | Taian2013_2 | Gucheng2014 | Jalaid2015_2 | Jalaid2015_3 | |||||||||||
| MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | |
| JointSeg | 20.9 | 23.2 | 46.6 | 47.9 | 16.4 | 19.7 | 25.1 | 29.8 | 6.5 | 8.0 | 7.3 | 10.5 | 27.8 | 29.1 | 53.2 | 61.3 | 24.2 | 31.6 |
| mTASSEL | 9.8 | 14.9 | 18.6 | 22.1 | 11.6 | 12.7 | 5.3 | 7.8 | 13.1 | 16.6 | 31.1 | 35.3 | 16.2 | 18.0 | 46.6 | 51.0 | 19.6 | 26.1 |
| GlobalReg | 19.0 | 21.5 | 23.0 | 24.7 | 14.1 | 16.8 | 13.5 | 15.7 | 19.6 | 25.2 | 19.5 | 21.7 | 11.2 | 13.7 | 42.1 | 45.4 | 19.7 | 23.3 |
| DensityReg | 16.1 | 20.2 | 9.9 | 10.7 | 9.2 | 11.7 | 10.8 | 12.7 | 20.2 | 23.7 | 9.4 | 10.5 |
|
| 23.5 | 26.9 | 11.9 | 14.8 |
| CCNN | 21.3 | 23.3 | 28.9 | 31.6 | 12.4 | 16.0 | 12.6 | 15.3 | 18.9 | 23.7 | 21.6 | 24.1 | 9.6 | 12.4 | 39.5 | 46.4 | 21.0 | 25.5 |
| TasselNet |
|
|
|
|
|
|
|
|
|
|
|
| 16.0 | 16.6 |
|
|
|
|
The lowest error is italicised