| Literature DB >> 33313551 |
Etienne David1,2, Simon Madec1,2, Pouria Sadeghi-Tehran3, Helge Aasen4, Bangyou Zheng5, Shouyang Liu2,6, Norbert Kirchgessner4, Goro Ishikawa7, Koichi Nagasawa8, Minhajul A Badhon9, Curtis Pozniak10, Benoit de Solan1, Andreas Hund4, Scott C Chapman5,11, Frédéric Baret2,6, Ian Stavness9, Wei Guo12.
Abstract
The detection of wheat heads in plant images is an important task for estimating pertinent wheat traits including head population density and head characteristics such as health, size, maturity stage, and the presence of awns. Several studies have developed methods for wheat head detection from high-resolution RGB imagery based on machine learning algorithms. However, these methods have generally been calibrated and validated on limited datasets. High variability in observational conditions, genotypic differences, development stages, and head orientation makes wheat head detection a challenge for computer vision. Further, possible blurring due to motion or wind and overlap between heads for dense populations make this task even more complex. Through a joint international collaborative effort, we have built a large, diverse, and well-labelled dataset of wheat images, called the Global Wheat Head Detection (GWHD) dataset. It contains 4700 high-resolution RGB images and 190000 labelled wheat heads collected from several countries around the world at different growth stages with a wide range of genotypes. Guidelines for image acquisition, associating minimum metadata to respect FAIR principles, and consistent head labelling methods are proposed when developing new head detection datasets. The GWHD dataset is publicly available at http://www.global-wheat.com/and aimed at developing and benchmarking methods for wheat head detection.Entities:
Year: 2020 PMID: 33313551 PMCID: PMC7706323 DOI: 10.34133/2020/3521852
Source DB: PubMed Journal: Plant Phenomics ISSN: 2643-6515
Characteristics of the experiments used to acquire images for GWHD dataset.
| Sub-dataset name | Institution | Country | Lat (°) | Long (°) | Year | No. of dates | Targeted stages | Row spacing (cm) | Sowing density (seeds·m2) | No. of genotypes |
|---|---|---|---|---|---|---|---|---|---|---|
| UTokyo_1 | NARO & UTokyo | Japan | 36.0 N | 140.0 E | 2018 | 3 | Postflowering | 15 | 186 | 66 |
| UTokyo_2 | NARO & UTokyo | Japan | 42.8 N | 143.0 E | 2016 | 6 | Flowering∗ | 12.5 | 200 | 1 |
| Arvalis_1 | Arvalis | France | 43.7 N | 5.8 E | 2017 | 3 | Postflowering-ripening | 17.5 | 300 | 20 |
| Arvalis_2 | Arvalis | France | 43.7 N | 5.8 E | 2019 | 1 | Postflowering | 17.5 | 300 | 20 |
| Arvalis_3 | Arvalis | France | 49.7 N | 3.0 E | 2019 | 3 | Postflowering-ripening | 17.5 | 300 | 4 |
| INRAE_1 | INRAE | France | 43.5 N | 1.5 E | 2019 | 1 | Postflowering | 16 | 300 | 7 |
| USask_1 | University of Saskatchewan | Canada | 52.1 N | 106 W | 2019 | 1 | n.a | 30.5 | 250 | 16 |
| RRes_1 | Rothamsted research | UK | 51.8 N | 0.36 W | 2016 | 1 | n.a | n.a | 350 | 6 |
| ETHZ_1 | ETHZ | Switzerland | 47.4 N | 8.6 E | 2018 | 1 | n.a | 12.5 | 400 | 354 |
| NAU_1 | Nanjing Agric. University | China | 31.6 N | 119.4 E | 2018 | 1 | Flowering∗ | 20 | 300 or 450 | 5 |
| UQ_1 | UQueensland | Australia | 27.5 S | 152.3 E | 2016 | 1 | Flowering -ripening | 22 | 150 | 8 |
∗Images were checked carefully to ensure that heads have fully developed and flowered.
Image characteristics of the sub-datasets comprising the GWHD dataset. All cameras looked vertically downward.
| Sub-dataset name | Vector | Camera | Focal length (mm) | Field of view (°)∗ | Shooting mode | Image size (pixels) | Distance to ground (m) | GSD (mm/px) |
|---|---|---|---|---|---|---|---|---|
| UTokyo_1 | Cart | Canon PowerShot G9 X mark II | 10 | 38.15 | Automatic | 5472 × 3648 | 1.8 | 0.43 |
| UTokyo_2 | Handheld | Olympus | 7/4 | 45.5 | Automatic | 3264 × 2488 & 4608 × 3456 | 1.7 | 0.6 |
| Arvalis_1 | Handheld | Sony alpha ILCE-6000 | 50 & 60 | 7.1 | Automatic | 6000 × 4000 | 2.9 | 0.10-0.16 |
| Arvalis_2 | Handheld | Sony RX0 | 7.7 | 9.99 | Automatic | 800 × 800† | 1.8 | 0.56 |
| Arvalis_3 | Handheld | Sony RX0 | 7.7 | 9.99 | Automatic | 800 × 800† | 1.8 | 0.56 |
| INRAE_1 | Handheld | Sony RX0 | 7.7 | 9.99 | Automatic | 800 × 800† | 1.8 | 0.56 |
| USask_1 | Minivehicle | FLIR Chameleon3 USB3 | 16 | 19.8 | Fixed | 2448 × 2048 | 2 | 0.45 |
| RRes_1 | Gantry | Prosilica GT 3300 Allied Vision | 50 | 12.8 | Automatic | 3296 × 2472 | 3-3.5§ | 0.33-0.385 |
| ETHZ_1 | Gantry | Canon EOS 5D mark II | 35 | 32.2 | Fixed | 5616 × 3744 | 3 | 0.55 |
| NAU_1 | Handheld | Sony RX0 | 24 | 16.9 | Automatic | 4800 × 3200 | 2 | 0.21 |
| UQ_1 | Handheld | Canon 550D | 55 | 17.3 | Automatic | 5184 × 3456 | 2 | 0.2 |
∗The field of view is measured diagonally. The reported measure is the half-angle. †Original images were cropped, and a subimage of size 800 × 800 was extracted from the central area. §The camera was positioned perpendicular to the ground and automatically adjusted to ensure a 2.2 m distance was maintained between the camera and canopy.
Figure 1Overview of the harmonization process conducted. Images were first rescaled using bilinear interpolation up- or downsampling techniques. Then, the rescaled images were split into 1024 × 1024 squared patches.
Figure 2Examples of wheat heads difficult to label. These examples are zoomed-in views from images contained in the dataset, with different zoom factors. It includes overlapping heads (a–c), heads at emergence (d), heads that are partly cut at the border of the image (e), and images with a low illumination (f). Note that image (d) was removed from the dataset because of the ambiguity of heads at emergence. Wheat heads in the image (e) were not labelled because less than 30% of their basal part is visible, as defined in Section 2.4.
Statistics for each component of the Global Wheat Head Detection.
| Sub-dataset name | No. of acquired images | No. of patch per image | Original GSD (mm) | Sampling factor | Used GSD (mm) | No. of labelled images | No. of labelled heads | Average no. of heads/images |
|---|---|---|---|---|---|---|---|---|
| UTokyo_1 | 994 | 1 | 0.43 | 1 | 0.43 | 994 | 29174 | 29 |
| UTokyo_2 | 30 | 4 | 0.6 | 2 | 0.3 | 120 | 3263 | 27 |
| Arvalis_1 | 239 | 6 | 0.23 | 0.5 | 0.46 | 1055∗ | 45716 | 43 |
| Arvalis_2 | 51 | 4 | 0.56 | 2 | 0.28 | 204 | 4179 | 20 |
| Arvalis_3 | 152 | 4 | 0.56 | 2 | 0.28 | 608 | 16665 | 27 |
| INRAE_1 | 44 | 4 | 0.56 | 2 | 0.28 | 176 | 3701 | 21 |
| USask_1 | 100 | 2 | 0.45 | 1 | 0.45 | 200 | 5737 | 29 |
| RRes_1 | 72 | 6 | 0.33 | 1 | 0.33 | 432 | 20236 | 47 |
| ETHZ_1 | 375 | 2 | 0.55 | 1 | 0.55 | 747∗ | 51489 | 69 |
| NAU_1 | 20 | 1 | 0.21 | 1 | 0.21 | 20 | 1250 | 63 |
| UQ_1 | 142 | 1 | 0.2 | 0.5 | 0.4 | 142 | 7035 | 50 |
| Total | 2219 | — | — | — | — | 4698 | 188445 | — |
∗Some labelled images have been removed during the labelling process.
Figure 3Distribution of the number of bounding boxes per image (a) and bounding boxes size∗ (b) in the GWHD dataset. ∗The bounding box size is defined as the square root of the bounding box area in pixel.
Figure 4Example of images from different acquisition sites after cropping and rescaling.
Figure 5A selection of bounding boxes for each sub-dataset. The same size of pixels is used across all the bounding boxes displayed.
Figure 6Comparison of GWHD dataset with other object detection datasets. Both axes are in log-scale.
The minimum metadata that should be associated with images of wheat heads.
| Session level | Image level | |
|---|---|---|
| Experiment metadata | Name of the experiment (PUID)† | Microplot id |
| Name of institution | Row spacing | |
| GPS coordinates (°) | Sowing density | |
| Email address of the contact person | Name of the genotype (or any identifier)† | |
| Date of the session (yyyymmdd) | Presence or not of awns | |
| Wheat species (durum, aestivum …)∗ | ||
| Development stage/ripening stage∗ | ||
|
| ||
| Acquisition metadata |
| Camera aperture |
| Name | Shutter speed | |
| Type (handheld, cart, phenomobile, gantry, UAV) | ISO | |
| Sampling procedure | Distance from camera to canopy (m) | |
| Distance to the ground (m)∗ | Position of the image in the microplot§ | |
|
| ||
| Model | ||
| Focal length of the lens (mm) | ||
| Size of the pixel at the sensor matrix ( | ||
| Sensor dimensions (pixels × pixels) | ||
∗This may be alternatively reported at the image level if it is variable within a session. †Persistent unique identifier (PUID). This may be a DOI as for genetic resources regulated under the on Plant Genetic Resources for Food and Agriculture (https://ssl.fao.org/glis) or any other identifier including the information of the maintainer of the genetic material, ripening stage. ‡The distance between camera and canopy is an essential piece of information to harmonize dataset and calculate the density and should be carefully monitored during an acquisition. §In case of multiple images over the same microplot.