| Literature DB >> 30788393 |
Douglas F Pereira1, Pedro H Bugatti1, Fabricio M Lopes1, André L S M Souza2, Priscila T M Saito1,3.
Abstract
Agribusiness has a great relevance in the world׳s economy. It generates a considerable impact in the gross national product of several nations. Hence, it is the major driver of many national economies. Nowadays, from each new planting to harvesting process it is mandatory and crucial to apply some kind of technology to optimize a given singular process, or even the entire cropping chain. For instance, digital image analysis joined with machine learning methods can be applied to obtain and guarantee a higher quality of the harvest, leading to not only a greater profit for producers, but also better products with lower cost to the final consumers. Thus, to provide this possibility this work describes a visual feature dataset from soybean seed images obtained from the tetrazolium test. This is a test capable to define how healthy a given seed is (e.g. how much the plant will produce, or if it is resistant to inclement weather, among others). To answer these questions we proposed this dataset which is the cornerstone to provide an effective classification of the soybean seed vigor (i.e. an extremely tiresome human visual inspection process). Besides, as one of the most prominent international commodity, the soybean production must follow rigid quality control process to be part of world trade. Hence, small mistakes in the seed vigor definition of a given seed lot can lead to huge losses.Entities:
Keywords: classication; quality control; soybean seed data; tetrazolium test; visual features
Year: 2019 PMID: 30788393 PMCID: PMC6369404 DOI: 10.1016/j.dib.2018.12.090
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Example of a sheet with seeds.
Fig. 2Pipeline adopted for processing the seed sheets.
Fig. 3Seed sheet after preprocessing.
Fig. 4Examples of external and internal portions of seed samples. (a) without damage (perfect). (b) with bug damage. (c) with humidity damage. (d) with mechanical damage.
Description of the extractors, types and number of features.
| Extractor | Description | Type | #Features |
|---|---|---|---|
| BIC | Border/interior classification | Color | 128 |
| GCH | Global color histogram | Color | 66 |
| Haralick | Haralick׳s descriptors | Texture | 5 |
| LBP | Local binary patterns | Texture | 256 |
Description and distribution of samples of each image class obtained in the first acquisition.
| Classes | Description | Samples |
|---|---|---|
| 0PE | External portion w/o damage (perfect) | 502 |
| 0PI | Internal portion w/o damage (perfect) | 529 |
| 2HE | External portion w/ humidity damage - level 2 | 23 |
| 2HI | Internal portion w/ humidity damage - level 2 | 7 |
| 3ME | External portion w/ mechanical damage - level 3 | 36 |
| 3MI | Internal portion w/ mechanical damage - level 3 | 28 |
| 3BE | External portion w/ bug damage - level 3 | 83 |
| 3BI | Internal portion w/ bug damage - level 3 | 40 |
| 3HE | External portion w/ humidity damage - level 3 | 36 |
| 3HI | Internal portion w/ humidity damage - level 3 | 49 |
Description and distribution of samples of each image class obtained in the second acquisition.
| Classes | Description | Samples |
|---|---|---|
| 0PE | External portion w/o damage (perfect) | 306 |
| 0PI | Internal portion w/o damage (perfect) | 374 |
| 3ME | External portion w/ mechanical damage - level 3 | 4 |
| 3MI | Internal portion w/ mechanical damage - level 3 | 5 |
| 3BE | External portion w/ bug damage - level 3 | 18 |
| 3BI | Internal portion w/ bug damage - level 3 | 17 |
| 3HI | Internal portion w/ humidity damage - level 3 | 9 |
| Subject area | Computer Science, Agronomy, Soybean crop |
| More specific subject area | Image Analysis, Soybean Seeds, Tetrazolium Test |
| Type of data | Image features (numerical data) |
| How data were acquired | Visual feature extraction from the seed images obtained through the tetrazolium test. |
| Data format | Floating point n-dimensional vectors for each image |
| Experimental factors | Description and classification of the soybean seed damages. |
| Experimental features | In the tetrazolium test, the seeds are cut in half and the 4 parts of the seed are analyzed (2 internal portions and 2 external portions). These parts were scanned, generating seed sheets that comprise several seed images. Each image was annotated by a seed analyst. |
| Data source location | The seeds were scanned and annotated in the seed analysis laboratory in Tamarana, Paraná, Brazil. The preprocessing and feature extraction phases occurred at the Federal University of Technology - Paraná, in Cornélio Procópio, Paraná, Brazil. |
| Data accessibility | Data is publicly available on github ( |
| Related Research Article | Pereira et al. |