Literature DB >> 31516936

A citrus fruits and leaves dataset for detection and classification of citrus diseases through machine learning.

Hafiz Tayyab Rauf1, Basharat Ali Saleem2, M Ikram Ullah Lali1, Muhammad Attique Khan3, Muhammad Sharif4, Syed Ahmad Chan Bukhari5.   

Abstract

Plants are as vulnerable by diseases as animals. Citrus is a major plant grown mainly in the tropical areas of the world due to its richness in vitamin C and other important nutrients. The production of the citrus fruit has been widely affected by citrus diseases which ultimately degrades the fruit quality and causes financial loss to the growers. During the past decade, image processing and computer vision methods have been broadly adopted for the detection and classification of plant diseases. Early detection of diseases in citrus plants helps in preventing them to spread in the orchards which minimize the financial loss to the farmers. In this article, an image dataset citrus fruits, leaves, and stem is presented. The dataset holds citrus fruits and leaves images of healthy and infected plants with diseases such as Black spot, Canker, Scab, Greening, and Melanose. Most of the images were captured in December from the Orchards in Sargodha region of Pakistan when the fruit was about to ripen and maximum diseases were found on citrus plants. The dataset is hosted by the Department of Computer Science, University of Gujrat and acquired under the mutual cooperation of the University of Gujrat and the Citrus Research Center, Government of Punjab, Pakistan. The dataset would potentially be helpful to researchers who use machine learning and computer vision algorithms to develop computer applications to help farmers in early detection of plant diseases. The dataset is freely available at https://data.mendeley.com/datasets/3f83gxmv57/2.

Entities:  

Keywords:  Feature extraction; Feature selection; Image classification

Year:  2019        PMID: 31516936      PMCID: PMC6731382          DOI: 10.1016/j.dib.2019.104340

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table This dataset provides visual tracking of citrus plant diseases in a hyperspectral sequence of citrus plants images. Therefore, it enables researchers to perform chemometric examinations for early identification of disease in citrus plant trees. The dataset could be useful to test and compare different computer vision and image processing classifiers for the detection of different diseases based on their visual features. Supports numerous feature selector and feature extractor by their textural descriptors and color scheme of different types of citrus diseases. Original citrus plant images are taken from broader view (at leaf level), so could be desirable by the plant physiologist for the depth analysis. The symptoms exerted from the dataset will facilitate comparison of the local citrus plants diseases structure with the other species of citrus plants located in Africa and surroundings. This data is collected in a natural domain with inconsistent weather and light intensity. Hence there could be a challenge for the researchers to identify the disease symptoms with naked eye.

Data

Fruit plants play a significant role in the economic growth of any state. One of the famous species among the fruit plants is a citrus plant, which is full of vitamin C and broadly used in the region of the Middle East and Africa [1]. In the agro-industries, some kind of citrus plants are used as a raw material [2]. Citrus plant diseases is a major cause to reduce the production of citrus fruits and their usages in several industries. The most common disease identified by the domain experts and researchers are Greening, Melanose, Downy, Black spot, Canker, Scab, and Anthracnose. These citrus plants disease can be identified on the basis of their visual symptoms by applying some computer vision and deep learning techniques [3]. This article presents the data set contain 759 images of healthy and unhealthy citrus fruits and leaves that could be used by the researchers to apply different computer vision and image processing algorithm for the detection of citrus plants diseases. The data set is associated with the articles [4], [5]. All images were acquired manually using a DSLR with the help of domain expert of citrus disease and from the Sargodha region, Pakistan. The width and height for all images with the resolution of 72 dpi were 256 pixels and 256 pixels respectively. The infected images were classified into 4 different diseases of citrus fruits and leave separately. The disease we targeted in the data sets were Black spot, Canker, Scab, Greening, and Melanose. Table 1 contains the description of the data set against each of its disease class. The dark spot on the citrus plants likewise shortened as CBS is accessible at the period when the plant is sick and the climate is ideal for the disease. The indications of the citrus leaf and fruits are little, bowed, and under hazardous spots with dark squares. The spectral range for spot diameter is 0.12–0.4 [4]. The canker spot is covered by the wind-driven downpour. On the citrus leaf, the small circles appear and the scope of the lesion is 2–10mm in size. Where the minimum range of citrus fruit lesion is appeared to be 1–10mm in size and these ranges contrast in size from one another [6]. Similarly the fruit species of citrus plants, the scab skin break out is a composite of parasitic and creature tissue. These acnes are not raised and the color shades vary from pink to light brown [6].
Table 1

Description of data set against each of its disease class.

Citrus Leaves
Citrus Fruits
DiseaseNumber of ImagesDiseaseNumber of Images
Black Spot171Black Spot19
Canker163Canker78
Greening204Greening16
Melanose13Scab15
Healthy58Healthy22
Total Images609Total Images150

Bold represents the total number of disease images captured in citrus leaves and fruits.

Description of data set against each of its disease class. Bold represents the total number of disease images captured in citrus leaves and fruits. Melanose is referred to as saprophyte where the weight of disease is characterized by total units of inoculum that make an influence on dead wood in the tree top. The sign of the disease is showing up as a little darker spot which ends up stuffed with a rosy dark colored gum. However, the indications of the disease on fruit species rely upon the age of the organic product at disease time [4]. Fig. 1, Fig. 2, Fig. 3, Fig. 4 showing the self-annotated images of healthy and un-healthy citrus fruit and plants.
Fig. 1

Samples of self-annotated images of healthy citrus fruits and leaves taken from the own collected dataset.

Fig. 2

Samples of self-annotated images of citrus leaves and fruit with Black Spot diseases taken from the own collected dataset.

Fig. 3

Samples of self-annotated images of citrus fruit with a) Canker b) Scab taken from the own collected dataset.

Fig. 4

Samples of self-annotated images of citrus fruit and leaves with a) Greening and b) Melanose diseases taken from the own collected dataset.

Samples of self-annotated images of healthy citrus fruits and leaves taken from the own collected dataset. Samples of self-annotated images of citrus leaves and fruit with Black Spot diseases taken from the own collected dataset. Samples of self-annotated images of citrus fruit with a) Canker b) Scab taken from the own collected dataset. Samples of self-annotated images of citrus fruit and leaves with a) Greening and b) Melanose diseases taken from the own collected dataset.

Experimental design, materials, and methods

Camera specification

The data set were gathered using an advanced DSLR (Canon EOS 1300D) having a sensor with CMOS system and resolution of 5202 × 3465 (Mpix). The sensor size for Canon EOS 1300D is 14.9 × 22.3 (mm). RGB color range is chosen for each of the images in the JPG combination, including 256 shades for each RGB layer and 8 pixels for each shading layer.

Processing

All images are resized to the dimension of 256*256 with 72 dpi resolution. In Ref. [4], the presented data set is used by the authors for the detection and classification of citrus diseases. The entire processes include four major steps to complete which are: (a) enhancing the dataset, (b) lesion segmentation which highlights the infected region, (c) extracting features from the infected region and finally (d) selecting feature visually and perform classification. Firstly, all original input images were filtered by the Top-hat process and then Gaussian function is applied to improve the contrast of the infected region. Secondly, the enhanced images then mapped to the segmented blocks by implementing the weighted segmentation and Saliency map. At this stage, the input data set is segmented with lesion spots identified. Afterward, the segmented images were given to feature extractor and selector algorithms which extract texture, color, and geometric features and select those features using skewness, PCA, and Entropy methods. Finally, the classification is performed to classify each image instance corresponds to each of its disease class. Each process is made out of the composition of steps as displayed in Fig. 5.
Fig. 5

System architecture for identification and classification of infected plants diseases directly taken from Ref. [4].

System architecture for identification and classification of infected plants diseases directly taken from Ref. [4].

Specifications Table

SubjectComputer Science
Specific subject areaImage identification, Image classification, Image processing and computer vision
Type of dataImages
How data were acquiredImages are captured by using a single camera.
Data formatJPG, Raw
Parameters for data collectionHealthy and infected images of citrus fruits and leaves were collected separately. The infected images are further subcategorized into the disease, each citrus plant holds i.e Black spot, Canker, Scab, Greening and Melanose.
Description of data collectionNo such sample pre-treatment was conducted. The data is collected manually with the help of domain expert and Citrus Research Center, Government of Punjab, Pakistan.
Data source locationInstitution: University of Gujrat.
City: Sargodha
Country: Pakistan
Data accessibilityDataset can be accessible at Mendeley data: https://data.mendeley.com/datasets/3f83gxmv57/2
Related research articleArticle 1:
Author's name: Sharif, M., Khan, M. A., Iqbal, Z., Azam, M. F., Lali, M. I. U., & Javed, M. Y.
Title: Detection and classification of citrus diseases in agriculture based on optimized weighted segmentation and feature selection.
Journal:Computers and electronics in agriculture
DOI:https://doi.org/10.1016/j.compag.2018.04.023
Article 2:
Author's name: Ali, H., Lali, M. I., Nawaz, M. Z., Sharif, M., & Saleem, B. A.
Title: Symptom based automated detection of citrus diseases using color histogram and textural descriptors.
Journal:Computers and electronics in agriculture
DOI:https://doi.org/10.1016/j.compag.2017.04.008
Value of the data

This dataset provides visual tracking of citrus plant diseases in a hyperspectral sequence of citrus plants images. Therefore, it enables researchers to perform chemometric examinations for early identification of disease in citrus plant trees.

The dataset could be useful to test and compare different computer vision and image processing classifiers for the detection of different diseases based on their visual features.

Supports numerous feature selector and feature extractor by their textural descriptors and color scheme of different types of citrus diseases.

Original citrus plant images are taken from broader view (at leaf level), so could be desirable by the plant physiologist for the depth analysis.

The symptoms exerted from the dataset will facilitate comparison of the local citrus plants diseases structure with the other species of citrus plants located in Africa and surroundings.

This data is collected in a natural domain with inconsistent weather and light intensity. Hence there could be a challenge for the researchers to identify the disease symptoms with naked eye.

  7 in total

1.  LungNet22: A Fine-Tuned Model for Multiclass Classification and Prediction of Lung Disease Using X-ray Images.

Authors:  F M Javed Mehedi Shamrat; Sami Azam; Asif Karim; Rakibul Islam; Zarrin Tasnim; Pronab Ghosh; Friso De Boer
Journal:  J Pers Med       Date:  2022-04-24

2.  Detection and Localization of Tip-Burn on Large Lettuce Canopies.

Authors:  Benjamin Franchetti; Fiora Pirri
Journal:  Front Plant Sci       Date:  2022-05-12       Impact factor: 6.627

3.  Maize leaf disease identification based on WG-MARNet.

Authors:  Zongchen Li; Guoxiong Zhou; Yaowen Hu; Aibin Chen; Chao Lu; Mingfang He; Yahui Hu; Yanfeng Wang
Journal:  PLoS One       Date:  2022-04-28       Impact factor: 3.240

4.  A comprehensive guava leaves and fruits dataset for guava disease recognition.

Authors:  Aditya Rajbongshi; Sadia Sazzad; Rashiduzzaman Shakil; Bonna Akter; Umme Sara
Journal:  Data Brief       Date:  2022-04-12

5.  AI-driven deep CNN approach for multi-label pathology classification using chest X-Rays.

Authors:  Saleh Albahli; Hafiz Tayyab Rauf; Abdulelah Algosaibi; Valentina Emilia Balas
Journal:  PeerJ Comput Sci       Date:  2021-04-20

6.  MDAM-DRNet: Dual Channel Residual Network With Multi-Directional Attention Mechanism in Strawberry Leaf Diseases Detection.

Authors:  Tingjing Liao; Ruoli Yang; Peirui Zhao; Wenhua Zhou; Mingfang He; Liujun Li
Journal:  Front Plant Sci       Date:  2022-07-08       Impact factor: 6.627

7.  Resources for image-based high-throughput phenotyping in crops and data sharing challenges.

Authors:  Monica F Danilevicz; Philipp E Bayer; Benjamin J Nestor; Mohammed Bennamoun; David Edwards
Journal:  Plant Physiol       Date:  2021-10-05       Impact factor: 8.340

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.