| Literature DB >> 31656834 |
Syed Zakir Hussain Shah1, Hafiz Tayyab Rauf2, Muhammad IkramUllah2, Malik Shahzaib Khalid1, Muhammad Farooq2, Mahroze Fatima3, Syed Ahmad Chan Bukhari4.
Abstract
Fishes are most diverse group of vertebrates with more than 33000 species. These are identified based on several visual characters including their shape, color and head. It is difficult for the common people to directly identify the fish species found in the market. Classifying fish species from images based on visual characteristics using computer vision and machine learning techniques is an interesting problem for the researchers. However, the classifier's performance depends upon quality of image dataset on which it has been trained. An imagery dataset is needed to examine the classification and recognition algorithms. This article exhibits Fish-Pak: an image dataset of 6 different fish species, captured by a single camera from different pools located nearby the Head Qadirabad, Chenab River in Punjab, Pakistan. The dataset Fish-Pak are quite useful to compare various factors of classifiers such as learning rate, momentum and their impact on the overall performance. Convolutional Neural Network (CNN) is one of the most widely used architectures for image classification based on visual features. Six data classes i.e. Ctenopharyngodon idella (Grass carp), Cyprinus carpio (Common carp), Cirrhinus mrigala (Mori), Labeo rohita (Rohu), Hypophthalmichthys molitrix (Silver carp), and Catla (Thala), with a different number of images, have been included in the dataset. Fish species are captured by one camera to ensure the fair environment to all data. Fish-Pak is hosted by the Zoology Lab under the mutual affiliation of the Department of Computer Science and the Department of Zoology, University of Gujrat, Gujrat, Pakistan.Entities:
Keywords: Fish feature extraction; Fish head; Fish scale; Fish species classification; Fish species recognition; Fish species shape
Year: 2019 PMID: 31656834 PMCID: PMC6806455 DOI: 10.1016/j.dib.2019.104565
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Example head images of 6 different fish species with transparent background taken from a particular position.
Fig. 2Example body images of 6 different fish species with transparent background taken from a particular position.
Fig. 3Example scale images of 6 different fish species with transparent background taken from a particular position.
Morphological features of different fish species determined manually from Fish-Pak dataset.
| Characters | ||||||
|---|---|---|---|---|---|---|
| Body shape | Spindle shaped | Elongated, streamlined or laterally compressed | Short and deep, somewhat laterally compressed | Deep and laterally compressed | Elongated, laterally compressed and back arched | Elongated, chubby and torpedo-shaped |
| Color | Blackish on the dorsal side and silvery on the ventro-lateral sides | Grayish or greenish on the back and silvery at the sides and below | Grayish on back and flanks, silvery-white at the below side | Greenish on the back, silvery on the belly | Silvery grey in with yellowish belly | Dark olive, shading to brownish-yellow on the sides with a white belly |
| Head region | ||||||
| – | ||||||
| – | ||||||
| Fin rays | ||||||
| – | – | – | ||||
| – | – | – | – | |||
| – | ||||||
| – | – |
Fig. 4VGGNet feature map of 2nd convolutional layer with 64 maps and 3 × 3 kernel size.
Specifications Table
| Subject area | Computer Science |
| More specific subject area | Image processing, Image identification, Image classification, computer vision |
| Type of data | Images |
| How data was acquired | A digital camera (Canon EOS 1300D) with a sensor type of CMOS bearing the resolution of 5202 × 3465 (Mpix) was used to acquired data. |
| Data format | JPG, Raw |
| Experimental factors | No such sample pre-treatment was conducted. However blue back ground color were considered instead of transparent and the Camera illuminance were ensured constant for each image at the time of capturing images. |
| Experimental features | The attributes of the subjects includes head features (mouth, snout, lips), body features (Elongated, streamlined, compressed), and the scale features (Dorsal fin, Pectoral fin, Pelvic fin, Anal fin, Caudal fin). |
| Data source location | Head Qadirabad, Phalia, Pakistan, University of Gujrat, Department of Computer Science. |
| Data accessibility | Repository name: [Mendeley data repository] |
This is the sole dataset holding exact fish species identification based on six distinct classes i.e. Supports the testing of different classifiers based on the distinct features of fish species, and to compare their performance. Gives information to evaluate algorithm performance under various fish species attributes such as shape and size of the body, Fins color and shape, eye, lateral line, head's size, and shape, scale size and shape. In fisheries management, this dataset can be utilized to segregate species of fishes for the examination of underwater ecology and fish behavior. A complex multiclass image dataset for the researchers to test several feature extractor and record their performance. Data exerted in a controlled domain with a steady white background. |