| Literature DB >> 28155722 |
Elham Yousef Kalafi1, Wooi Boon Tan1, Christopher Town2, Sarinder Kaur Dhillon3,4.
Abstract
BACKGROUND: Monogeneans are flatworms (Platyhelminthes) that are primarily found on gills and skin of fishes. Monogenean parasites have attachment appendages at their haptoral regions that help them to move about the body surface and feed on skin and gill debris. Haptoral attachment organs consist of sclerotized hard parts such as hooks, anchors and marginal hooks. Monogenean species are differentiated based on their haptoral bars, anchors, marginal hooks, reproductive parts' (male and female copulatory organs) morphological characters and soft anatomical parts. The complex structure of these diagnostic organs and also their overlapping in microscopic digital images are impediments for developing fully automated identification system for monogeneans (LNCS 7666:256-263, 2012), (ISDA; 457-462, 2011), (J Zoolog Syst Evol Res 52(2): 95-99. 2013;). In this study images of hard parts of the haptoral organs such as bars and anchors are used to develop a fully automated identification technique for monogenean species identification by implementing image processing techniques and machine learning methods. RESULT: Images of four monogenean species namely Sinodiplectanotrema malayanus, Trianchoratus pahangensis, Metahaliotrema mizellei and Metahaliotrema sp. (undescribed) were used to develop an automated technique for identification. K-nearest neighbour (KNN) was applied to classify the monogenean specimens based on the extracted features. 50% of the dataset was used for training and the other 50% was used as testing for system evaluation. Our approach demonstrated overall classification accuracy of 90%. In this study Leave One Out (LOO) cross validation is used for validation of our system and the accuracy is 91.25%.Entities:
Keywords: Automated image recognition; Digital image processing; K-nearest neighbour; Monogenean
Mesh:
Year: 2016 PMID: 28155722 PMCID: PMC5260054 DOI: 10.1186/s12859-016-1376-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1System flowchart
Fig. 2Process in image pre-processing, edge detection and image segmentation steps for four species of Sinodiplectanotrema malayanus, Trianchoratus pahangensis, Metahaliotrema mizellei and Metahaliotrema sp.
Fig. 33D scatter plot with different features. a scatter plot with combination of three features which are Euler number, perimeter, area (b) scatter plot with combination of three features which are, area, area density, perimeter density (c) scatter plot with combination of three features which are length of bounding box, width of bounding box and orientation of bounding box (d) scatter plot with combination of LDA selected features: FvLDA1, FvLDA2 and FvLDA3. The data were classified into 4 species: Sinodiplectanotrema malayanus (Sm), Trianchoratus pahangensis (Tp), Metahaliotrema mizellei (Mmi) and Metahaliotrema sp. (Mma)
Confusion matrix of testing dataset used for system evaluation
| Species | Results | Accuracy % | |||
|---|---|---|---|---|---|
| Sm | Tp | Mmi | Mma | ||
| Sm | 9 | 1 | 0 | 0 | 90 |
| Tp | 0 | 8 | 0 | 2 | 80 |
| Mmi | 0 | 0 | 10 | 0 | 100 |
| Mma | 0 | 1 | 0 | 9 | 90 |
| Overall | 90 | ||||
The confusion matrices showing the classification of 4 species of Monogeneans with k = 10
The data was classified into 4 species: Sinodiplectanotrema malayanus (Sm), Trianchoratus pahangensis (Tp), Metahaliotrema mizellei (Mmi) and Metahaliotrema sp. (Mma)
Confusion matrix of leave one out cross validation
| Species | Results | Accuracy % | |||
|---|---|---|---|---|---|
| Sm | Tp | Mmi | Mma | ||
| Sm | 19 | 1 | 0 | 0 | 95 |
| Tp | 0 | 18 | 0 | 2 | 90 |
| Mmi | 0 | 0 | 20 | 0 | 100 |
| Mma | 0 | 4 | 0 | 16 | 80 |
| Overall | 91.25 | ||||
The data was classified into 4 species: Sinodiplectanotrema malayanus (Sm), Trianchoratus pahangensis (Tp), Metahaliotrema mizellei (Mmi) and Metahaliotrema sp. (Mma)
Confusion matrix of 10 fold cross validation
| Species | Results | Accuracy % | |||
|---|---|---|---|---|---|
| Sm | Tp | Mmi | Mma | ||
| Sm | 19 | 0 | 1 | 0 | 95 |
| Tp | 0 | 18 | 2 | 0 | 90 |
| Mmi | 0 | 0 | 20 | 0 | 100 |
| Mma | 0 | 3 | 0 | 17 | 85 |
| Overall | 92. 5 | ||||
The data was classified into 4 species: Sinodiplectanotrema malayanus (Sm), Trianchoratus pahangensis (Tp), Metahaliotrema mizellei (Mmi) and Metahaliotrema sp. (Mma)