| Literature DB >> 33584121 |
Himanshu Mittal1, Avinash Chandra Pandey1, Mukesh Saraswat1, Sumit Kumar2, Raju Pal1, Garv Modwel3.
Abstract
Image segmentation is an essential phase of computer vision in which useful information is extracted from an image that can range from finding objects while moving across a room to detect abnormalities in a medical image. As image pixels are generally unlabelled, the commonly used approach for the same is clustering. This paper reviews various existing clustering based image segmentation methods. Two main clustering methods have been surveyed, namely hierarchical and partitional based clustering methods. As partitional clustering is computationally better, further study is done in the perspective of methods belonging to this class. Further, literature bifurcates the partitional based clustering methods into three categories, namely K-means based methods, histogram-based methods, and meta-heuristic based methods. The survey of various performance parameters for the quantitative evaluation of segmentation results is also included. Further, the publicly available benchmark datasets for image-segmentation are briefed.Entities:
Keywords: Benchmark datasets; Clustering methods; Image segmentation; Performance parameters
Year: 2021 PMID: 33584121 PMCID: PMC7870780 DOI: 10.1007/s11042-021-10594-9
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.577
Fig. 1Challenges in image segmentation a Illumination variation [2] b Intra-class variation [2] c Background complexity [3]
Fig. 2Example of clustering based image segmentation a Original image b Segmented image
Fig. 3Classification of clustering based image segmentation methods [85]
Fig. 4Types of hierarchical clustering [66]
Hierarchical clustering methods for image segmentation
| Sub-categories | Methods | Remarks |
|---|---|---|
| Divisive | DIVCLUS-T [ | Nested grouping at different similarity levels; |
| DHCDC [ | Number of clusters needed to be preset. | |
| Agglomerative | SLINK [ | Sensitive to noise and outliers; |
| CURE [ | Poor performance on overlapping clusters; | |
| Chameleon [ | Computationally expensive for large dataset. |
Partitional clustering methods for image segmentation
| Sub-categories | Methods | Remarks |
|---|---|---|
| Soft | FCM, FCS, | Grouping based on objective function; |
| FLAME | Preferred for large datasets. | |
| Hard | Kmeans-based | Low time complexity; |
| histogram-based | Number of clusters need to be known priorly; | |
| Metaheuristic-based | Dependance over the initial clusters. |
Fig. 5Classification of hard clustering methods
Fig. 61D view of grey level histogram of an image taken from BSDS300 [49] a Image b 1D histogram
Fig. 7Grey-local 2D histogram of the image, depicted in Fig. 6a a Three dimensional view b Two dimensional view
Fig. 8Grey-gradient 2D histogram of an image, depicted in Fig. 6a a Three dimensional view b Two dimensional view
Various performance parameters for the quantitative evaluation of an image segmentation method [51]
| Parameters | Formulation |
|---|---|
| Boundary Displacement Error (BDE) |
|
| Probability Rand Index (PRI) |
|
| Variation of Information (VoI) | |
| Global Consistency Error (GCE) |
|
| Structural Similarity Index (SSIM) |
|
| Feature Similarity Index (FSIM) |
|
| Root Mean Squared Error (RMSE) |
|
| Peak Signal to Noise Ratio (PSNR) |
|
| Normalized Cross-Correlation (NCC) | NCC = |
| Average Difference (AD) |
|
| Maximum Difference (MD) | |
| Normalized Absolute Error (NAE) |
|
Confusion matrix
| True Positive | True Negative | |
|---|---|---|
| Predicted Positive | TP | FP |
| Predicted Negative | FN | TN |
Benchmark datasets for image segmentation [23]
| S.No. | Dataset | Purpose | Links |
|---|---|---|---|
| 1 | Aberystwyth Leaf Evaluation Dataset | Segmentation of Leaf | [ |
| 2 | ADE20K Dataset | Segmentation of object and different parts of the objects | [ |
| 3 | Berkeley Segmentation Dataset and Benchmark (BSDS) | Segmentation of objects | [ |
| 4 | Brain MRI Dataset | Segmentation of FLAIR abnormalitiy | [ |
| 5 | CAD120 Affordance Dataset | Segmentation of Affordance of objects | [ |
| 6 | CoVID-19 CT-images Segmentation Dataset | Segmentation of infected region | [ |
| 7 | Crack Detection Dataset | Segmentation of cracks | [ |
| 8 | Daimler Pedestrian Segmentation Benchmark | Segmentation of pedestrian | [ |
| 9 | Epithelium Segmentation Dataset | Segmentation of epithelium region | [ |
| 10 | EVIMO Dataset | Motion segmentation | [ |
| 11 | Liver Tumor Segmentation Dataset | Segmentation of Liver and Liver Tumor | [ |
| 12 | Materials in Context Dataset | Material segmentation | [ |
| 13 | Nuclei Segmentation Dataset | Segmentation of breast cancer nuclei | [ |
| 14 | Objects with Thin and Elongated parts | Segmentation of objects with thin and elongated parts | [ |
| 15 | OpenSurfaces Dataset | Surface segmentation | [ |
| 16 | Oxford-IIIT Pet | Segmentation of pet animals | [ |
| 17 | PetroSurf3D Dataset | Segmentation of petroglyphs | [ |
| 18 | Segmentation Evaluation Database | Segmentation of single or two objects | [ |
| 19 | Sky Dataset | Segmentation of sky | [ |
| 20 | TB-roses-v1 Dataset | Segmentation of rose stems | [ |
| 21 | Tsinghua Road Markings Dataset | Segmentation of road markings | [ |
An overview of the different clustering-based image segmentation methods [85]
| Category | Method | Time | Suitable | Suitable | Noise/outlier | Scalability | Suitable for | Suitable for |
|---|---|---|---|---|---|---|---|---|
| complexity | data-type | data-shape | sensitive | high-dimension data | large-scale data | |||
| Hierarchy | BIRCH | O(n) | Numerical | Convex | Low | High | No | Yes |
| CURE | O( | Numerical | Arbitrary | Low | High | Yes | No | |
| ROCK | O( | Categorical | Arbitrary | Low | Average | Yes | No | |
| Chameleon | O( | Numerical/ Categorical | Arbitrary | Low | High | No | Yes | |
| Partition | FCM | O(n) | Numerical | Convex | High | Average | No | No |
| FCS | Kernal | Numerical | Arbitrary | High | Low | No | No | |
| MM | O( | Numerical | Arbitrary | Low | Low | No | No | |
| K-means | O(knt) | Numerical | Convex | High | Average | No | Yes | |
| K-medoids | O( | Numerical | Convex | Low | Low | No | No | |
| PAM | O( | Numerical | Convex | Low | Low | No | No | |
| CLARA | O( | Numerical | Convex | Low | High | No | Yes | |
| CLARANS | O( | Numerical | Convex | Low | Average | No | Yes | |
| Histogram | O( | Numerical | Arbitrary | Average | Low | Yes | No | |
| Metaheuristic | O(P x t x C) | Numerical/ Categorical | Arbitrary | Low | Low | Yes | Yes |
n: Number of data items; k: number of formed clusters; t: number of iterations; s: number of sample data items
P: Population-size; C: complexity of the objective function; L: number of levels; M: number of partitions