| Literature DB >> 35161652 |
Manzoor Ahmed Hashmani1, Mehak Maqbool Memon1, Kamran Raza2, Syed Hasan Adil2, Syed Sajjad Rizvi3, Muhammad Umair1.
Abstract
Super-pixels represent perceptually similar visual feature vectors of the image. Super-pixels are the meaningful group of pixels of the image, bunched together based on the color and proximity of singular pixel. Computation of super-pixels is highly affected in terms of accuracy if the image has high pixel intensities, i.e., a semi-dark image is observed. For computation of super-pixels, a widely used method is SLIC (Simple Linear Iterative Clustering), due to its simplistic approach. The SLIC is considerably faster than other state-of-the-art methods. However, it lacks in functionality to retain the content-aware information of the image due to constrained underlying clustering technique. Moreover, the efficiency of SLIC on semi-dark images is lower than bright images. We extend the functionality of SLIC to several computational distance measures to identify potential substitutes resulting in regular and accurate image segments. We propose a novel SLIC extension, namely, SLIC++ based on hybrid distance measure to retain content-aware information (lacking in SLIC). This makes SLIC++ more efficient than SLIC. The proposed SLIC++ does not only hold efficiency for normal images but also for semi-dark images. The hybrid content-aware distance measure effectively integrates the Euclidean super-pixel calculation features with Geodesic distance calculations to retain the angular movements of the components present in the visual image exclusively targeting semi-dark images. The proposed method is quantitively and qualitatively analyzed using the Berkeley dataset. We not only visually illustrate the benchmarking results, but also report on the associated accuracies against the ground-truth image segments in terms of boundary precision. SLIC++ attains high accuracy and creates content-aware super-pixels even if the images are semi-dark in nature. Our findings show that SLIC++ achieves precision of 39.7%, outperforming the precision of SLIC by a substantial margin of up to 8.1%.Entities:
Keywords: Euclidean measure; clustering; geodesic measure; similarity measure
Mesh:
Year: 2022 PMID: 35161652 PMCID: PMC8838179 DOI: 10.3390/s22030906
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Critical analysis of gradient-ascent super-pixel creation algorithms.
| Method | Complexity | Pixel Manipulation Strategy | Distance Measure | Dataset | Semi-Dark Image Mentions | Year | Ref. |
|---|---|---|---|---|---|---|---|
| Meanshift |
| Mode seeking to locate local maxima | Euclidean | Not mentioned | ✘ | 2002 | [ |
| Medoidshift |
| Approximates local gradient using weighted estimates of medoids. | Euclidean | Not mentioned | ✘ | 2007 | [ |
| Quickshift |
| Parzen’s density estimation for pixel values | Non-Euclidean | Caltech-4 | ✘ | 2008 | [ |
| TurboPixel |
| Geometric flows for limited pixels | Gradient calculation for boundary pixels only | Berkeley Dataset | ✘ | 2009 | [ |
| Scene Shape Super-pixel (SSP) |
| Shortest path manipulation with prior information of boundary. | Probabilistic modeling plus Euclidean space manipulation | Dynamic road scenes. No explicit mentions of semi-dark images but we suspect presence of semi | ✓ | 2009 | [ |
| Compact Super-pixels (CS) | - | Approximation of distance between pixels and further optimization with graph cut methods | Euclidean | 3D images | ✘ | 2010 | [ |
| Compact Intensity Super-pixels | - | Same as CS, With added color constant information. | Euclidean | 3D images | ✘ | 2010 | [ |
| SLIC |
| Gradient optimization after every iteration | Euclidean | Berkeley Dataset | ✘ | 2012 | [ |
| SEEDS | - | Energy optimization for super-pixel is based on hill-climbing. | Euclidean | Berkeley Dataset | ✘ | 2012 | [ |
| Structure Sensitive Super-pixels |
| Super-pixel densities are checked, and energy minimization is conformed. | Geodesic | Berkeley Dataset | ✘ | 2013 | [ |
| Depth-adaptive super-pixels |
| Super-pixel density identification, followed by sampling and finally k-means to create final clusters | Euclidean | RGB-D dataset consisting of 11 images | ✘ | 2013 | [ |
| Contour Relaxed Super-pixels |
| Uses pre-segmentation technique to create homogeneity constraint | Not mentioned | Not mentioned | ✘ | 2013 | [ |
| Saliency-based super-pixel |
| Super-pixel creation followed by merging operator based on saliency. | Euclidean | Not mentioned | ✘ | 2014 | [ |
| Linear Spectral Clustering |
| Two-fold pixel manipulation strategy of optimization based on graph and clustering based algorithms. | Euclidean | Berkeley Dataset | ✘ | 2015 | [ |
| Manifold SLIC |
| Same as SLIC but with mapping over manifold. | Euclidean | Berkeley Dataset | ✘ | 2016 | [ |
| BASS (Boundary-Aware Super-pixel Segmentation) |
| Extension of SLIC initially creates boundary then uses SLIC with different distance measures, along with optimization of initialization parameters. | Euclidean + Geodesic | Fashionista, |
| 2016 | [ |
| BSLIC |
| Extension of SLIC initializes seed within a hexagonal space rather than square | Euclidean | Berkeley Dataset | ✘ | 2017 | [ |
| Intrinsic Manifold SLIC |
| Extension of Manifold SLIC with geodesic distance measure | Geodesic | Berkeley Dataset | ✘ | 2017 | [ |
| Similarity Ratio based Super-pixels |
| Extension of SLIC. Proposes automatic scaling of coordinate axes. | Mahanlanobis | SAR Image dataset |
| 2017 | [ |
| Scalable SLIC |
| Optimized initialization parameters such as ‘n’ number of super-pixels, focused research to parallelization of sequential implementation. | Euclidean | cyrosection Visible Human Male | ✘ | 2018 | [ |
| Content adaptive super-pixel segmentation |
| Work on prior transformation of image with highlighted edges created by edge filters | Euclidean (with graph-based transformation) | Berkeley Dataset | ✘ | 2019 | [ |
| BASS (Bayesian Adaptive Super-Pixel Segmentation) |
| Uses probabilistic methods to intelligently initialize the super-pixel seeds. | Euclidean | Berkeley Dataset | ✘ | 2019 | [ |
| Super-pixel segmentation with fully convolutional networks |
| Attempts to use neural networks for automatic seed initialization over grid. | Euclidean | Berkeley Dataset, SceneFlow Dataset | ✘ | 2020 | [ |
| Texture-aware and structure preserving Super-pixels |
| The seed initialization takes place in circular grid. | Three different distance measure (without explicit details) | Berkeley Dataset | ✘ | 2021 | [ |
| Efficient Image-Warping Framework for Content-Adaptive Super-pixels Generation |
| Warping transform is used along with SLIC for creation of adaptive super-pixels. | Euclidean | Berkeley Dataset | ✘ | 2021 | [ |
| Edge aware super-pixel segmentation with unsupervised CNN |
| Edges are detected using unsupervised convolutional neural networks then passed to super-pixel segmentation algorithms | Entropy based clustering | Berkeley Dataset | ✘ | 2021 | [ |
Figure 1Irrelevance of Euclidean distance measure for super-pixel creation relating to image content.
Figure 2Restricted Image search area for super-pixel creation specified by input argument for image window under consideration [31].
Summary statistics of average performance of SLIC++ for varying weights.
| Row | Ratio | Precision | Recall | Score | ||
|---|---|---|---|---|---|---|
| Test Case 1 (Image ID = 14037): | ||||||
| 1 | 10:90 | 0.1123 | 0.8877 | 0.47882 | 0.88930 | 0.62248 |
| 2 | 70:30 | 0.6825 | 0.3175 | 0.38850 | 0.92210 | 0.54660 |
| 3 | 50:50 | 0.4863 | 0.5137 | 0.37780 | 0.93040 | 0.53740 |
|
|
|
|
|
|
|
|
| 5 | 90:10 | 0.8877 | 0.1123 | 0.38840 | 0.87340 | 0.53770 |
| Test Case 2 (Image ID = 26031): | ||||||
| 6 | 10:90 | 0.1123 | 0.8877 | 0.21623 | 0.78808 | 0.33935 |
| 7 | 70:30 | 0.6825 | 0.3175 | 0.18370 | 0.82790 | 0.30070 |
| 8 | 50:50 | 0.4863 | 0.5137 | 0.18910 | 0.85000 | 0.31000 |
|
|
|
|
|
|
|
|
| 10 | 90:10 | 0.8877 | 0.1123 | 0.18650 | 0.79000 | 0.30220 |
| Test Case 3 (Image ID = 108082): | ||||||
| 11 | 10:90 | 0.1123 | 0.8877 | 0.27023 | 0.89832 | 0.41548 |
| 12 | 70:30 | 0.6825 | 0.3175 | 0.21840 | 0.82160 | 0.34510 |
| 13 | 50:50 | 0.4863 | 0.5137 | 0.22640 | 0.86800 | 0.35920 |
|
|
|
|
|
|
|
|
| 15 | 90:10 | 0.8877 | 0.1123 | 0.22360 | 0.79470 | 0.34900 |
Performance analysis of SLIC extensions.
| Row |
|
|
| Parameters | Score | Precision | Recall | Distance Measure | |
|---|---|---|---|---|---|---|---|---|---|
| Test Case 1 (Image ID = 14037): | |||||||||
| 1 | 500 | 10 | 10 | 0.54430 | 0.39120 | 0.89390 | Euclidean—SLIC | ||
| 2 | 500 | 10 | 10 | 0.61234 | 0.46563 | 0.89406 | Chessboard—SLIC+ | ||
| 3 | 500 | 10 | 10 | 0.59713 | 0.44407 | 0.91118 | Cosine—SLIC+ | ||
| 4 | 500 | 10 | 10 |
| 0.62792 | 0.47345 | 0.93199 | Min4—SLIC+ | |
| 5 | 500 | 10 | 10 | 0.56128 | 0.43777 | 0.78186 | Geodesic—SLIC+ | ||
|
|
|
|
|
|
|
|
|
|
|
| Test Case 2 (Image ID = 26031): | |||||||||
| 7 | 500 | 10 | 10 | 0.30420 | 0.18690 | 0.81740 | Euclidean—SLIC | ||
| 8 | 500 | 10 | 10 | 0.35454 | 0.22098 | 0.89623 | Chessboard—SLIC+ | ||
| 9 | 500 | 10 | 10 | 0.35698 | 0.22329 | 0.88957 | Cosine—SLIC+ | ||
| 10 | 500 | 10 | 10 |
| 0.34057 | 0.20959 | 0.90798 | Min4—SLIC+ | |
| 11 | 500 | 10 | 10 | 0.33715 | 0.21369 | 0.79842 | Geodesic—SLIC+ | ||
|
|
|
|
|
|
|
|
|
|
|
| Test Case 3 (Image ID = 108082): | |||||||||
| 13 | 500 | 10 | 10 | 0.35410 | 0.22720 | 0.80260 | Euclidean—SLIC | ||
| 14 | 500 | 10 | 10 | 0.42099 | 0.27720 | 0.87476 | Chessboard—SLIC+ | ||
| 15 | 500 | 10 | 10 | 0.38368 | 0.24251 | 0.91811 | Cosine—SLIC+ | ||
| 16 | 500 | 10 | 10 |
| 0.42465 | 0.27694 | 0.91004 | Min4—SLIC+ | |
| 17 | 500 | 10 | 10 | 0.40382 | 0.26764 | 0.82216 | Geodesic—SLIC+ | ||
|
|
|
|
|
|
|
|
|
|
|
Figure 3SLIC v/s SLIC++ performance over different number of pixels: (a) precision values; (b) recall value; (c) score values.
Summary statistics of average performance for Berkeley dataset.
| Algorithm | Score | Precision | Recall |
|---|---|---|---|
|
| 0.47020 | 0.31604 | 0.97719 |
|
|
|
|
|
|
| 0.55705 | 0.57573 | 0.68416 |
Semi-dark perceptual results conforming boundary retrieval.
| Row ID | Image | Groundtruth Image | Prediction | Prediction Map Compared with Groundtruth |
|---|---|---|---|---|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 4Input image with highlighted regions for detailed analysis.
Detailed perceptual analysis with increasing parameters.
| Number of Super-Pixels/Algorithm | 500 | 1000 | 1500 | 2000 |
|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
|
| |||
Figure 5Zoomed in view of test case image for content-aware super-pixel analysis created by SLIC++ (b) against SLIC (a).