| Literature DB >> 27034616 |
Abstract
A large variety of well-known scale-invariant texture recognition methods is tested with respect to their scale invariance. The scale invariance of these methods is estimated by comparing the results of two test setups. In the first test setup, the images of the training and evaluation set are acquired under same scale conditions and in the second test setup, the images in the evaluation set are gathered under different scale conditions than those of the training set. For the first test setup, scale invariance is not needed, whereas for the second test setup, scale invariance is obviously crucial. The difference between the results of these two test setups indicates the scale invariance of a method (the higher the scale invariance the lower the difference). The scale invariance of the methods is additionally estimated by analyzing the similarity of the feature vectors of images and their scaled versions. Additionally to the scale invariance, we also test eventual viewpoint and illumination invariance of the methods. As texture databases for our tests we use the KTH-TIPS database and the CUReT database. Results imply that many of the considered methods are not as scale-invariant as expected.Entities:
Keywords: CUReT database; KTH-TIPS database; Scale invariance; Texture recognition
Year: 2014 PMID: 27034616 PMCID: PMC4768293 DOI: 10.1007/s10044-014-0435-1
Source DB: PubMed Journal: Pattern Anal Appl ISSN: 1433-7541 Impact factor: 2.580
Fig. 1Characteristic changes by rotation (b), varying illumination (c) or by scaling (d) of the material bread from the KTH-TIPS database
Fig. 2Images of the materials linen and cracker from the KTH-TIPS database with a scale difference of factor
Fig. 3Cyclic shifting of the means of the subbands across the scale dimension
Fig. 4a Different scaling factors are used for the training set images and for the evaluation set images. Each node denotes two elements, a sum of means and a sum of standard deviations. b The sliding of evaluation set image feature vector along augmented training set image vector
Fig. 5Fractal dimension in 2D space. a Smooth spiral curve with , b the checkerboard with and c the Sierpinski-Triangle with
Fig. 6The process of constructing and discretizing the orientation histogram when using the neighborhood of size
Fig. 7Generating binary blob images
Fig. 8The top row shows one texture image per material (as material numbers 2, 11, 12, and 14) from the CUReT database (originally scaled), while the bottom row shows these textures with a higher zoom factor (as material numbers 29, 30, 31, and 32)
OCR results for the two experiments on the CUReT database. The (relative) differences between the results indicate the scale invariance of the methods
| Method | Setup 1 | Setup 2 | Diff. |
|---|---|---|---|
| Dominant scale approach | 97.0 | 89.9 | 7.3 |
| Slide matching | 99.2 | 81.1 | 18.3 |
| Log-polar approach | 82.3 | 75.4 | 8.4 |
| Multi-fractal spectrum | 100 | 87.7 | 12.3 |
| Fractal analysis using filter banks | 99.9 | 73.9 | 26.0 |
| Fractal dim. for O. histograms | 97.6 | 71.8 | 26.4 |
| Dense SIFT features | 93.7 | 59.8 | 36.2 |
| ICM | 97.1 | 72.2 | 25.6 |
| SCM | 100 | 94.2 | 5.8 |
| Multiscale blob feat. (shape and | 100 | 86.8 | 13.2 |
| Multiscale blob feat. (shape) | 99.7 | 93.5 | 6.2 |
| Local affine regions | 98.2 | 88.7 | 9.7 |
| Local binary pattern | 99.9 | 79.1 | 20.8 |
| Gabor wavelet | 99.8 | 89.5 | 10.3 |
| DT-CWT | 100 | 97.9 | 2.1 |
Fig. 9The 10 materials of the KTH-TIPS database
Fig. 10Scale levels 2–6 (from left to right) for the materials cracker and orange peel of the KTH-TIPS database
Results for the KTH-TIPS database
The column ‘Sum’ indicates the scale invariance of the methods
Gray letters indicate worst results
Fig. 11The most frequently occurring types of misclassifications (mc’s) caused by scale change
The most fatal misclassifications (mc’s) of the methods caused by scale changes
| Type of mc | Affected methods |
|---|---|
| M1 | Dense sift features (71), slide matching (60), |
| Fractal dim. for O. histograms (44) and ICM (41) | |
| M2 | LBP (35) and ICM (31) |
| M3 | Fractal analysis using filter banks (45) and local affine regions (30) |
| M4 | Multiscale blob feat. (shape and |
Fig. 12RoSS of the CUReT and KTH-TIPS database
The nine images within each scale of a texture class in the KTH-TIPS database
| Image number | Viewing direction | Illumination direction | ||||
|---|---|---|---|---|---|---|
| Frontal | 22.5° right | 22.5° left | Frontal | 45° from top | 45° from side | |
| 1 | x | x | ||||
| 2 | x | x | ||||
| 3 | x | x | ||||
| 4 | x | x | ||||
| 5 | x | x | ||||
| 6 | x | x | ||||
| 7 | x | x | ||||
| 8 | x | x | ||||
| 9 | x | x | ||||
Fig. 13Two rank-based measures to test the influence of scaling and varying viewpoints (a) or to test the influence of scaling and varying illumination conditions (b) by means of the KTH-TIPS database
Fig. 14RoS as a measure to indicate the methods’ ability for texture recognition and the influence of scale changes
Fig. 15Comparing the RoS of the methods with the inverted accuracies of the methods
The methods’ scale invariance, viewpoint invariance and illumination invariance
| Method | Invariance | ||
|---|---|---|---|
| Scale | Viewpoint | Illumination | |
| Dominant scale approach |
|
|
|
| Slide matching |
|
|
|
| Log-polar approach |
|
|
|
| Multi-fractal spectrum |
|
|
|
| Fractal analysis using filter banks |
|
|
|
| Fractal dim. for O. histograms |
|
|
|
| Dense SIFT features |
|
|
|
| ICM |
|
|
|
| SCM |
|
|
|
| Multiscale blob feat. (shape and |
|
|
|
| Multiscale blob feat. (shape) |
|
|
|
| Local affine regions |
|
|
|
| Local binary pattern |
|
|
|
| Gabor wavelet |
|
|
|
| DT-CWT |
|
|
|
Fig. 16Results (Accuracies) of the performed experiments in Sect. 4 on the two databases
Results of the performed experiments on the two databases in Sect. 4, which are indicating the scale invariance of the methods
Bold letters indicate good results
Gray letters indicate worst results
Italic letters indicate results that are hard to interpret
Results of testing the scale invariance for three different subsets of the KTH-TIPS database with only 4 materials per class (column “4 Materials”), and the results for only 9 different viewing and illumination conditions per material of the CUReT database (column “9 Conditions”)
Bold letters indicate good results
Gray letters indicate worst results
Fig. 17Subsets of the KTH-TIPS database consisting of 4 materials