| Literature DB >> 32240408 |
Anitha Ganesan1, Anbarasu Balasubramanian2.
Abstract
In the context of improved navigation for micro aerial vehicles, a new scene recognition visual descriptor, called spatial color gist wavelet descriptor (SCGWD), is proposed. SCGWD was developed by combining proposed Ohta color-GIST wavelet descriptors with census transform histogram (CENTRIST) spatial pyramid representation descriptors for categorizing indoor versus outdoor scenes. A binary and multiclass support vector machine (SVM) classifier with linear and non-linear kernels was used to classify indoor versus outdoor scenes and indoor scenes, respectively. In this paper, we have also discussed the feature extraction methodology of several, state-of-the-art visual descriptors, and four proposed visual descriptors (Ohta color-GIST descriptors, Ohta color-GIST wavelet descriptors, enhanced Ohta color histogram descriptors, and SCGWDs), in terms of experimental perspectives. The proposed enhanced Ohta color histogram descriptors, Ohta color-GIST descriptors, Ohta color-GIST wavelet descriptors, SCGWD, and state-of-the-art visual descriptors were evaluated, using the Indian Institute of Technology Madras Scene Classification Image Database two, an Indoor-Outdoor Dataset, and the Massachusetts Institute of Technology indoor scene classification dataset [(MIT)-67]. Experimental results showed that the indoor versus outdoor scene recognition algorithm, employing SVM with SCGWDs, produced the highest classification rates (CRs)-95.48% and 99.82% using radial basis function kernel (RBF) kernel and 95.29% and 99.45% using linear kernel for the IITM SCID2 and Indoor-Outdoor datasets, respectively. The lowest CRs-2.08% and 4.92%, respectively-were obtained when RBF and linear kernels were used with the MIT-67 dataset. In addition, higher CRs, precision, recall, and area under the receiver operating characteristic curve values were obtained for the proposed SCGWDs, in comparison with state-of-the-art visual descriptors.Entities:
Keywords: Micro aerial vehicle; Navigation; Scene recognition; Support vector machine; Visual descriptors
Year: 2019 PMID: 32240408 PMCID: PMC7099533 DOI: 10.1186/s42492-019-0030-9
Source DB: PubMed Journal: Vis Comput Ind Biomed Art ISSN: 2524-4442
Fig. 1Parrot augmented reality drone2 quadrotor
Fig. 2Illustration of visual descriptors. a Scale-invariant feature transform key points detected in an outdoor image; b Speeded up robust features key points detected in an indoor image; c Histogram of oriented gradients features detected in an outdoor image; d Input indoor image; e Horizontal directional morphological gradient; f Vertical directional morphological gradient; g GIST descriptor
Fig. 3Census transformed output. a Indoor image; b Census transformed Indoor image; c Census transformed value
Fig. 4Wavelet descriptors. a CIELAB color space image; b 2 dimensional wavelet decomposition output
Fig. 5Visual illustration of Color-GIST descriptor.a RGB outdoor image; b Red channel (RGB image); c Green channel (RGB image); d Blue channel (RGB image); e-g Color-GIST descriptor extracted applying 32 Gabor filters at 4 scales and 8 orientations from red channel (e) green channel (f) and blue channel (g); h-j Color-GIST descriptor extracted applying 64 Gabor filters at 4 scales and 16 orientations from red channel (h) green channel (i) and blue channel (j)
Fig. 6The block diagram of the proposed method
Scene categorization performance on the Indian institute of technology madras-scene classification image database two dataset
| No | Algorithms | CR | Precision | Recall | F-measure | AUC |
|---|---|---|---|---|---|---|
| 1 | CENTRIST-spatial pyramid (RBF) | 92.93% | 92.93% | 92.91% | 92.92% | 92.92% |
| 2 | CENTRIST-spatial pyramid (linear) | 92.53% | 92.53% | 92.54% | 92.53% | 92.54% |
| 3 | Color-GIST descriptors (RBF) | 89.59% | 89.58% | 89.58% | 89.58% | 89.37% |
| 4 | Color-GIST descriptors (linear) | 81.73% | 81.77% | 81.68% | 81.73% | 81.68% |
| 5 | Wavelet descriptors (RBF) | 75.05% | 75.10% | 75.09% | 75.09% | 75.09% |
| 6 | CENTRIST (linear) | 85.06% | 85.08% | 85.09% | 85.08% | 85.10% |
| 7 | CENTRIST (RBF) | 86.24% | 86.27% | 86.28% | 86.28% | 86.28% |
| 8 | SIFT-ScSPM (linear) | 92.92% | 92.92% | 92.94% | 92.93% | 92.94% |
| 9 | SIFT-LLC (linear) | 92.53% | 92.52% | 92.54% | 92.53% | 92.55% |
| 10 | SIFT-SPM (Chi-square) | 86.05% | 87.42% | 85.84% | 86.62% | 85.84% |
| 11 | HOG-SPM (Chi-square) | 80.74% | 86.31% | 80.32% | 83.20% | 80.32% |
| 12 | SURF- SPM (Chi-square) | 84.47% | 88.34% | 84.13% | 86.19% | 84.14% |
| 13 | Enhanced-GIST (RBF) | 89.98% | 90.19% | 89.90% | 90.05% | 90% |
| 14 | Enhanced-GIST (linear) | 84.28% | 84.40% | 84.21% | 84.21% | 84.22% |
| 15 | Enhanced Ohta color histogram descriptors (RBF)-proposed | 76.03% | 77.04% | 76.23% | 76.63% | 76.23% |
| 16 | Ohta Color-GIST descriptors (RBF)-proposed | 90.57% | 90.56% | 90.56% | 90.56% | 90.57% |
| 17 | Ohta Color-GIST descriptors (linear)-proposed | 88.21% | 88.21% | 88.21% | 88.21% | 88.21% |
| 18 | Ohta Color-GIST wavelet descriptors (linear)-proposed | 88.61% | 88.61% | 88.59% | 88.60% | 88.59% |
| 19 | Ohta Color-GIST wavelet descriptors (RBF)-proposed | 90.57% | 90.56% | 90.56% | 90.56% | 90.57% |
| 20 | spatial color-gist wavelet descriptors (linear) -proposed | 95.29% | 95.28% | 95.28% | 95.28% | 95.28% |
| 21 | spatial color-gist wavelet descriptors (RBF)-proposed | 95.48% | 95.50% | 95.47% | 95.48% | 95.47% |
AUC Area under the receiver operating characteristic curve, CR Classification rate, RBF Radial basis function kernel, SIFT-LLC SIFT with locality-constrained linear coding, SIFT-ScSPM SIFT with sparse coding based spatial pyramid matching, SIFT-SPM SIFT with spatial pyramid matching, SPM Spatial pyramid matching, HOG Histogram of oriented gradients, SURF Speeded up robust features, CENTRIST Census transform histogram
Scene categorization performance on the Indoor Outdoor dataset
| No | Algorithms | CR | Precision | Recall | F-measure | AUC |
|---|---|---|---|---|---|---|
| 1 | CENTRIST-spatial pyramid (RBF) | 97.27% | 97.47% | 95.62% | 96.54% | 95.63% |
| 2 | CENTRIST-spatial pyramid (linear) | 95.81% | 95.36% | 94% | 94.67% | 94% |
| 3 | Color-GIST descriptors (RBF) | 89.63% | 92.89% | 81.41% | 86.77% | 81.42% |
| 4 | Color-GIST descriptors (linear) | 83.27% | 78.84% | 81.83% | 80.31% | 81.83% |
| 5 | Wavelet descriptors (RBF) | 74.54% | 69.11% | 71.45% | 70.26% | 71.46% |
| 6 | CENTRIST (linear) | 93.63% | 92.88% | 90.83% | 91.84% | 90.83% |
| 7 | CENTRIST (RBF) | 97.27% | 96.14% | 97.08% | 96.61% | 97.08% |
| 8 | SIFT-ScSPM (linear) | 98.72% | 98.49% | 98.29% | 98.39% | 98.29% |
| 9 | SIFT-LLC (linear) | 99.27% | 98.88% | 99.29% | 99.08% | 99.29% |
| 10 | SIFT-SPM (Chi-square) | 57.81% | 51.75% | 52.04% | 51.89% | 52.04% |
| 11 | HOG-SPM (Chi-square) | 71.81% | 51.79% | 50.20% | 50.98% | 50.21% |
| 12 | SURF- SPM (Chi-square) | 73.27% | 86.56% | 51% | 64.18% | 51.00% |
| 13 | Enhanced-GIST (RBF) | 97.81% | 96.73% | 97.87% | 97.30% | 97.88% |
| 14 | Enhanced-GIST (linear) | 96.72% | 94.98% | 97.12% | 96.04% | 97.12% |
| 15 | Enhanced Ohta color histogram descriptors (RBF)-proposed | 73.45% | 65.11% | 61.33% | 63.16% | 61.33% |
| 16 | Ohta Color-GIST descriptors (RBF)-proposed | 98.73% | 98.29% | 98.50% | 98.39% | 98.50% |
| 17 | Ohta Color-GIST descriptors (linear)-proposed | 96.72% | 94.86% | 97.33% | 96.08% | 97.33% |
| 18 | Ohta Color-GIST wavelet descriptors (linear)-proposed | 97.27% | 95.70% | 97.70% | 96.69% | 97.71% |
| 19 | Ohta Color-GIST wavelet descriptors (RBF)-proposed | 98.73% | 98.29% | 98.50% | 98.39% | 98.50% |
| 20 | Spatial Color-gist wavelet descriptors (linear) -proposed | 99.45% | 99.21% | 99.42% | 99.31% | 99.42% |
| 21 | spatial color-gist wavelet descriptors (RBF)-proposed | 99.82% | 99.67% | 99.87% | 99.77% | 99.88% |
AUC Area under the receiver operating characteristic curve, CR Classification rate, RBF Radial basis function kernel, SIFT-LLC SIFT with locality-constrained linear coding, SIFT-ScSPM SIFT with sparse coding based spatial pyramid matching, SIFT-SPM SIFT with spatial pyramid matching, SPM Spatial pyramid matching, HOG Histogram of oriented gradients, SURF Speeded up robust features, CENTRIST Census transform histogram
Computational cost calculation
| Type of feature | SVM classifier method | IITM-SCID2 Dataset | Indoor-Outdoor Dataset | MIT-67 indoor scene classification Dataset | |||
|---|---|---|---|---|---|---|---|
| Recognition rate | Average time elapsed, in seconds per frame | Recognition rate | Average time elapsed, in seconds per frame | Recognition rate | Average time elapsed, in seconds per frame | ||
| Spatial color gist wavelet descriptors | linear kernel | 95.29% | 2.58 | 99.45% | 2.80 | 4.92% | 1.74 |
| RBF kernel | 95.48% | 2.60 | 99.82% | 2.89 | 2.08% | 2.12 | |
SVM Support vector machine, IITM-SCID2 Indian institute of technology madras-scene classification image database two, RBF Radial basis function kernel
Fig. 7Confusion matrices obtained for spatial color gist wavelet descriptors on Indian institute of technology madras-scene classification image database two dataset. a Linear kernel; b Radial basis function kernel
Fig. 8Confusion matrices obtained for Spatial color gist wavelet descriptors on Indoor Outdoor dataset. a Linear kernel; b Radial basis function kernel