| Literature DB >> 33495686 |
Manish Kumar1, Kota Solomon Raju2, Dinesh Kumar1, Nitin Goyal3, Sahil Verma4, Aman Singh5.
Abstract
Smart city surveillance systems are the battery operated light weight Internet of Things (IoT) devices. In such devices, automatic face recognition requires a low powered memory efficient visual computing system. For these real time applications in smart cities, efficient visual recognition systems are need of the hour. In this manuscript, efficient fast subspace decomposition over Chi Square transformation is proposed for IoT based on smart city surveillance systems. The proposed technique extracts the features for visual recognition using local binary pattern histogram. The redundant features are discarded by applying the fast subspace decomposition over the Gaussian distributed Local Binary Pattern (LBP) features. This redundancy is major contributor to memory and time consumption for battery based surveillance systems. The proposed technique is suitable for all visual recognition applications deployed in IoT based surveillance devices due to higher dimension reduction. The validation of proposed technique is proved on the basis of well-known databases. The technique shows significant results for all databases when implemented on Raspberry Pi. A comparison of the proposed technique with already existing/reported techniques for the similar applications has been provided. Least error rate is achieved by the proposed technique with maximum feature reduction in minimum time for all the standard databases. Therefore, the proposed algorithm is useful for real time visual recognition for smart city surveillance.Entities:
Keywords: Fast subspace decomposition; Feature reduction; Local binary pattern; Minimum error rate
Year: 2021 PMID: 33495686 PMCID: PMC7816836 DOI: 10.1007/s11042-020-10471-x
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.757
Comparison of different algorithms with respect to database and real time implementation
| Technique Used | Accuracy | Databases Tested | Real Time | Complexity |
|---|---|---|---|---|
| Optimal [ | Recognition accuracy for face database ORL is 93%, for Facial Recognition Technology (FERET) is 79%, for Extended Yale face database is 83% achieved | ORL, FERET, Extended Yale B, Carnegie Mellon University (CMU), Pose, Illumination and Expression (PIE) and Aleix and Robert (AR) databases. | No | High |
| Local [ | achieving a face verification rate of 88.1% at 0.1% false accept rate | Extended Yale-B, Chinese Academy of Sciences-Pose, Expression, Accessories, and Lighting (CAS-PEAL-R1) and Face Recognition Grand Challenge (FRGC)-204 data set. | Yes | Low |
| Local Binary [ | Accuracy is more than 95% | FERET Database | No | Low |
| Noise-resistant [ | Optimum accuracy is achieved. | AR Database extended Yale dataset, O2FN Dataset. With noise | No | Medium |
| Chi-squared [ | Error rate reduced to 0.38% for AR dataset. 1.5% for O2FN 9.5% for LFW 8.6% for 2D Hela | AR Database, LFW dataset, O2FN Dataset. 2D hela database also texture database | No | Medium |
| Karhunen–Loeve transform and SVM [ | Accuracy achieved 97.5%, 97.5%, 91.5% and 90.6% | ORL database | No | High |
| Haar wavelet [ | Accuracy achieved up to optimum level. | Faces 94 Directory dataset | No | High |
| Compound dictionary learning based classification (CDLBC) [ | Accuracy achieved with number of the training samples of each Subject is 90.14%, 65.4%, 63.88% and 37.01% respectively | ORL, FERET, Georgia Tech (GT), LFW database, | No | High |
| Linear representation-based classification method using Fisher discriminant [ | Accuracy achieved for ORL database is 85%, for XM2VTS face database 95% | ORL, AR and XM2VTS (Multi Modal Verification for Teleservices and Security applications) face databases. | No | High |
Fig. 1Block of real time vision system on Raspberry Pi board
Fig. 2a Extracted feature image, b Histogram vector
Fig. 3Proposed architecture
Fig. 4AR dataset [5]
Comparison of the existing approaches on different database
| S. No. | Algorithms | AR Dataset Error Rate (%) | O2FN Dataset Error Rate (%) | LFW Dataset Error Rate (%) | Dyn Tex++ Dataset Error rate (%) |
|---|---|---|---|---|---|
| 1 | Nearest-Neighbor Classifier with Euclidean distance (NNC-ED) [ | 3.43 | 33.20 | 28.70 | 10.43 |
| 2 | Nearest-Neighbor Classifier with Chi Squared Distance (NNC-Chi2D) [ | 1.33 | 30.00 | 26.35 | 5.92 |
| 3 | Principal Component Analysis with Mahalanobis distance (PCA-mMDC) [ | 2.48 | 5.00 | 21.05 | 5.48 |
| 4 | Chi Squared Transformation-Asymmetric Principal Component Analysis (CST-APCA) [ | 0.38 | 1.50 | 9.50 | 3.02 |
| 5 | Principal Component Analysis with Mahalanobis distance (SR-PCA-mMDC) [ | 1.33 | 2.80 | 15.88 | 4.60 |
| 6 | Power mean support vector machine with Chi Squared Distance (PmSVM-Chi2) [ | 0 | 4.00 | 13.67 | 11.79 |
| 7 | Power mean support vector machine with Histogram intersection kernel (PmSVM-HI) [ | 0 | 4.10 | 13.43 | 11.83 |
| 8 | Joint Bayesian [ | – | – | 15.92 | – |
| 9 | Distance Learning Pegasos (DL-PEGASOS) [ | 36.30 | |||
| 10 | Dynamic fractal analysis (DFS) [ | – | – | – | 10.10 |
| 11 | Dimensionality Reduced Local Directional Pattern (DR-LDP) [ | – | – | 26.87 ± 0.7 | – |
| 12 | Modified Dimensionality Reduced Local Directional Pattern (MDR-LDP) [ | 21.28 ± 0.6 | |||
| 13 | Compound Dictionary Learning Based Classification (CDLBC) [ | 2.10 | 4.20 | 20.8 | |
| 14 | Hybrid feature extraction (HFE) [ | 1.9 | 3.9 | 13.7 | |
| 15 |
Fig. 5O2FN mobile dataset [23]
Fig. 6LFW dataset [41]
Fig. 7Percentage reduced features with respect to percentage error rate
Fig. 8Performance of different algorithms for standard databases
Fig. 9a Analysis of precision, sensitivity and F-measure of AR dataset of the proposed approach with existing algorithms. b Analysis of precision, sensitivity and F-measure of O2FN mobile dataset of the proposed approach with existing algorithms. c Analysis of precision, sensitivity and F-measure of LFW dataset of the proposed approach with existing algorithms. d Analysis of precision, sensitivity and F-measure of Dynamic Texture dataset of the proposed approach with existing algorithms
Fig. 10a TAR Vs FAR comparative analysis of Dynamic Texture database of existing and proposed algorithms. b TAR Vs FAR comparative analysis of LFW database of existing and proposed algorithms. c TAR Vs FAR comparative analysis of O2FN database database of existing and proposed algorithms. d TAR Vs FAR comparative analysis of AR database of existing and proposed algorithms
Fig. 11Percentage accuracy comparisons
Feature comparison of the proposed algorithm with the existing approaches in the literature
| Sr. No. | Technique(s) Applied | No. of Bins | Feature Reduction in Comparison to basic LBP |
| 1. | Basic LBP [ | 256 × 8 × 8 = 214 = 16,384 | |
| 2. | LBP, CST and PCA [ | 59 × 8 × 8 = 3776 reduced by APCA | Greater than 213.6645 |
| 3. | LBP-Three Orthogonal Plane [ | 3 × 2P × 8 × 8 = 49,152 | Nil (−32,738) |
| 4. | LBPHF [ | 59 × 8 × 8 = 211.885 = 3776 | 213.622 |
| 5. | Noise Resistance LBP [ | 59 × 8 × 8 = 211.885 = 3776 | 213.622 |
| Extended NRLBP [ | 107 × 8 × 8 = 212.6545 | 213.2785 | |
| 6. | Concatenation of RLBP and DLBP [ | 196 × 8 × 8 = 12,544 | |
| 7. | Maximum conditional mutual [ | 256 × 8 × 8 = 214 = 16,384 | Up to some extent |
| 59 × 8 × 8 = 3776 reduced by Fast Subspace Technique | Greater than 213.718 ~ 13,476 |