| Literature DB >> 35572051 |
Nayaneesh Kumar Mishra1, Sumit Kumar1, Satish Kumar Singh1.
Abstract
In wake of COVID-19, the world has adapted to a new order. People have started wearing mask on their faces to prevent getting infected. The present face recognition models are no longer proving to be efficient in the current circumstances. This is because, most of the informative part of the face is covered by mask. The periocular recognition therefore holds the key to future of face recognition. However, the periocular region proves to be insufficiently enough to generate highly discriminative features. Also, most of the pre-COVID-19 algorithms fail to work in cases, where the number of training images available is very less. We propose a lightweight periocular recognition framework that uses thermo-visible features and ensemble subspace network classifier to improve upon the existing periocular recognition systems named as Masked Mobile Lightweight Thermo-visible Face Recognition (MmLwThV). The framework successfully improves the accuracy over a single visible modality by mitigating the effect of noise present in the thermo-visible features. The experiments on WHU-IIP dataset and an in-house collected dataset named, CVBL masked dataset, successfully validate the efficacy of our proposed framework. The MmLwFR framework is lightweight and can be easily deployed on mobile phones with a visible and an infrared camera.Entities:
Keywords: COVID-19; Ensemble of networks; Masked face recognition; Periocular recognition; Random subspace sampling; Thermo visible fusion
Year: 2022 PMID: 35572051 PMCID: PMC9084274 DOI: 10.1007/s10489-022-03517-0
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
Fig. 1Registered thermal and visible images of WHU-IIP dataset. The WHU-IIP dataset has no masked faces
Fig. 2Registered thermal and visible images of CVBL Masked dataset. The dataset is unconstrained and captured in real world environment. There is high variation in illumination, pose, distance of subject of the camera. The subjects have real masks on their faces. The quality of thermal images is also affected by the intensity of the day light when the image was captured
Comparative Results of Face Recognition for MASKED (M) and UNMASKED (Un) faces on CVBL Masked Dataset. The experiment has been performed on features from Visible (V) and Thermal (T) images
| Classifiers | M/Un | LBP | LBDP | LDP | LVP | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| V | T | V+T | V | T | V+T | V | T | V+T | V | T | V+T | ||
| Minimum | |||||||||||||
| Distance | Un | 81.78 | 81.78 | 81.78 | 87.71 | 72.46 | 92.80 | 71.19 | 58.90 | 84.32 | 76.69 | 55.51 | 86.44 |
| Classifier | M | 19.41 | 5.91 | 10.97 | 5.06 | 21.94 | 9.70 | 13.08 | 13.08 | 15.61 | 10.55 | 14.35 | 16.46 |
| Linear SVM | Un | 64.4 | 65.7 | 64.4 | 81.8 | 73.7 | 89.4 | 58.5 | 51.3 | 72.5 | 57.2 | 50.8 | 70.8 |
| M | 21.1 | 4.64 | 7.59 | 10.97 | 21.52 | 15.19 | 13.92 | 14.77 | 12.66 | 12.24 | 16.03 | 12.24 | |
| Quadratic SVM | Un | 67.8 | 67.8 | 68.2 | 84.7 | 77.5 | 90.3 | 64.4 | 56.4 | 77.1 | 67.8 | 55.1 | 79.2 |
| M | 21.10 | 4.22 | 7.59 | 10.13 | 20.25 | 12.66 | 15.19 | 16.46 | 13.92 | 13.08 | 16.88 | 12.24 | |
| Cubic SVM | Un | 68.2 | 66.9 | 67.4 | 84.3 | 75.8 | 89.4 | 62.3 | 50.0 | 76.3 | 64.8 | 54.2 | 76.7 |
| M | 20.68 | 4.22 | 8.02 | 8.86 | 20.68 | 14.35 | 13.62 | 15.61 | 13.50 | 13.50 | 17.72 | 12.24 | |
| Fine | Un | 7.6 | 8.1 | 7.2 | 9.7 | 11.4 | 7.6 | 6.4 | 7.2 | 6.4 | 6.8 | 6.4 | 6.8 |
| Gaussian SVM | M | 4.22 | 4.22 | 4.22 | 4.22 | 7.59 | 4.22 | 4.22 | 4.22 | 4.22 | 4.22 | 4.22 | 4.22 |
| Medium | Un | 55.5 | 56.8 | 57.2 | 72.5 | 69.1 | 75.8 | 53.0 | 41.5 | 64.8 | 49.2 | 48.3 | 62.3 |
| Gaussian SVM | M | 9.28 | 6.33 | 6.75 | 5.06 | 21.52 | 4.64 | 10.13 | 15.19 | 14.77 | 10.97 | 15.61 | 13.50 |
| Coarse Gaussian | Un | 29.2 | 29.2 | 30.1 | 46.2 | 32.6 | 46.6 | 24.2 | 24.6 | 30.9 | 22.9 | 28.0 | 29.2 |
| SVM | M | 15.61 | 3.80 | 8.86 | 6.33 | 20.68 | 13.50 | 9.28 | 12.24 | 13.50 | 6.75 | 11.81 | 17.72 |
| Fine KNN | Un | 63.1 | 62.3 | 60.6 | 85.6 | 62.3 | 90.3 | 58.5 | 55.1 | 78.0 | 63.6 | 57.2 | 76.3 |
| M | 19.41 | 4.64 | 10.97 | 9.28 | 18.14 | 13.92 | 7.59 | 16.88 | 13.92 | 13.50 | 15.61 | 17.30 | |
| Medium KNN | Un | 61.9 | 57.6 | 60.2 | 71.2 | 59.3 | 79.2 | 58.5 | 45.8 | 63.6 | 55.5 | 53.0 | 70.3 |
| M | 17.30 | 5.91 | 11.39 | 4.64 | 14.77 | 11.81 | 12.66 | 18.57 | 13.50 | 12.24 | 20.25 | 19.41 | |
| Coarse KNN | Un | 23.7 | 25.8 | 25.0 | 20.3 | 24.6 | 23.3 | 27.5 | 16.1 | 22.5 | 28.8 | 20.0 | 23.7 |
| M | 18.99 | 6.75 | 14.35 | 5.91 | 16.03 | 9.28 | 6.75 | 12.24 | 10.13 | 8.44 | 17.72 | 14.35 | |
| Cosine KNN | Un | 58.5 | 57.2 | 58.1 | 72.5 | 58.5 | 83.9 | 45.8 | 31.4 | 67.4 | 59.7 | 42.4 | 69.1 |
| M | 15.19 | 7.17 | 11.39 | 4.22 | 19.83 | 13.92 | 9.28 | 14.77 | 16.03 | 11.39 | 16.88 | 21.70 | |
| Cubic KNN | Un | 51.7 | 46.2 | 51.7 | 65.3 | 48.3 | 69.1 | 51.7 | 49.2 | 61.9 | 53.0 | 51.7 | 68.2 |
| M | 15.19 | 5.49 | 11.39 | 3.38 | 13.50 | 7.59 | 13.92 | 18.57 | 14.35 | 12.24 | 16.88 | 17.72 | |
| Weighted KNN | Un | 64.8 | 60.6 | 63.6 | 76.7 | 61.9 | 84.7 | 58.5 | 52.1 | 69.1 | 59.3 | 53.0 | 73.7 |
| M | 14.77 | 7.17 | 11.81 | 5.49 | 18.14 | 12.24 | 10.97 | 18.99 | 12.24 | 11.39 | 20.25 | 20.25 | |
| Ensemble | |||||||||||||
| Subspace | Un | 79.2 | 78.4 | 71.6 | 78.8 | 63.6 | 89.8 | 73.3 | 65.3 | 86.4 | 77.1 | 66.1 | 84.3 |
| Discriminant | M | 24.47 | 9.70 | 12.24 | 8.02 | 27.00 | 13.92 | 15.61 | 22.36 | 18.99 | 19.83 | 23.21 | 21.10 |
| Ensemble | Un | 82.6 | 82.6 | 83.1 | 87.7 | 75.4 | 92.8 | 71.2 | 62.7 | 83.1 | 79.7 | 58.1 | 86.4 |
| Subspace KNN | M | 22.36 | 5.91 | 10.13 | 3.80 | 22.36 | 12.66 | 13.50 | 15.61 | 15.19 | 11.39 | 14.77 | 16.88 |
Recognition Accuracy for different face regions for Thermal and Visible Images using Euclidean distance based prediction
| Face | Thermal | Visible | Thermo- |
|---|---|---|---|
| Region | Image | Image | visible |
| Face | 98.99 | 98.23 | 100 |
| Left Eye | 81.75 | 56.05 | 92.05 |
| Right Eye | 75.51 | 58.59 | 91.29 |
Fig. 3Complete block diagram of MmLwThV framework
Fig. 4The Anchor, Positive and Negative Visible Images. For correct classification, the Intra-class distance should be less than the Inter-class distance
Fig. 5The figure explains how the mask causes noise in the thermo-visible features. The noise affects the discriminative ability of the features
Fig. 6Random Subspace Sampling Method
Results of MmLwThV framework on WHU-IIP dataset on Thermal(T) and Visible (V) Images for different Features
| Classifiers | LBP | LBDP | LDP | LVP | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| V | T | V+T | V | T | V+T | V | T | V+T | V | T | V+T | |
| Minimum Distance | ||||||||||||
| Classifier | 97.1 | 97.73 | 98.86 | 83.46 | 95.33 | 97.60 | 49.87 | 26.26 | 54.8 | 91.54 | 54.17 | 91.41 |
| Linear SVM | 96.6 | 90.8 | 99.1 | 71.7 | 87.4 | 93.6 | 83.7 | 73.5 | 92.6 | 91.7 | 81.3 | 96.1 |
| Quadratic SVM | 97.0 | 90.9 | 99.2 | 73.6 | 86.9 | 93.3 | 85.2 | 75.4 | 94.1 | 92.6 | 83.3 | 96.7 |
| Cubic SVM | 96.7 | 89.9 | 98.9 | 70.8 | 84.8 | 91.9 | 84.5 | 73.7 | 93.8 | 92.6 | 82.6 | 96.8 |
| Fine Gaussian SVM | 5.2 | 5.1 | 4.8 | 6.3 | 6.1 | 6.4 | 3.4 | 2.8 | 3.7 | 3.0 | 2.8 | 3.3 |
| Medium Gaussian SVM | 95.7 | 90.7 | 98.4 | 74.5 | 83.0 | 90.7 | 82.8 | 68.3 | 92.7 | 92.7 | 81.4 | 96.2 |
| Coarse Gaussian SVM | 16.2 | 15.7 | 16.7 | 14.0 | 13.5 | 16.2 | 15.0 | 13.6 | 14.8 | 15.4 | 14.0 | 16.0 |
| Fine KNN | 74.2 | 46.1 | 73.7 | 59.6 | 76.4 | 85.9 | 39.5 | 19.4 | 44.2 | 71.5 | 34.8 | 74.7 |
| Medium KNN | 69.6 | 45.6 | 63.5 | 51.6 | 74.0 | 78.7 | 37.1 | 17.6 | 37.8 | 72.0 | 30.3 | 71.8 |
| Coarse KNN | 39.1 | 23.1 | 32.4 | 21.7 | 31.9 | 38.5 | 21.7 | 9.2 | 17.7 | 51.4 | 18.7 | 53.9 |
| Cosine KNN | 92.0 | 82.3 | 97.5 | 58.2 | 77.1 | 84.0 | 70.6 | 51.3 | 83.2 | 84.0 | 69.4 | 91.4 |
| Cubic KNN | 51.9 | 34.5 | 44.9 | 40.8 | 62.8 | 65.2 | 21.0 | 12.1 | 20.3 | 54.8 | 20.1 | 46.5 |
| Weighted KNN | 74.5 | 50.1 | 68.7 | 56.8 | 76.4 | 81.7 | 40.2 | 17.0 | 40.8 | 74.6 | 32.4 | 77.1 |
| Ensemble Subspace | ||||||||||||
| Discriminant | 99.1 | 97.2 | 68.6 | 85.1 | 88.8 | 91.4 | 76.4 | 96.7 | 98.1 | 89.5 | 98.5 | |
| Ensemble | ||||||||||||
| Subspace KNN | 97.0 | 98.1 | 83.5 | 97.1 | 98.2 | 87.6 | 75.4 | 93.4 | 96.6 | 79.2 | 96.6 | |
Results of MmLwThV framework on CVBL Masked dataset on Thermal(T) and Visible (V) Images for different Features
| Classifiers | LBP | LBDP | LDP | LVP | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| V | T | V+T | V | T | V+T | V | T | V+T | V | T | V+T | |
| Minimum Distance | ||||||||||||
| Classifier | 65.11 | 26.81 | 65.95 | 56.17 | 34.47 | 53.19 | 36.60 | 20.43 | 34.04 | 36.6 | 31.49 | 42.98 |
| Linear SVM | 45.11 | 29.36 | 42.98 | 31.91 | 28.51 | 37.02 | 34.04 | 21.70 | 34.89 | 34.04 | 25.53 | 34.47 |
| Quadratic SVM | 50.64 | 29.79 | 45.53 | 35.32 | 27.23 | 38.72 | 39.15 | 24.26 | 40.85 | 39.15 | 29.36 | 42.55 |
| Cubic SVM | 50.64 | 30.21 | 44.68 | 37.45 | 29.79 | 39.15 | 38.30 | 22.98 | 40.85 | 37.02 | 26.81 | 41.28 |
| Fine Gaussian SVM | 4.26 | 4.26 | 4.26 | 4.26 | 8.09 | 4.26 | 4.26 | 4.26 | 4.26 | 4.26 | 4.26 | 4.26 |
| Medium Gaussian SVM | 41.28 | 23.83 | 32.77 | 20.43 | 23.83 | 22.98 | 28.94 | 23.40 | 28.94 | 32.77 | 29.79 | 37.02 |
| Coarse Gaussian SVM | 17.02 | 13.19 | 17.02 | 17.02 | 20.00 | 19.57 | 10.64 | 16.17 | 17.45 | 11.49 | 16.17 | 17.87 |
| Fine KNN | 36.17 | 22.55 | 33.62 | 48.09 | 30.64 | 39.15 | 29.79 | 24.68 | 32.77 | 29.79 | 31.91 | 36.17 |
| Medium KNN | 34.47 | 22.55 | 34.89 | 37.87 | 30.21 | 35.32 | 31.91 | 25.11 | 33.62 | 32.77 | 28.51 | 36.60 |
| Coarse KNN | 18.30 | 14.89 | 17.02 | 12.77 | 15.74 | 16.60 | 12.34 | 14.89 | 25.96 | 14.47 | 14.47 | 19.57 |
| Cosine KNN | 42.13 | 33.62 | 45.53 | 40.00 | 25.96 | 40.43 | 34.04 | 22.13 | 34.04 | 31.91 | 27.23 | 34.89 |
| Cubic KNN | 23.40 | 23.40 | 28.09 | 34.04 | 21.28 | 29.36 | 30.64 | 24.26 | 31.06 | 31.91 | 27.66 | 36.17 |
| Weighted KNN | 33.62 | 23.83 | 37.02 | 40.00 | 32.34 | 40.00 | 34.04 | 25.11 | 31.06 | 32.77 | 29.36 | 38.30 |
| Ensemble Subspace | ||||||||||||
| Discriminant | 48.94 | 36.60 | 38.72 | 28.94 | 51.06 | 48.09 | 37.87 | 51.49 | 48.94 | 40.43 | 54.47 | |
| Ensemble | ||||||||||||
| Subspace KNN | 68.94 | 28.94 | 54.47 | 35.74 | 54.47 | 36.17 | 22.13 | 36.60 | 37.87 | 30.64 | 42.98 | |
Results of Wilcoxon Signed-ranks Test for the MmLwThV framework on WHU-IIT masked dataset. Ensemble Subspace Discriminant is compared with all other classifiers. The other classifiers are represented by the letter in brackets. Ensemble subspace KNN (B), Minimum distance classifier (C), Linear SVM (D), Quadratic SVM(E), Cubic SVM (F), Fine Gaussian SVM (G) , Medium Gaussian SVM (H), Core Gaussian SVM (J), Fine KNN (K), Medium KNN (L), Course KNN (M), Cosine KNN (N), Cubic KNN (O), Weighted KNN (P). Also, S stands for similar and D stands for Different
| Parameters | B | C | D | E | F | G | H | J | K | L | M | N | O | P |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R+ | 44 | 56 | 62 | 59.5 | 66 | 78 | 68 | 78 | 78 | 78 | 78 | 78 | 78 | 78 |
| R- | 34 | 22 | 16 | 18.5 | 12 | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Stats | 34 | 22 | 16 | 18.5 | 12 | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| T | S | S | S | S | D | D | D | D | D | D | D | D | D | D |
Results of Wilcoxon Signed-ranks Test for the MmLwThV framework on CVBL masked dataset. Ensemble Subspace KNN is compared with all other classifiers. The other classifiers are represented by the letter in brackets. Ensemble subspace KNN(A), Ensemble subspace discriminant (B), Minimum distance classifier (C), Linear SVM (D), Quadratic SVM(E) , Cubic SVM (F), Fine Gaussian SVM (G) , Medium Gaussian SVM (H), Core Gaussian SVM (J), Fine KNN (K), Medium KNN (L), Course KNN (M), Cosine KNN (N), Cubic KNN (O), Weighted KNN (P). Also, S stands for similar and D stands for Different
| Parameters | B | C | D | E | F | G | H | J | K | L | M | N | O | P |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R+ | 29 | 56.5 | 77 | 54.5 | 61.5 | 77 | 76 | 78 | 75 | 75.5 | 78 | 72 | 77 | 75 |
| R- | 49 | 9.5 | 1 | 23.56 | 16.5 | 0 | 2 | 0 | 3 | 2.5 | 0 | 6 | 1 | 3 |
| Stats | 29 | 9.5 | 1 | 23.5 | 16.5 | 0 | 2 | 0 | 3 | 2.5 | 0 | 6 | 1 | 3 |
| T | S | D | D | S | S | D | D | D | D | D | D | D | D | D |
Comparison with state-of-the-art methods
| Method | Accuracy in % |
|---|---|
| Alexnet [ | 42.55 |
| GoogleNet [ | 43.83 |
| VGG-16 [ | 39.15 |
| ResNet-50 [ | 61.70 |
| ResNet-18 [ | 55.31 |
| ResNet-34 [ | 61.27 |
| ResNet-18 [ | 15.74 |
| ResNet-34 [ | 9.78 |
| MmLwThV framework | 70.64 |