| Literature DB >> 34764616 |
Hoai Nam Vu1, Mai Huong Nguyen2, Cuong Pham1.
Abstract
Face recognition is one of the most common biometric authentication methods as its feasibility while convenient use. Recently, the COVID-19 pandemic is dramatically spreading throughout the world, which seriously leads to negative impacts on people's health and economy. Wearing masks in public settings is an effective way to prevent viruses from spreading. However, masked face recognition is a highly challenging task due to the lack of facial feature information. In this paper, we propose a method that takes advantage of the combination of deep learning and Local Binary Pattern (LBP) features to recognize the masked face by utilizing RetinaFace, a joint extra-supervised and self-supervised multi-task learning face detector that can deal with various scales of faces, as a fast yet effective encoder. In addition, we extract local binary pattern features from masked face's eye, forehead and eyebow areas and combine them with features learnt from RetinaFace into a unified framework for recognizing masked faces. In addition, we collected a dataset named COMASK20 from 300 subjects at our institution. In the experiment, we compared our proposed system with several state of the art face recognition methods on the published Essex dataset and our self-collected dataset COMASK20. With the recognition results of 87% f1-score on the COMASK20 dataset and 98% f1-score on the Essex dataset, these demonstrated that our proposed system outperforms Dlib and InsightFace, which has shown the effectiveness and suitability of the proposed method. The COMASK20 dataset is available on https://github.com/tuminguyen/COMASK20 for research purposes.Entities:
Keywords: Face recognition; Local binary pattern; Masked face recognition
Year: 2021 PMID: 34764616 PMCID: PMC8363871 DOI: 10.1007/s10489-021-02728-1
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
Fig. 1Proposed method pipeline
Fig. 2Angle between two vectors indicating the difference
The proposed model’s performance for different parameter values of K and distance functions on COMASK20
| Euclidean | Cosine | LBP-voting | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | F1-score | Precision | Recall | F1-score | Precision | Recall | F1-score | ||
| faces94 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces96 | 0.99 | 0.89 | 0.94 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| grimace | 0.95 | 0.94 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | |
| k = 1 | COMASK20 | 0.97 | 0.91 | 0.94 | 0.97 | 0.94 | 0.95 | 0.96 | 0.95 | 0.95 |
| faces94 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces96 | 0.99 | 0.89 | 0.94 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| grimace | 0.95 | 0.94 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | |
| k = 3 | COMASK20 | 0.97 | 0.89 | 0.93 | 0.97 | 0.92 | 0.94 | 0.95 | 0.94 | 0.94 |
| faces94 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces96 | 0.99 | 0.88 | 0.93 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| grimace | 0.95 | 0.94 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | |
| k = 5 | COMASK20 | 0.85 | 0.69 | 0.76 | 0.93 | 0.82 | 0.87 | 0.87 | 0.87 | 0.87 |
| faces94 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces96 | 0.99 | 0.88 | 0.93 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| grimace | 0.95 | 0.94 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | |
| k = 6 | COMASK20 | 0.83 | 0.66 | 0.74 | 0.92 | 0.80 | 0.86 | 0.86 | 0.86 | 0.86 |
| faces94 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| faces96 | 0.99 | 0.89 | 0.94 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| grimace | 0.95 | 0.94 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | |
| k = 10 | COMASK20 | 0.75 | 0.54 | 0.63 | 0.88 | 0.74 | 0.80 | 0.81 | 0.81 | 0.81 |
Fig. 3Localization of landmark points for input images (top) on the whole face (middle) and on specific ROIs (bottom)
Fig. 4Binary code and value are made for a pixel
Fig. 5Feature histogram created by the LBP pipeline
Essex’s face recognition dataset summary
| Resolution | Initial reported | After manual check | |
|---|---|---|---|
| Faces94 | 180 x 200 pixels | 153 | 152 |
| Faces95 | 180 x 200 pixels | 72 | 72 |
| Faces96 | 196 x 196 pixels | 152 | 152 |
| Grimace | 180 x 200 pixels | 18 | 18 |
Fig. 6Original face images (first row) and Masked face images (second row) in COMASK20 dataset
Fig. 7Original face images (first row) and Masked face images (second row) in Essex dataset
Comparison table between our purposed distance functions vs Euclidean and Cosine on COMASK 20
| Distance function | Precision | Recall | F1-score | Inference time |
|---|---|---|---|---|
| Euclidean | 0.85 | 0.69 | 0.76 | 1.11337 |
| Cosine | 0.93 | 0.82 | 0.87 | 1.14166 |
| Our proposal | 0.87 | 0.87 | 0.87 | 1.01947 |
The face recognition performance for 512-D vs 1024-D feature vectors
| Precision | Recall | F1-core | Inference time | |
|---|---|---|---|---|
| 512-D | 0.87 | 0.87 | 0.87 | 0.96046 |
| 1024-D | 0.89 | 0.87 | 0.88 | 1.29443 |
Comparison of our proposed method vs. other methods on the Essex dataset (K = 5)
| Precision | Recall | F1-score | ||
|---|---|---|---|---|
| faces94 | Dlib | 0.52 | 0.31 | 0.39 |
| Dlib + LBP-based voting | 0.78 | 0.56 | 0.65 | |
| InsightFace | 0.72 | 0.59 | 0.65 | |
| Insightface (euclidean) + LBP | 0.99 | 0.99 | 0.99 | |
| Insightface (cosine) + LBP | 0.99 | 0.99 | 0.99 | |
| Our proposed method | 0.99 | 0.99 | 0.99 | |
| faces95 | Dlib | 0.67 | 0.39 | 0.49 |
| Dlib + LBP-based voting | 0.86 | 0.57 | 0.69 | |
| InsightFace | 0.76 | 0.55 | 0.64 | |
| Insightface (euclidean) + LBP | 0.99 | 0.98 | 0.98 | |
| Insightface (cosine) + LBP | 0.99 | 0.99 | 0.99 | |
| Our proposed method | 0.99 | 0.99 | 0.99 | |
| faces96 | Dlib | 0.59 | 0.27 | 0.37 |
| Dlib + LBP-based voting | 0.82 | 0.41 | 0.55 | |
| InsightFace | 0.64 | 0.34 | 0.44 | |
| Insightface (euclidean) + LBP | 0.99 | 0.88 | 0.93 | |
| Insightface (cosine) + LBP | 0.99 | 0.97 | 0.98 | |
| Our proposed method | 0.99 | 0.97 | 0.98 | |
| grimace | Dlib | 0.79 | 0.49 | 0.6 |
| Dlib + LBP-based voting | 0.79 | 0.51 | 0.62 | |
| InsightFace | 0.79 | 0.47 | 0.59 | |
| Insightface (euclidean) + LBP | 0.95 | 0.94 | 0.94 | |
| Insightface (cosine) + LBP | 1 | 1 | 1 | |
| Our proposed method | 1 | 1 | 1 |
Comparison of our proposed method vs other methods on COMASK20 (K = 5)
| Precision | Recall | F1-score | Prediction time | |
|---|---|---|---|---|
| Dlib | 0.12 | 0.13 | 0.12 | 1.54875 |
| Dlib + LBP-based voting | 0.22 | 0.20 | 0.21 | 1.57038 |
| InsightFace | 0.58 | 0.46 | 0.51 | 0.99812 |
| Insightface (euclidean) + LBP | 0.85 | 0.69 | 0.76 | 1.11337 |
| Insightface (cosine) + LBP | 0.93 | 0.82 | 0.87 | 1.14166 |
| Insightface + Gabor (1 kernel) | 0.87 | 0.82 | 0.84 | 1.34166 |
| Insightface + Gabor (80 kernels) | 0.88 | 0.82 | 0.85 | 3.24166 |
| Our proposed method | 0.87 | 0.87 | 0.87 | 1.01947 |
Results of K values and distance methods on the Essex dataset
| Distance function | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Euclidean | Cosine | LBP voting | ||||||||
| Precision | Recall | F1-score | Precision | Recall | F1-score | Precision | Recall | F1-score | ||
| Neighbors | k = 1 | 0.98 | 0.92 | 0.95 | 0.99 | 0.98 | 0.98 | 0.99 | 0.98 | 0.98 |
| k = 3 | 0.98 | 0.92 | 0.95 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| k = 5 | 0.98 | 0.92 | 0.95 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| k = 6 | 0.96 | 0.91 | 0.93 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
| k = 10 | 0.96 | 0.9 | 0.93 | 0.99 | 0.97 | 0.98 | 0.99 | 0.97 | 0.98 | |
Fig. 8The ROC curves of ours vs other methods