| Literature DB >> 34903950 |
Xueping Su1, Meng Gao1, Jie Ren1, Yunhong Li1, Mian Dong1, Xi Liu2.
Abstract
Wearing a mask is an important way of preventing COVID-19 transmission and infection. German researchers found that wearing masks can effectively reduce the infection rate of COVID-19 by 40%. However, the detection of face mask-wearing in the real world is affected by factors such as light, occlusion, and multi-object. The detection effect is poor, and the wearing of cotton masks, sponge masks, scarves and other items greatly reduces the personal protection effect. Therefore, this paper proposes a new algorithm for mask detection and classification that fuses transfer learning and deep learning. Firstly, this paper proposes a new algorithm for face mask detection that integrates transfer learning and Efficient-Yolov3, using EfficientNet as the backbone feature extraction network, and choosing CIoU as the loss function to reduce the number of network parameters and improve the accuracy of mask detection. Secondly, this paper divides the mask into two categories of qualified masks (N95 masks, disposable medical masks) and unqualified masks (cotton masks, sponge masks, scarves, etc.), creates a mask classification data set, and proposes a new mask classification algorithm that the combines transfer learning and MobileNet, enhances the generalization of the model and solves the problem of small data size and easy overfitting. Experiments on the public face mask detection data set show that the proposed algorithm has a better performance than existing algorithms. In addition, experiments are performed on the created mask classification data set. The mask classification accuracy of the proposed algorithm is 97.84%, which is better than other algorithms.Entities:
Keywords: COVID-19; Mask classification; Masked face dataset; Masked face detection
Year: 2021 PMID: 34903950 PMCID: PMC8656443 DOI: 10.1007/s11042-021-11772-5
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.577
Fig. 1Architecture of DarkNet53
Fig. 2Architecture of Efficient-Yolov3
Fig. 3Transfer learning framework
Fig. 4Two ways of overlapping two rectangles with exactly the same IOU value
Fig. 5Depth separable convolution
Fig. 6Transfer learning-mobilenet
Fig. 7Dataset annotation example diagram
Sample image annotation data
| Object | Box |
|---|---|
| face | 381,207,618,561 |
| face_mask | 156,141,393,418 |
Fig. 8Mask classification dataset
Detection average precision of Efficient-Yolov3
| Methods | face(%) | face_mask(%) | mAP(%) | FPS(F | Params(M) |
|---|---|---|---|---|---|
| Yolov3 | 90.82 | 95.26 | 93.04 | 12.91 | 61.58 |
| Ours | 93.88 | 98.18 | 96.03 | 14.62 | 15.91 |
Fig. 9Part of detection results
Comparison with related works
| Methods | face(%) | face_mask(%) | mAP(%) | FPS(F | params(M) |
|---|---|---|---|---|---|
| Baseline [ | 89.60 | 91.90 | 90.75 | 23.12 | 1.01 |
| Yolov3 [ | 90.82 | 95.26 | 93.04 | 12.91 | 61.58 |
| Yolov4 [ | 91.22 | 96.59 | 93.91 | 11.32 | 64.01 |
| Yolov5 [ | 93.03 | 98.14 | 95.59 | 18.45 | 7.07 |
| Ours | 93.88 | 98.18 | 96.03 | 14.62 | 15.91 |
Fig. 10Classification results of different models in the test set
Fig. 11The consumption of each epoch training time
Fig. 12The confusion matrix of ResNet50 and TransferLearning-MobileNet on test set
Ablation study of Yolov3
| Backbone | face | face_mask | mAP | FPS |
|---|---|---|---|---|
| Darknet53-CIOU | 90.82% | 95.26% | 93.04% | 12.91 |
| EfficientnetB2 | 75.64% | 89.96% | 82.80% | 12.99 |
| EfficientnetB2-Transer Learning | 92.79% | 97.23% | 95.01% | 13.08 |
| EfficientnetB2-Transer Learning- CIOU | 93.88% | 98.18% | 96.03% | 14.62 |
| EfficientnetB4-Transer Learning- CIOU | 88.62% | 94.86% | 91.74% | 11.87 |
Ablation study of MobileNet
| model | OK-mask | NG-mask | Accuracy | ||
|---|---|---|---|---|---|
| Precision | Recall | Precision | Recall | ||
| MobileNet | 80.62% | 76.47% | 83.59% | 86.70% | 82.41% |
| MobileNet-Transer Learning | 97.08% | 97.79% | 98.40% | 97.87% | 97.84% |