| Literature DB >> 35321193 |
Muhammad Zubair Asghar1,2, Fahad R Albogamy3, Mabrook S Al-Rakhami4, Junaid Asghar5, Mohd Khairil Rahmat1, Muhammad Mansoor Alam1,6,7,8,9, Adidah Lajis1, Haidawati Mohamad Nasir1.
Abstract
Deep neural networks have made tremendous strides in the categorization of facial photos in the last several years. Due to the complexity of features, the enormous size of the picture/frame, and the severe inhomogeneity of image data, efficient face image classification using deep convolutional neural networks remains a challenge. Therefore, as data volumes continue to grow, the effective categorization of face photos in a mobile context utilizing advanced deep learning techniques is becoming increasingly important. In the recent past, some Deep Learning (DL) approaches for learning to identify face images have been designed; many of them use convolutional neural networks (CNNs). To address the problem of face mask recognition in facial images, we propose to use a Depthwise Separable Convolution Neural Network based on MobileNet (DWS-based MobileNet). The proposed network utilizes depth-wise separable convolution layers instead of 2D convolution layers. With limited datasets, the DWS-based MobileNet performs exceptionally well. DWS-based MobileNet decreases the number of trainable parameters while enhancing learning performance by adopting a lightweight network. Our technique outperformed the existing state of the art when tested on benchmark datasets. When compared to Full Convolution MobileNet and baseline methods, the results of this study reveal that adopting Depthwise Separable Convolution-based MobileNet significantly improves performance (Acc. = 93.14, Pre. = 92, recall = 92, F-score = 92).Entities:
Keywords: Depthwise Separable Convolutions; MobileNet; deep learning; face mask detection; facial image classification
Mesh:
Year: 2022 PMID: 35321193 PMCID: PMC8936807 DOI: 10.3389/fpubh.2022.855254
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Investigative research questions.
|
|
|
|---|---|
| RQ1. How can the Depthwise Separable Convolutional Neural Network based on MobileNet be utilized to successfully categorize photos for facial mask detection? | Investigate the Depthwise Separable Convolution Neural Network based on MobileNet to learn how it may be used to classify facial photos for mask recognition. |
| RQ2. How efficient is the suggested technique in contrast to the traditional CNN model in terms of many performance assessment measures? | Examine the usefulness of the proposed deep learning model, MobileNet-based Depthwise Separable Convolution Neural Network, which classifies face photos in terms of mask recognition using a variety of performance metrics such as accuracy, recall, F1 measure, and precision. |
| RQ3 What is the effectiveness of the suggested approach in comparison to comparable approaches? | Compare the efficacy of the proposed mobile-based deep learning model employing depth-wise separable convolution in categorizing face pictures to baseline testing using a variety of assessment measures including precision, recall, F1-score, and accuracy. |
A partial list of literature review works.
|
|
|
|
|
|---|---|---|---|
| Toppo et al. ( | Mobile NetV2 | 88% | Revised parameter settings can improve the system performance |
| Kaur et al. ( | CNN-based approach | 86% | Light weight DWS-based CNN can provide more efficient results |
| Fan et al. ( | Residual contextual awareness module | 91% (Acc.) | Due to the constraints of the datasets, more processing is necessary to generate visualizations. |
| Bhuiyan et al. ( | YOLO-v3 model | 86% (Acc) | YOLOv4 needs to be compared using the proposed model. |
| Mata ( | CNN model | 60 % (Acc) | More effective techniques required for improved results |
| Balaji et al. ( | VGG-16 CNN | N/A | DWS solution can provide better results |
Figure 1Generic Diagram of proposed system.
Figure 2Proposed Framework with detailed view.
Figure 3(A,B) A sample dataset of mask and no-mask images.
Figure 4Preprocessing by drawing rectangles.
Figure 5Convolution in the conventional sense.
Figure 6Depthwise separable convolution.
Figure 7Depth-wise: separable convolutions with depth-wise and pointwise layers, then batchnorm and ReLU, are the next steps.
Layered process time distribution and classification assessment tasks.
|
|
|
|
|---|---|---|
| Conv1 | 41.01% | 21.1% |
| DWS_Conv1 | 2.9% | 0.08% |
| Pw_Conv1 | 3.5% | 8.1% |
| DWS_Conv2 | 2.8% | 0.02% |
| Pw_Conv2 | 3.3% | 7.7% |
| Pooling_avg | 0.2% | 0.1% |
Figure 8Confusion matrix.
DWS-based (MobileNet) (proposed) and full convolution MobileNet performance comparison.
|
|
|
| |
|---|---|---|---|
| Full convolution MobileNet | 92.008 | 8.428 | 14, 362 |
| MobileNet (DWS-based) (proposed) | 93.164 | 2.106 | 6, 844 |
Cross validation results.
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
| SVM | 92.008 | 0.06 | 85 | 0.05 | 89 | 0.07 | 87 | 0.07 |
| CNN | 91.03 | 0.06 | 86 | 0.05 | 88 | 0.06 | 86 | 0.06 |
| MobileNet (DWS-based) | 93.164 | 0.05 | 90 | 0.04 | 91 | 0.05 | 89 | 0.05 |
Figure 9Three techniques' classification maps using AIZOO FACE MASKS dataset. (A) MobileNet (DWS-based) (proposed), (B) Full Convolution MobileNet, and (C) CNN.
DWS (based on MobileNet) results for facial image (mask/no mask) classification.
|
|
|
|
|
|---|---|---|---|
| Mask | 96 | 96 | 95 |
| No-Mask | 97 | 96 | 97 |
Comparative results to benchmark work.
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| Mask | 88 | 90 | 89 | 90 | 91 | 91 | 95 | 93 | 94 |
| N-Mask | 90 | 89 | 89 | 91 | 90 | 90 | 93 | 92 | 92 |