| Literature DB >> 34697552 |
Murat Koklu1, Ilkay Cinar1, Yavuz Selim Taspinar2.
Abstract
CONTEXT: The COVID-19 virus, exactly like in numerous other diseases, can be contaminated from person to person by inhalation. In order to prevent the spread of this virus, which led to a pandemic around the world, a series of rules have been set by governments that people must follow. The obligation to use face masks, especially in public spaces, is one of these rules.Entities:
Keywords: AlexNet; BiLSTM; Convolutional neural network; LSTM; Transfer learning; VGG16
Year: 2021 PMID: 34697552 PMCID: PMC8527867 DOI: 10.1016/j.bspc.2021.103216
Source DB: PubMed Journal: Biomed Signal Process Control ISSN: 1746-8094 Impact factor: 3.880
Comparative summary of the studies found in the literature based on face mask detection.
| Model | The Number of classes | The Number of images | Classification Accuracy (%) | References |
|---|---|---|---|---|
| MobileNetMask | 2 | 3835 | 93 | |
| YOLOv3 | 2 | 600 | 96 | |
| ResNet50 + SVM | 2 | 15,000 | between 92 and 98 | |
| CNN | 2 | 1376 | 98 | |
| CNN | 2 | 135,849 | 99.83 | |
| CNN | 2 | 1539 | 98.7 | |
| R-CNN Inception ResNet V2 | 2 | 1853 | 99.8 | |
| ResNet | 2 | 95,000 | 97 | |
| MobilNetV2 + SVM | 2 | 1376 | 97.1 | |
| YoloV2 + ResNet-50 | 2 | 1415 | 81 (Precision) | |
| MobileNet V2 | 2 | 3165 | between 85 and 95 | |
| VGG-16 | 2 | 20,000 | 97 | |
| MobileNetV2 | 2 | 3846 | 96.85 | |
| InceptionV3 | 2 | 1570 | 100 | |
| Yolo + CNN | 2 | – | 95 | |
| DenseNet-121 | 2 | 7855 | 99.49 | |
| CNN | 2 | 95,000 | 97 | |
| YoloV3 | 3 | 680 | 98 | |
| VGG-16 | 2 | 25,000 | 96 | |
| MobileNet | 2 | 3216 | 94.2 |
Fig. 1LSTM architecture.
Fig. 2LSTM cell structure.
Fig. 3Bi-directional RNN structure.
Fig. 4The example of BiLSTM architecture in 3 consecutive steps.
Dataset features.
Fig. 5Block diagram of the proposed models.
Fig. 6TrAlexNet architecture.
Layers and parameters of the proposed TrAlexNet.
| conv1 | convolution 2d | 11,11 | 4,4 | 0 | 96 | relu |
| pool1 | max pooling 2d | 3,3 | 2,2 | 0 | 96 | – |
| conv2 | grouped convolution 2d | 5,5 | 1,1 | 2 | 256 | relu |
| pool2 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| conv3 | convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv4 | grouped convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv5 | grouped convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool5 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Fig. 7TrAlexNet + LSTM architecture.
Layers and parameters of the proposed TrAlexNet + LSTM.
| conv1 | convolution 2d | 11,11 | 4,4 | 0 | 96 | relu |
| pool1 | max pooling 2d | 3,3 | 2,2 | 0 | 96 | – |
| conv2 | grouped convolution 2d | 5,5 | 1,1 | 2 | 256 | relu |
| pool2 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| conv3 | convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv4 | grouped convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv5 | grouped convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool5 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc8 | fully connected | – | – | – | 1000 | relu |
| flatten | flatten | – | – | – | – | – |
| lstm | lstm | – | – | – | – | – |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Fig. 8TrAlexNet + BiLSTM architecture.
Layers and parameters of the proposed TrAlexNet + BiLSTM.
| conv1 | convolution 2d | 11,11 | 4,4 | 0 | 96 | relu |
| pool1 | max pooling 2d | 3,3 | 2,2 | 0 | 96 | – |
| conv2 | grouped convolution 2d | 5,5 | 1,1 | 2 | 256 | relu |
| pool2 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| conv3 | convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv4 | grouped convolution 2d | 3,3 | 1,1 | 1 | 384 | relu |
| conv5 | grouped convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool5 | max pooling 2d | 3,3 | 2,2 | 0 | 256 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc8 | fully connected | – | – | – | 1000 | relu |
| flatten | flatten | – | – | – | – | – |
| bilstm | bilstm | – | – | – | – | – |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Fig. 9TrVGG16 architecture.
Layers and parameters of the proposed TrVGG16.
| conv1_1 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| conv1_2 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| pool1 | max pooling 2d | 2,2 | 2,2 | 0 | 64 | – |
| conv2_1 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| conv2_2 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| pool2 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv3_1 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_2 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_3 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool3 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv4_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool4 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| conv5_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool5 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Fig. 10TrVGG16 + LSTM architecture.
Layers and parameters of the proposed TrVGG16 + LSTM.
| conv1_1 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| conv1_2 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| pool1 | max pooling 2d | 2,2 | 2,2 | 0 | 64 | – |
| conv2_1 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| conv2_2 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| pool2 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv3_1 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_2 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_3 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool3 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv4_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool4 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| conv5_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool5 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc8 | fully connected | – | – | – | 1000 | relu |
| flatten | flatten | – | – | – | – | – |
| lstm | lstm | – | – | – | – | – |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Fig. 11TrVGG16 + BiLSTM architecture.
Layers and parameters of the proposed TrVGG16 + BiLSTM.
| conv1_1 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| conv1_2 | convolution 2d | 3,3 | 1,1 | 1 | 64 | relu |
| pool1 | max pooling 2d | 2,2 | 2,2 | 0 | 64 | – |
| conv2_1 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| conv2_2 | convolution 2d | 3,3 | 1,1 | 1 | 128 | relu |
| pool2 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv3_1 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_2 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| conv3_3 | convolution 2d | 3,3 | 1,1 | 1 | 256 | relu |
| pool3 | max pooling 2d | 2,2 | 2,2 | 0 | 256 | – |
| conv4_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv4_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool4 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| conv5_1 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_2 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| conv5_3 | convolution 2d | 3,3 | 1,1 | 1 | 512 | relu |
| pool5 | max pooling 2d | 2,2 | 2,2 | 0 | 512 | – |
| fc6 | fully connected | – | – | – | 4096 | relu |
| fc7 | fully connected | – | – | – | 4096 | relu |
| fc8 | fully connected | – | – | – | 1000 | relu |
| flatten | flatten | – | – | – | – | – |
| bilstm | bilstm | – | – | – | – | – |
| fc_optimized | fully connected | – | – | – | 4 | softmax |
Training parameters for all proposed models.
| sgdm | 0.0001 | 5 | 8 | 11 |
Parameters of fc_optimized layer for all proposed models.
| 10 | 1 | 10 | 0 | glorot | zeros |
Parameters of LSTM and BiLSTM layer for all proposed models.
| 100 | last | tanh | sigmoid |
Fig. 12Training and validation graphs of model.
Classification accuracies of models and training times of the network.
| TrAlexNet | 90.33 | 8 min 38 sec |
| TrAlexNet + LSTM | 90.55 | 7 min 25 sec |
| TrAlexNet + BiLSTM | 91.17 | 7 min 43 sec |
| TrVGG16 | 94.00 | 74 min 55 sec |
| TrVGG16 + LSTM | 94.17 | 74 min 23 sec |
| TrVGG16 + BiLSTM | 95.67 | 75 min 23 sec |
Classification accuracy achieved by the TrVGG16 + BiLSTM model for all classes.
| Masked | 95.65 |
| No_Mask | 98.55 |
| Masked but nose open | 94.3 |
| Masked but under the chin | 94.2 |