| Literature DB >> 33870073 |
Mohd Aquib Ansari1, Dushyant Kumar Singh1.
Abstract
COVID-19 is a severe epidemic that has put the world in a global crisis. Over 42 Million people are infected, and 1.14 Million deaths are reported worldwide as on Oct 23, 2020. A deeper understanding of the epidemic suggests that a person's negligence can cause widespread harm that would be difficult to negate. Since no vaccine is yet developed, social distancing must be practiced to detain COVID-19 spread. Therefore, we aim to develop a framework that tracks humans for monitoring the social distancing being practiced. To accomplish this objective of social distance monitoring, an algorithm is developed using object detection method. Here, CNN based object detector is explored to detect human presence. The object detector's output is used for calculating distances between each pair of humans detected. This approach of social distancing algorithm will red mark the persons who are getting closer than a permissible limit. Experimental results prove that CNN based object detectors with our proposed social distancing algorithm exhibit promising outcomes for monitoring social distancing in public areas. © Bharati Vidyapeeth's Institute of Computer Applications and Management 2021.Entities:
Keywords: CNN; COVID-19; Human Detection; Social Distancing; Surveillance
Year: 2021 PMID: 33870073 PMCID: PMC8044502 DOI: 10.1007/s41870-021-00658-2
Source DB: PubMed Journal: Int J Inf Technol ISSN: 2511-2104
Fig. 1Total Cases vs. Total Deaths [1]
Fig. 2Complete block diagram for practicing social distancing
Proposed models configuration
| Model | Model 1 | Model 2 | |||
|---|---|---|---|---|---|
| Layer | Size | Output shape | Parameters | Output shape | Parameters |
| Conv2D | 32 filters | (None, 126, 62, 32) | 896 | (None, 126, 62, 32) | 896 |
| MaxPooling2 | (2, 2) | (None, 63, 31, 32) | 0 | (None, 63, 31, 32) | 0 |
| Conv2D | 48 filters | (None, 61, 29, 48) | 13,872 | (None, 61, 29, 48) | 13,872 |
| MaxPooling2 | (2, 2) | (None, 30, 14, 48) | 0 | (None, 30, 14, 48) | 0 |
| Conv2D | 64 filters | – | – | (None, 28, 12, 64) | 27,712 |
| MaxPooling2 | (2, 2) | – | – | (None, 14, 6, 64) | 0 |
| Flatten | – | (None, 20,160) | 0 | (None, 5376) | 0 |
| FC1 | 512 | (None, 512) | 10,322,432 | (None, 512) | 2,753,024 |
| Dropout | 0.30 | (None, 512) | 0 | (None, 512) | 0 |
| FC2 | 128 | (None, 128) | 65,664 | (None, 128) | 65,664 |
| Output Layer | 1 | (None, 1) | 129 | (None, 1) | 129 |
| Total trainable parameter | 10,402,993 | 2,861,297 | |||
Fig. 3Proposed CNN architecture of Model 2
Hyper parameter tuning for proposed models
| Model | Batch Size | Drop-out | Activation Function | Optimizer | Epochs | Environment | ||
|---|---|---|---|---|---|---|---|---|
| Convolutional Layer | FC Layer | Output Layer | ||||||
| Model 1 | 8 | 0.30 | Relu | Relu | Sigmoid | Adam | 120 | Our System and Google Colab |
| Model 2 | 8 | 0.50 | Relu | Tanh | Sigmoid | Ada-Delta | 120 | Our System and Google Colab |
Outcomes for Model 1 and Model 2
| Model | True positive | True negative | False positive | False negative | Training accuracy | Training loss | Validation accuracy | Validation loss |
|---|---|---|---|---|---|---|---|---|
| Model 1 | 100 | 94 | 0 | 6 | 0.9957 | 0.0213 | 0.9700 | 0.00178 |
| Model 2 | 100 | 97 | 0 | 3 | 0.9981 | 0.0057 | 0.9850 | 9.6017e-05 |
Time Comparision between Model 1 and Model 2
| Model | Model 1 | Model 2 | ||
|---|---|---|---|---|
| Enviroment | Our system | Google colab | Our system | Google colab |
| Training time | 8.51 h | 34.34 min | 9.36 h | 37.99 min |
| Testing time | 9.25 s | 1.34 s | 9.49 s | 1.41 s |
Fig. 4Accuracy and loss curve w.r.t. epochs for Model 1 and Model 2
Comparison with Existing Human Detection Approaches
| Authors and Year | Techniques used | Accuracy (%) |
|---|---|---|
| Fu-Chun Hsu et al. [ | HOG + SVM | 65.52 |
| HOOF + SVM | 88.48 | |
| HOG + HOOF + SVM | 86.26 | |
| Vijay and Shashikant [ | Edgelet Features + Cascade Structure of K-Means Clustering | 95.00 |
| Suman Kumar Choudhury et al. [ | Background Subtraction + Silhouette Orientation Histogram + Golden Ratio Based Partition + HIKSVM | 98.36 |
| Seemanthini and Manjunath [ | Cluster Segmentation + Temporal Tracking + HOG + SVM | 89.59 |
| Aichun, Tian and Qiao [ | Candidate Region Convolutional Neural Network | 86.00 |
| D. K. Singh et al. [ | Background Subtraction + HOG + SVM | 81.00 |
| Proposed model 1 | ||
| Proposed model 2 | ||
Fig. 5Outcomes of Model 2 for practicing social distancing