| Literature DB >> 35890782 |
Roman Bhuiyan1, Junaidi Abdullah1, Noramiza Hashim1, Fahmid Al Farid1, Wan Noorshahida Mohd Isa1, Jia Uddin2, Norra Abdullah3.
Abstract
Almost two million Muslim pilgrims from all around the globe visit Mecca each year to conduct Hajj. Each year, the number of pilgrims grows, creating worries about how to handle such large crowds and avoid unpleasant accidents or crowd congestion catastrophes. In this paper, we introduced deep Hajj crowd dilated convolutional neural network (DHCDCNNet) for crowd density analysis. This research also presents augmentation technique to create additional dataset based on the hajj pilgrimage scenario. We utilized a single framework to extract both high-level and low-level features. For creating additional dataset we divide the process of images augmentation into two routes. In the first route, we utilized magnitude extraction followed by the polar magnitude. In the second route, we performed morphological operation followed by transforming the image into skeleton. This paper presented a solution to the challenge of measuring crowd density using a surveillance camera pointed at a distance. An FCNN-based technique for crowd analysis is included in the proposed methodology, particularly for classifying crowd density. There are several obstacles in video analysis when there are a large number of pilgrims moving around the tawaf area, with densities of between 7 and 8 per square meter. The proposed DHCDCNNet method has achieved accuracy of 97%, 89% and 100% for the JHU-CROWD dataset, the UCSD dataset and the proposed Hajj-Crowd dataset, respectively. The proposed Hajj-Crowd dataset, the UCSD dataset, and the JHU-CROW dataset all had accuracy of 98%, 97% and 97%, respectively, using the VGGNet approach. Using the ResNet50 approach, the proposed Hajj-Crowd dataset, the UCSD dataset, and the JHU-CROW dataset all had an accuracy of 99%, 91% and 97%, respectively.Entities:
Keywords: FCNN; Hajj-Crowd dataset; crowd density classification; deep augmentation; morphological operation
Mesh:
Year: 2022 PMID: 35890782 PMCID: PMC9320336 DOI: 10.3390/s22145102
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The proposed architecture of DHCDCNNet.
The proposed DHCDCNNet structure.
| Layer Type | Output Shape | Param |
|---|---|---|
| conv2d (Conv2D) | (None, 118, 118, 32) | 896 |
| max_pooling2d (MaxPooling2D) | (None, 59, 59, 32) | 0 |
| conv2d_1 (Conv2D) | (None, 57, 57, 16) | 4624 |
| max_pooling2d_1 (MaxPooling2 | (None, 28, 28, 16) | 0 |
| dropout (Dropout) | (None, 28, 28, 16) | 0 |
| flatten (Flatten) | (None, 12544) | 0 |
| dense (Dense) | (None, 128) | 1,605,760 |
| dropout_1 (Dropout) | (None, 128) | 0 |
| dropout_2 (Dropout) | (None, 128) | 0 |
| dense_1 (Dense) | (None, 50) | 6450 |
| dense_2 (Dense) | (None, 5) | 255 |
| Total params: | N/A | 1,617,985 |
| Trainable params: | N/A | 1,617,985 |
Figure 2Proposed Augmentation Techniques.
Figure 3Output of the Augmentation: (a) Input Image (b) Mask (c) Dilation (d) Erosion (e) Polar Magnitude and (f) Skeleton of Image.
Figure 4The dilation of rates 1, 2 or 3 with 3 × 3 convolution kernels.
Figure 5Model of Crowd Density Classification.
Comparison with state-of-the-art public datasets.
| Dataset | No. of Image | Resolution |
|---|---|---|
| UCSD [ | 2500 | 158 × 238 |
| UCF-CC-50 [ | 50 | 2101 × 2888 |
| Mall [ | 2000 | 480 × 640 |
| WorldExpo’10 [ | 3980 | 576 × 720 |
| ShanghaiTech Part A [ | 482 | 589 × 868 |
| ShanghaiTech Part B [ | 716 | 768 × 1024 |
| JHU-CROWD [ | 4250 | 1450 × 900 |
| NWPU-Crowd [ | 5109 | 2311 × 3383 |
| PROPOSED HAJJ-CROWD | 27,000 | 1914 × 922 |
Figure 6Graphical Representations: (a) Data Augmentation applied (b) Five Classes Results.
Result for the five classes proposed Hajj-Crowd dataset using FCNN Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 1080 |
| LOW | 1.00 | 1.00 | 1.00 | 1080 |
| MEDIUM | 1.00 | 1.00 | 1.00 | 1080 |
| HIGH | 1.00 | 1.00 | 1.00 | 1080 |
| VHIGH | 1.00 | 1.00 | 1.00 | 1080 |
| micro avg | 1.00 | 1.00 | 1.00 | 5400 |
| macro avg | 1.00 | 1.00 | 1.00 | 5400 |
| avg | 1.00 | 1.00 | 1.00 | 5400 |
Result for the five classes UCSD dataset using FCNN Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 100 |
| LOW | 1.00 | 0.46 | 0.63 | 100 |
| MEDIUM | 1.00 | 1.00 | 1.00 | 100 |
| HIGH | 1.00 | 1.00 | 1.00 | 100 |
| VHIGH | 0.67 | 1.00 | 0.80 | 100 |
| micro avg | 0.90 | 0.89 | 0.90 | 500 |
| macro avg | 0.93 | 0.89 | 0.89 | 500 |
Result for the five classes JHU-CROWD dataset using FCNN Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 160 |
| LOW | 0.81 | 1.00 | 0.89 | 160 |
| MEDIUM | 1.00 | 0.76 | 0.86 | 160 |
| HIGH | 1.00 | 1.00 | 1.00 | 160 |
| VHIGH | 1.00 | 1.00 | 1.00 | 160 |
| micro avg | 0.95 | 0.95 | 0.95 | 800 |
| macro avg | 0.96 | 0.95 | 0.95 | 800 |
Comparision with the state-of-the-art dataset and proposed DHCDCNNet model.
| Exp. | Dataset | Model | Final Accuracy |
|---|---|---|---|
| 1 | Hajj-Crowd | DHCDCNNet | 100% |
| 1 | UCSD | DHCDCNNet | 89% |
| 1 | JHU-CROWD | DHCDCNNet | 97% |
Result for the five classes Hajj-Crowd dataset using VGGNet Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 1080 |
| LOW | 0.82 | 1.00 | 0.88 | 1080 |
| MEDIUM | 1.00 | 1.00 | 1.00 | 1080 |
| HIGH | 1.00 | 1.00 | 1.00 | 1080 |
| VHIGH | 1.00 | 1.00 | 1.00 | 1080 |
| micro avg | 0.95 | 0.95 | 0.95 | 5400 |
| macro avg | 0.96 | 0.95 | 0.95 | 5400 |
Result for the five classes UCSD dataset using VGGNet Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 100 |
| LOW | 0.88 | 1.00 | 0.93 | 100 |
| MEDIUM | 1.00 | 0.86 | 0.92 | 100 |
| HIGH | 1.00 | 1.00 | 1.00 | 100 |
| VHIGH | 1.00 | 1.00 | 1.00 | 100 |
| micro avg | 0.97 | 0.97 | 0.97 | 500 |
| macro avg | 0.98 | 0.97 | 0.97 | 500 |
Result for the five classes JHU-CROWD dataset using VGGNet Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 160 |
| LOW | 0.81 | 1.00 | 0.89 | 160 |
| MEDIUM | 1.00 | 0.76 | 0.86 | 160 |
| HIGH | 1.00 | 1.00 | 1.00 | 160 |
| VHIGH | 1.00 | 1.00 | 1.00 | 160 |
| micro avg | 0.95 | 0.95 | 0.95 | 800 |
| macro avg | 0.96 | 0.95 | 0.95 | 800 |
Comparision with the state-of-the-art dataset and VGGNet.
| Exp. | Dataset | Model | Final Accuracy |
|---|---|---|---|
| 2 | Hajj-Crowd | VGGNet | 98% |
| 2 | UCSD | VGGNet | 97% |
| 2 | JHU-CROWD | VGGNet | 97% |
Result for the five classes Hajj-Crowd dataset using ResNet50 Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 1080 |
| LOW | 0.99 | 0.99 | 1.00 | 1080 |
| MEDIUM | 1.00 | 1.00 | 1.00 | 1080 |
| HIGH | 1.00 | 1.00 | 1.00 | 1080 |
| VHIGH | 1.00 | 1.00 | 0.99 | 1080 |
| micro avg | 1.00 | 1.00 | 1.00 | 5400 |
| macro avg | 1.00 | 1.00 | 1.00 | 5400 |
Result for the five classes UCSD dataset using ResNet50 Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 0.97 | 0.97 | 0.97 | 100 |
| LOW | 0.86 | 0.75 | 0.80 | 100 |
| MEDIUM | 0.86 | 0.95 | 0.90 | 100 |
| HIGH | 0.98 | 0.98 | 0.97 | 100 |
| VHIGH | 0.86 | 0.80 | 0.75 | 100 |
| micro avg | 0.90 | 0.90 | 0.90 | 500 |
| macro avg | 0.91 | 0.91 | 0.91 | 500 |
Result for the five classes JHU-CROWD dataset using ResNet50 Model.
| Class | Precision | Recall | f1-Score | Support |
|---|---|---|---|---|
| VLOW | 1.00 | 1.00 | 1.00 | 160 |
| LOW | 0.81 | 1.00 | 0.89 | 160 |
| MEDIUM | 0.97 | 0.97 | 0.97 | 160 |
| HIGH | 0.97 | 1.00 | 0.94 | 160 |
| VHIGH | 1.00 | 095 | 097 | 160 |
| micro avg | 0.96 | 0.96 | 0.96 | 800 |
| macro avg | 0.96 | 0.96 | 0.96 | 800 |
Comparison with the state-of-the-art dataset and ResNet50.
| Exp. | Dataset | Model | Final Accuracy |
|---|---|---|---|
| 3 | Hajj-Crowd | ResNet50 | 99% |
| 3 | UCSD | ResNet50 | 91% |
| 3 | JHU-CROWD | ResNet50 | 97% |