| Literature DB >> 34960274 |
Irfan Azhar1, Muhammad Sharif1, Mudassar Raza1, Muhammad Attique Khan2, Hwan-Seung Yong3.
Abstract
The recent development in the area of IoT technologies is likely to be implemented extensively in the next decade. There is a great increase in the crime rate, and the handling officers are responsible for dealing with a broad range of cyber and Internet issues during investigation. IoT technologies are helpful in the identification of suspects, and few technologies are available that use IoT and deep learning together for face sketch synthesis. Convolutional neural networks (CNNs) and other constructs of deep learning have become major tools in recent approaches. A new-found architecture of the neural network is anticipated in this work. It is called Spiral-Net, which is a modified version of U-Net fto perform face sketch synthesis (the phase is known as the compiler network C here). Spiral-Net performs in combination with a pre-trained Vgg-19 network called the feature extractor F. It first identifies the top n matches from viewed sketches to a given photo. F is again used to formulate a feature map based on the cosine distance of a candidate sketch formed by C from the top n matches. A customized CNN configuration (called the discriminator D) then computes loss functions based on differences between the candidate sketch and the feature. Values of these loss functions alternately update C and F. The ensemble of these nets is trained and tested on selected datasets, including CUFS, CUFSF, and a part of the IIT photo-sketch dataset. Results of this modified U-Net are acquired by the legacy NLDA (1998) scheme of face recognition and its newer version, OpenBR (2013), which demonstrate an improvement of 5% compared with the current state of the art in its relevant domain.Entities:
Keywords: NLDA; OpenBR; Spiral-Net; U-Net; Vgg-19 net; convolutional neural network; face recognition; sketch synthesis; smart cities
Mesh:
Year: 2021 PMID: 34960274 PMCID: PMC8708226 DOI: 10.3390/s21248178
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Schematic diagram of the proposed method.
Figure 2Architecture of Spiral-Net as a sketch compilation network.
Details of initial datasets.
| Dataset | Total Pairs | Train | Test | |
|---|---|---|---|---|
| CUFS | CUHK [ | 188 | 88 | 100 |
| AR [ | 123 | 80 | 43 | |
| XM2VTS [ | 295 | 100 | 195 | |
| CUFSF | 1194 | 250 | 944 | |
| Total Pairs | 1800 | 518 | 1282 | |
Parameters for processing.
| S No | Item | CUFS | CUFSF | |
|---|---|---|---|---|
| 1 | Hardware | Core i-7 ®, 7th Gen, NVIDIA 1060 (6GB) GPU | ||
| 2 | OS | Ubuntu Linux | ||
| 3 | Environment | PyCharm (CE), Torch 1.4.0 | ||
| 4 | Moderating Weights |
| 1 | 1 |
|
| 103 | 103 | ||
|
| 10−5 | 10−2 | ||
| 5 | Learning Weights | 10−3 to 10−5 reducing by a factor of 10−1 | ||
| 6 | Batch Sizes | 4 to 2 for different iterations | ||
| 7 | Processing Time | See respective tables | ||
Distribution of synthesized sketches by the NLDA procedure of face recognition.
| Dataset | Total Pairs | Train | Test |
|---|---|---|---|
| CUFS | 338 | 150 | 188 |
| CUFSF | 944 | 300 | 644 |
Comparison of SSIM and FSIM Values for CUFS.
| Type | MRF [ | MWF [ | LLE [ | SSD [ | FCN [ | GAN [ | RSLCR [ | Face2Sketch [ | BiL-STM [ | Proposed Spiral-Net |
|---|---|---|---|---|---|---|---|---|---|---|
| Proc Time (msec/photo) | Not presented by the original works | 7.57 | ||||||||
| SSIM | 51.31 | 53.92 | 52.58 | 54.19 | 52.13 | 49.38 | 55.71 | 54.41 |
| 54.42 |
| FSIM | 70.46 | 71.45 | 70.32 | 69.59 | 69.36 | 71.54 | 69.66 |
| 67.77 |
|
Comparison of face recognition scores for CUFS.
| Type | MRF [ | MWF [ | LLE [ | SSD [ | FCN [ | GAN [ | RSLCR [ | Face2Sketch [ | BiL-STM [ | Proposed Spiral-Net |
|---|---|---|---|---|---|---|---|---|---|---|
| NLDA Score (Equal/Best) | 87.34 | 92.10 | 90.61 | 90.61 | 96.99 | 93.48 |
| 97.82 | 94.87 | |
| No. of Features (Equal/Best) | 138 | 148 | 144 | 144 | 137 | 139 | 142 | 95 | - |
Comparison of SSIM and FSIM Values for CUFSF.
| Type | MRF [ | MWF [ | LLE [ | SSD [ | FCN [ | GAN [ | RSLCR [ | Face2Sketch [ | BiL-STM [ | Proposed Spiral-Net |
|---|---|---|---|---|---|---|---|---|---|---|
| Proc Time (msec/photo) | Not presented by the original works | 4.37 | - | 7.89 | ||||||
| SSIM | 35.36 | 40.83 | 39.66 | 41.88 | 34.39 | 34.81 | 42.69 | 38.97 |
| 38.32 |
| FSIM | 66.06 | 66.76 | 66.89 | 64.81 | 62.91 | 67.05 | 63.16 | 66.87 | 68.04 |
|
Comparison of Face Recognition Scores for CUFSF.
| Type | MRF [ | MWF [ | LLE [ | SSD [ | FCN [ | GAN [ | RSLCR [ | Face2Sketch [ | BiL-STM [ | Proposed Spiral-Net |
|---|---|---|---|---|---|---|---|---|---|---|
| NLDA Score (Equal/Best) | 46.03 | 74.15 | 70.92 | 61.76 | 70.14 | 71.48 | 73.05/75.94 | 73.05 | 71.35 | 73.14/ |
| No. of Features (Equal/Best) | 223 | 293 | 266 | 274 | 226 | 164 | 102/296 | 217 | - |
Figure 3Comparative view of NLDA scores by different techniques on CUFS dataset.
Details of augmented datasets.
| Dataset | Total Pairs | Train | Test | |
|---|---|---|---|---|
| VSC | CUHK [ | 188 | 88 | 100 |
| AR [ | 123 | 80 | 43 | |
| XM2VTS [ | 295 | 100 | 195 | |
| IIIT-D | 234 | 94 | 140 | |
| Total Pairs | 840 | 362 | 478 | |
| VSF | CUFSF | 1194 | 250 | 944 |
| IIIT-D | 234 | 94 | 140 | |
| Total Pairs | 1428 | 344 | 1084 | |
Comparative values of performance for augmented datasets using SNET and proposed Spiral-Net.
| Type | VSC-SNET | VSC-Spiral-Net | VSF-SNET | VSF-Spiral-Net |
|---|---|---|---|---|
| Proc Time (msec/photo) | 4.3033 | 8.5619 | 4.3113 | 8.1858 |
| SSIM | 38.18 |
| 40.33 | 40.51 |
| FSIM | 67.65 |
| 70.25 | 70.13 |
| NLDA Score (1998) (%) | 67.82 |
| 65.99 | 65.44 |
| OpenBR_FR Score (2013) (%) | 66 |
| 30.7 | 30.4 |
Figure 4Comparative view of NLDA scores by different techniques on CUFSF dataset.