| Literature DB >> 35840636 |
Lu Cheng1,2,3, Yicai Ji4,5,6, Chao Li1,2,3, Xiaojun Liu1,2, Guangyou Fang1,2,3.
Abstract
With the strengthening of global anti-terrorist measures, it is increasingly important to conduct security checks in public places to detect concealed objects carried by the human body. Research in recent years has shown that deep learning is helpful for detecting concealed objects in passive terahertz images. However, previous studies have failed to achieve superior accuracy and performance for real-time labeling. Our research aims to propose a novel method for accurate and real-time detection of concealed objects in terahertz images. To reach this goal we trained and tested a promising detector based on deep residual networks using human image data collected by passive terahertz devices. Specifically, we replaced the backbone network of the SSD (Single Shot MultiBox Detector) algorithm with a more representative residual network to reduce the difficulty of network training. Aiming at the problems of repeated detection and missed detection of small targets, a feature fusion-based terahertz image target detection algorithm was proposed. Furthermore, we introduced a hybrid attention mechanism in SSD to improve the algorithm's ability to acquire object details and location information. Finally, the Focal Loss function was introduced to improve the robustness of the model. Experimental results show that the accuracy of the SSD algorithm is improved from 95.04 to 99.92%. Compared with other current mainstream models, such as Faster RCNN, YOLO, and RetinaNet, the proposed method can maintain high detection accuracy at a faster speed. This proposed method based on SSD achieves a mean average precision of 99.92%, an F1 score of 0.98, and a prediction speed of 17 FPS on the validation subset. This proposed method based on SSD-ResNet-50 can provide a technical reference for the application and development of deep learning technology in terahertz smart security systems. In the future, it can be widely used in some public scenarios with real-time security inspection requirements.Entities:
Mesh:
Year: 2022 PMID: 35840636 PMCID: PMC9287380 DOI: 10.1038/s41598-022-16208-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1The network structure of SSD.
Figure 2Improved SSD network architecture.
Figure 3Flowchart of the algorithm.
Figure 4Comparison diagram of traditional structure and residual structure. (a) The traditional structure. (b) The residual structure.
Comparison of network complexity between VGGNet-16 and ResNet-50.
| Network | Parameter quantity | Number of floating-point operations |
|---|---|---|
| VGGNet-16 | 138 | 15.5 |
| ResNet-50 | 25.5 | 3.8 |
Figure 5Overview of the proposed feature fusion strategy.
Figure 6The overview of CBAM attention mechanism module.
Figure 7Diagram of channel attention module.
Figure 8Diagram of spatial attention module.
Figure 9Curve of focal loss function[22].
Figure 10Passive terahertz hidden object imaging. (a) The 0.2 THz passive terahertz imaging device. (b) the metal pistol model and the mobile phone model. (c) the passive terahertz imaging results.
Figure 11Definitions of true positive, false positive, false negative and true negative[39]. For example, “True Positive” means that one actual positive instance was predicted to be positive by one method.
Figure 12Change of loss function.
Ablation experiment results on the passive terahertz dataset.
| NET | Module | mAP(%) | ||||
|---|---|---|---|---|---|---|
| VGGNet-16 | ResNet-50 | Feature Fusion | CBAM | Focal Loss | ||
| SSD | 95.04 | |||||
| Model-1 | 98.65 | |||||
| Model-2 | 98.77 | |||||
| Model-3 | 98.24 | |||||
| Model-4 | 95.37 | |||||
| Model-5 | 98.86 | |||||
| SSD(ours) | 99.92 | |||||
Figure 13Comparision of mAP values of different models.
Comparison of accuracy with other advanced algorithms.
| Model | AP(%) | mAP(%) | F1 score | Time(ms) | FPS | |
|---|---|---|---|---|---|---|
| Gun | Phone | |||||
| Faster R-CNN | 98.30 | 97.20 | 97.75 | 0.95 | 9 | 4.5568 |
| YOLO v5 | 99.24 | 97.48 | 98.36 | 0.96 | 42 | 12.2265 |
| Retinanet | 96.47 | 93.64 | 95.06 | 0.94 | 25 | 10.1952 |
| SSD | 96.10 | 93.98 | 95.04 | 0.93 | 23 | 11.2652 |
| SSD(our) | 99.94 | 99.90 | 99.92 | 0.98 | 17 | 8.2624 |
Figure 14Visualized results of the passive terahertz dataset. (a) Detection results of the original SSD algorithm. (b) Detection results of the improved SSD algorithm.
Figure 15ROC curve.