| Literature DB >> 35498206 |
Guiyi Yang1, Zhengyou Wang2, Shanna Zhuang1, Hui Wang2.
Abstract
Occlusion pedestrian detection is an important and difficult task in pedestrian detection. At present, the main method to deal with occlusion pedestrian detection usually adopts pedestrian parts or human body relationship methods. However, in the scene of crowd occlusion or severe pedestrian occlusion, only small parts of the body can be used for detection. Pedestrian parts or human body relationship methods cannot effectively address these issues. In view of the above problems, this paper abandoned the occlusion processing method of pedestrian parts or human body relationship. Considering that it is difficult to establish the relationship between parts and key points. The scale of visible parts of the occlusion pedestrian is small, and the scale of no occlusion pedestrian and occlusion pedestrian in the same picture is different. A multiscale feature attention fusion network named parallel feature fusion with CBAM (PFF-CB) is proposed for occlusion pedestrian detection. Feature information of different scales can be integrated effectively in the PFF-CB module. PFF-CB module uses a convolutional block attention module (CBAM) to enhance the important feature information in space and channel. A parallel feature fusion module based on FPN is used to enhance key features. The performance of the proposed module was tested on two common data sets of occlusion pedestrians with different occlusion types. The results show that the PFF-CB module makes a good performance in occlusion pedestrian detection tasks.Entities:
Mesh:
Year: 2022 PMID: 35498206 PMCID: PMC9050291 DOI: 10.1155/2022/3798060
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Faster R-CNN.
Figure 2Cascade R-CNN.
Figure 3Residual structure.
Figure 4ResNet structure.
Figure 5FPN structure.
Figure 6SE module.
Figure 7CBAM module.
Figure 8PFF structure.
Figure 9Block1 structure.
Figure 10Block1, block3, and block4 structure.
Figure 11CBAM refined PFF features.
Figure 12Pedestrian detection network added PFF.
Figure 13CrowdHuman data sets.
Figure 14CityPersons data sets.
Hardware configuration.
| Hardware | Configuration |
|---|---|
| CPU | Intel (R) I7 CPU 8700K |
| GPU | 2080Ti |
| HDD | 480 GB |
Software environment.
| Software | Configuration |
|---|---|
| System | Ubuntu 18.04 |
| Libraries | Cudnn, CUDA, OpenCV, Cython, Numpy |
| Language | Python |
| Framework | Pytorch 1.4.0, Megengine 0.3.1, mmdetection |
The result of PFF-3 and PFF-4
| Module | AP (%) | MR−2 (%) |
|---|---|---|
| Faster R-CNN + FPN | 85.80 | 42.90 |
| Faster R-CNN + PFF-3 |
|
|
| Faster R-CNN + PFF-4 | 86.42 | 43.06 |
The result of PFF in faster R-CNN and cascade R-CNN in CrowdHuman data sets.
| Module | AP (%) | MR−2 (%) |
|---|---|---|
| Faster R-CNN + FPN [ | 85.80 | 42.90 |
| Faster R-CNN + PFF |
|
|
| Cascade R-CNN [ | 85.60 | 43.00 |
| Cascade R-CNN + PFF |
|
|
Compared PFF with the most advanced methods in the CrowdHuman data set.
| Module | MR−2 (%) |
|---|---|
| Faster R-CNN | 50.49 |
| Adaptive NMS [ | 49.73 |
| IterDet [ | 49.12 |
| IterDet2 [ | 49.22 |
| Faster R-CNN + PFF (ours) |
|
| NOH-NMS [ | 43.90 |
| CrowdDet [ | 41.40 |
| DeFCN (POTO + 3DMF + AUX) | 48.90 |
| Cascade R-CNN + FPN | 43.00 |
| Cascade R-CNN + PFF (ours) |
|
Experimental results about faster R-CNN and cascade R-CNN in CityPersons.
| Module | R-H (%) |
|---|---|
| Faster R-CNN + FPN | 40.55 |
| Faster R-CNN + PFF |
|
| Cascade R-CNN | 40.76 |
| Cascade R-CNN + PFF |
|
Figure 15Detection results in faster R-CNN in CityPersons.
Figure 16Detection results in cascade R-CNN in CityPersons.
Final experimental results in CityPersons.
| Module | R-H (%) |
|---|---|
| Faster R-CNN + FPN | 40.55 |
| Faster R-CNN + PFF | 39.80 |
| Faster R-CNN + PFF-CB |
|
Experimental results in CityPersons data set
| Module | H (%) |
|---|---|
| Rep loss [ | 56.90 |
| OR-CNN [ | 55.70 |
| TLL | 53.60 |
| TLL + MRF | 52.00 |
| ALFNet [ | 51.90 |
| CSP | 49.30 |
| NOH-NMS [ | 53.00 |
| Faster R-CNN | 49.24 |
| Beat R-CNN [ | 47.10 |
| PRF-Ped [ | 47.30 |
| Faster R-CNN + PFF (ours) | 47.29 |
| Faster R-CNN + PFF-CB (ours) |
|