| Literature DB >> 30769813 |
Chao Zhu1,2, Xu-Cheng Yin3,4.
Abstract
Significant progress has been achieved in the past few years for the challenging task of pedestrian detection. Nevertheless, a major bottleneck of existing state-of-the-art approaches lies in a great drop in performance with reducing resolutions of the detected targets. For the boosting-based detectors which are popular in pedestrian detection literature, a possible cause for this drop is that in their boosting training process, low-resolution samples, which are usually more difficult to be detected due to the missing details, are still treated equally importantly as high-resolution samples, resulting in the false negatives since they are more easily rejected in the early stages and can hardly be recovered in the late stages. To address this problem, we propose in this paper a robust multi-resolution detection approach with a novel group cost-sensitive boosting algorithm, which is derived from the standard AdaBoost algorithm to further explore different costs for different resolution groups of the samples in the boosting process, and to place greater emphasis on low-resolution groups in order to better handle the detection of multi-resolution targets. The effectiveness of the proposed approach is evaluated on the Caltech pedestrian benchmark and KAIST (Korea Advanced Institute of Science and Technology) multispectral pedestrian benchmark, and validated by its promising performance on different resolution-specific test sets of both benchmarks.Entities:
Keywords: group cost-sensitive boosting; multi-resolution; pedestrian detection
Year: 2019 PMID: 30769813 PMCID: PMC6412415 DOI: 10.3390/s19040780
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Example images and ground truth annotations in the Caltech pedestrian benchmark. Note that the resolutions of the pedestrians are in a wide range.
Figure 2Comparison of different cost-sensitive boosting strategies.
A list of the terms that appear in our approach.
| Term | Definition | Meaning in Pedestrian Detection Problem |
|---|---|---|
|
| Feature representation of a sample | Image region that needs to be classified as pedestrian or not |
|
| Class label of a sample | Its value will be 1 if the corresponding image region |
|
| Detector (binary classifier) | Get label y given image region |
|
| Predictor (strong classifier learned via boosting) | Output score given image region |
|
| Loss function | A measurement for wrong detections (pedestrian region is classified as non-pedestrian or background region is classified as pedestrian) |
|
| Weak classifier in boosting learning | Simple classifier to decide if an image region is pedestrian (only slightly better than random) |
|
| Weight of each weak classifier | |
|
| Weight of each sample | Its value will be increased if detection is wrong, otherwise decreased |
|
| Costs in group cost-sensitive loss function | Different cost values are assigned to measure wrong detections in different resolution pedestrian samples |
|
| Groups of different resolution samples | Image regions are divided into groups according to different resolution pedestrians in it |
|
| Sum of weights of samples in each resolution group | |
|
| Classification error | Total loss of detections in each resolution pedestrian group |
Figure 3Comparison with popular approaches on the Caltech benchmark.
Figure 4Comparison with popular approaches on the KAIST benchmark.
Figure 5Influence of group number in GCS-LDCF and GCS-CCF on the Caltech and KAIST benchmarks, respectively.
Figure 6Log-average miss rate vs. runtime of different approaches on Caltech “Reasonable” setting (symbols closer to the bottom-right corner indicating that the corresponding approaches possess both better accuracy and faster runtime speed).