| Literature DB >> 27537888 |
Rui Sun1, Guanghai Zhang2, Xiaoxing Yan3, Jun Gao4.
Abstract
Vision-based pedestrian detection has become an active topic in computer vision and autonomous vehicles. It aims at detecting pedestrians appearing ahead of the vehicle using a camera so that autonomous vehicles can assess the danger and take action. Due to varied illumination and appearance, complex background and occlusion pedestrian detection in outdoor environments is a difficult problem. In this paper, we propose a novel hierarchical feature extraction and weighted kernel sparse representation model for pedestrian classification. Initially, hierarchical feature extraction based on a CENTRIST descriptor is used to capture discriminative structures. A max pooling operation is used to enhance the invariance of varying appearance. Then, a kernel sparse representation model is proposed to fully exploit the discrimination information embedded in the hierarchical local features, and a Gaussian weight function as the measure to effectively handle the occlusion in pedestrian images. Extensive experiments are conducted on benchmark databases, including INRIA, Daimler, an artificially generated dataset and a real occluded dataset, demonstrating the more robust performance of the proposed method compared to state-of-the-art pedestrian classification methods.Entities:
Keywords: CENTRIST; kernel method; pedestrian classification; pooling; sparse representation
Mesh:
Year: 2016 PMID: 27537888 PMCID: PMC5017461 DOI: 10.3390/s16081296
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Reconstructed human image from CENTRIST. (a) Original image; (b) Contour image; (c) Reconstruct image.
Figure 2Spatial pyramid for CENTRIST.
Figure 3Illustration of proposed HFE.
Average running time (s).
| Method | INRIA | Daimler with Occlusion |
|---|---|---|
| HOG + SVM | 0.1806 | 0.0682 |
| HFE + SRC | 0.1239 | 0.0403 |
| HFE − WKSR | 0.1372 | 0.0463 |
Parameters of HFE − WKSR.
| Procedure | Parameters | |
|---|---|---|
| Feature extraction | Hierarchical partition | P0 = 4, Q0 = 4 when S = 0 |
| Histogram bin number | 16 | |
| WKSR | Kernel function | Histogram intersection kernel |
| Weight | ||
| convergence |
| |
| Lagrange multiplier |
|
Figure 4Some samples of INRIA dataset.
Classification results for INRIA database.
| N | 20 | 50 | 100 |
|---|---|---|---|
| HOG + SVM | 45.2 | 53.6 | 62.5 |
| HOG + SRC | 72.8 | 77.1 | 82.9 |
| HFE + SRC | 84.2 | 88.9 | 91.3 |
| HFE + CRC | 85.3 | 87.9 | 90.8 |
| HFE + HIKSVM | 62.7 | 68.2 | 77.9 |
| HFE − WKSR | 90.3 | 94.4 | 97.5 |
Figure 5Some samples of Daimler dataset.
Classification Results on Daimler database.
| Group | Illumination | Background | Appearance |
|---|---|---|---|
| HOG + SVM | 58.7 | 55.2 | 46.3 |
| HOG + SRC | 75.4 | 86.6 | 73.5 |
| HFE + SRC | 84.5 | 86.4 | 83.2 |
| HFE + CRC | 85.4 | 85.5 | 81.2 |
| HFE + HIKSVM | 73.5 | 76.3 | 68.3 |
| HFE − WKSR | 94.6 | 92.5 | 90.3 |
| HFE − WKSR (without MP) | 88.3 | 87.1 | 84.5 |
Figure 6Examples of pedestrian images with random block occlusion. (a) 20% block occlusion; (b) 30% block occlusion; (c) 40% block occlusion.
Classification results on block occlusion.
| Occlusion | 10% | 20% | 30% | 40% | 50% |
|---|---|---|---|---|---|
| HOG + SVM | 57.2 | 53.6 | 42.9 | 38.3 | 32.4 |
| HOG + SRC | 72.3 | 68.2 | 55.4 | 48.2 | 47.9 |
| HFE + SRC | 83.2 | 80.8 | 76.3 | 72.5 | 68.1 |
| HFE + CRC | 81.3 | 76.5 | 73.2 | 71.6 | 67.2 |
| HFE + HIKSVM | 75.2 | 71.3 | 68.2 | 63.3 | 61.4 |
| HFE − WKSR | 93.2 | 91.5 | 88.2 | 82.3 | 75.4 |
Figure 7Examples of pedestrian images with real occlusion in Daimler partially occluded set.
Figure 8Classification Results on Daimler partially occluded set.