| Literature DB >> 25951341 |
Ji Hoon Lee1, Jong-Suk Choi2, Eun Som Jeon3, Yeong Gon Kim4, Toan Thanh Le5, Kwang Yong Shin6, Hyeon Chang Lee7, Kang Ryoung Park8.
Abstract
With the development of intelligent surveillance systems, the need for accurate detection of pedestrians by cameras has increased. However, most of the previous studies use a single camera system, either a visible light or thermal camera, and their performances are affected by various factors such as shadow, illumination change, occlusion, and higher background temperatures. To overcome these problems, we propose a new method of detecting pedestrians using a dual camera system that combines visible light and thermal cameras, which are robust in various outdoor environments such as mornings, afternoons, night and rainy days. Our research is novel, compared to previous works, in the following four ways: First, we implement the dual camera system where the axes of visible light and thermal cameras are parallel in the horizontal direction. We obtain a geometric transform matrix that represents the relationship between these two camera axes. Second, two background images for visible light and thermal cameras are adaptively updated based on the pixel difference between an input thermal and pre-stored thermal background images. Third, by background subtraction of thermal image considering the temperature characteristics of background and size filtering with morphological operation, the candidates from whole image (CWI) in the thermal image is obtained. The positions of CWI (obtained by background subtraction and the procedures of shadow removal, morphological operation, size filtering, and filtering of the ratio of height to width) in the visible light image are projected on those in the thermal image by using the geometric transform matrix, and the searching regions for pedestrians are defined in the thermal image. Fourth, within these searching regions, the candidates from the searching image region (CSI) of pedestrians in the thermal image are detected. The final areas of pedestrians are located by combining the detected positions of the CWI and CSI of the thermal image based on OR operation. Experimental results showed that the average precision and recall of detecting pedestrians are 98.13% and 88.98%, respectively.Entities:
Year: 2015 PMID: 25951341 PMCID: PMC4481953 DOI: 10.3390/s150510580
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparison of previous and proposed methods.
| Category | Method | Advantages | Disadvantage | |
|---|---|---|---|---|
| Single camera-based method | Using visible camera [ | By using spatial or temporal information only in visible light image | The performance of people detection in daytime of high temperature is higher due to the high resolution and quality of visible light image | The performance is affected by non-uniform illumination, shadow, and low external light during evening and night |
| Using thermal camera [ | By using spatial or temporal information only in thermal image | The performance is less affected by illumination change, shadow, and low external light during evening and night | The performance is affected by high background temperatures in daytime | |
| Dual camera-based method | Using stereo thermal cameras [ | By using spatial information in stereo thermal images | Higher performance of people detection than the single camera-based method | The performance is affected by high background temperatures in daytime |
| Using co-axial structure of visible and thermal cameras [ | Elaborately co-aligned structure of visible light and thermal cameras is used | Additional device of beamsplitter is required, and large beamsplitter increases the system size | ||
| Using visible light and thermal cameras | People detection inconstrained environment such as indoor [ | Not showing quantitative accuracies of people detection in various environments such as high background temperatures in daytime, non-uniform illumination, and shadow in outdoors | ||
| People detection in unconstrained environments (proposed method) | Robust to various environments without the additional device for combining two cameras | Lower processing speed than single camera-based method due to processing of two camera images | ||
Figure 1Proposed dual camera system.
Figure 2Calibration between two cameras based on geometric transform and accuracy measurements of the calibration. (a) Examples of calibration points used for calculating the matrix of geometric transform in the visible light (left) and thermal (right) images, respectively; (b) Points used for calculating the calibration error in the visible light (left) and thermal (right) images, respectively.
Figure 3Flow chart of the proposed system.
Figure 4Examples of results by morphological operation and size filtering with the binarized difference image between background and input visible light images. (a) Input visible light image; (b) Background image; (c) The binarized difference image between background and input visible light images; (d) Result image by morphological operation; (e) Result image by size filtering.
Figure 5Examples obtained from CWI. (a) First example of the current input images by visible light (left figure) and thermal cameras (right figure) in morning; (b) Detected CWIs in visible light (left figure) and the thermal input image (right figure) of (a); (c) Second example of the current input images by visible light (left figure) and thermal cameras (right figure) at afternoon; (d) Detected CWIs in visible light (left figure) and the thermal input image (right figure) of (c).
Figure 6Examples of the obtained CSI in the thermal image. (a) The CSI obtained from both Figure 5a and the left image of Figure 5b; (b) The CSI obtained from both Figure 5c and the left image of Figure 5d.
Figure 7Examples of CWI and CSI. (a) Visible light and thermal images in the morning; (b) Results of CWI in visible light (left figure) and thermal (right figure) images; (c) Result of CSI in a thermal image.
Figure 8Separation of one candidate region into two areas based on the horizontal histogram. (a) Detected candidate region and its horizontal histogram; (b) The separation result of one candidate region into two areas.
Figure 9Examples of combined image of CWI and CSI, and the final result of human detection. (a) Combined image of CWI (right image of Figure 7b) and CSI (Figure 7c); (b) Final result of human detection.
Figure 10Calibration error between the two cameras (example 1). Left and right figures of (a,b) are visible light and thermal images, respectively. In each image, the circle and crosshair represent the ground-truth and calculated points, respectively (a) When using the geometric transform matrix (from visible light to thermal images); (b) When using the geometric transform inverse matrix (from thermal to visible light images).
Result of calibration errors of Figure 10 (unit: pixel).
| Applying Geometric Transform Matrix | Average Pixel Error | Average RMS Error | ||
|---|---|---|---|---|
| From | To | X Direction | Y Direction | |
| Visible light image | Thermal image | 1 | 0.5 | 1.12 |
| Thermal image | Visible light image | 1.15 | 0.25 | 1.18 |
Figure 11Calibration error between the two cameras (example 2). Left and right figures of (a,b) are visible light and thermal images, respectively. In each image, the circle and crosshair represent the ground-truth and calculated points, respectively (a) When using the geometric transform matrix (from visible light to thermal images); (b) When using the geometric transform inverse matrix (from thermal to visible light images).
Result of calibration errors of Figure 11 (unit: pixel).
| Applying Geometric Transform Matrix | Average Pixel Error | Average RMS Error | ||
|---|---|---|---|---|
| From | To | X Direction | Y Direction | |
| Visible light image | Thermal image | 0.88 | 0.67 | 1.11 |
| Thermal image | Visible light image | 1.09 | 0.45 | 1.18 |
Figure 12Example of detection results in various environments. (a) Detection result in the morning; (b) Detection result in the afternoon; (c) Detection result at night; (d) Detection result on a rainy day.
Detection results using dual camera systems.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 786 | 15 | 87.43 | 98.13 |
| Afternoon | 500 | 730 | 677 | 5 | 92.74 | 99.27 |
| Night | 500 | 698 | 561 | 27 | 80.37 | 95.41 |
| Rainy day | 500 | 559 | 544 | 2 | 97.32 | 99.63 |
| Total | 2000 | 2886 | 2568 | 49 | 88.98 | 98.13 |
Detection result using only visible light camera.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 556 | 11 | 61.85 | 98.06 |
| Afternoon | 500 | 730 | 594 | 9 | 81.37 | 98.51 |
| Night | 500 | 698 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 500 | 559 | 254 | 523 | 45.44 | 32.69 |
| Total | 2000 | 2886 | 1404 | 543 | 48.65 | 72.11 |
Detection result using only thermal camera.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 759 | 22 | 84.43 | 97.18 |
| Afternoon | 500 | 730 | 252 | 2 | 34.52 | 99.21 |
| Night | 500 | 698 | 554 | 64 | 79.37 | 89.64 |
| Rainy day | 500 | 559 | 543 | 2 | 97.14 | 99.63 |
| Total | 2000 | 2886 | 2108 | 90 | 73.04 | 95.91 |
Figure 13Detection error case in our database: (a) The example of the current input images by visible light (left figure) and thermal cameras (right figure); (b) Result image (of Step (17) of Figure 3).
Processing time of our method.
| Steps of | Processing Time (ms) |
|---|---|
| Steps (1)–(4), (7) and (8) | 16.05 |
| Steps (5) and (6) | 2.44 |
| Steps (9)–(12) | 2.25 |
| Step (13) | 0.25 |
| Steps (14) and (15) | 0.72 |
| Steps (16)–(18) | 1.42 |
| Total | 23.13 |
Detection result using only thermal camera by previous method [22].
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 785 | 57 | 87.32 | 93.23 |
| Afternoon | 500 | 730 | 247 | 27 | 33.84 | 90.15 |
| Night | 500 | 698 | 517 | 131 | 74.07 | 79.78 |
| Rainy day | 500 | 559 | 541 | 37 | 96.78 | 93.60 |
| Total | 2000 | 2886 | 2090 | 252 | 72.42 | 89.24 |
Detection result using only visible light camera by previous method [8,14].
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 579 | 33 | 64.40 | 94.61 |
| Afternoon | 500 | 730 | 560 | 46 | 76.71 | 92.41 |
| Night | 500 | 698 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 500 | 559 | 248 | 501 | 44.36 | 33.11 |
| Total | 2000 | 2886 | 1387 | 580 | 48.06 | 70.51 |
Detection result using only thermal camera by previous method [8,14].
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 626 | 7 | 69.63 | 98.89 |
| Afternoon | 500 | 730 | 242 | 61 | 33.15 | 79.87 |
| Night | 500 | 698 | 507 | 10 | 72.64 | 98.07 |
| Rainy day | 500 | 559 | 429 | 2 | 76.74 | 99.54 |
| Total | 2000 | 2886 | 1804 | 80 | 62.51 | 95.75 |
Figure 14The results of background subtraction by our method and previous one [37]. Upper and lower figures of (a,b) are the results with the visible light and thermal images, respectively: (a) Results by our method; (b) Results by previous method [37].
Detection result using only visible light camera by previous method [37].
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 464 | 32 | 51.61 | 93.55 |
| Afternoon | 500 | 730 | 573 | 17 | 78.49 | 97.12 |
| Night | 500 | 698 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 500 | 559 | 143 | 445 | 25.58 | 24.32 |
| Total | 2000 | 2886 | 1180 | 494 | 40.89 | 70.49 |
Detection result using only thermal camera by previous method [37].
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 500 | 899 | 500 | 379 | 55.62 | 56.88 |
| Afternoon | 500 | 730 | 406 | 109 | 55.62 | 78.84 |
| Night | 500 | 698 | 590 | 43 | 84.53 | 93.21 |
| Rainy day | 500 | 559 | 109 | 653 | 19.50 | 14.30 |
| Total | 2000 | 2886 | 1605 | 1184 | 55.61 | 57.55 |
Figure 15Proposed dual camera system which is used for collecting database II.
Figure 16Examples of collected images in database II. Left and right figures of (a–d) are the images by visible light and thermal cameras, respectively. Image captured (a) in the morning; (b) in the afternoon; (c) at night; (d) on a rainy day.
Detection results using dual camera systems by our method with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 135 | 1 | 80.84 | 99.26 |
| Afternoon | 200 | 216 | 210 | 26 | 97.22 | 88.98 |
| Night | 200 | 269 | 254 | 2 | 94.42 | 99.22 |
| Rainy day | 200 | 181 | 180 | 72 | 99.45 | 71.43 |
| Total | 800 | 833 | 779 | 101 | 93.52 | 88.52 |
Detection result using only visible light camera by our method with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 48 | 16 | 28.74 | 75.00 |
| Afternoon | 200 | 216 | 132 | 28 | 61.11 | 82.50 |
| Night | 200 | 269 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 200 | 181 | 142 | 70 | 78.45 | 66.98 |
| Total | 800 | 833 | 322 | 114 | 38.66 | 73.85 |
Detection result using only thermal camera by our method with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 128 | 55 | 76.65 | 69.95 |
| Afternoon | 200 | 216 | 149 | 119 | 68.98 | 55.60 |
| Night | 200 | 269 | 241 | 35 | 89.59 | 87.32 |
| Rainy day | 200 | 181 | 180 | 5 | 99.45 | 97.30 |
| Total | 800 | 833 | 698 | 214 | 83.79 | 76.54 |
Processing time of our method with database II.
| Steps of | Processing Time (ms) |
|---|---|
| Steps (1)–(4), (7) and (8) | 0.003 |
| Steps (5) and (6) | 1.60 |
| Steps (9)–(12) | 18.10 |
| Step (13) | 0.97 |
| Steps (14) and (15) | 5.69 |
| Steps (16)–(18) | 0.68 |
| Total | 27.04 |
Detection result using only thermal camera by previous method [22] with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 80 | 103 | 47.90 | 43.72 |
| Afternoon | 200 | 216 | 177 | 83 | 81.94 | 68.08 |
| Night | 200 | 269 | 206 | 52 | 76.58 | 79.85 |
| Rainy day | 200 | 181 | 150 | 10 | 82.87 | 93.75 |
| Total | 800 | 833 | 613 | 248 | 73.59 | 71.20 |
Detection result using only visible light camera by previous method [8,14] with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 32 | 20 | 19.16 | 61.54 |
| Afternoon | 200 | 216 | 117 | 45 | 54.17 | 72.22 |
| Night | 200 | 269 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 200 | 181 | 147 | 92 | 81.22 | 61.51 |
| Total | 800 | 833 | 296 | 157 | 35.53 | 65.34 |
Detection result using only thermal camera by previous method [8,14] with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 108 | 48 | 64.67 | 69.23 |
| Afternoon | 200 | 216 | 121 | 98 | 56.02 | 55.25 |
| Night | 200 | 269 | 237 | 44 | 88.10 | 84.34 |
| Rainy day | 200 | 181 | 177 | 19 | 97.79 | 90.31 |
| Total | 800 | 833 | 643 | 209 | 77.19 | 75.47 |
Detection result using only visible light camera by previous method [37] with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 27 | 0 | 16.17 | 100 |
| Afternoon | 200 | 216 | 118 | 28 | 54.63 | 80.82 |
| Night | 200 | 269 | 0 | 0 | 0 | Cannot be calculated |
| Rainy day | 200 | 181 | 102 | 73 | 56.35 | 58.29 |
| Total | 800 | 833 | 247 | 101 | 29.65 | 70.98 |
Detection result using only thermal camera by previous method [37] with database II.
| Environment | #Frame | #People | #TP | #FP | Recall (%) | Precision (%) |
|---|---|---|---|---|---|---|
| Morning | 200 | 167 | 129 | 50 | 77.25 | 72.07 |
| Afternoon | 200 | 216 | 139 | 124 | 64.35 | 52.85 |
| Night | 200 | 269 | 178 | 34 | 66.17 | 83.96 |
| Rainy day | 200 | 181 | 180 | 5 | 99.45 | 97.30 |
| Total | 800 | 833 | 626 | 213 | 75.15 | 74.61 |