| Literature DB >> 27043564 |
Eun Som Jeon1, Jong Hyun Kim2, Hyung Gil Hong3, Ganbayar Batchuluun4, Kang Ryoung Park5.
Abstract
Recently, human detection has been used in various applications. Although visible light cameras are usually employed for this purpose, human detection based on visible light cameras has limitations due to darkness, shadows, sunlight, etc. An approach using a thermal (far infrared light) camera has been studied as an alternative for human detection, however, the performance of human detection by thermal cameras is degraded in case of low temperature differences between humans and background. To overcome these drawbacks, we propose a new method for human detection by using thermal camera images. The main contribution of our research is that the thresholds for creating the binarized difference image between the input and background (reference) images can be adaptively determined based on fuzzy systems by using the information derived from the background image and difference values between background and input image. By using our method, human area can be correctly detected irrespective of the various conditions of input and background (reference) images. For the performance evaluation of the proposed method, experiments were performed with the 15 datasets captured under different weather and light conditions. In addition, the experiments with an open database were also performed. The experimental results confirm that the proposed method can robustly detect human shapes in various environments.Entities:
Keywords: fuzzy system; generation of background image; human detection; thermal camera image
Year: 2016 PMID: 27043564 PMCID: PMC4850967 DOI: 10.3390/s16040453
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Overall procedure of the proposed method.
Figure 2Flow chart of generating a background image (model).
Figure 3Examples of generating the background image from database I: (a) preliminary background image obtained by median value from the sequence of images; (b) extracted candidate human area by binarization; (c) extracted human areas by labeling, size filtering and a morphological operation; and (d) the final background image.
Figure 4The first example for generating a background image from database III: (a) preliminary background image obtained by median value from the sequence of images; (b) extracted candidate human area by binarization; (c) extracted human areas by labeling, size filtering and a morphological operation; and (d) the final background image.
Figure 5The second example for obtaining a background image from database VIII: (a) preliminary background image obtained by median value from the sequence of images; (b) extracted candidate human area by binarization; (c) extracted human areas by labeling, size filtering and a morphological operation; and (d) the final background image.
Figure 6Fuzzy system for the proposed method to extract adaptive threshold for ROI extraction.
Figure 7Membership functions for fuzzy system to extract adaptive threshold for ROI extraction: (a) average value of the background image; (b) sum of difference values between background and input image; and (c) obtaining the output optimal threshold.
Fuzzy rules based on the characteristics of the background and input images.
| Input 1 ( | Input 2 ( | Output ( |
|---|---|---|
| L | L | L |
| L | H | VH |
| M | L | M |
| M | H | M |
| H | L | H |
| H | H | VL |
Figure 8The first example for output of membership functions for fuzzy system: outputs by (a) F1 and (b) F2.
The first example for fuzzy rules and min rule based on the characteristics of the background and input images.
| Value | ||
|---|---|---|
| 0.2 (L) | 0.136 (L) | 0.136 (L) |
| 0.2 (L) | 0.358 (H) | 0.2 (VH) |
| 0.1 (M) | 0.136 (L) | 0.1 (M) |
| 0.1 (M) | 0.358 (H) | 0.1 (M) |
Figure 9The first example for output optimal threshold based on the COG defuzzification method.
Figure 10The second example for output of membership functions for fuzzy system: outputs by (a) F1 and (b) F2.
The second example for fuzzy rules and min rule based on the characteristics of the background and input images.
| Value | ||
|---|---|---|
| 0.25 (M) | 0.394 (L) | 0.25 (M) |
| 0.25 (M) | 0.104 (H) | 0.104 (M) |
| 0.125 (H) | 0.394 (L) | 0.125 (H) |
| 0.125 (H) | 0.104 (H) | 0.104 (VL) |
Figure 11The second example for output optimal threshold based on the COG defuzzification method.
Figure 12Example of a difference image: (a) input image; (b) background image; (c) difference image.
Figure 13Example of a difference image: (a) input image; (b) background image; (c) difference image.
Figure 14Separation of the candidate region within an input image based on the horizontal histogram: (a) input image and detected candidate region; (b) detected candidate region and its horizontal histogram; and (c) the division result of the candidate region.
Figure 15Separation of the candidate region within an input image based on the vertical histogram: (a) input image and detected candidate region; (b) detected candidate region and its vertical histogram; and (c) the division result of the candidate region.
Figure 16Separation of the candidate region within an input image based on the width, height, size and ratio: (a) input image and detected candidate region; (b) detected candidate region and its vertical histogram; and (c) the division result of the candidate region.
Figure 17The examples of separation of one detected box into three (a,c) or four (b,d) ones by our method.
Figure 18Separation of the candidate region within an input image: (a) input image and candidate human regions; (b) result of connecting separated regions.
Figure 19Example of different sizes of human areas because of camera viewing direction.
Figure 20Example of procedures for detecting human regions: (a) input image; (b) binarized image by background subtraction; (c) result of connecting separated candidate regions; and (d) Final result of detected human area.
Descriptions of fifteen databases.
| Database | Condition | Detail Description |
|---|---|---|
| I (see in | 2 °C, morning, average −1 °C during the day, snowy, wind 3.6 mph | The behaviors of the humans include walking, running, standing, and sitting. The sequence was captured during a little snowfall. The intensity of the human is influenced by material of clothes. |
| II (see in | −2 °C, night, average −3 °C during the day, wind 2.4 mph | The behaviors of the humans include walking, running, standing, and sitting. Three or four people appear together in several frames. Example of this database is presented in |
| III (see in | −1 °C, morning, average 3 °C during the day, sunny after rainy at dawn time, wind 4.0 mph | The behaviors of the humans include walking, running, standing, and sitting. The brightness of the human is very different compared to that of the background. The pixel value of the human is much higher than that of the background. |
| IV (see in | −6 °C, night, average −3 °C during the day, sunny after rainy at dawn time, wind 4.0 mph | The behaviors of the humans include walking, running, standing, and sitting. The intensity of the human is variously affected by temperature. If a person just appears from a building (indoors), the brightness of the person is much greater than other objects. The day when the database was captured was too cold. |
| V (see in | −2 °C, night, average −2 °C during the day, sunny, wind 4.9 mph | The behaviors of the humans include walking, running, standing, and sitting. There is a person wearing thick clothes. Therefore, the brightness of human is similar to the background because the intensity of image captured by infrared camera depends on the emission of heat. |
| VI (see in | −1 °C, morning, average 2 °C during the day, sunny, wind 2.5 mph | The behaviors of the humans include walking, running, standing, and sitting. The Halo effect is shown below the regions of humans. It is shown distinctive to the background. The brightness of the humans is much higher than that of background. |
| VII (see in | 22 °C, indoor, average −12 °C during the day outside, no wind | The behaviors of the humans include walking, running, standing, and sitting. The brightness of an image captured indoors is brighter than that of an image captured outside. The reflected region is located under the human region. The size of the region is same with the human. It is influenced by the material of the floor. |
| VIII (see in | 26 °C, afternoon, average 21 °C during the day, sunny, wind 1 mph | The behaviors of the humans include walking, sitting, and waving. The intensity of the humans is much lower than the background. The intensity of some background regions is also similar to that of human. |
| IX (see in | 14 °C, morning, average −18 °C during the day, sunny, wind 2.4 mph | The behavior of humans is waving. There are two or four people in the sequence. Their sizes are very different. The intensity of the humans is higher than that of background. There is also a watering ground. |
| X (see in | 28 °C, afternoon, average −23 °C during the day, sunny, wind 5 mph | The behavior of humans is walking. The sequence is captured during a hot day. The intensity of the image is influenced by the camera module system. Therefore, the brightness of humans is much darker than that of the background. There are some occluded people that can be a cause of difficulty for detection of the proposed system. |
| XI (see in | 18 °C, night, average 19 °C during the day, sunny after rainfall during the daytime, wind 2 mph | The behaviors of the human include kicking and punching. The person that appeared in the sequence is wearing short sleeves. The intensity of the human is a little higher than that of the background. |
| XII (see in | 27 °C, afternoon, average 23 °C during the day, sunny, wind 4.3 mph | The behavior of the humans is walking. There is region whose brightness is very similar to humans. Intensity of humans is reflected because of the fences. Not only the size but also intensity of the reflection is very similar to those of humans. The sequence is captured during a hot day. The intensity of the image is influenced by the camera module system. There is a slight brightness change during recording because of a large vehicle. |
| XIII (see in | 27 °C, night, average 29 °C during the day, sunny after rainfall during morning, wind 2.4 mph | The behaviors of the humans include walking, waving, and punching. The intensity of the human is similar to that of the background. The detection result of the proposed method is affected by the little contrast between humans and the background. |
| XIV (see in | 33 °C, afternoon, average 29 °C during the day, sunny, wind 3.5 mph | The behaviors of the humans include walking, running, standing, punching, and kicking. The sequence is captured during a heat wave. The humans that appeared in the sequence are wearing short sleeves. The brightness of the humans is darker than that of the background. There is a region whose brightness is very similar to the background. There are two crosswalks whose intensity is a little darker than the surrounding region. There is a slight brightness change during the recording because of a large vehicle. |
| XV (see in | 30 °C, night, average 29 °C during the day, sunny, wind 2.5 mph | The behaviors of the human include walking, waving, kicking, and punching. The intensity of the human is much darker than the background. A human is shown relevant to the background region. There is a round iron piece in the middle of the images. There is a region whose brightness is very similar to the background. There are two crosswalks whose intensity is a little darker than the surrounding region. |
Figure 21Examples of databases: (a) database I; (b) database II; (c) database III; (d) database IV; (e) database V; (f) database VI; (g) database VII; (h) database VIII; (i) database IX; (j) database X; (k) database XI; (l) database XII; (m) database XIII; (n) database XIV; and (o) database XV.
Figure 22Comparisons of preliminary background images with database I. The left figure (a) is by [33,34,35] and right figure (b) is by the proposed method, respectively.
Figure 23Comparisons of created background images with database I. The left-upper, right-upper, left-lower and right-lower figures are by [24,27,28,29,30,31,33,34,35] and the proposed method, respectively.
Figure 24Comparisons of created background images with database III. The left-upper, right-upper, left-lower and right-lower figures are by [24,27,28,29,30,31,33,34,35] and the proposed method, respectively.
Figure 25Detection results with database (I–XV). Results of images in: (a) Database I; (b) Database II; (c) Database III; (d) Database IV; (e) Database V; (f) Database VI; (g) Database VII; (h) Database VIII; (i) Database IX; (j) Database X; (k) Database XI; (l) Database XII; (m) Database XIII; (n) Database XIV; and (o) Database XV.
Results of human detection by the proposed method with our database.
| Database No. | #Frames | #People | #TP | #FP | Sensitivity | PPV | F1-Score |
|---|---|---|---|---|---|---|---|
| I | 2609 | 3928 | 3905 | 48 | 0.9941 | 0.9879 | 0.9910 |
| II | 2747 | 4543 | 4536 | 135 | 0.9985 | 0.9711 | 0.9846 |
| III | 3151 | 5434 | 5433 | 60 | 0.9998 | 0.9891 | 0.9944 |
| IV | 3099 | 4461 | 4368 | 101 | 0.9792 | 0.9774 | 0.9783 |
| V | 4630 | 5891 | 5705 | 113 | 0.9684 | 0.9806 | 0.9745 |
| VI | 3427 | 3820 | 3820 | 70 | 1 | 0.9820 | 0.9909 |
| VII | 3330 | 3098 | 3046 | 14 | 0.9832 | 0.9954 | 0.9893 |
| VIII | 1316 | 1611 | 1505 | 58 | 0.9342 | 0.9629 | 0.9483 |
| IX | 905 | 2230 | 1818 | 0 | 0.8152 | 1 | 0.8982 |
| X | 1846 | 3400 | 3056 | 112 | 0.8988 | 0.9646 | 0.9306 |
| XI | 5599 | 6046 | 5963 | 162 | 0.9863 | 0.9736 | 0.9799 |
| XII | 2913 | 4399 | 3407 | 676 | 0.7745 | 0.8344 | 0.8033 |
| XIII | 3588 | 4666 | 4047 | 33 | 0.8673 | 0.9919 | 0.9255 |
| XIV | 5104 | 7232 | 7036 | 158 | 0.9729 | 0.9780 | 0.9755 |
| XV | 1283 | 1924 | 1913 | 148 | 0.9942 | 0.9282 | 0.9601 |
| Total | 45,546 | 62,683 | 59,558 | 1888 | 0.9501 | 0.9693 | 0.9596 |
Results of human detection categorized by human behaviors with our database.
| Behavior | #Frames | #People | #TP | #FP | Sensitivity | PPV | F1-Score |
|---|---|---|---|---|---|---|---|
| Walking | 17,380 | 22,315 | 20,186 | 1340 | 0.9046 | 0.9378 | 0.9209 |
| Running | 6274 | 3864 | 3776 | 153 | 0.9772 | 0.9611 | 0.9536 |
| Standing | 5498 | 10,430 | 10,356 | 67 | 0.9929 | 0.9936 | 0.9932 |
| Sitting | 6179 | 11,417 | 11,364 | 3 | 0.9954 | 0.9997 | 0.9975 |
| Waving | 1975 | 3611 | 3181 | 0 | 0.8809 | 1 | 0.9367 |
| Punching | 3932 | 5434 | 5117 | 96 | 0.9417 | 0.9816 | 0.9612 |
| Kicking | 4308 | 5612 | 5578 | 229 | 0.9939 | 0.9606 | 0.9770 |
Figure 26Detection error cases: (a) result of the proposed method with database X; (b) result of the proposed method with database XIII.
Comparative results of human detection by the proposed method and previous ones [24,32,37] with our database.
| DB No. | Sensitivity | PPV | F1-Score | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ours | Previous Method | Ours | Previous Method | Ours | Previous Method | |||||||
| [ | [ | [ | [ | [ | [ | [ | [ | [ | ||||
| I | 0.9941 | 0.9514 | 0.9351 | 0.9832 | 0.9879 | 0.9544 | 0.8713 | 0.9621 | 0.9910 | 0.9529 | 0.9021 | 0.9725 |
| II | 0.9985 | 0.9595 | 0.9406 | 0.9885 | 0.9711 | 0.9462 | 0.8623 | 0.9539 | 0.9846 | 0.9528 | 0.8998 | 0.9709 |
| III | 0.9998 | 0.9522 | 0.9366 | 0.9763 | 0.9891 | 0.9515 | 0.8711 | 0.9597 | 0.9944 | 0.9519 | 0.9027 | 0.9679 |
| IV | 0.9792 | 0.9386 | 0.9219 | 0.9698 | 0.9774 | 0.9497 | 0.8698 | 0.9678 | 0.9783 | 0.9441 | 0.8951 | 0.9688 |
| V | 0.9684 | 0.9257 | 0.9085 | 0.9559 | 0.9806 | 0.9605 | 0.8792 | 0.9681 | 0.9745 | 0.9428 | 0.8936 | 0.9620 |
| VI | 1 | 0.9601 | 0.9441 | 0.9913 | 0.9820 | 0.9525 | 0.8712 | 0.9696 | 0.9909 | 0.9563 | 0.9062 | 0.9803 |
| VII | 0.9832 | 0.9432 | 0.9231 | 0.9714 | 0.9954 | 0.9644 | 0.8823 | 0.9713 | 0.9893 | 0.9537 | 0.9022 | 0.9714 |
| VIII | 0.9342 | 0.9001 | 0.8792 | 0.9278 | 0.9629 | 0.9399 | 0.8581 | 0.9473 | 0.9483 | 0.9196 | 0.8685 | 0.9374 |
| IX | 0.8152 | 0.7653 | 0.7554 | 0.8049 | 1 | 0.9731 | 0.8923 | 0.9815 | 0.8982 | 0.8568 | 0.8182 | 0.8845 |
| X | 0.8988 | 0.8509 | 0.8325 | 0.8811 | 0.9646 | 0.9327 | 0.8498 | 0.9409 | 0.9306 | 0.8899 | 0.8411 | 0.9100 |
| XI | 0.9863 | 0.9414 | 0.9225 | 0.9709 | 0.9736 | 0.9497 | 0.8612 | 0.9573 | 0.9799 | 0.9455 | 0.8908 | 0.9641 |
| XII | 0.7745 | 0.7278 | 0.7105 | 0.7592 | 0.8344 | 0.8121 | 0.7193 | 0.8199 | 0.8033 | 0.7676 | 0.7149 | 0.7884 |
| XIII | 0.8673 | 0.8198 | 0.8019 | 0.8509 | 0.9919 | 0.9623 | 0.8802 | 0.9793 | 0.9255 | 0.8854 | 0.8392 | 0.9106 |
| XIV | 0.9729 | 0.9309 | 0.9113 | 0.9599 | 0.9780 | 0.9431 | 0.8621 | 0.9518 | 0.9755 | 0.9370 | 0.8860 | 0.9558 |
| XV | 0.9942 | 0.9502 | 0.9351 | 0.9825 | 0.9282 | 0.8976 | 0.8064 | 0.9056 | 0.9601 | 0.9232 | 0.8660 | 0.9423 |
| Avg | 0.9501 | 0.9064 | 0.8896 | 0.9376 | 0.9693 | 0.9409 | 0.8573 | 0.9505 | 0.9596 | 0.9234 | 0.8731 | 0.9437 |
Comparative results of human detection categorized by human behaviors by the proposed method and previous ones [24,32,37] with our database. (Behav.: Behavior, W: Walking, R: Running, St: Standing, Si: Sitting, Wav: Waving, P: Punching, K: Kicking).
| Behav. | Sensitivity | PPV | F1-Score | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ours | Previous Method | Ours | Previous Method | Ours | Previous Method | |||||||
| [ | [ | [ | [ | [ | [ | [ | [ | [ | ||||
| W | 0.9046 | 0.8612 | 0.8434 | 0.8923 | 0.9378 | 0.9084 | 0.8269 | 0.9175 | 0.9209 | 0.8842 | 0.8351 | 0.9047 |
| R | 0.9772 | 0.9331 | 0.9193 | 0.9629 | 0.9611 | 0.9034 | 0.8203 | 0.9103 | 0.9536 | 0.9180 | 0.8670 | 0.9359 |
| St | 0.9929 | 0.9474 | 0.9295 | 0.9735 | 0.9936 | 0.9652 | 0.8812 | 0.9713 | 0.9932 | 0.9562 | 0.9047 | 0.9724 |
| Si | 0.9954 | 0.9523 | 0.9378 | 0.9821 | 0.9997 | 0.9703 | 0.8903 | 0.9785 | 0.9975 | 0.9612 | 0.9134 | 0.9803 |
| Wav | 0.8809 | 0.8371 | 0.8198 | 0.8656 | 1 | 0.9702 | 0.8913 | 0.9798 | 0.9367 | 0.8987 | 0.8541 | 0.9192 |
| P | 0.9417 | 0.9005 | 0.8837 | 0.9334 | 0.9816 | 0.9527 | 0.8702 | 0.9605 | 0.9612 | 0.9259 | 0.8769 | 0.9468 |
| K | 0.9939 | 0.9492 | 0.9302 | 0.9793 | 0.9606 | 0.9311 | 0.8525 | 0.9556 | 0.9770 | 0.9401 | 0.8897 | 0.9673 |
Comparative results of human detection by the proposed method and previous ones [24,32,37] with OTCBVS database. (Seq. No.: Sequence Number).
| Seq. No. | Sensitivity | PPV | F1-Score | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ours | Previous Method | Ours | Previous Method | Ours | Previous Method | |||||||
| [ | [ | [ | [ | [ | [ | [ | [ | [ | ||||
| 1 | 1 | 1 | 0.97 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.9848 | 1 |
| 2 | 1 | 0.99 | 0.94 | 1 | 1 | 1 | 1 | 1 | 1 | 0.9949 | 0.9691 | 1 |
| 3 | 0.99 | 0.99 | 1 | 0.98 | 0.99 | 0.98 | 0.99 | 0.99 | 0.99 | 0.9850 | 0.9950 | 0.9850 |
| 4 | 1 | 1 | 0.98 | 1 | 0.99 | 1 | 0.99 | 0.97 | 0.9950 | 1 | 0.9850 | 0.9848 |
| 5 | 1 | 1 | 0.89 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.9418 | 1 |
| 6 | 0.99 | 1 | 0.96 | 0.98 | 1 | 1 | 1 | 1 | 0.9950 | 1 | 0.9796 | 0.9899 |
| 7 | 1 | 1 | 0.98 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.9899 | 1 |
| 8 | 1 | 1 | 0.76 | 1 | 1 | 0.99 | 0.99 | 1 | 1 | 0.9950 | 0.8599 | 1 |
| 9 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 10 | 1 | 0.97 | 0.98 | 1 | 1 | 0.97 | 0.97 | 1 | 1 | 0.97 | 0.9750 | 1 |
| Avg | 0.9980 | 0.9949 | 0.9459 | 0.9959 | 0.9980 | 0.9939 | 0.9936 | 0.9959 | 0.9980 | 0.9945 | 0.9680 | 0.9960 |