| Literature DB >> 36262150 |
Oluwakorede Monica Oluyide1, Jules-Raymond Tapamo1, Tom Mmbasu Walingo1.
Abstract
This article presents a semi-automatic algorithm that can detect pedestrians from the background in thermal infrared images. The proposed method is based on the powerful Graph Cut optimisation algorithm which produces exact solutions for binary labelling problems. An additional term is incorporated into the energy formulation to bias the detection framework towards pedestrians. Therefore, the proposed method obtains reliable and robust results through user-selected seeds and the inclusion of motion constraints. An additional advantage is that it enables the algorithm to generalise well across different databases. The effectiveness of our method is demonstrated on four public databases and compared with several methods proposed in the literature and the state-of-the-art. The method obtained an average precision of 98.92% and an average recall of 99.25% across the four databases considered and outperformed methods which made use of the same databases.Entities:
Keywords: Infrared video; Pedestrian detection; Video surveillance
Year: 2022 PMID: 36262150 PMCID: PMC9575872 DOI: 10.7717/peerj-cs.1064
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Figure 1Difference between topologically unconstrained and constrained solution using Graph Cut (A) Original image showing the pixels belonging to the object of interest (green diamonds) and the background (red circles) (B) topological unconstrained solution: all the pixels with similar properties to the object of interest are included in the result (C) topological constrained solution: only the pixels of the object of interest are included in the result.
Figure 2Overview of the proposed method.
Figure 3An illustration of a graph constructed over an image (A) original image (B) graph construction.
Figure 4(A) and (B) are two consecutive frames with an area of interest selected and (C) shows the directional difference images around that selected area.
The image energy is higher when the image is shifted to the right than to the left, and then when it is shifted downwards than upwards. So, without previous knowledge, we can tell the pedestrian is moving to the right and slightly downwards. Image credit: LITIV dataset; Torabi, Massé & Bilodeau (2012).
Edge weights of the graph constructed from the image.
| Edge | Weight | For |
|---|---|---|
| { |
|
|
| { |
|
|
|
|
| |
| 0 |
| |
| { |
|
|
|
|
| |
| 0 |
|
Figure 5Binary labelling of an image after energy minimization (A) shows the minimum cut separating the vertices (B) shows the binary labelling as a result of the cut.
Quantitative results for GC and MCE comparison.
| Method | LITIV | LTIR | Terravic | OSU | |
|---|---|---|---|---|---|
| Recall | GC | 0.9859 | 0.9562 | 0.9842 | 0.9933 |
| MCE | 0.9992 | 0.9771 | 0.9902 | 0.9995 | |
| Precision | GC | 0.9785 | 0.9504 | 0.9780 | 0.9801 |
| MCE | 0.9969 | 0.9805 | 0.9889 | 0.9907 |
Figure 6Chart showing the performance of GC and MCE using precision and recall.
Figure 7GC and MCE results on LTIR database (A) image (B) GC (C) MCE.
Image credit: Linkoping Thermal InfraRed (LTIR) dataset; Berg, Ahlberg & Felsberg (2015).
Figure 10MCE results on TMIR database (A) image (B) GC (C) MCE.
Image credit: Linkoping Thermal InfraRed (LTIR) dataset; Berg, Ahlberg & Felsberg (2015).
Table showing the weather conditions, true positive (TP) and false positive (FP) detection results for the OSU thermal database.
| Video | Atmospheric phenomenon | Time of day | Temp (°C) | Total pedestrians | TP | FP |
|---|---|---|---|---|---|---|
| 1 | Light rain | Afternoon | 13 | 91 | 90 | 0 |
| 2 | Partly cloudy | Morning | 5 | 100 | 98 | 2 |
| 3 | Partly cloudy | Morning | 21 | 101 | 98 | 3 |
| 4 | Fair | Morning | 9 | 109 | 109 | 0 |
| 5 | Partly cloudy | Morning | 25 | 101 | 99 | 1 |
| 6 | Mostly cloudy | Morning | 21 | 97 | 95 | 0 |
| 7 | Light rain | Afternoon | 36 | 94 | 92 | 1 |
| 8 | Light rain | Afternoon | 30 | 99 | 95 | 0 |
| 9 | Haze | Afternoon | 18 | 95 | 95 | 1 |
| 10 | Haze | Afternoon | 23 | 97 | 90 | 1 |
| 984 | 961 | 9 |
Legend for Tables 5 and 6 column names.
| Author | Letter |
|---|---|
|
| A |
|
| B |
|
| C |
|
| D |
|
| E |
Comparing the proposed method with other methods using the number of true positive (TP) detections on the OSU dataset.
| Video | #Pedestrians | A | B | C | D | E | MCE |
|---|---|---|---|---|---|---|---|
| 1 | (91) | 88 | 90 | 87 | 77 | 78 | 90 |
| 2 | (100) | 94 | 95 | 96 | 99 | 98 | 98 |
| 3 | (101) | 101 | 101 | 83 | 64 | - | 98 |
| 4 | (109) | 107 | 108 | 109 | 107 | 109 | 109 |
| 5 | (101) | 90 | 95 | 100 | 97 | 101 | 97 |
| 6 | (97) | 93 | 94 | 94 | 92 | 97 | 93 |
| 7 | (94) | 92 | 93 | 86 | 78 | 80 | 90 |
| 8 | (99) | 75 | 80 | 97 | 89 | 96 | 93 |
| 9 | (95) | 95 | 95 | 95 | 91 | 95 | 95 |
| 10 | (97) | 95 | 95 | 94 | 91 | 83 | 89 |
| 1–10 | (984) | 930 | 946 | 941 | 885 | 829 | 961 |
Comparing the proposed method with other methods using the number of false positive (FP) detections on the OSU dataset.
| Video | A | B | C | D | E | MCE |
|---|---|---|---|---|---|---|
| 1 | 0 | 0 | 5 | 3 | 0 | 0 |
| 2 | 0 | 0 | 14 | 2 | 2 | 2 |
| 3 | 1 | 1 | 27 | 90 | - | 3 |
| 4 | 1 | 0 | 18 | 7 | 10 | 0 |
| 5 | 0 | 0 | 13 | 16 | 16 | 1 |
| 6 | 0 | 0 | 2 | 8 | 0 | 0 |
| 7 | 0 | 0 | 4 | 8 | 0 | 1 |
| 8 | 1 | 1 | 3 | 8 | 0 | 0 |
| 9 | 0 | 0 | 2 | 4 | 0 | 1 |
| 10 | 3 | 3 | 8 | 18 | 16 | 1 |
| 1–10 | 6 | 5 | 96 | 164 | 44 | 9 |
Comparison of the proposed method with the state-of-the-art using precision and recall.
| Author | Metrics | LITIV | LTIR | Terravic | OSU |
|---|---|---|---|---|---|
|
| Precision | 0.9679 | – | – | 0.9737 |
| Recall | 0.7819 | – | – | 0.7375 | |
|
| Precision | – | 0.6700 | 0.9600 | 0.8600 |
| Recall | – | 0.7500 | 0.9500 | 0.8900 | |
|
| Precision | – | – | – | 0.7100 |
| Recall | – | – | – | 0.6100 | |
|
| Precision | – | – | – | 0.9920 |
| Recall | – | – | – | 0.9775 | |
| Proposed method | Precision | 0.9969 | 0.9805 | 0.9899 | 0.9907 |
| Recall | 0.9992 | 0.9771 | 0.9902 | 0.9995 |
Motion-constrained Graph Cut.
| 1: Compute the directional difference images using |
| 2: Compute the location estimate map |
| 3: Compute edge weights according to |
| 4: Minimise the energy using Boykov-Kolmogorov min-cut/max-flow algorithm |
| Figure | Database | Sequence | Image |
|---|---|---|---|
|
| LTIR | Saturated | 00000043 |
|
| LTIR | Saturated | 00000044 |
|
| LTIR | quadrocopter2 | 00000858 |
|
| LTIR | hiding | 00000023 |
|
| LTIR | crossing | 00000058 |
|
| LTIR | street | 00000056 |
|
| LITIV | SEQUENCE2 | in000333 |
|
| LITIV | SEQUENCE5 | in0003992 |
|
| LITIV | SEQUENCE8 | in000105 |
|
| LITIV | SEQUENCE4 | in000103 |
|
| OSU | 00003 | img_00008 |
|
| OSU | 00006 | img_00013 |
|
| OSU | 00008 | img_00013 |
|
| OSU | 00005 | img_00023 |
|
| Terravic | irw01 | 000272 |
|
| Terravic | irw10 | 000366 |
|
| Terravic | irw11 | 000970 |