| Literature DB >> 30974735 |
Hyun-Koo Kim1, Kook-Yeol Yoo2, Ju H Park3, Ho-Youl Jung4.
Abstract
A traffic light recognition system is a very important building block in an advanced driving assistance system and an autonomous vehicle system. In this paper, we propose a two-staged deep-learning-based traffic light recognition method that consists of a pixel-wise semantic segmentation technique and a novel fully convolutional network. For candidate detection, we employ a binary-semantic segmentation network that is suitable for detecting small objects such as traffic lights. Connected components labeling with an eight-connected neighborhood is applied to obtain bounding boxes of candidate regions, instead of the computationally demanding region proposal and regression processes of conventional methods. A fully convolutional network including a convolution layer with three filters of (1 × 1) at the beginning is designed and implemented for traffic light classification, as traffic lights have only a set number of colors. The simulation results show that the proposed traffic light recognition method outperforms the conventional two-staged object detection method in terms of recognition performance, and remarkably reduces the computational complexity and hardware requirements. This framework can be a useful network design guideline for the detection and recognition of small objects, including traffic lights.Entities:
Keywords: advanced driver assistance system; artificial neural networks; binary semantic segmentation; deep learning; traffic light detection; traffic light recognition
Year: 2019 PMID: 30974735 PMCID: PMC6479298 DOI: 10.3390/s19071700
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Proposed two-staged traffic light recognition method.
Analysis of traffic light size in evaluation dataset.
| Traffic Light’s Size | Proportion (%) | Width [pixel] | Height [pixel] | ||||
|---|---|---|---|---|---|---|---|
| Min | Max | Mean | Min | Max | Mean | ||
| Small (area | 89.01 | 2 | 40 | 9.89 | 4 | 76 | 23.81 |
| Non-small ( | 10.99 | 15 | 99 | 27.39 | 24 | 208 | 62.45 |
| Total | 100.00 | 2 | 99 |
| 4 | 208 |
|
Figure 2Proposed traffic light classification networks.
TL candidate detection performances.
| Measure Metrics | Faster R-CNN | BSSNet-Full-Size | BSSNet-Half-Size | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Small | Non-Small | Total | Small | Non-Small | Total | Small | Non-Small | |
| No. of Traffic Lights (GT) | 4306 | 3815 | 491 | 4306 | 3815 | 491 | 4306 | 3815 | 491 |
| No. of True Positive | 3685 | 3229 | 456 |
|
|
| 3908 | 3417 |
|
| No. of False Negative | 621 | 586 | 35 |
|
|
| 398 | 398 |
|
| No. of False Positive | 2619 | 1933 | 686 | 203 | 139 | 64 |
|
|
|
| Precision (%) | 58.45 | 62.55 | 39.93 | 95.27 | 96.28 | 88.47 |
|
|
|
| Recall (%) | 85.58 | 84.64 | 92.87 |
|
|
| 90.76 | 89.57 |
|
| F-measure (%) | 69.46 | 71.94 | 55.85 |
|
| 93.88 | 93.69 | 93.22 |
|
TL recognition performances (overall mAP and overall AP) on test set.
| TL Recognition Method | Overall mAP (%) | Overall AP (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Small | Non-Small | Green | Red | Yellow | Red-Left | Green-Left | Off | |
| Faster R-CNN [ | 20.40 | 15.85 | 36.15 | 33.46 | 23.81 | 4.75 | 34.69 | 17.59 | 8.08 |
| BSSNet-full-size & TLC1Net | 41.00 | 34.94 | 69.09 | 49.97 | 35.19 | 24.66 | 57.03 | 51.00 | 28.12 |
| BSSNet-full-size & TLC2Net | 42.96 | 36.93 | 72.31 | 52.07 | 37.79 | 26.47 | 59.04 | 53.00 | 29.39 |
| BSSNet-full-size & TLC3Net |
|
|
|
|
|
|
|
|
|
| BSSNet-half-size & TLC1Net | 31.04 | 25.55 | 66.53 | 41.81 | 25.5 | 19.34 | 47.38 | 38.76 | 13.46 |
| BSSNet-half-size & TLC2Net | 34.22 | 28.70 | 70.01 | 43.18 | 28.42 | 21.52 | 53.21 | 42.74 | 16.25 |
| BSSNet-half-size & TLC3Net | 36.32 | 30.69 | 73.62 | 44.80 | 29.60 | 22.65 | 54.74 | 48.02 | 18.10 |
TL recognition performances (mAP@0.5 and AP@0.5) on test set.
| TL Recognition Method | mAP@0.5 (%) | AP@0.5 (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Small | Non-Small | Green | Red | Yellow | Red-Left | Green-Left | Off | |
| Faster R-CNN [ | 38.48 | 31.27 | 57.79 | 70.56 | 52.12 | 8.49 | 59.11 | 27.13 | 13.44 |
| BSSNet-full-size & TLC1Net | 65.32 | 59.04 | 82.86 | 85.47 | 70.64 | 41.88 | 74.28 | 65.89 | 53.73 |
| BSSNet-full-size & TLC2Net | 67.67 | 62.26 | 86.65 | 88.05 | 72.92 | 45.71 | 76.95 | 67.48 | 54.93 |
| BSSNet-full-size & TLC3Net |
|
|
|
|
|
|
|
|
|
| BSSNet-half-size & TLC1Net | 50.67 | 44.94 | 79.60 | 76.58 | 54.43 | 31.59 | 63.29 | 50.36 | 27.78 |
| BSSNet-half-size & TLC2Net | 54.35 | 48.61 | 83.41 | 77.94 | 57.19 | 34.38 | 70.55 | 55.15 | 30.87 |
| BSSNet-half-size & TLC3Net | 57.73 | 52.19 | 87.90 | 80.55 | 59.26 | 36.18 | 72.45 | 62.16 | 35.79 |
Figure 3Average TL recognition performances according to ratio of training to test datasets.
Figure 4Average TL recognition performances according to dataset shift.
TL recognition performances on test dataset in LISA database.
| TL Recognition Method | mAP@0.5 (%) | AP@0.5 (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Small | Non-Small | Go | Go-Left | Warning | Warning-Left | Stop | Stop-Left | |
| Faster R-CNN | 44.34 | 40.05 | 48.63 | 53.22 | 34.84 | 42.97 | 33.97 | 52.26 | 43.78 |
| BSSNet-full-size & TLC3Net |
|
|
|
|
|
|
|
|
|
Hardware requirements according to the TL recognition.
| TL Recognition Method | Network Size [MB] | Comparison |
|---|---|---|
| Faster R-CNN | 242.2 | 1x |
| BSSNet-x-size & TLC1Net | 2.04 | 0.0084x |
| BSSNet-x-size & TLC2Net | 2.05 | 0.0085x |
| BSSNet-x-size & TLC3Net |
|
|
Detail network size for each sub-network of functional stage.
| Functional Stage | Sub-Network | # of Weight Parameters | Network Size [MB] |
|---|---|---|---|
| TL Candidate Detection | BSSNet-full-size | 366,482 | 1.76 |
| BSSNet-half-size | 366,482 | 1.76 | |
| TL Classification | TLC1Net | 65,807 | 0.28 |
| TLC2Net | 65,816 | 0.29 | |
| TLC3Net | 42,687 | 0.21 |
Computational complexity according to TL recognition.
| TL Recognition Method | Average Processing Time | Comparison | |
|---|---|---|---|
| [ms] | [fps] | ||
| Faster R-CNN | 525 | 1.90 | 1x |
| BSSNet-full-size & TLCxNet | 96 | 10.42 | 5.47x |
| BSSNet-half-size & TLCxNet |
|
|
|
Figure 5TL recognition examples of the Faster R-CNN and the proposed method.