| Literature DB >> 29382078 |
Roxana Velazquez-Pupo1, Alberto Sierra-Romero2, Deni Torres-Roman3, Yuriy V Shkvarko4, Jayro Santiago-Paz5, David Gómez-Gutiérrez6, Daniel Robles-Valdez7, Fernando Hermosillo-Reynoso8, Misael Romero-Delgado9.
Abstract
This paper presents a high performance vision-based system with a single static camera for traffic surveillance, for moving vehicle detection with occlusion handling, tracking, counting, and One Class Support Vector Machine (OC-SVM) classification. In this approach, moving objects are first segmented from the background using the adaptive Gaussian Mixture Model (GMM). After that, several geometric features are extracted, such as vehicle area, height, width, centroid, and bounding box. As occlusion is present, an algorithm was implemented to reduce it. The tracking is performed with adaptive Kalman filter. Finally, the selected geometric features: estimated area, height, and width are used by different classifiers in order to sort vehicles into three classes: small, midsize, and large. Extensive experimental results in eight real traffic videos with more than 4000 ground truth vehicles have shown that the improved system can run in real time under an occlusion index of 0.312 and classify vehicles with a global detection rate or recall, precision, and F-measure of up to 98.190%, and an F-measure of up to 99.051% for midsize vehicles.Entities:
Keywords: IoT vision system; One Class Support Vector Machine; adaptive Gaussian mixture model; adaptive Kalman filter; vehicle classification; vehicle detection; vehicle occlusion index
Year: 2018 PMID: 29382078 PMCID: PMC5856131 DOI: 10.3390/s18020374
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Related works in the detection of vehicles.
| Reference | Frames | Scenarios | Traffic Load | ||||
|---|---|---|---|---|---|---|---|
| Saunier, N.; Sayed, T. [ | 302 | 8360 | 3 | - | 88.4 | - | - |
| Hsieh, J.-W.; Yu, S.-H.; Chen, Y.-S.; Hu, W.-F. [ | 20,443 | 16,400 | 3 | - | 82.16 | - | - |
| Hu, Z.; Wang, C.; Uchimura, K. [ | 1074 | Not indicated | - | - | 99.3 | - | - |
| Zhang, W.; Wu, Q. M. J.; Yang, X.; Fang, X. [ | 427 | Not indicated | - | - | 93.87–84.43, 100–83.8 | - | - |
| Fang, W.; Zhao, Y.; Yuan, Y.; Liu, K [ | 226 | 3500 | 2 | - | 86.8, 100 | - | - |
| Arróspide, J.; Salgado, L.; Nieto, M. [ | 4000 | NA | - | - | 96.14, 89.92, 94.14 | - | - |
| Pham, H.V.; Lee, B.-R. [ | 672 | 18,000 | 1 | - | 97.17 | - | - |
| Shirazi, M.S.; Morris, B. [ | Not indicated | 1080 at 8 fps | 3 | - | 94 | - | - |
| Our System (2017) | 4111 | 92,160 at 25 fps | 5 | 1.34 | 82.42–99.24 | 68.7–99.5 | 74.6–98.3 |
Related works in the classification of vehicles.
| Reference | Sensors | Scenarios | Input Space | Result | Reported Metrics |
|---|---|---|---|---|---|
| Hsieh, J.-W.; Yu, S.-H.; Chen, Y.-S.; Hu, W.-F. [ | Camera only | Static side-road camera | Size and the “linearity” of a vehicle | Global TPR of up to 94.8% for cars, minivans, trucks, and van-trucks | |
| Feng, Z.; Mingzhe, W. [ | Anisotropic magnetoresistive (AMR) sensor | Vehicle passes through the sensor | Features of wave length, mean, variance, peak, valley, and acreage | 86%, 80%, 81%, and 89%
| |
| Changjun, Z.; Yuzong, C. [ | Acoustic signals | Vehicles on the road ramp | Set of frequency feature vectors | 95.12%
| |
| Chen, Z.; Pears, N.; Freeman, M.; Austin, J. [ | Stationary roadside (CCTV) camera | Static side-road camera | Size and width of the blob | 88.35%, 69.07%, and 73.47%
| |
| Moussa, G.S. [ | Laser sensor | Top-down laser over road (different scenarios from those presented here.) | Geometric-based features | 99.5%, 93.0%, and 97.5%
| |
| Liang, M.; Huang, X.; Chen, C.H.; Chen, X.; Tokuta, A. [ | Camera only | Static side-road camera | Low level features | 79.9%, 63.4%, and 92.7%,
| |
| Lamas-Seco, J.; Castro, P.; Dapena, A.; Vazquez-Araujo, F. [ | Inductive Loop detectors | Vehicle passes through the sensor | Fourier Transform of inductive signatures | Global
| |
| Kamkar, S.; Safabakhsh, R. [ | Camera only | Static side-road camera | Vehicle length and Grey-Level Co-occurrence matrix features | 71.9% Global
| |
| Our System (2017) | Camera only | Static side-road camera | 3-D geometric-based features | Global
|
Figure 1Block diagram of the proposed system.
Figure 2System initialization.
Figure 3Vehicle detection: (a) actual image, green lines indicate the ROI, and blue line the detection line; (b) background and (c) foreground mask.
Figure 4Occlusion handling when cases 1 and 2 are fulfilled, green lines indicate the ROI, and blue line the detection line. Actual image and foreground mask, (a,c) before applying the algorithm and (b,d) after applying the algorithm.
Figure 5Estimation of the lane width.
Videos analyzed in this work.
| Video | Frames | Vehicles per Second | Occlusion Index | Recording Place | Vehicle Direction | Weather |
|---|---|---|---|---|---|---|
| V1 | 16,925 | 1.24 | 0.312 | Ringroad, Guadalajara, Mexico | Front | Sunny |
| V2 | 5400 | 1.05 | 0.189 | Ringroad, Guadalajara, Mexico | Front | Sunny |
| V3 | 3875 | 0.75 | 0.124 | Ringroad, Guadalajara, Mexico | Front | 0 to 20 s Sunny, 21 to 140 s Cloudy |
| V4 | 7520 | 0.88 | 0.000 | M-30, Madrid, Spain | Rear | Sunny |
| V5 | 9390 | 0.63 | 0.000 | M-30, Madrid, Spain | Rear | Cloudy |
| V6 | 15,050 | 1.32 | 0.249 | M6 motorway, England | Front | Cloudy |
| V7 | 14,875 | 1.21 | 0.203 | M6 motorway, England | Front | Cloudy |
| V8 | 19,125 | 1.18 | 0.202 | M6 motorway, England | Front | Cloudy |
Figure 6Behavior of the selected geometric features of the detected vehicles, (a) area of the detected objects; (b) width of the detected objects, and (c) height of the detected objects.
Figure 7Projection of the vehicles into a classification line (yellow), green lines indicates the ROI.
Figure 8Traffic load (vehicles per second).
Experimental results of the detection stage with occlusion handling.
| Video | |||||||
|---|---|---|---|---|---|---|---|
| V1 | 842 | 694 | 324 | 148 | 82.422 | 68.172 | 74.623 |
| V2 | 228 | 202 | 104 | 26 | 88.596 | 66.013 | 75.655 |
| V3 | 116 | 103 | 30 | 13 | 88.793 | 77.44 | 82.730 |
| V4 | 264 | 262 | 7 | 2 | 99.242 | 97.397 | 98.311 |
| V5 | 236 | 228 | 1 | 8 | 96.610 | 99.563 | 98.064 |
| V6 | 797 | 761 | 53 | 36 | 95.483 | 93.488 | 94.475 |
| V7 | 725 | 686 | 43 | 39 | 94.620 | 94.101 | 94.360 |
| V8 | 903 | 862 | 82 | 41 | 95.459 | 91.313 | 93.340 |
Experimental results of the classification stage.
| Video | Class | Input Space | ||||||
|---|---|---|---|---|---|---|---|---|
| V1 | S | 179 | 179 | 132 | 0 | 100.000 | 57.556 | 73.061 |
| M | 789 | 669 | 20 | 120 | 84.790 | 97.097 | 90.527 | |
| L | 50 | 16 | 2 | 34 | 32.000 | 88.888 | 47.058 | |
| T | 1018 | 864 | 154 | 154 | 84.872 | 84.872 | 84.872 | |
| V2 | S | 35 | 34 | 26 | 1 | 97.142 | 56.666 | 71.578 |
| M | 210 | 177 | 5 | 33 | 84.285 | 97.252 | 90.306 | |
| L | 61 | 55 | 9 | 6 | 90.163 | 85.937 | 88.000 | |
| T | 306 | 266 | 40 | 40 | 86.928 | 86.928 | 86.928 | |
| V3 | S | 11 | 10 | 1 | 1 | 90.909 | 90.909 | 90.909- |
| M | 97 | 95 | 8 | 2 | 97.938 | 92.233 | 95.000 | |
| L | 25 | 18 | 1 | 7 | 72.000 | 94.736 | 81.818 | |
| T | 133 | 123 | 10 | 10 | 92.481 | 92.481 | 92.481 | |
| V4 | S | 16 | 15 | 12 | 1 | 93.750 | 55.555 | 69.767 |
| M | 233 | 222 | 4 | 11 | 95.279 | 98.230 | 96.732 | |
| L | 20 | 14 | 2 | 6 | 70.000 | 87.500 | 77.777 | |
| T | 269 | 251 | 18 | 18 | 93.308 | 93.308 | 93.308 | |
| V5 | S | 3 | 3 | 6 | 0 | 100.00 | 33.333 | 50.000 |
| M | 220 | 211 | 0 | 9 | 95.909 | 100.000 | 97.911 | |
| L | 6 | 4 | 5 | 2 | 66.666 | 44.444 | 53.333 | |
| T | 229 | 218 | 11 | 11 | 95.196 | 95.196 | 95.196 | |
| V6 | S | 3 | 2 | 2 | 1 | 66.667 | 50.000 | 57.142 |
| M | 766 | 755 | 1 | 11 | 98.564 | 99.867 | 99.211 | |
| L | 45 | 45 | 9 | 0 | 100.000 | 83.333 | 90.909 | |
| T | 814 | 802 | 12 | 12 | 98.525 | 98.525 | 98.525 | |
| V7 | S | 2 | 1 | 3 | 1 | 50.000 | 25.000 | 33.333 |
| M | 688 | 676 | 2 | 12 | 98.255 | 99.705 | 98.975 | |
| L | 39 | 37 | 10 | 2 | 94.871 | 78.723 | 86.046 | |
| T | 729 | 714 | 15 | 15 | 97.942 | 97.942 | 97.942 | |
| V8 | S | 5 | 4 | 9 | 1 | 80.000 | 30.769 | 44.444 |
| M | 882 | 867 | 3 | 15 | 98.299 | 99.655 | 98.972 | |
| L | 57 | 55 | 6 | 2 | 96.491 | 90.163 | 93.220 | |
| T | 944 | 926 | 18 | 18 | 98.093 | 98.093 | 98.093 |
Experimental results of the detection stage of videos V6, V7, and V8.
| Test | Video | |||||||
|---|---|---|---|---|---|---|---|---|
| Without occlusion handling | V6 | 797 | 653 | 6 | 144 | 81.932 | 99.089 | 89.697 |
| V7 | 725 | 624 | 12 | 101 | 86.069 | 98.113 | 91.697 | |
| V8 | 903 | 755 | 16 | 148 | 83.610 | 97.924 | 90.203 | |
| Total | 2425 | 2032 | 34 | 393 | 83.793 | 98.354 | 90.492 | |
| With occlusion handling | V6 | 797 | 761 | 53 | 36 | 95.483 | 93.488 | 94.475 |
| V7 | 725 | 686 | 43 | 39 | 94.620 | 94.101 | 94.360 | |
| V8 | 903 | 862 | 82 | 41 | 95.459 | 91.313 | 93.340 | |
| Total | 2425 | 2309 | 178 | 116 | 95.216 | 92.842 | 94.014 |
Figure 9Vehicle examples for every class: (a) small; (b) midsize and (c) large.
Experimental results of the classification stage of videos V6, V7, and V8 using different input spaces and classifiers.
| Test | Class | Input Space | ||||||
| With occlusion handling | S | 10 | 9 | 474 | 1 | 90.000 | 1.863 | 3.651 |
| M | 2336 | 1875 | 63 | 461 | 80.265 | 96.749 | 87.739 | |
| L | 141 | 39 | 27 | 102 | 27.659 | 59.090 | 37.681 | |
| Total | 2487 | 1923 | 564 | 564 | 77.322 | 77.322 | 77.322 | |
| Test | Class | Input Space | ||||||
| With occlusion handling | S | 10 | 10 | 247 | 0 | 100.00 | 3.891 | 7.490 |
| M | 2336 | 2079 | 23 | 257 | 88.998 | 98.905 | 93.690 | |
| L | 141 | 117 | 11 | 24 | 82.978 | 91.406 | 86.988 | |
| Total | 2487 | 2206 | 281 | 281 | 88.701 | 88.701 | 88.701 | |
| Test | Class | Input Space | ||||||
| With occlusion handling | S | 16 | 16 | 100 | 0 | 100.000 | 13.793 | 24.242 |
| M | 2333 | 2214 | 4 | 119 | 94.899 | 99.819 | 97.736 | |
| L | 138 | 133 | 20 | 5 | 96.376 | 86.928 | 91.408 | |
| Total | 2487 | 2363 | 124 | 124 | 95.014 | 95.014 | 95.014 | |
| Test | Class | Input Space | ||||||
| With occlusion handling | S | 10 | 7 | 14 | 3 | 70.000 | 33.333 | 45.161 |
| M | 2336 | 2298 | 6 | 38 | 98.373 | 99.739 | 99.051 | |
| L | 141 | 137 | 25 | 4 | 97.163 | 84.567 | 90.429 | |
| Total | 2487 | 2442 | 45 | 45 | 98.190 | 98.190 | 98.190 | |
Matrix confusion of the classification stage of videos V6, V7, and V8.
| Threshold | K-Means | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| S | M | L | T | S | M | L | T | ||
| S | 9 | 1 | 0 | 10 | S | 10 | 0 | 0 | 10 |
| M | 434 | 1875 | 27 | 2336 | M | 246 | 2079 | 11 | 2336 |
| L | 40 | 62 | 39 | 141 | L | 1 | 23 | 117 | 141 |
| T | 2487 | T | 2487 | ||||||
| (a) | (b) | ||||||||
| SVM | OC-SVM | ||||||||
| S | M | L | T | S | M | L | T | ||
| S | 16 | 0 | 0 | 16 | S | 7 | 3 | 0 | 10 |
| M | 99 | 2214 | 20 | 2333 | M | 13 | 2298 | 25 | 2336 |
| L | 1 | 4 | 133 | 138 | L | 1 | 3 | 137 | 141 |
| T | 2487 | T | 2487 | ||||||
| (c) | (d) | ||||||||