| Literature DB >> 28665360 |
Debojit Biswas1, Hongbo Su2, Chengyi Wang3, Jason Blankenship4, Aleksandar Stevanovic5.
Abstract
Automatic car counting is an important component in the automated traffic system. Car counting is very important to understand the traffic load and optimize the traffic signals. In this paper, we implemented the Gaussian Background Subtraction Method and OverFeat Framework to count cars. OverFeat Framework is a combination of Convolution Neural Network (CNN) and one machine learning classifier (like Support Vector Machines (SVM) or Logistic Regression). With this study, we showed another possible application area for the OverFeat Framework. The advantages and shortcomings of the Background Subtraction Method and OverFeat Framework were analyzed using six individual traffic videos with different perspectives, such as camera angles, weather conditions and time of the day. In addition, we compared the two algorithms above with manual counting and a commercial software called Placemeter. The OverFeat Framework showed significant potential in the field of car counting with the average accuracy of 96.55% in our experiment.Entities:
Keywords: Background Subtraction Method; Convolution Neural Network; OverFeat Framework; Placemeter; car counting
Year: 2017 PMID: 28665360 PMCID: PMC5539514 DOI: 10.3390/s17071535
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Basic approach of background subtraction method.
Figure 2A simple CNN architecture.
Figure 3The filter on the left might activate strongest when it encounters a horizontal line; the one in the middle for a vertical line and the right one for ‘L’ shape line.
Figure 4Convolving a small region of an image with a set of 5 filters of size F × F.
Figure 5Max pooling over a 2 × 2 region with stride of 2.
Figure 6Fully connected layer.
Architecture of the OverFeat network.
| Layer | 1 | 2 | 3 | 4 | 5 | 6 | 7 | Output 8 |
|---|---|---|---|---|---|---|---|---|
| Stage | conv + max | conv + max | conv | conv | conv + max | full | full | full |
| #channels | 96 | 256 | 512 | 1024 | 1024 | 3072 | 4096 | 1000 |
| Filter size | 11 × 11 | 5 × 5 | 3 × 3 | 3 × 3 | 3 × 3 | - | - | - |
| Conv. stride | 4 × 4 | 1 × 1 | 1 × 1 | 1 × 1 | 1 × 1 | - | - | - |
| Pooling size | 2 × 2 | 2 × 2 | - | - | 2 × 2 | |||
| Pooling stride | 2 × 2 | 2 × 2 | - | - | 2 × 2 | - | - | - |
| Zero-Padding size | - | - | 1 × 1 × 1 × 1 | 1 × 1 × 1 × 1 | 1 × 1 × 1 × 1 | - | - | - |
| Spatial input size | 231 × 231 | 24 × 24 | 12 × 12 | 12 × 12 | 12 × 12 | 6 × 6 | 1 × 1 | 1 × 1 |
Figure 7Background subtraction method results.
Figure 8OverFeat network results.
Accuracy assessment of the algorithms.
| Camera | Time Duration (Local Time) | Manual Counts | Placemeter | BSM | OverFeat |
|---|---|---|---|---|---|
| C34 | 10:00–11:00 | 879 | 582 (66.21%) | 597 (67.91%) | 910 (96.47%) |
| 18:00–19:00 | 2075 | 1467 (70.96%) | 1335 (64.33%) | 2120 (99.97%) | |
| C35 | 07:00–08:00 | 1862 | 1332 (71.53%) | 2236 (79.91%) | 1902 (97.85%) |
| C66 | 11:00–12:00 | 1978 | 1393 (70.42%) | 1674 (84.63%) | 1942 (98.17%) |
| 23:00–00:00 | 549 | 335 (61.02%) | 108 (19.67%) | 566 (99.96%) | |
| C73 | 11:00–11:10 (for 10 min) | 270 | 156 (57.77%) | 151 (52.92%) | 255 (94.44%) |
| C103 | 07:00–08:00 | 210 | 145 (69.04%) | 372 (22.85%) | 225 (92.85%) |
| 11:00–12:00 | 579 | 432 (74.61%) | 463 (79.96%) | 619 (93.09%) | |
| Under the bridge | 09:00-09:01 (1 min) | 52 | - | 50 (96.15%) | 54 (96.15%) |
| Average | - | - | 67.69% | 63.14% | 96.55% |
Figure 9The double counting issue, when a car moves between two RIOs.