| Literature DB >> 28394270 |
Xiaolei Ma1, Zhuang Dai2, Zhengbing He3, Jihui Ma4, Yong Wang5, Yunpeng Wang6.
Abstract
This paper proposes a convolutional neural network (CNN)-based method that learns traffic as images and predicts large-scale, network-wide traffic speed with a high accuracy. Spatiotemporal traffic dynamics are converted to images describing the time and space relations of traffic flow via a two-dimensional time-space matrix. A CNN is applied to the image following two consecutive steps: abstract traffic feature extraction and network-wide traffic speed prediction. The effectiveness of the proposed method is evaluated by taking two real-world transportation networks, the second ring road and north-east transportation network in Beijing, as examples, and comparing the method with four prevailing algorithms, namely, ordinary least squares, k-nearest neighbors, artificial neural network, and random forest, and three deep learning architectures, namely, stacked autoencoder, recurrent neural network, and long-short-term memory network. The results show that the proposed method outperforms other algorithms by an average accuracy improvement of 42.91% within an acceptable execution time. The CNN can train the model in a reasonable time and, thus, is suitable for large-scale transportation networks.Entities:
Keywords: convolutional neural network; deep learning; spatiotemporal feature; traffic speed prediction; transportation network
Year: 2017 PMID: 28394270 PMCID: PMC5422179 DOI: 10.3390/s17040818
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1An illustration of the traffic-to-image conversion on a network.
Figure 2Deep learning architecture of CNN in the context of transportation.
Figure 3Two sub-transportation networks for testing: (a) Network 1, the second ring of Beijing; and (b) Network 2, a network in Northeast Beijing
Figure 4Sample images with spatiotemporal traffic speeds for (a) Network 1; and (b) Network 2.
Different depths for CNN.
| Depth | Structures of Prediction Model |
|---|---|
| Depth-1 | A fully connected layer simply makes predictions using the input layer |
| Depth-2 | 64 conv |
| Depth-3 | 128 conv |
| Depth-4 | 256 conv |
Figure 5Results of CNN in different depths.
Hyperparameters of the CNN.
| Layer | Name | Parameters | Dimensions | Parameter Scale |
|---|---|---|---|---|
| Input | — | — | (1, 236, 20) | — |
| Layer 1 | Convolution | Filter (256, 3, 3) | (256, 236, 20) | 2304 |
| Layer 1 | Pooling | Pooling (2, 2) | (256, 118, 10) | 0 |
| Layer 2 | Convolution | Filter (128, 3, 3) | (128, 118, 10) | 1152 |
| Layer 2 | Pooling | Pooling (2, 2) | (128, 59, 5) | 0 |
| Layer 3 | Convolution | Filter (64, 3, 3) | (64, 59, 5) | 576 |
| Layer 3 | Pooling | Pooling (2, 2) | (64, 30, 3) | 0 |
| Layer 4 | Data flatten | — | (5760, ) | 0 |
| Layer 4 | Fully-connected | — | (1180, ) | 6,796,800 |
| Output | — | — | (1180, ) | — |
Prediction performance (MSE) of the CNN and other algorithms.
| Study Network | Model | MSE of Different Models (on Test Datasets) | |||
|---|---|---|---|---|---|
| Task 1 | Task 2 | Task 3 | Task 4 | ||
| Network 1 | CNN | ||||
| OLS | 27.047 | 31.273 | 41.334 | 48.107 | |
| KNN | 51.700 | 55.708 | 60.256 | 64.132 | |
| RF | 35.092 | 35.431 | 40.476 | 40.638 | |
| ANN | 67.764 | 52.339 | 58.797 | 57.225 | |
| SAE | 60.751 | 69.082 | 65.292 | 68.326 | |
| RNN | 33.408 | 36.833 | 40.551 | 39.038 | |
| LSTM NN | 37.759 | 33.218 | 42.909 | 42.865 | |
| Network 2 | CNN | ||||
| OLS | 33.741 | 41.657 | 50.123 | 62.282 | |
| KNN | 69.965 | 74.863 | 79.367 | 83.881 | |
| RF | 48.603 | 48.946 | 52.676 | 53.067 | |
| ANN | 124.937 | 147.489 | 133.299 | 168.136 | |
| SAE | 85.079 | 94.982 | 82.271 | 99.020 | |
| RNN | 48.877 | 47.470 | 52.577 | 52.114 | |
| LSTM NN | 43.304 | 45.657 | 50.928 | 48.345 | |
Note: * indicates the best result.
Figure 6Results of different algorithms: (a) MSEs on Network 1; and (b) MSEs on Network 2.
Prediction performance (accuracy) of the CNN and other algorithms.
| Study Network | Model | Accuracy Score of Different Models (on Test Datasets) | |||
|---|---|---|---|---|---|
| Task 1 | Task 2 | Task 3 | Task 4 | ||
| Network 1 | CNN | ||||
| OLS | 0.935 | 0.929 | 0.915 | 0.909 | |
| KNN | 0.901 | 0.897 | 0.893 | 0.890 | |
| RF | 0.917 | 0.917 | 0.910 | 0.910 | |
| ANN | 0.869 | 0.876 | 0.852 | 0.865 | |
| SAE | 0.867 | 0.870 | 0.866 | 0.866 | |
| RNN | 0.908 | 0.913 | 0.898 | 0.900 | |
| LSTM NN | 0.910 | 0.908 | 0.901 | 0.905 | |
| Network 2 | CNN | ||||
| OLS | 0.929 | 0.920 | 0.907 | 0.897 | |
| KNN | 0.886 | 0.884 | 0.879 | 0.876 | |
| RF | 0.898 | 0.898 | 0.893 | 0.892 | |
| ANN | 0.794 | 0.867 | 0.823 | 0.832 | |
| SAE | 0.846 | 0.835 | 0.848 | 0.825 | |
| RNN | 0.901 | 0.900 | 0.896 | 0.896 | |
| LSTM NN | 0.903 | 0.907 | 0.901 | 0.895 | |
Note: * indicates the best result.
Figure 7Training time of different algorithms: (a) training time on Network 1; and (b) training time on Network 2.