| Literature DB >> 35990123 |
Haifeng Huang1, Lei Huang1, Rongjia Song2, Feng Jiao1, Tao Ai1.
Abstract
The prediction of bus single-trip time is essential for passenger travel decision-making and bus scheduling. Since many factors could influence bus operations, the accurate prediction of the bus single-trip time faces a great challenge. Moreover, bus single-trip time has obvious nonlinear and seasonal characteristics. Hence, in order to improve the accuracy of bus single-trip time prediction, five prediction algorithms including LSTM (Long Short-term Memory), LR (Linear Regression), KNN (K-Nearest Neighbor), XGBoost (Extreme Gradient Boosting), and GRU (Gate Recurrent Unit) are used and examined as the base models, and three ensemble models are further constructed by using various ensemble methods including Random Forest (bagging), AdaBoost (boosting), and Linear Regression (stacking). A data-driven bus single-trip time prediction framework is then proposed, which consists of three phases including traffic data analysis, feature extraction, and ensemble model prediction. Finally, the data features and the proposed ensembled models are analyzed using real-world datasets that are collected from the Beijing Transportation Operations Coordination Center (TOCC). Through comparing the predicting results, the following conclusions are drawn: (1) the accuracy of predicting by using the three ensemble models constructed is better than the corresponding prediction results by using the five sub-models; (2) the Random Forest ensemble model constructed based on the bagging method has the best prediction accuracy among the three ensemble models; and (3) in terms of the five sub-models, the prediction accuracy of LR is better than that of the other four models.Entities:
Mesh:
Year: 2022 PMID: 35990123 PMCID: PMC9388288 DOI: 10.1155/2022/6831167
Source DB: PubMed Journal: Comput Intell Neurosci
Comparison of advantages and disadvantages of popular models.
| Category | Commonly used models | Advantages | Disadvantages | References |
|---|---|---|---|---|
| History of the same period | Smoothing method | Easy understandability, better results in normal conditions and with large time granularity | Excessive reliance on data patterns from historical data | Omkar and Kumar [ |
|
| ||||
| Time series | Kalman filtering | Applicable to time series data and interpretability | Unsuitable for capturing nonlinear data patterns | Zhou et al. [ |
| AR(Auto regressive) | Li et al. [ | |||
| ARIMA | Gummadi and Edara [ | |||
|
| ||||
| Machine learning | SVM SVR | Suitable for learning nonlinear features in data | Low computational efficiency at high data volumes | Li and Xu [ |
| K-nearest neighbor | Sun et al. [ | |||
| Linear regression | Khiari and Olaverri-Monreal [ | |||
| Decision tree | Alajali et al. [ | |||
| Random forest | Zhou et al. [ | |||
|
| ||||
| Deep learning | RNN | Applicable for learning linear and nonlinear patterns with good data fitting capability | Low interpretability and low efficiency | Pang et al. [ |
| LSTM | Agafonov and Yumaganov [ | |||
| GRU | Shu et al. [ | |||
|
| ||||
| Ensembled model | AdaBoost | Applicable to select the appropriate base model for ensemble according to the characteristics of different datasets | Prone to overfitting, low interpretability, and poor results when data are unbalanced | Zhou et al. [ |
| Bootstrapped aggregation | Vaish et al. [ | |||
| Stacked generalization | Sharma et al. [ | |||
| Gradient boosting Machines, GBM | Monego et al. [ | |||
| Gradient boosted regression Trees, GBRT | Chen et al. [ | |||
|
| ||||
| Combined model | Direct averaging, weighted averaging, and other combinations | High applicability with various sub-models and combinations | Subjective on choosing the combination method and sub-models | Yan et al. [ |
Figure 1Structure of the ensemble model based on Random Forest.
Figure 2Structure of AdaBoost-based ensemble model.
Figure 3Structure of ensemble model based on Linear Regression.
Figure 4MLR schematic diagram.
Figure 5KNN algorithm principle.
Figure 6XGBoost principle.
Figure 7Comparison of the internal structure of LSTM and GRU.
Figure 8Prediction steps.
Figure 9Bus single-trip time data trend graph (daily, weekly, monthly).
Figure 10Weather data trend graph.
Figure 11Data distribution.
Figure 12Weather data box plot.
Figure 13KDE plots (single-trip time-visibility, single-trip time-temperature).
Figure 14Ensemble model prediction results (timestep = 4).
Figure 15Ensemble model prediction results (timestep = 6).
Model evaluation indices.
| Ensemble models | TimeStep = 4 | TimeStep = 6 | ||||
|---|---|---|---|---|---|---|
| MAE | RMSE |
| MAE | RMSE |
| |
| LR-stacking | 4.317347 | 5.236942 | 0.155475 | 4.062111 | 4.808697 | 0.196314 |
| RF-bagging | 3.232055 | 4.313371 | 0.909258 | 3.895147 | 4.671306 | 0.883880 |
| AdaBoost-boosting | 4.086188 | 5.040191 | 0.489591 | 3.973029 | 4.724360 | 0.404368 |
Figure 16All model prediction results (timestep = 4).
Figure 17All model prediction results (timestep = 6).
Model training and prediction time.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| Training | Prediction (ms) | Training | Prediction (ms) | |
| LSTM | 115 min | 1257 | 121 min | 1471 |
| GRU | 119 min | 727 | 126 min | 1103 |
| LR | 7 ms | 2 | 16 ms | 3 |
| KNN | 77 ms | 63 | 98 ms | 82 |
| XGBoost | 3619 ms | 48 | 5192 ms | 73 |
| LR-stacking | 58 ms | 11 | 52 ms | 56 |
| RF-bagging | 329 ms | 79 | 417 ms | 136 |
| AdaBoost-boosting | 2351 ms | 92 | 3162 ms | 141 |
Model evaluation indices.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| MAE | RMSE | MAE | RMSE | |
| LSTM | 11.130382 | 11.977166 | 5.008087 | 5.851334 |
| GRU | 4.696761 | 5.694506 | 4.307110 | 5.269999 |
| LR | 4.324053 | 5.275761 | 4.038138 | 4.842730 |
| KNN | 4.931067 | 6.001118 | 4.690769 | 5.538258 |
| XGBoost | 5.112036 | 6.396894 | 4.604660 | 5.554114 |
| LR-stacking | 4.317347 | 5.236942 | 4.062111 | 4.808697 |
| RF-bagging | 3.232055 | 4.313371 | 3.895147 | 4.671306 |
| AdaBoost-boosting | 4.086188 | 5.040191 | 3.973029 | 4.724360 |
Wilcoxon Single-Rank and Friedman test result.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| Wilcoxon single-rank test ( | Friedman test ( | Wilcoxon single-rank test ( | Friedman test ( | |
| LSTM | 0.00004 | 0.000298 | 0.00251 | 0.000463 |
| GRU | 0.00891 | 0.00830 | ||
| LR | 0.00916 | 0.00962 | ||
| KNN | 0.00612 | 0.00973 | ||
| XGBoost | 0.00315 | 0.00988 | ||
| LR-stacking | 0.00979 | 0.01374 | ||
| AdaBoost-boosting | 0.01693 | 0.01421 | ||
Figure 18Ensemble model prediction results (timestep = 4).
Figure 19Ensemble model prediction results (timestep = 6).
Model evaluation indices.
| Ensemble model | TimeStep = 4 | TimeStep = 6 | ||||
|---|---|---|---|---|---|---|
| MAE | RMSE |
| MAE | RMSE |
| |
| LR-stacking | 4.129265 | 5.092021 | 0.123339 | 4.098628 | 4.870725 | 0.205406 |
| RF-bagging | 4.146337 | 5.069491 | 0.836024 | 3.935297 | 4.683990 | 0.846075 |
| AdaBoost-boosting | 4.347092 | 5.273396 | 0.412592 | 4.139669 | 4.920567 | 0.390879 |
Figure 20All model prediction results (timestep = 4).
Figure 21All model prediction results (timestep = 6).
Model training and prediction time.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| Training | Prediction (ms) | Training | Prediction (ms) | |
| LSTM | 85 min | 986 | 106 min | 1326 |
| GRU | 93 min | 645 | 114 min | 735 |
| LR | 4 ms | 1 | 6 ms | 2 |
| KNN | 64 ms | 52 | 71 ms | 68 |
| XGBoost | 2639 ms | 32 | 3283 ms | 42 |
| LR-stacking | 21 ms | 6 | 35 ms | 21 |
| RF-bagging | 214 ms | 51 | 257 ms | 82 |
| AdaBoost-boosting | 1427 ms | 72 | 1974 ms | 102 |
Model evaluation indices.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| MAE | RMSE | MAE | RMSE | |
| LSTM | 4.306787 | 5.199513 | 4.481952 | 5.300563 |
| GRU | 4.861043 | 5.786797 | 4.392949 | 5.170590 |
| LR | 4.539661 | 5.596035 | 4.361890 | 5.276523 |
| KNN | 4.954903 | 6.027883 | 4.684654 | 5.524954 |
| XGBoost | 5.214021 | 6.529260 | 4.698066 | 5.579520 |
| LR-stacking | 4.129265 | 5.092021 | 4.098628 | 4.870725 |
| RF-bagging | 4.146337 | 5.069491 | 3.935297 | 4.683990 |
| AdaBoost-boosting | 4.347092 | 5.273396 | 4.139669 | 4.920567 |
Wilcoxon Single-Rank and Friedman test results.
| Models | TimeStep = 4 | TimeStep = 6 | ||
|---|---|---|---|---|
| Wilcoxon single-rank test ( | Friedman test ( | Wilcoxon single-rank test ( | Friedman test ( | |
| LSTM | 0.00942 | 0.000327 | 0.00887 | 0.000409 |
| GRU | 0.00725 | 0.00891 | ||
| LR | 0.00932 | 0.00736 | ||
| KNN | 0.00462 | 0.00623 | ||
| XGBoost | 0.00152 | 0.00693 | ||
| LR-stacking | 0.00979 | 0.01729 | ||
| AdaBoost-boosting | 0.00957 | 0.00932 | ||