| Literature DB >> 34945907 |
Zheng Fang1, David L Dowe1, Shelton Peiris2, Dedi Rosadi3.
Abstract
Modeling and analysis of time series are important in applications including economics, engineering, environmental science and social science. Selecting the best time series model with accurate parameters in forecasting is a challenging objective for scientists and academic researchers. Hybrid models combining neural networks and traditional Autoregressive Moving Average (ARMA) models are being used to improve the accuracy of modeling and forecasting time series. Most of the existing time series models are selected by information-theoretic approaches, such as AIC, BIC, and HQ. This paper revisits a model selection technique based on Minimum Message Length (MML) and investigates its use in hybrid time series analysis. MML is a Bayesian information-theoretic approach and has been used in selecting the best ARMA model. We utilize the long short-term memory (LSTM) approach to construct a hybrid ARMA-LSTM model and show that MML performs better than AIC, BIC, and HQ in selecting the model-both in the traditional ARMA models (without LSTM) and with hybrid ARMA-LSTM models. These results held on simulated data and both real-world datasets that we considered.We also develop a simple MML ARIMA model.Entities:
Keywords: Bayesian statistics; deep learning; long short-term memory; minimum message length; neural network; probabilistic modeling; time series
Year: 2021 PMID: 34945907 PMCID: PMC8699874 DOI: 10.3390/e23121601
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1LSTM Structure.
Figure 2LSTM Overlapping.
RMSE in LSTM for simulated data () with different time steps and N = 100.
| No. of LSTM Time Steps |
|
|
|
|
|---|---|---|---|---|
| 1 | 1.2519 | 1.3677 | 1.4962 | 1.3911 |
| 2 | 1.1794 | 1.2442 | 1.3863 | 1.2718 |
| 3 | 1.3372 | 1.6324 | 1.2256 | 1.3018 |
| 4 | 1.2195 | 1.2301 | 1.3284 | 1.3951 |
| 5 | 1.1341 | 1.6294 | 1.4276 | 1.4494 |
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.108 | 1.112 | 1.033 | 1.217 | 1.215 | 1.234 | ||
|
| 1.133 | 1.172 | 1.166 | 1.301 | 1.366 | 1.384 | ||
|
| 1.027 | 1.024 | 1.029 | 1.025 | 1.012 | 1.021 | ||
|
| 1.333 | 1.278 | 1.286 | 1.241 | 1.211 | 1.194 | ||
|
| 0.955 | 0.956 | 0.951 | 0.975 | 0.971 | 0.986 | ||
|
| 1.293 | 1.241 | 1.245 | 1.114 | 1.211 | 1.172 | ||
|
| 0.901 | 0.916 | 0.913 | 0.948 | 0.944 | 0.945 | ||
|
| 1.207 | 1.226 | 1.224 | 1.252 | 1.261 | 1.257 | ||
|
| 1.006 | 0.907 | 0.911 | 1.122 | 1.117 | 1.112 | ||
|
| 1.052 | 1.054 | 1.061 | 1.042 | 1.027 | 1.046 | ||
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.234 | 1.208 | 1.221 | 1.132 | 1.135 | 1.138 | ||
|
| 1.571 | 1.553 | 1.556 | 1.555 | 1.549 | 1.551 | ||
|
| 1.041 | 1.044 | 1.043 | 1.02 | 1.025 | 1.037 | ||
|
| 1.353 | 1.327 | 1.322 | 1.274 | 1.257 | 1.268 | ||
|
| 0.947 | 0.918 | 0.901 | 1.018 | 0.966 | 0.989 | ||
|
| 1.06 | 1.065 | 1.048 | 1.149 | 1.137 | 1.141 | ||
|
| 1.083 | 1.063 | 1.075 | 1.081 | 1.035 | 1.061 | ||
|
| 1.121 | 1.112 | 1.124 | 1.093 | 1.095 | 1.096 | ||
|
| 1.279 | 1.244 | 1.264 | 1.169 | 1.167 | 1.172 | ||
|
| 0.903 | 0.882 | 0.877 | 1.053 | 1.033 | 1.046 | ||
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.263 | 1.253 | 1.256 | 1.217 | 1.125 | 1.192 | ||
|
| 2.641 | 2.631 | 2.694 | 1.848 | 1.822 | 1.803 | ||
|
| 1.221 | 1.186 | 1.199 | 1.102 | 1.094 | 1.124 | ||
|
| 1.044 | 1.145 | 1.093 | 1.138 | 1.153 | 1.148 | ||
|
| 1.086 | 1.066 | 1.073 | 1.038 | 1.036 | 1.036 | ||
|
| 1.112 | 1.139 | 1.101 | 1.202 | 1.153 | 1.166 | ||
|
| 1.053 | 1.038 | 1.044 | 1.058 | 1.055 | 1.063 | ||
|
| 1.263 | 1.247 | 1.251 | 1.204 | 1.191 | 1.211 | ||
|
| 1.613 | 1.679 | 1.669 | 1.541 | 1.531 | 1.539 | ||
|
| 1.092 | 1.047 | 1.052 | 1.074 | 1.041 | 1.045 | ||
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.189 | 1.191 | 1.193 | 1.182 | 1.164 | 1.091 | 1.155 | 1.173 |
|
| 2.307 | 2.308 | 2.305 | 2.298 | 1.862 | 1.868 | 1.889 | 1.852 |
|
| 1.113 | 1.092 | 1.095 | 1.094 | 1.058 | 1.045 | 1.172 | 1.059 |
|
| 1.191 | 1.189 | 1.192 | 1.191 | 1.176 | 1.178 | 1.183 | 1.201 |
|
| 1.094 | 1.093 | 1.095 | 1.097 | 1.101 | 1.061 | 1.065 | 1.093 |
|
| 1.127 | 1.123 | 1.129 | 1.125 | 1.121 | 1.129 | 1.126 | 1.132 |
|
| 1.188 | 1.189 | 1.185 | 1.192 | 1.136 | 1.095 | 1.099 | 1.139 |
|
| 1.232 | 1.221 | 1.222 | 1.212 | 1.268 | 1.19 | 1.197 | 1.203 |
|
| 1.593 | 1.521 | 1.533 | 1.528 | 1.331 | 1.275 | 1.277 | 1.338 |
|
| 1.051 | 1.033 | 1.029 | 1.032 | 1.035 | 1.021 | 1.029 | 1.023 |
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.068 | 1.071 | 1.073 | 1.067 | 1.198 | 1.122 | 1.137 | 1.175 |
|
| 1.994 | 1.994 | 2.056 | 2.04 | 1.93 | 1.932 | 1.926 | 1.921 |
|
| 1.242 | 1.242 | 1.274 | 1.235 | 1.106 | 1.116 | 1.119 | 1.154 |
|
| 1.185 | 1.183 | 1.196 | 1.232 | 1.163 | 1.194 | 1.172 | 1.254 |
|
| 1.348 | 1.254 | 1.269 | 1.304 | 1.257 | 1.139 | 1.212 | 1.256 |
|
| 1.283 | 1.283 | 1.281 | 1.291 | 1.198 | 1.198 | 1.211 | 1.215 |
|
| 1.263 | 1.251 | 1.288 | 1.044 | 1.091 | 1.079 | 1.096 | 1.129 |
|
| 0.987 | 0.987 | 0.989 | 0.989 | 1.007 | 1.017 | 1.022 | 0.999 |
|
| 1.533 | 1.426 | 1.454 | 1.464 | 1.227 | 1.178 | 1.192 | 1.254 |
|
| 1.101 | 1.098 | 1.111 | 1.137 | 1.061 | 1.068 | 1.072 | 1.08 |
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.244 | 1.277 | 1.286 | 1.248 | 1.153 | 1.13 | 1.146 | 1.151 |
|
| 1.359 | 1.359 | 1.366 | 1.359 | 1.474 | 1.491 | 1.477 | 1.474 |
|
| 0.927 | 0.915 | 0.916 | 0.92 | 0.939 | 0.955 | 0.969 | 0.933 |
|
| 1.184 | 1.191 | 1.193 | 1.189 | 1.134 | 1.114 | 1.116 | 1.106 |
|
| 1.137 | 1.136 | 1.129 | 1.117 | 1.082 | 1.082 | 1.088 | 1.085 |
|
| 0.915 | 1.038 | 1.011 | 0.991 | 1.088 | 1.083 | 1.075 | 1.054 |
|
| 1.199 | 1.166 | 1.174 | 1.19 | 1.086 | 1.109 | 1.115 | 1.107 |
|
| 1.108 | 1.101 | 1.132 | 1.129 | 1.184 | 1.186 | 1.192 | 1.184 |
|
| 1.581 | 1.584 | 1.584 | 1.586 | 1.383 | 1.391 | 1.396 | 1.382 |
|
| 1.123 | 1.101 | 1.107 | 1.101 | 1.063 | 1.069 | 1.065 | 1.063 |
Simulated data with and from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 1.024 | 1.028 | 1.029 | 1.031 | 1.033 | 1.02 | 1.021 | 1.024 |
|
| 2.008 | 1.995 | 1.996 | 1.988 | 1.709 | 1.72 | 1.725 | 1.72 |
|
| 1.022 | 1.025 | 1.027 | 1.016 | 1.011 | 1.012 | 1.017 | 1.014 |
|
| 1.172 | 1.168 | 1.171 | 1.166 | 1.164 | 1.177 | 1.172 | 1.17 |
|
| 0.886 | 0.868 | 0.882 | 0.865 | 0.964 | 0.932 | 0.952 | 0.914 |
|
| 1.07 | 1.068 | 1.065 | 1.059 | 1.096 | 1.095 | 1.097 | 1.092 |
|
| 1.215 | 1.191 | 1.194 | 1.184 | 1.22 | 1.091 | 1.124 | 1.166 |
|
| 1.191 | 1.167 | 1.172 | 1.162 | 1.182 | 1.188 | 1.184 | 1.184 |
|
| 1.169 | 1.159 | 1.161 | 1.152 | 0.997 | 1.071 | 1.011 | 1.01 |
|
| 0.874 | 0.846 | 0.852 | 0.844 | 0.936 | 0.939 | 0.935 | 0.938 |
Simulated data with & from Section 6.1.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Order of Stationary ARMA | ARMA | ARMA-LSTM | ||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
|
| 0.988 | 0.966 | 0.967 | 0.968 | 1.016 | 1.012 | 1.017 | 1.014 |
|
| 1.546 | 1.549 | 1.552 | 1.562 | 1.841 | 1.838 | 1.838 | 1.838 |
|
| 1.002 | 1.017 | 1.017 | 1.016 | 1.008 | 1.008 | 1.008 | 1.05 |
|
| 1.156 | 1.165 | 1.163 | 1.165 | 1.167 | 1.156 | 1.159 | 1.156 |
|
| 1.091 | 1.093 | 1.099 | 1.09 | 1.064 | 1.06 | 1.066 | 1.058 |
|
| 1.23 | 1.235 | 1.235 | 1.235 | 1.197 | 1.209 | 1.21 | 1.209 |
|
| 1.041 | 1.07 | 1.069 | 1.063 | 1.135 | 1.053 | 1.096 | 1.139 |
|
| 1.253 | 1.256 | 1.261 | 1.255 | 1.134 | 1.136 | 1.131 | 1.126 |
|
| 1.559 | 1.551 | 1.55 | 1.541 | 1.159 | 1.199 | 1.196 | 1.161 |
|
| 1.073 | 1.068 | 1.071 | 1.068 | 1.083 | 1.062 | 1.067 | 1.062 |
Average of RMSE in forecast window size T = 3, 10, 30, and 50.
| Average of RMSE | ||||||||
|---|---|---|---|---|---|---|---|---|
| ARMA | ARMA-LSTM | |||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
| 1.086 | 1.076 | 1.090 |
| 1.121 | 1.123 | 1.134 |
| |
| 1.149 | 1.136 | 1.144 |
| 1.159 | 1.136 | 1.143 |
| |
| 1.338 | 1.331 | 1.340 |
| 1.234 | 1.221 | 1.224 |
| |
| 1.308 | 1.296 | 1.297 |
| 1.225 |
| 1.219 | 1.221 | |
Figure 3Comparison in forecast window sizes T = 3, 10, 30, and 50.
Average of RMSE for in sample size N = 50, 100, 200, 300, and 500 with forecast window size T = 10.
| Average of RMSE (and Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| ARMA | ARMA-LSTM | |||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
| 1.301 | 1.291 | 1.299 |
| 1.224 |
| 1.216 | 1.244 | |
| 1.149 | 1.136 | 1.144 |
| 1.159 | 1.136 | 1.143 |
| |
| 1.177 | 1.187 | 1.189 |
| 1.159 | 1.161 | 1.164 |
| |
| 1.163 | 1.152 | 1.155 |
| 1.131 | 1.125 | 1.124 |
| |
|
| 1.197 | 1.1984 | 1.196 | 1.180 |
| 1.179 | 1.181 | |
Figure 4Comparison in the in-sample size N = 50, 200, 300, and 500.
Mean, standard deviation, PACF lag 1 to 3 for ten selected stocks.
| Mean | S.D | PACF1 | PACF2 | PACF3 | |
|---|---|---|---|---|---|
| AAPL | 66.440217 | 37.060808 | 0.996875 | 0.044454 | −0.004848 |
| BA | 258.704781 | 82.478194 | 0.995870 | −0.031231 | −0.061804 |
| CSCO | 40.585947 | 8.595774 | 0.994585 | 0.073202 | −0.016488 |
| GS | 227.095242 | 56.820929 | 0.993579 | 0.039741 | −0.043412 |
| IBM | 124.851224 | 10.369478 | 0.982339 | 0.070195 | −0.040622 |
| INTC | 46.269478 | 9.305502 | 0.992194 | 0.178757 | −0.053398 |
| JNJ | 130.715314 | 18.399352 | 0.993930 | 0.050988 | −0.031304 |
| JPM | 104.046116 | 24.467471 | 0.993854 | 0.067756 | −0.049235 |
| KO | 44.519034 | 6.089778 | 0.993828 | 0.031639 | −0.039178 |
| MMM | 173.550240 | 20.467854 | 0.991641 | 0.004475 | 0.026664 |
Figure 5Log prices for ten selected stocks.
RMSE for forecast window sizes T = 3, 5, 10, 30, 50, 70, 100, 130, 150, and 200.
| Average of RMSE (& Standard Deviation) | ||||||||
|---|---|---|---|---|---|---|---|---|
| ARIMA | ARIMA-LSTM | |||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
| 3.027 | 2.914 | 3.075 | 4.414 | 4.302 | 4.375 | |||
| 4.077 | 4.126 | 4.163 | 4.024 | 3.966 | 4.081 | |||
| 4.748 | 4.712 | 4.868 | 5.359 | 5.261 | 5.272 | |||
| 5.872 | 5.91 | 5.994 | 5.754 | 5.726 | 5.643 | |||
| 7.834 | 7.609 | 7.726 | 7.411 | 7.405 | 7.384 | |||
| 9.991 | 9.909 | 10.024 | 10.393 | 10.221 | 10.42 | |||
| 14.465 | 13.991 | 14.197 | 9.304 | 9.235 | 9.253 | |||
| 14.482 | 14.301 | 17.672 | 13.811 | 13.9 | 14.581 | |||
| 22.985 | 22.985 | 23.021 | 17.778 | 17.526 | 17.98 | |||
| 31.144 | 30.502 | 30.712 | 26.831 | 26.662 | 26.507 | |||
LSTM with different time steps for financial data in varying forecast windows.
| No. Steps | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 8.5789 | 10.1965 | 56.7817 | 104.3681 | 123.4805 | 119.2673 | 151.1338 | 107.2951 | 114.8106 | 73.2335 |
| 3 | 5.7604 | 3.5166 | 3.6097 | 13.325 | 10.6368 | 31.9361 | 33.4419 | 26.0112 | 31.5578 | 26.6354 |
| 5 | 4.0695 | 3.0575 | 8.5064 | 11.9009 | 15.5075 | 17.3077 | 19.0942 | 48.0012 | 30.0622 | 36.6099 |
| 7 | 3.9708 | 6.4145 | 10.6368 | 6.8547 | 13.2163 | 16.5474 | 19.0724 | 32.7076 | 20.5954 | 44.1875 |
| 10 | 5.3985 | 6.4576 | 5.9597 | 13.8295 | 16.0972 | 20.6271 | 12.8859 | 28.2251 | 28.2803 | 25.5409 |
Figure 6Comparison in different forecast windows.
RMSE for forecast window sizes T = 3, 5, 10, 30, 50, 70, 100, 130, 150, and 200.
| Average of RMSE & Standard Deviation | ||||||||
|---|---|---|---|---|---|---|---|---|
| ARIMA | ARIMA-LSTM | |||||||
| AIC | BIC | HQ | MML87 | AIC | BIC | HQ | MML87 | |
| 26.805 | 26.689 | 26.569 | 25.768 | 23.066 | 24.791 | |||
| 28.036 | 27.538 | 27.479 | 24.636 | 22.309 | 24.113 | |||
| 30.633 | 31.502 | 31.585 | 26.970 | 27.566 | 27.924 | |||
| 40.730 | 40.788 | 40.157 | 31.022 | 29.382 | 31.689 | |||
| 39.097 | 39.007 | 42.986 | 35.639 | 36.036 | 40.568 | |||
| 34.004 | 33.551 | 34.773 | 48.942 | 45.305 | 49.723 | |||
| 32.002 | 32.444 | 37.925 | 56.024 | 51.543 | 59.705 | |||
| 44.023 | 44.162 | 43.635 | 36.183 | 39.401 | 46.496 | |||
| 44.463 | 44.736 | 43.928 | 32.574 | 31.225 | 33.584 | |||
| 42.150 | 42.372 | 43.75 | 46.711 | 53.393 | 46.363 | |||
LSTM for PM2.5 Beijing data in different time steps and forecast windows.
| No. Steps | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3.0976 | 5.7806 | 16.5048 | 47.8431 | 53.7436 | 67.2412 | 81.7044 | 92.6897 | 73.4192 | 71.5536 |
| 3 | 4.4983 | 8.1565 | 17.0462 | 34.0492 | 36.3896 | 47.2558 | 64.68533 | 90.7986 | 78.9972 | 78.5648 |
| 5 | 4.7719 | 9.2208 | 18.7955 | 33.6065 | 50.4786 | 56.7465 | 59.3666 | 75.4321 | 102.0098 | 88.1695 |
| 7 | 5.9126 | 9.8355 | 15.3696 | 25.8551 | 38.6874 | 53.06845 | 50.962 | 87.998 | 92.101 | 101.2337 |
| 10 | 8.4289 | 11.4749 | 11.4479 | 38.3303 | 44.5138 | 65.6299 | 70.6415 | 74.6879 | 90.1211 | 84.0196 |