| Literature DB >> 33967398 |
Ioannis E Livieris1, Stavros Stavroyiannis2, Lazaros Iliadis3, Panagiotis Pintelas1.
Abstract
Time-series analysis and forecasting problems are generally considered as some of the most challenging and complicated problems in data mining. In this work, we propose a new complete framework for enhancing deep learning time-series models, which is based on a data preprocessing methodology. The proposed framework focuses on conducting a sequence of transformations on the original low-quality time-series data for generating high-quality time-series data, "suitable" for efficiently training and fitting a deep learning model. These transformations are performed in two successive stages: The first stage is based on the smoothing technique for the development of a new de-noised version of the original series in which every value contains dynamic knowledge of the all previous values. The second stage of transformations is performed on the smoothed series and it is based on differencing the series in order to be stationary and be considerably easier fitted and analyzed by a deep learning model. A number of experiments were performed utilizing time-series datasets from the cryptocurrency market, energy sector and financial stock market application domains on both regression and classification problems. The comprehensive numerical experiments and statistical analysis provide empirical evidence that the proposed framework considerably improves the forecasting performance of a deep learning model.Entities:
Keywords: Deep Learning; Forecasting; Stationarity; Time-series
Year: 2021 PMID: 33967398 PMCID: PMC8096631 DOI: 10.1007/s00521-021-06043-1
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.606
Fig. 1Daily CCI30 price trend and the corresponding smoothed series for
Smoothing and stationarity framework to enhance deep learning in time-series forecasting
| /* Phase I: Smoothing */ | |
| Step 1. Import time-series training data. | |
| Step 2. Select value of smoothing parameter | |
| Step 3. Apply smoothing on time-series data using ( | |
| /* Phase II: Differencing-Training */ | |
| Step 4. Apply the ADF unit root test. | |
| Step 5. | |
| Step 6. | |
| Step 7. Apply the transformation based on first differences. | |
| Step 8. Apply the ADF unit root test. | |
| Step 9. | |
| Step 10. Train the deep learning model DL using the transformed time series. | |
| Step 11. | |
| Step 12. Train the deep learning model DL using the smoothed time series. | |
| Step 13. Calculate the model’s predictions on the smoothed training data. | |
| Step 14. Calculate the residuals between the smoothed training data and the model’s predictions. | |
| Step 15. | |
| Step 16. Apply the transformation based on first differences. | |
| Step 17. Re-train the deep learning model DL using the transformed time series. | |
| Step 18. | |
| Step 19. |
Descriptive statistics for CCI30, Brent and DJIA datasets
| Data | Minimum | Maximum | Mean | Std. Dev. | Median | Skewness | Kurtosis |
|---|---|---|---|---|---|---|---|
| Training set | 276.35 | 20796.64 | 4316.56 | 3332.19 | 3548.02 | 1.90 | 4.35 |
| Testing set | 1938.49 | 4760.54 | 3307.14 | 738.22 | 3427.91 | −0.10 | −1.05 |
| Training set | 26.01 | 118.90 | 70.69 | 24.55 | 63.57 | 0.53 | −1.06 |
| Testing set | 14.85 | 70.25 | 57.66 | 12.12 | 61.15 | −1.97 | 3.03 |
| Training set | 13328.85 | 27359.16 | 19721.59 | 3899.00 | 18041.55 | 0.47 | −1.19 |
| Testing set | 18591.93 | 29551.42 | 27064.85 | 2261.92 | 27576.29 | −1.90 | 3.40 |
The number of up and down movements of CCI30, Brent and DJIA datasets
| Data | Up | Down | ||
|---|---|---|---|---|
| CCI30 | ||||
| Training set | 606 | 55.39% | 488 | 44.61% |
| Testing set | 55 | 53.92% | 47 | 46.08% |
| Brent | ||||
| Training set | 850 | 49.97% | 851 | 50.03% |
| Testing set | 68 | 45.64% | 81 | 54.36% |
| DJIA | ||||
| Training set | 925 | 54.35% | 777 | 45.65% |
| Testing set | 81 | 54.36% | 68 | 45.64% |
ADF unit root test of all time series under consideration
| Time series | CCI30 | Brent | DJIA | |||
|---|---|---|---|---|---|---|
| Levels | −2.217 | 0.200 | −1.695 | 0.434 | −0.784 | 0.824 |
| Smoothed ( | −2.256 | 0.187 | −1.712 | 0.425 | −0.768 | 0.828 |
| Smoothed ( | −2.297 | 0.173 | −1.713 | 0.424 | −0.763 | 0.830 |
| Smoothed ( | −2.339 | 0.160 | −1.715 | 0.423 | −0.759 | 0.831 |
| Smoothed ( | −2.380 | 0.147 | −1.717 | 0.422 | −0.753 | 0.832 |
| Smoothed ( | −2.230 | 0.195 | −1.719 | 0.421 | −0.747 | 0.834 |
| Smoothed ( | −2.217 | 0.200 | −1.724 | 0.419 | −0.738 | 0.837 |
| Smoothed ( | −2.210 | 0.203 | −1.731 | 0.415 | −0.723 | 0.841 |
| Smoothed ( | −2.245 | 0.190 | −1.746 | 0.408 | −0.694 | 0.848 |
| Smoothed ( | −2.372 | 0.150 | −1.788 | 0.387 | −0.614 | 0.868 |
ADF unit root test of all differenced time series
| Time series | CCI30 | Brent | DJIA | |||
|---|---|---|---|---|---|---|
| First-differenced | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
| Smoothed | 0.000 | 0.000 | 0.000 | |||
Performance comparison for CCI30 dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 112.70 | 190.58 | 0.931 | 52.60 | 0.512 | 24.754 | 0.683 | 0.341 |
| Smoothed | 112.38 | 192.07 | 0.931 | 53.44 | 0.523 | 24.538 | 0.664 | 0.382 |
| Smoothed | 112.68 | 191.50 | 0.929 | 53.75 | 0.526 | 24.829 | 0.667 | 0.384 |
| Smoothed | 111.77 | 190.81 | 0.934 | 55.52 | 0.544 | 26.016 | 0.677 | 0.411 |
| Smoothed | 113.40 | 190.06 | 0.934 | 55.38 | 0.535 | 26.761 | 0.644 | 0.425 |
| Smoothed | 112.75 | 190.48 | 0.934 | 55.94 | 0.553 | 26.717 | 0.621 | 0.484 |
| Smoothed | 113.84 | 190.65 | 0.933 | 54.58 | 0.539 | 25.785 | 0.619 | 0.459 |
| Smoothed | 116.13 | 192.90 | 0.929 | 53.44 | 0.527 | 25.079 | 0.615 | 0.439 |
| Smoothed | 117.03 | 192.47 | 0.929 | 54.17 | 0.532 | 25.641 | 0.650 | 0.414 |
| Smoothed | 120.79 | 195.82 | 0.927 | 51.35 | 0.511 | 25.742 | 0.546 | 0.475 |
Performance comparison for CCI30 dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 121.85 | 197.56 | 0.919 | 50.42 | 0.491 | 25.123 | 0.646 | 0.336 |
| Smoothed | 118.51 | 196.86 | 0.919 | 51.88 | 0.508 | 25.060 | 0.642 | 0.373 |
| Smoothed | 122.25 | 200.21 | 0.919 | 52.50 | 0.516 | 25.925 | 0.619 | 0.414 |
| Smoothed | 121.81 | 196.20 | 0.924 | 53.98 | 0.512 | 26.381 | 0.606 | 0.418 |
| Smoothed | 120.27 | 195.20 | 0.928 | 54.27 | 0.537 | 26.713 | 0.612 | 0.461 |
| Smoothed | 120.96 | 195.70 | 0.927 | 54.06 | 0.538 | 25.923 | 0.575 | 0.500 |
| Smoothed | 122.29 | 195.19 | 0.924 | 52.71 | 0.523 | 25.875 | 0.575 | 0.491 |
| Smoothed | 123.66 | 195.67 | 0.924 | 52.92 | 0.523 | 24.773 | 0.594 | 0.452 |
| Smoothed | 123.22 | 196.36 | 0.921 | 54.79 | 0.546 | 25.249 | 0.567 | 0.525 |
| Smoothed | 124.81 | 194.35 | 0.922 | 52.60 | 0.525 | 24.871 | 0.542 | 0.507 |
Performance comparison for Brent dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 1.19 | 1.80 | 0.974 | 54.59 | 0.538 | 38.795 | 0.435 | 0.640 |
| Smoothed | 1.19 | 1.80 | 0.974 | 54.53 | 0.537 | 38.383 | 0.428 | 0.645 |
| Smoothed | 1.19 | 1.78 | 0.976 | 54.73 | 0.542 | 38.126 | 0.472 | 0.611 |
| Smoothed | 1.20 | 1.79 | 0.976 | 56.22 | 0.556 | 39.499 | 0.479 | 0.633 |
| Smoothed | 1.20 | 1.78 | 0.977 | 55.41 | 0.543 | 38.884 | 0.449 | 0.634 |
| Smoothed | 1.20 | 1.79 | 0.976 | 54.93 | 0.543 | 38.408 | 0.466 | 0.630 |
| Smoothed | 1.20 | 1.79 | 0.976 | 55.68 | 0.550 | 37.361 | 0.468 | 0.633 |
| Smoothed | 1.23 | 1.82 | 0.975 | 54.26 | 0.533 | 37.149 | 0.410 | 0.655 |
| Smoothed | 1.25 | 1.87 | 0.975 | 51.35 | 0.507 | 35.418 | 0.431 | 0.584 |
| Smoothed | 1.26 | 1.91 | 0.975 | 51.42 | 0.506 | 35.302 | 0.402 | 0.610 |
Performance comparison for Brent dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 1.36 | 2.03 | 0.971 | 53.04 | 0.526 | 38.470 | 0.475 | 0.578 |
| Smoothed | 1.33 | 1.97 | 0.972 | 51.76 | 0.516 | 38.806 | 0.500 | 0.533 |
| Smoothed | 1.31 | 1.93 | 0.974 | 51.89 | 0.518 | 38.983 | 0.502 | 0.534 |
| Smoothed | 1.30 | 1.92 | 0.974 | 53.61 | 0.529 | 40.122 | 0.479 | 0.579 |
| Smoothed | 1.30 | 1.89 | 0.974 | 50.74 | 0.505 | 38.340 | 0.469 | 0.540 |
| Smoothed | 1.28 | 1.86 | 0.974 | 51.42 | 0.509 | 39.116 | 0.444 | 0.574 |
| Smoothed | 1.26 | 1.84 | 0.973 | 51.05 | 0.533 | 36.368 | 0.438 | 0.628 |
| Smoothed | 1.25 | 1.82 | 0.975 | 51.62 | 0.509 | 36.496 | 0.424 | 0.595 |
| Smoothed | 1.24 | 1.85 | 0.973 | 52.43 | 0.511 | 35.964 | 0.347 | 0.675 |
| Smoothed | 1.25 | 1.87 | 0.973 | 51.96 | 0.519 | 36.496 | 0.513 | 0.525 |
Performance comparison for DJIA dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 320.24 | 594.22 | 0.928 | 48.18 | 0.472 | 33.495 | 0.580 | 0.363 |
| Smoothed | 317.68 | 589.16 | 0.930 | 48.58 | 0.479 | 32.292 | 0.549 | 0.409 |
| Smoothed | 316.30 | 585.59 | 0.931 | 48.58 | 0.481 | 32.057 | 0.532 | 0.430 |
| Smoothed | 318.67 | 588.26 | 0.930 | 48.11 | 0.475 | 35.161 | 0.543 | 0.406 |
| Smoothed | 318.56 | 586.56 | 0.931 | 48.72 | 0.482 | 35.825 | 0.533 | 0.431 |
| Smoothed | 317.16 | 591.56 | 0.930 | 51.55 | 0.503 | 36.847 | 0.552 | 0.419 |
| Smoothed | 316.83 | 588.37 | 0.931 | 50.74 | 0.498 | 34.674 | 0.594 | 0.403 |
| Smoothed | 316.85 | 589.08 | 0.930 | 51.49 | 0.513 | 33.413 | 0.533 | 0.493 |
| Smoothed | 315.21 | 588.47 | 0.931 | 51.62 | 0.504 | 33.790 | 0.631 | 0.378 |
| Smoothed | 315.78 | 587.02 | 0.930 | 50.20 | 0.494 | 33.753 | 0.575 | 0.354 |
Performance comparison for DJIA dataset ()
| MAE | RMSE | Acc (%) | AUC | GM | Sen | Spe | ||
|---|---|---|---|---|---|---|---|---|
| First-differenced | 328.08 | 602.64 | 0.927 | 47.23 | 0.463 | 34.583 | 0.551 | 0.348 |
| Smoothed | 331.26 | 613.32 | 0.924 | 48.38 | 0.471 | 35.821 | 0.535 | 0.379 |
| Smoothed | 326.80 | 600.03 | 0.928 | 49.53 | 0.486 | 35.191 | 0.540 | 0.379 |
| Smoothed | 323.17 | 595.67 | 0.929 | 48.85 | 0.484 | 37.211 | 0.516 | 0.463 |
| Smoothed | 326.95 | 600.71 | 0.929 | 48.92 | 0.486 | 36.510 | 0.528 | 0.430 |
| Smoothed | 322.70 | 598.46 | 0.929 | 49.19 | 0.491 | 36.874 | 0.533 | 0.460 |
| Smoothed | 326.72 | 610.53 | 0.926 | 48.65 | 0.483 | 36.221 | 0.474 | 0.512 |
| Smoothed | 330.66 | 615.92 | 0.924 | 51.76 | 0.514 | 36.221 | 0.512 | 0.482 |
| Smoothed | 321.74 | 595.43 | 0.922 | 52.57 | 0.526 | 34.797 | 0.532 | 0.466 |
| Smoothed | 334.46 | 647.65 | 0.925 | 49.39 | 0.488 | 34.452 | 0.498 | 0.510 |
FAR test and Finner post hoc test based on MAE metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 9.917 | − | − |
| Smoothed | 10 | 0.9837 | Accepted |
| Smoothed | 12.417 | 0.6883 | Accepted |
| First differenced | 17.667 | 0.1632 | Accepted |
FAR test and Finner post hoc test based on RMSE metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 9.5 | − | − |
| Smoothed | 10.583 | 0.8820 | Accepted |
| Smoothed | 10.75 | 0.8820 | Accepted |
| First differenced | 19.167 | 0.0497 | Rejected |
FAR test and Finner post hoc test based on R metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 7.5 | − | − |
| Smoothed | 10.167 | 0.551661 | Accepted |
| Smoothed | 10.833 | 0.551661 | Accepted |
| First differenced | 21.5 | 0.001814 | Rejected |
FAR test and Finner post hoc test based on accuracy metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 9 | − | − |
| Smoothed | 9.5 | 0.9025 | Accepted |
| Smoothed | 13 | 0.4481 | Accepted |
| First differenced | 18.5 | 0.0487 | Rejected |
FAR test and Finner post hoc test based on AUC metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 7.917 | − | − |
| Smoothed | 11 | 0.4501 | Accepted |
| Smoothed | 12.25 | 0.3998 | Accepted |
| First differenced | 18.833 | 0.0223 | Rejected |
FAR test and Finner post hoc test based on GM metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 8.833 | − | − |
| Smoothed | 10.167 | 0.8465 | Accepted |
| Smoothed | 10.333 | 0.8465 | Accepted |
| First differenced | 20.667 | 0.0112 | Rejected |
FAR test and Finner post hoc test based on SenSpe metric
| Series | Friedman | Finner post hoc test | |
|---|---|---|---|
| Ranking | |||
| Smoothed | 7.667 | − | − |
| Smoothed | 12.833 | 0.5676 | Accepted |
| Smoothed | 10 | 0.2920 | Accepted |
| First differenced | 19.5 | 0.0112 | Rejected |