| Literature DB >> 29391864 |
Haimin Yang1, Zhisong Pan1, Qing Tao2.
Abstract
Online time series prediction is the mainstream method in a wide range of fields, ranging from speech analysis and noise cancelation to stock market analysis. However, the data often contains many outliers with the increasing length of time series in real world. These outliers can mislead the learned model if treated as normal points in the process of prediction. To address this issue, in this paper, we propose a robust and adaptive online gradient learning method, RoAdam (Robust Adam), for long short-term memory (LSTM) to predict time series with outliers. This method tunes the learning rate of the stochastic gradient algorithm adaptively in the process of prediction, which reduces the adverse effect of outliers. It tracks the relative prediction error of the loss function with a weighted average through modifying Adam, a popular stochastic gradient method algorithm for training deep neural networks. In our algorithm, the large value of the relative prediction error corresponds to a small learning rate, and vice versa. The experiments on both synthetic data and real time series show that our method achieves better performance compared to the existing methods based on LSTM.Entities:
Mesh:
Year: 2017 PMID: 29391864 PMCID: PMC5748146 DOI: 10.1155/2017/9478952
Source DB: PubMed Journal: Comput Intell Neurosci
Algorithm 1Different values of r.
|
|
| |
|---|---|---|
| Outlier | Normal | |
| Outlier | (1) | (2) |
| Normal | (2) | (1) |
Figure 1True value of data sets.
RMSE on synthetic data and real time series.
| Algorithm | Data | |||
|---|---|---|---|---|
| Synthetic | ECG | HandOutlines | DJIA | |
| RLSTM | 0.7606 | 0.8505 | 0.9756 | 1.8454 |
| SR-LSTM | 0.7329 | 0.8323 | 0.9411 | 1.7574 |
| RN-LSTM | 0.7218 | 0.8217 | 0.9376 | 1.6218 |
| RoAdam |
|
|
|
|
Figure 2Prediction value of different algorithms on synthetic data.
Figure 3Prediction value of different algorithms on ECG.
Figure 4Prediction value of different algorithms on HandOutlines.
Figure 5Prediction value of different algorithms on DJIA.