| Literature DB >> 35350478 |
Abstract
Exchange rates are affected by the impact of disparate types of new information as well as the couplings between these modalities. Previous work mainly predicted exchange rates solely based on market indicators and therefore achieved unsatisfactory results. In response to such an issue, this study develops an inventive multimodal fusion-based long short-term memory (MF-LSTM) model to forecast the USD/CNY exchange rate. Our model consists of two parallel LSTM modules that extract abstract features from each modality of information and a shared representation layer that fuses these features. In terms of the text modality, bidirectional encoder representations from transformers (BERT) is applied to conduct a sentiment analysis on social media microblogs. Compared to previous studies, we incorporate not only market indicators but also investor sentiments into consideration, treating the two types of data differently to match their exclusive characteristics. In addition, we apply the multimodal fusion technique and contrive a deep coupled model rather than a shallow and simple model to reflect the couplings between the two modalities. As a consequence, the experimental results obtained over a 15-month period exhibit the superiority of the proposed approach over nine baseline algorithms. The purpose of our study is to demonstrate that it is practicable and effective to incorporate multimodal fusion into financial time series forecasting.Entities:
Keywords: Exchange rate forecasting; MF-LSTM; Multimodal fusion; Sentiment analysis
Year: 2022 PMID: 35350478 PMCID: PMC8949836 DOI: 10.1007/s10489-022-03342-5
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Fig. 1Hidden mechanisms beneath the exchange rate
Fig. 2Flowchart of our work
Fig. 3Structure of the MF-LSTM model
Model settings
| Algorithms | Sentiment series | Market indicators | Historical USD/CNY rate | Parameters |
|---|---|---|---|---|
| ARIMA | X | X | O | Order = (2,1,2) |
| BPNN | O | O | O | Hidden units: 52; Activation function: ReLU; Learning rate: 0.05} |
| ELM | O | O | O | Hidden units: 77; Activation function: sigmoid} |
| SVR | O | O | O | Kernel function: sigmoid; C: 100.0; |
| CNN | O | O | O | Convolutional layer: Conv1D, Filter: 64, Kernal size: 2; Pooling layer: MaxPooling1D, Pool size: 2; Optimizer: Adam; Activate function: ReLU |
| LSTM-single | X | X | O | Hidden units: 32; Learning rate: 0.0001; Epoch: 200 |
| LSTM-market | O | O | O | Hidden units: 60; Learning rate: 0.0001; Epoch: 200 |
| LSTM-sentiment | O | X | O | Hidden units: 50; Learning rate: 0.0001; Epoch: 200 |
| LSTM-all | X | O | O | Hidden units: 64; Learning rate: 0.0001; Epoch:200 |
| MF-LSTM | O | O | O | Hidden units: 50( |
O means that the factor is applied, while X means not
Selected market indicators and social media keywords
| Category | Market indicator | Social media keyword | |
|---|---|---|---|
| Sina Weibo | |||
| Commodity market | Gold price | “Gold price” | |
| Silver Price | “Silver price” | ||
| WTI crude oil price | “Oil price” | ||
| Stock market | Shanghai Securities Composite Index | N/A | “CN stock market” |
| Shenzhen Component Index | |||
| Dow Jones Index | “US stock market” | N/A | |
| S&P 500 Index | |||
| NASDAQ Index | |||
| NYSE Index | |||
| Bond market | China 10-year bond yield | N/A | N/A |
| U.S. 10-year Treasury bond yield | “US bond market” | ||
| Interest rate | Shibor O/N rate | N/A | N/A |
| Federal fund rate | “US interest rate” | ||
| Exchange rate | USD/CNY rate | “Dollar index” | “USD/CNY” |
Evaluation dataset of sentiment analysis
| Social media keyword | Positive | Neutral | Negative | Total |
|---|---|---|---|---|
| “Gold price” | 76 | 194 | 119 | 389 |
| “Silver price” | 53 | 89 | 71 | 213 |
| “Oil price” | 108 | 109 | 184 | 401 |
| “CN stock market” | 384 | 1,207 | 1,882 | 3,473 |
| “US stock market” | 285 | 563 | 173 | 1,021 |
| “US bond market” | 201 | 637 | 299 | 1,137 |
| “US interest rate” | 162 | 387 | 313 | 862 |
| “Dollar index” | 276 | 538 | 183 | 997 |
| “USD/CNY” | 1,156 | 695 | 673 | 2,524 |
Sentiment evaluation results
| Evaluation methods | BERT-wwm | RoBERTa |
|---|---|---|
| Precision | 0.83 | 0.81 |
| Recall | 0.84 | 0.79 |
| F1_score | 0.83 | 0.80 |
Forecasting performance with different lag orders
| Lag time | MAE | MSE | RMSE | |
|---|---|---|---|---|
| 1 | 0.9788 | 0.0145 | 0.0004 | 0.0208 |
| 2 | 0.9767 | 0.0147 | 0.0004 | 0.0212 |
| 3 | 0.9702 | 0.0154 | 0.0006 | 0.0249 |
| 4 | 0.9612 | 0.0265 | 0.0013 | 0.0362 |
| 5 | 0.9537 | 0.0384 | 0.0021 | 0.0457 |
Technical results of different algorithms
| Algorithms | MAE | MSE | RMSE | |
|---|---|---|---|---|
| ARIMA | 0.9392 | 0.0271 | 0.0014 | 0.0369 |
| SVR | 0.9582 | 0.0248 | 0.0011 | 0.0335 |
| BPNN | 0.9447 | 0.0256 | 0.0012 | 0.0347 |
| ELM | 0.9578 | 0.0239 | 0.0009 | 0.0312 |
| CNN | 0.9603 | 0.0232 | 0.0009 | 0.0299 |
| LSTM-single | 0.9655 | 0.0235 | 0.0009 | 0.0304 |
| LSTM-sentiment | 0.9689 | 0.0234 | 0.0008 | 0.0286 |
| LSTM-market | 0.9723 | 0.0191 | 0.0006 | 0.0240 |
| LSTM-all | 0.9788 | 0.0145 | 0.0004 | 0.0208 |
| MF-LSTM-US | 0.9765 | 0.0193 | 0.0006 | 0.0248 |
| MF-LSTM-CN | 0.9693 | 0.0227 | 0.0008 | 0.0289 |
| MF-LSTM |
Six-month window length
Fig. 4Forecasting results of different algorithms with a six-month window length
Fig. 5Seasonal forecasting RMSE
Seasonal forecasting RMSE of different algorithms
| Algorithms | 2020Q1 | 2020Q2 | 2020Q3 | 2020Q4 | 2021Q1 |
|---|---|---|---|---|---|
| ARIMA | 0.0469 | 0.0275 | 0.0385 | 0.0419 | 0.0442 |
| SVR | 0.0423 | 0.0382 | 0.0361 | 0.0379 | 0.0288 |
| BPNN | 0.0402 | 0.0378 | 0.0365 | 0.0399 | 0.0270 |
| ELM | 0.0401 | 0.0338 | 0.0361 | 0.0389 | 0.0224 |
| CNN | 0.0380 | 0.0326 | 0.0311 | 0.0340 | 0.0216 |
| LSTM-single | 0.0321 | 0.0302 | 0.0348 | 0.0362 | 0.0234 |
| LSTM-sentiment | 0.0281 | 0.0308 | 0.0314 | 0.0334 | 0.0210 |
| LSTM-market | 0.0259 | 0.0273 | 0.0260 | 0.0261 | 0.0182 |
| LSTM-all | 0.0195 | 0.0236 | 0.0220 | 0.0242 | 0.0161 |
| MF-LSTM |
Technical results of different algorithms (robustness tests on out-of-time data)
| Algorithms | MAE | MSE | RMSE | |
|---|---|---|---|---|
| ARIMA | 0.9388 | 0.0275 | 0.0014 | 0.0372 |
| SVR | 0.9587 | 0.0240 | 0.0011 | 0.0328 |
| BPNN | 0.9466 | 0.0261 | 0.0011 | 0.0339 |
| ELM | 0.9580 | 0.0243 | 0.0010 | 0.0317 |
| CNN | 0.9611 | 0.0237 | 0.0009 | 0.0303 |
| LSTM-single | 0.9657 | 0.0233 | 0.0009 | 0.0298 |
| LSTM-sentiment | 0.9691 | 0.0229 | 0.0008 | 0.0282 |
| LSTM-market | 0.9746 | 0.0188 | 0.0006 | 0.0237 |
| LSTM-all | 0.9785 | 0.0147 | 0.0004 | 0.0211 |
| MF-LSTM |
Six-month window length
Statistical results of different algorithms
| Algorithms | PT statistic | DM statistic |
|---|---|---|
| ARIMA | 0.8125 | − 5.7468*** |
| SVR | 1.2039 | − 5.2486*** |
| BPNN | 1.1433 | − 5.3276*** |
| ELM | 1.2918* | − 5.1937*** |
| CNN | 1.9922** | − 5.1329*** |
| LSTM-single | 2.6138** | − 4.4173*** |
| LSTM-sentiment | 3.0192*** | − 3.5404*** |
| LSTM-market | 3.9765*** | − 3.0719*** |
| LSTM-all | 5.2011*** | − 2.3561** |
| MF-LSTM |
*** denotes rejection of the null hypothesis at 1% significance level. ** denotes rejection of the null hypothesis at 5% significance level. * denotes rejection of the null hypothesis at 10% significance level