| Literature DB >> 35603041 |
Weijun Xu1, Zhineng Fu1, Hongyi Li2, Jinglong Huang1, Weidong Xu3, Yiyang Luo4.
Abstract
Coronavirus 2019 (COVID-19) has caused violent fluctuation in stock markets, and led to heated discussion in stock forums. The rise and fall of any specific stock is influenced by many other stocks and emotions expressed in forum discussions. Considering the transmission effect of emotions, we propose a new Textual Multiple Auto Regressive Moving Average (TM-ARMA) model to study the impact of COVID-19 on the Chinese stock market. The TM-ARMA model contains a new cross-textual term and a new cross-auto regressive (AR) term that measure the cross impacts of textual emotions and price fluctuations, respectively, and the adjacent matrix which measures the relationships among stocks is updated dynamically. We compute the textual sentiment scores by an emotion dictionary-based method, and estimate the parameter matrices by a maximum likelihood method. Our dataset includes the textual posts from the Eastmoney Stock Forum and the price data for the constituent stocks of the FTSE China A50 Index. We conduct a sliding-window online forecast approach to simulate the real-trading situations. The results show that TM-ARMA performs very well even after the attack of COVID-19.Entities:
Keywords: COVID‐19; FTSE China A50 Index; multiple ARMA; stock forum; textual sentiment
Year: 2022 PMID: 35603041 PMCID: PMC9111149 DOI: 10.1002/sam.11582
Source DB: PubMed Journal: Stat Anal Data Min ISSN: 1932-1864 Impact factor: 1.247
List of symbols and the corresponding descriptions
| Variable | Definition | Value or size |
|---|---|---|
|
| Number of stocks | 50 |
|
| Number of trading days | 323 |
|
| Number of days arranged for initialization | 25 |
|
| The length of training window | 20 |
|
| Hyperparameters |
|
|
| Learning rate | 0.001 |
|
| Returns in day |
|
|
| Textual sentiment in day |
|
|
| Predicted value of |
|
|
| Random vector for noise |
|
|
| Prediction error in day |
|
|
| Parameter vector for intercept |
|
|
| Parameter matrix for the cross‐AR term |
|
|
| Parameter matrix for MA term |
|
|
| Parameter matrix for the cross‐textual term |
|
|
| Series of daily returns |
|
|
| Series of textual sentiment |
|
|
| Series of prediction error |
|
|
| Adjacency matrix |
|
|
| Covariance matrix of |
|
|
| The set of parameter matrices | ( |
FIGURE 1Sliding‐window online forecast approach
Number of posts for constituent stocks of Chinese major stock indices
| Total number of posts | Average daily posts per stock | |||
|---|---|---|---|---|
| Stock index | Pre‐COVID‐19 | Post‐COVID‐19 | Pre‐COVID‐19 | Post‐COVID‐19 |
| FTSE China A50 | 1,054,631 | 1,514,325 | 87 | 124 |
| SSE 50 | 1,055,387 | 1,382,526 | 87 | 113 |
| CSI 100 | 1,911,830 | 2,514,643 | 79 | 103 |
| CSI 300 | 5,247,094 | 6,923,634 | 72 | 95 |
FIGURE 2Data used for forecasting every day
Average performance indicators of the all hyperparameter combinations
| Model |
|
|
|
|
|---|---|---|---|---|
| AR | 26.625 | 0.115 | 0.245 | 0.483 |
| VAR | 24.926 | 0.126 | 0.189 | 0.702 |
| ARMA | 23.851 | 0.118 | 0.184 | 0.685 |
| EM‐ARMA | 21.428 | 0.146 | 0.162 | 0.925 |
| TM‐AR | 22.245 | 0.164 | 0.149 | 1.164 |
| TM‐ARMA | 19.354 | 0.173 | 0.142 | 1.235 |
Performance indicators of the hyperparameter combination with the best RMSE
| Model |
|
|
|
|
|---|---|---|---|---|
| AR | 25.925 | 0.124 | 0.251 | 0.494 |
| VAR | 21.672 | 0.107 | 0.192 | 0.557 |
| ARMA | 22.349 | 0.115 | 0.184 | 0.625 |
| EM‐ARMA | 21.135 | 0.143 | 0.16 | 0.894 |
| TM‐AR | 21.529 | 0.158 | 0.152 | 1.039 |
| TM‐ARMA | 18.872 | 0.165 | 0.148 | 1.115 |
Performance indicators of hyperparameter combination with the best ARR
| Model |
|
|
|
|
|---|---|---|---|---|
| AR | 26.435 | 0.176 | 0.204 | 0.863 |
| VAR | 24.742 | 0.192 | 0.174 | 1.103 |
| ARMA | 23.674 | 0.182 | 0.173 | 1.055 |
| EM‐ARMA | 21.537 | 0.262 | 0.134 | 1.955 |
| TM‐AR | 22.105 | 0.273 | 0.137 | 1.993 |
| TM‐ARMA | 19.093 | 0.309 | 0.134 | 2.297 |
FIGURE 3Net asset value curves of hyperparameter combinations with the best ARR
Performance indicators in the two stages with hyperparameter combination of the best ARR
| ARR | Excess ARR | |||
|---|---|---|---|---|
| Models | Pre‐COVID | Post‐COVID | Pre‐COVID | Post‐COVID |
| AR | 0.297 | 0.235 | 0.050 | 0.002 |
| VAR | 0.197 | 0.256 | −0.050 | 0.023 |
| ARMA | 0.197 | 0.244 | −0.051 | 0.11 |
| EM‐ARMA | 0.397 | 0.354 | 0.150 | 0.121 |
| TM‐AR | 0.412 | 0.370 | 0.165 | 0.137 |
| TM‐ARMA | 0.442 | 0.420 | 0.195 | 0.187 |