| Literature DB >> 34421218 |
Jing-You Lu1, Hsu-Chao Lai2, Wen-Yueh Shih2, Yi-Feng Chen2, Shen-Hang Huang1, Hao-Han Chang3, Jun-Zhe Wang4, Jiun-Long Huang1,2, Tian-Shyr Dai5.
Abstract
Pairs trading is an effective statistical arbitrage strategy considering the spread of paired stocks in a stable cointegration relationship. Nevertheless, rapid market changes may break the relationship (namely structural break), which further leads to tremendous loss in intraday trading. In this paper, we design a two-phase pairs trading strategy optimization framework, namely structural break-aware pairs trading strategy (SAPT), by leveraging machine learning techniques. Phase one is a hybrid model extracting frequency- and time-domain features to detect structural breaks. Phase two optimizes pairs trading strategy by sensing important risks, including structural breaks and market-closing risks, with a novel reinforcement learning model. In addition, the transaction cost is factored in a cost-aware objective to avoid significant reduction of profitability. Through large-scale experiments in real Taiwan stock market datasets, SAPT outperforms the state-of-the-art strategies by at least 456% and 934% in terms of profit and Sortino ratio, respectively.Entities:
Keywords: Continuous wavelet CNN; Deep reinforcement learning; Pairs trading strategy; Structural break detection
Year: 2021 PMID: 34421218 PMCID: PMC8369334 DOI: 10.1007/s11227-021-04013-x
Source DB: PubMed Journal: J Supercomput ISSN: 0920-8542 Impact factor: 2.474
Fig. 1Stock price of 2337.TW and 2344.TW
Fig. 2Example of the normalized spread of paired stocks
Fig. 3Example of the structural break
Fig. 4Architecture of SAPT
Fig. 5Architecture of SWANet
Fig. 6An example of scalogram
An example of action set A
| Trading boundary | |||||||
| Stop-loss boundary |
Fig. 7Index of TAIEX from 2018/1 to 2020/5
Fig. 8Illustration of formation and trading period in the Taiwan stock market
Fig. 9SWANet-S architecture
Performance of each method ( min)
| Miss rate (%) | True detection rate (%) | Partial detection rate (%) | False detection rate (%) | Average delay (min) | |
|---|---|---|---|---|---|
| 3-std | 30.5 | 24.4 | 19.6 | 44.9 | 22.310 |
| ADF | 27.4 | 15.6 | 39.3 | 54.8 | 24.729 |
| BCD | 52.8 | 11.4 | 24.5 | 34.6 | 22.342 |
| LSTM | 22.4 | 44.7 | 3.6 | 5.7 | 18.284 |
| SWANet-S | 46.0 | 8.4 | 13.2 | 17.319 | |
| SWANet | 21.0 |
The best method in each case/experiment is marked as bold. It is a convention of the machine learning field
Fig. 10Delay distribution
Fig. 11Occurrence probability of structural break
Fig. 12True detection rate of each method
Fig. 13Case 1
Fig. 14Case 2
Fig. 15An example of training and testing with sliding window
Fig. 16Feature influence on cumulative net profit from 2018/1 to 2020/5
Fig. 17Trade volume of SAPT and PTDQN in each month from 2018/1 to 2020/5
Trade counts and net profit of SAPT and PTDQN in 2020/5
| Normal close count | Stop-loss close count | Exit close count | Profit | |
|---|---|---|---|---|
| SAPT | 7656 | 461 | 267 | 743,485 |
| PTDQN | 8342 | 342 | 358 | 647,546 |
Risk indicators of each method
| Sharpe ratio | Sortino ratio | MDD | |
|---|---|---|---|
| SAPT | |||
| SAPT w/o Break | 3.45 | 9.53 | 0.044 |
| SAPT w/o Time | 3.42 | 9.78 | 0.043 |
| SAPT w/o Hold | 3.07 | 6.71 | 0.090 |
| PTDQN | 1.01 | 1.41 | 0.169 |
| SAPT-3-std | 1.07 | 1.77 | 0.143 |
| SAPT-ADF | − 0.23 | − 0.32 | 0.127 |
| SAPT-BCD | − 3.15 | − 2.95 | 0.250 |
| SAPT-LSTM | − 1.32 | − 1.49 | 0.297 |
The best method in each case/experiment is marked as bold. It is a convention of the machine learning field
Fig. 18VIX index and 0050.TW price during COVID-19
Fig. 19Pairs trading during COVID-19
Fig. 20Pairs trading in high-volatility market on March 17, 2020
Fig. 21A case of SAPT (top) and PTDQN (middle) undergoing structural breaks (bottom). Only the trading period is displayed
Fig. 22A case of forced close by exit