| Literature DB >> 35634049 |
Pengfei Li1, Jungang Xu1, Keyao Li1.
Abstract
With the development of artificial intelligence technology, an increasing number of researchers try to apply different machine learning and deep learning methods to quantitative trading fields to obtain more stable and efficient trading models. As a typical quantitative trading strategy, stock selection has also attracted a lot of attention. There are many studies and applications on stock selection. However, the existing research and application cannot meet the continuous expansion of the scale and dimension of stock selection data set and cannot meet the needs in terms of efficiency and accuracy of stock selection. A convolutional neural network has been applied to image classification and achieved better results than the traditional methods. In this study, we first constructed a multifactor stock selection data set based on China's stock market. Then, we apply the convolutional neural network model to analyze stock selection data and select stocks. The main contribution of this study is that we build a stock multifactor data set, construct a "factor picture," and classify them by convolutional neural network to select stocks. This study also makes comparative experiments on the decision tree, support vector machine, and feedforward neural network in stock selection on the same data set constructed in this study. The results show that the stock selection method based on the convolutional neural network outperforms other methods in terms of the annual return, sharp ratio, and max drawdown.Entities:
Mesh:
Year: 2022 PMID: 35634049 PMCID: PMC9142321 DOI: 10.1155/2022/4743427
Source DB: PubMed Journal: Comput Intell Neurosci
Algorithm 1Factor selection.
Figure 1Factor picture.
Figure 2Stock selection model based on CNN.
Algorithm 2Stock selection model based on CNN.
Results of decision tree under different parameters.
| Training size | Leaf nodes | Depth | Benchmark | Annual return | Sharp ratio | Max drawdown |
|---|---|---|---|---|---|---|
| 24 | 50 | 20 | 29864 | 15956 | −0.0073 | 0.1946 |
| 36 | 50 | 20 | 45285 | 53892 | 0.02464 | 0.2008 |
| 48 | 50 | 20 | 35711 | 2871 | 0.0050 | 0.2199 |
| 24 | 50 | 25 | 29864 | 7042 | −0.0027 | 0.2186 |
| 36 | 50 | 25 | 45285 | 57700 | 0.0236 | 0.2072 |
| 48 | 50 | 25 | 35711 | 33672 | −0.0072 | 0.2689 |
| 24 | 50 | 30 | 29864 | 9193 | −0.0083 | 0.1953 |
| 36 | 50 | 30 | 45285 | 44739 | 0.0168 | 0.2077 |
| 48 | 50 | 30 | 35711 | 5885 | 0.0078 | 0.2010 |
| 24 | 30 | 25 | 29864 | 43062 | 0.0059 | 0.2087 |
| 36 | 30 | 25 | 45285 | 35755 | 0.0145 | 0.2244 |
| 48 | 30 | 25 | 35711 | 5259 | 0.0051 | 0.2631 |
| 24 | 50 | 25 | 29864 | 10775 | −0.0020 | 0.2065 |
| 36 | 50 | 25 | 45285 | 49842 | 0.0164 | 0.1878 |
| 48 | 50 | 25 | 35711 | 177 | 0.0092 | 0.2265 |
| 24 | 60 | 25 | 29864 | 2796 | −0.0136 | 0.2071 |
| 36 | 60 | 25 | 45285 | 35754 | 0.0101 | 0.2066 |
| 48 | 60 | 25 | 35711 | 10825 | 0.0134 | 0.2355 |
Results of SVM under different parameters.
| Training size | Regular | Benchmark | Annual return | Sharp ratio | Max drawdown |
|---|---|---|---|---|---|
| Coefficient | |||||
| 24 | 1 | 29864 | 38751 | −0.0152 | 0.2381 |
| 36 | 5 | 45285 | 35300 | 0.0150 | 0.2218 |
| 48 | 10 | 35711 | 34711 | 0.0236 | 0.2348 |
| 24 | 1 | 29864 | 38751 | 0.0144 | 0.2303 |
| 36 | 5 | 45285 | −7198 | −0.0060 | 0.2135 |
| 48 | 10 | 35711 | −11128 | −0.0056 | 0.2609 |
| 24 | 1 | 29864 | 19916 | 0.0082 | 0.2191 |
| 36 | 5 | 45285 | 2116 | −0.0071 | 0.2188 |
| 48 | 10 | 35711 | 30951 | 0.0124 | 0.2273 |
Results of FNN under different parameters.
| Training size | Hidden layer | Benchmark | Annual return | Sharp ratio | Max drawdown |
|---|---|---|---|---|---|
| 12 | 1 | 64367 | 5237 | 0.0023 | 0.2080 |
| 24 | 1 | 29864 | 1408 | 0.0192 | 0.2203 |
| 48 | 1 | 35711 | 11342 | 0.0043 | 0.2309 |
| 12 | 2 | 64367 | 32881 | 0.0472 | 0.2114 |
| 24 | 2 | 29864 | 4636 | 0.0150 | 0.2207 |
| 48 | 2 | 35711 | 27498 | 0.0236 | 0.2348 |
| 12 | 3 | 64367 | 38002 | 0.0352 | 0.2114 |
| 24 | 3 | 29864 | 26367 | 0.0250 | 0.1913 |
| 48 | 3 | 35711 | 29701 | 0.0236 | 0.1967 |
Results of CNN under different parameters.
| Training size | Sample width | Type | Benchmark | Annual return | Sharp ratio | Max drawdown |
|---|---|---|---|---|---|---|
| 36 | 3 | 1 | 45285 | 64351 | 0.0146 | 0.2176 |
| 36 | 3 | 2 | 45285 | 61248 | 0.0196 | 0.2420 |
| 36 | 3 | 3 | 45285 | 20171 | 0.0134 | 0.2188 |
| 24 | 3 | 1 | 29864 | 12915 | −0.0011 | 0.2437 |
| 24 | 3 | 2 | 29864 | 75060 | 0.0243 | 0.2225 |
| 24 | 3 | 3 | 29864 | 15195 | −0.0135 | 0.2548 |
| 36 | 6 | 1 | 45285 | 6464 | −0.0003 | 0.2171 |
| 36 | 6 | 2 | 45285 | 27737 | 0.0159 | 0.2060 |
| 36 | 6 | 3 | 45285 | 58898 | 0.0194 | 0.2041 |
| 24 | 6 | 1 | 29864 | −6565 | −0.0038 | 0.2429 |
| 24 | 6 | 2 | 29864 | −24867 | −0.0210 | 0.2394 |
| 24 | 6 | 3 | 29864 | 12791 | −0.0013 | 0.2159 |
Best performance of four models.
| Model | Benchmark | Annual return | Sharpe ratio | Max drawdown |
|---|---|---|---|---|
| Decision tree | 45285 | 64267 | 0.0506 | 0.1929 |
| SVM | 45285 | 66376 | 0.0619 | 0.2326 |
| FNN | 45285 | 38001 | 0.0335 | 0.2114 |
| CNN | 45285 | 71248 | 0.0696 | 0.2150 |
Figure 3The comparison of different stock selection methods.
The Stock Selection factors.
| No. | Factor | Type of factor | Description |
|---|---|---|---|
| 1 | TMV | Value | Total market value |
| 2 | CPs | Value | The average daily closing price of individual stocks in the last month is taken as logarithm |
| 3 | EPS | Value | Earnings per share |
| 4 | TOIS | Value | Total operating income per share |
| 5 | PE | Value | Price-to-earnings ratio, PE = total market value/net profit = PRICE/EPS |
| 6 | PECut | Value | Price-to-earnings cut ratio, PECut = total market value/net profit after deducting nonrecurring profit and loss |
| 7 | PB | Value | Price-to-book ratio, PB = total market value/net assets = = PRICE/BPS |
| BPS : BOOK PER SHARE | |||
| 8 | PS | Value | Price-to-sales ratio, PS = total market value/operating income = PRICE/SPS |
| SPS : SALE PER SHARE | |||
| 9 | PNFC | Value | Price-to-net cash flow, PNFC = total market value/net cash flow |
| 10 | POCF | Value | Price-to-operating cash flow, POCF = total market value/operating cash flow |
| 11 | ORYY | Growth ability | Year-to-year growth rate of operating revenue |
| 12 | NPYY | Growth ability | Year-to-year growth rate of net profit |
| 13 | OCFYY | Growth ability | Year-to-year growth rate of operating cash flow |
| 14 | ROEYY | Growth ability | Year-to-year growth rate of ROE |
| 15 | ROE | Financial quality | Return on equity, ROE = net profit/net assets |
| 16 | ROA | Financial quality | Return on assets, ROA = [net income + interest |
| 17 | GPM | Financial quality | Gross profit margin, GPM = gross profit/operating income |
| 18 | NPM | Financial quality | Net profit margin, NPM = net profit/operating income |
| 19 | AT | Financial quality | Asset turnover, AT = total revenue/total asset |
| 20 | RTR | Financial quality | Receivables turnover ratio |
| 21 | CAT | Financial quality | Current assets turnover |
| 22 | IT | Financial quality | Inventory turnover |
| 23 | FAT | Financial quality | Fixed assets turnover |
| 24 | TANA | Debt-paying ability | TANA = total assets/net assets |
| 25 | NCLNA | Debt-paying ability | NCLNA = noncurrent liabilities/net assets |
| 26 | Cash ratio | Debt-paying ability | Cash ratio = (cash + cash equivalents)/current liabilities |
| 27 | Current ratio | Debt-paying ability | Current ratio = current assets/current liabilities |
| 28 | Quick ratio | Debt-paying ability | Quick ratio = liquid capital/current liabilities |
| 29 | Equity ratio | Debt-paying ability | Equity ratio = total liabilities/total owners' equity |
| 30–33 | AANm | Profitability | The arithmetic average value of the daily turnover rate multiplied by the daily yield of individual stocks in the last N months, |
| 34–37 | SDNm | Profitability | Standard deviation of daily return series of individual stocks in the last N months, |
| 38–41 | ARNm | Profitability | Average value of daily return series of individual stocks in the last N months, |
| 42–45 | AT_Nm | Profitability | Average value of daily turnover rate series of individual stocks in the last N months, |
| 46 | RSC | Profitability | Ratio of sales to cost, RSC = cost of sales/net sales |
| 47 | DAR | Profitability | Debt asset ratio, DAR = total indebtedness/total assets |
| 48 | EBIT | Profitability | Earnings before interest and tax, EBIT = net profit + income tax + interest |
| 49 | ROA_Y | Profitability | Return on assets in one year |
| 50 | ROIC_Y | Profitability | Return on invested capital in one year, ROIC = (net income − tax)/total capital |
| 51 | EPSYY | Profitability | Year-to-year growth rate of EPS |
| 52 | NCFYY | Profitability | Year-to-year growth rate of net cash flow from operating activities per share |
| 53 | SPYY | Profitability | Year-to-year growth rate of sales profit |
| 54 | NPYY | Profitability | Year-to-year growth rate of net profit attributable to shareholders of the parent company |
| 55 | NATY | Profitability | Growth rate of net assets per share relative to the beginning of the year |
| 56 | TATY | Profitability | Growth rate of total assets relative to the beginning of the yea |
| 57 | TORYY | Profitability | Year-to-year growth rate of total operating revenue |
| 58 | TORYYS | Profitability | Year-to-year growth rate of total operating revenue in single quarter |
| 59 | OPYYS | Profitability | Year-to-year growth rate of operating profit in single quarter |
| 60 | NPYYS | Profitability | Year-to-year growth rate of net profit attributable to shareholders of the parent company in single quarter |
| 61 | MACD | Technical indicators | Moving average convergence divergence |
| 62 | RSI | Technical indicators | Relative strength index |
| 63 | PSY | Technical indicators | Psychological line |
| 64 | BIAS | Technical indicators |
We constructed a multi-factor stock selection data set containing 64 factors by the method based on Information Coefficient.