| Literature DB >> 35498190 |
Guofeng Li1, Zuojuan Li1, Zheji Wang1, Ke Zhang1.
Abstract
Illegal insider trading identification is of great significance to the healthy development of the securities market. However, with the development of information technology, problems such as multidata sources and noise bring challenges to the insider trading identification work. Moreover, most of the current research on insider trading identification is based on single-task learning, which treats enterprises in different industries as a whole. This may ignore the differences between insider trading identification in different industries. In this article, we collect indicators from multiple sources to help regulators identify insider trading and then use information gain and correlation analysis to screen the indicators. Finally, we propose a multitask deep neural network with insider trading identification in different industries as different subtasks. The proposed model takes into account the correlations and differences between different tasks. Results of experiments show that compared with logistic, support vector machine, deep neural network, random forest, and extreme gradient boosting model, the proposed model can identify insider trading of enterprises in different industries more accurately and efficiently. This article provides new ideas for market regulators to maintain the order of the securities market through intelligent means.Entities:
Mesh:
Year: 2022 PMID: 35498190 PMCID: PMC9054416 DOI: 10.1155/2022/4874516
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Flow chart of indicators screening.
Figure 2Identification of insider trading based on MTL-DNN.
Confusion matrix.
| Class | Predicted class | ||
|---|---|---|---|
| 1 | 0 | ||
| Actual | 1 | TP (True positives) | FN (false negatives) |
| Class | 0 | FP (false positives) | TN (true negatives) |
Description of relevant indicators.
| Indicator category | Indicator dimension | Indicator name | Indicator description |
|---|---|---|---|
| Stock performance | Earning | Average daily return | Average daily return of 30 trading days before the material information announcement date |
| Yield amplitude | Difference between the maximum return and minimum of 30 trading days before the material information announcement date | ||
| Cumulative excess return | Accumulation excess return of the 30 trading days before the material information announcement date | ||
| Cumulative excess return amplitude | Difference between the maximum cumulative excess return and the minimum of the 30 trading days before the material information announcement date | ||
| Liquidity | Relative turnover rate | The ratio of the sample shares actual turnover and the reference turnover from T-150 days to T-30 days | |
| Relative spread | The ratio of the actual spread and the average daily spread from T-150 days to T-30 days | ||
| Relative trading volume | The ratio of the actual trading volume and the average daily trading volume from T-150 days to T-30 days | ||
| Relative illiquidity ratio | The ratio of the actual noncurrent ratio and the average daily noncurrent ratio from T-150 days to T-30 days | ||
| Relative amplitude | The ratio of the actual amplitude and the average daily amplitude from T-150 days to T-30 days | ||
| Volatility | Earnings volatility | The standard deviation of the sample stocks returns for the 30 trading days before the information release | |
| Riskiness | BETA | Average daily beta of 30 trading days before the material information announcement date | |
|
| |||
| Corporate governance | Meeting status | The meeting times of the board of directors, the meeting times of the board of supervisors, etc. | |
| Executive information | The first three directors' total compensation, the first three executives' total compensation, independent director number, number of directors holding shares, shareholding ratio of the supervisory board, etc. | ||
| Shareholding status | CR-5 index, CR-10 index, | ||
|
| |||
| Financial indices | Solvency | Current ratio, quick ratio, cash ratio, asset debt ratio, interest coverage ratio, property right ratio | |
| Operating capacity | Inventory turnover, current asset turnover, fixed asset turnover ratio, total asset turnover ratio, accounts payable turnover ratio | ||
| Profitability | Net assets per share, earnings per share, operating margin, cost margin, return on assets, etc. | ||
| Growth capacity | Total asset growth rate, operating revenue growth rate, operating profit growth rate, capital preservation and appreciation rate, capital accumulation rate, etc. | ||
|
| |||
| Media coverage | Media attention | The number of news reports in the 3 months before the material information announcement date | |
| Negative media sentiment | The sentiment score of each news report in the 3 months before material information announcement date is calculated, and then, we calculate the proportion of reports with negative sentiment scores | ||
|
| |||
| Insider trading | Insider trading | A binary variable that takes value of 1 if insider trading occurred and 0 otherwise | |
Figure 3Information gain ranking of chemical manufacturing indicators.
Correlation analysis of chemical manufacturing indicators.
| Number | Retained indicators | Deleted indicators | Correlation coefficient | ||
|---|---|---|---|---|---|
| Indicators | Information gain | Indicators | Information gain | ||
| 1 | CR-10 index | 0.056402 | CR-5 index | 0.032328 | 0.997 |
| 2 | Capital preservation and appreciation rate | 0.050051 | Capital accumulation rate | 0.038245 | 0.995 |
The set of insider trading indicators at a threshold of 70%.
| Indicator category | Indicator dimension | Indicator name |
|---|---|---|
| Stock performance | Earning | Average daily return |
| Cumulative excess return | ||
| Volatility | Earnings volatility | |
| Liquidity | Relative spread | |
| Riskiness | BETA | |
|
| ||
| Corporate governance | Meeting status | The meeting times of the board of supervisors |
| Shareholding status | CR-10 index | |
| Shareholding ratio of the largest shareholder | ||
|
| ||
| Media coverage | Media coverage | Negative media sentiment |
|
| ||
| Financial indices | Growth capacity | Operating profit growth rate |
| Capital preservation and appreciation rate | ||
| Operating capacity | Inventory turnover ratio | |
| Fixed assets turnover ratio | ||
| Accounts payable turnover ratio | ||
| Profitability | Earnings per share | |
| Net assets per share | ||
| Cost margin | ||
| Solvency | Gearing ratio | |
| Equity ratio | ||
Comparison of modeling effects for different index sets.
| Threshold | 60% | 70% | 80% | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Methods | STL-DNN | Pool-DNN | MTL-DNN | STL-DNN | Pool-DNN | MTL-DNN | STL-DNN | Pool-DNN | MTL-DNN |
| Accuracy | 0.871 | 0.883 | 0.884 | 0.818 | 0.901 | 0.934 | 0.701 | 0.741 | 0.888 |
| Recall | 0.883 | 0.885 | 0.889 | 0.830 | 0.841 | 0.906 | 0.803 | 0.761 | 0.889 |
| AUC | 0.849 | 0.884 | 0.861 | 0.828 | 0.907 | 0.932 | 0.798 | 0.883 | 0.890 |
| F1-score | 0.893 | 0.881 | 0.897 | 0.887 | 0.898 | 0.944 | 0.809 | 0.879 | 0.886 |
Comparison of multiple models in insider trading identification.
| Methods | Logistic | SVM | RF | XGBoost | DNN | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pool | STL | Pool | STL | Pool | STL | Pool | STL | Pool | STL | MTL | ||
| Accuracy | Industry1 | — | 0.837 | — | 0.811 | — | 0.963 | — | 0.974 | — | 0.889 | 0.868 |
| Industry2 | — | 0.849 | — | 0.939 | — | 0.849 | — | 0.859 | — | 0.846 | 0.969 | |
| Industry3 | — | 0.875 | — | 0.854 | — | 0.965 | — | 0.931 | — | 0.667 | 0.909 | |
| Industry4 | — | 0.794 | — | 0.830 | — | 0.766 | — | 0.765 | — | 0.788 | 0.960 | |
| Industry5 | — | 0.800 | — | 0.789 | — | 0.903 | — | 0.974 | — | 0.902 | 0.966 | |
| Average | 0.856 | 0.831 | 0.813 | 0.845 | 0.901 | 0.889 | 0.874 | 0.900 | 0.901 | 0.818 | 0.934 | |
|
| ||||||||||||
| Recall | Industry1 | — | 0.615 | — | 0.732 | — | 0.867 | — | 0.946 | — | 0.889 | 0.871 |
| Industry2 | — | 0.529 | — | 0.861 | — | 0.652 | — | 0.582 | — | 0.861 | 0.911 | |
| Industry3 | — | 0.750 | — | 0.940 | — | 0.843 | — | 0.615 | — | 0.708 | 0.893 | |
| Industry4 | — | 0.511 | — | 0.760 | — | 0.538 | — | 0.500 | — | 0.789 | 0.939 | |
| Industry5 | — | 0.600 | — | 0.952 | — | 0.819 | — | 0.964 | — | 0.903 | 0.917 | |
| Average | 0.682 | 0.601 | 0.858 | 0.849 | 0.758 | 0.744 | 0.724 | 0.721 | 0.841 | 0.830 | 0.906 | |
|
| ||||||||||||
| AUC | Industry1 | — | 0.782 | — | 0.771 | — | 0.914 | — | 0.973 | — | 0.892 | 0.871 |
| Industry2 | — | 0.868 | — | 0.890 | — | 0.734 | — | 0.869 | — | 0.868 | 0.967 | |
| Industry3 | — | 0.920 | — | 0.750 | — | 0.909 | — | 0.977 | — | 0.685 | 0.896 | |
| Industry4 | — | 0.769 | — | 0.915 | — | 0.651 | — | 0.690 | — | 0.790 | 0.960 | |
| Industry5 | — | 0.726 | — | 0.722 | — | 0.911 | — | 0.987 | — | 0.902 | 0.966 | |
| Average | 0.763 | 0.813 | 0.791 | 0.810 | 0.879 | 0.824 | 0.902 | 0.899 | 0.907 | 0.828 | 0.932 | |
|
| ||||||||||||
|
| Industry1 | — | 0.667 | — | 0.826 | — | 0.836 | — | 0.972 | — | 0.930 | 0.878 |
| Industry2 | — | 0.546 | — | 0.872 | — | 0.634 | — | 0.650 | — | 0.949 | 0.969 | |
| Industry3 | — | 0.692 | — | 0.836 | — | 0.906 | — | 0.762 | — | 0.874 | 0.973 | |
| Industry4 | — | 0.632 | — | 0.867 | — | 0.557 | — | 0.533 | — | 0.763 | 0.958 | |
| Industry5 | — | 0.643 | — | 0.936 | — | 0.825 | — | 0.947 | — | 0.921 | 0.945 | |
| Average | 0.677 | 0.636 | 0.814 | 0.867 | 0.789 | 0.752 | 0.846 | 0.773 | 0.898 | 0.887 | 0.944 | |
Figure 4Comparison of MTL-DNN with different number of tasks.