| Literature DB >> 29765399 |
Ching-Hsue Cheng1, Chia-Pang Chan1, Jun-He Yang1.
Abstract
The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.Entities:
Mesh:
Year: 2018 PMID: 29765399 PMCID: PMC5885405 DOI: 10.1155/2018/1067350
Source DB: PubMed Journal: Comput Intell Neurosci
Financial crisis related researches for research methods and results.
| Author | Research methods | Data period | Sample ratio (crisis : health) | Accuracy |
|---|---|---|---|---|
| Frydman et al. [ | Repeated segmentation logic | 1971–1981 | 58 : 142 | 85.0–94.0 |
|
| ||||
| Mossman et al. [ | LR | 1970–1976 | Moody's Industrial Manual 23 : 23 | 82.6–84.9 |
|
| ||||
| Atiya [ | NN | 1993–1996 | US 444 : 716 | 60.0–90.8 |
|
| ||||
| Chen [ | PCA + DT, PCA + LR | 2000–2007 | Taiwan 50 : 50 | 85.1–97.0 |
|
| ||||
| Li et al. [ | RSBL, MDA, Logit, Probit | N/A | China 135 : 135, | 71.6–88.5 |
|
| ||||
| Korol [ | MDA, DT, NN | 1993–1996 | Warsaw 50 : 135 | 74.1–96.8 |
|
| ||||
| Geng et al. [ | DT, SVM, NN | Open dataset | China 344 : 344 | 70.9–92.1 |
|
| ||||
| Liang et al. [ | SVM, RBF SVM, | Open dataset | China 344 : 344 | 70.9–92.1 |
MDA: multivariate discriminant analysis, DT: decision tree, NN: neural network, PCA: principal component analysis, LR: logistic regression, NB: Naive Bayes, MLP: multilayer perceptron neural network, CART: classification and regression tree, SVM: support vector machines, and RSBL: random subspace binary logit.
Financial ratio used in related organization and researches.
| Source | Financial ratio name |
|---|---|
| Financial holding | (1) Operating capacity, (2) profitability, (3) financial structure, (4) solvency |
|
| |
| Stock exchange information observatory | (1) Financial analysis, (2) profit analysis, (3) gross profit ratio |
|
| |
| Major industries | (1) Financial structure, (2) solvency, (3) operational effectiveness, (4) profitability, (5) multiples, (6) asset-liability analysis, (7) cash flow analysis |
|
| |
| Beaver [ | (1) Cash flow ratio, (2) net profit ratio, (3) debt ratio, (4) current ratio, (5) quick ratio, (6) turnover ratio |
|
| |
| Altman [ | (1) Ratio of operating funds to total assets |
|
| |
| Foster [ | (1) Liquidity ratio, (2) capital structure ratio, (3) profitability ratio, (4) turnover rate |
|
| |
| Wu [ | (1) Financial structure, (2) solvency, (3) operational capacity, (4) profitability |
|
| |
| Li et al. [ | 30 financial ratios |
|
| |
| Korol [ | 14 financial ratios |
|
| |
| Geng et al. [ | 31 financial ratios |
Figure 1Flowchart of time-series financial crisis prediction model proposed.
Figure 2The procedure of computation step for gene expression programming.
Variables and records of time-series dataset.
| Dataset | Variable | Record |
|---|---|---|
| Original data | 54 | 13,452 |
| Feature selection | 35 (including class) | 8,278 |
| Training | 35 (including class) | 5,518 |
| Testing | 35 (including class) | 2,760 |
Definition and formulate of variable [25].
| Financial ratios | Formulate | Variables |
|---|---|---|
| ROA (A) before tax interest | (Continued operating profit and loss + interest expense |
|
|
| ||
| ROA (B) before tax interest depreciation before | Pretax interest before depreciation recurring net profit/total assets |
|
|
| ||
| ROA (C) pretax interest before depreciation | Pretax interest before depreciation recurring net profit/total assets |
|
|
| ||
| ROE (A) after tax | Continue business unit profit and loss/average net |
|
|
| ||
| Operating Expense Ratio | Operating expenses/net operating income |
|
|
| ||
| Cash flow ratio | Cash flow/current Liabilities from operations |
|
|
| ||
| Cash flow per share | (Cash flows from operations − dividends issued by special units)/weighted average number of shares |
|
|
| ||
| Turnover per share (yuan) | Net operating income/(ordinary share capital + special share capital + share dividends to be distributed − treasury shares |
|
|
| ||
| Revenue growth rate | (Net operating income − net operating income for the same period of previous year)/ABS (net operating income for the same period of previous year) |
|
|
| ||
| Pretax net profit growth | (Net profit before tax − net profit before tax for the same period of previous year)/ABS (net profit before tax for the same period of previous year) |
|
|
| ||
| Total assets growth rate | (Total assets − total assets of the same period of last year)/ABS (total assets of same period of previous year) |
|
|
| ||
| Current ratio | Current assets/current liabilities |
|
|
| ||
| Quick ratio | (Cash and cash equivalents + financial assets at fair value through profit or loss − current + available-for-sale financial assets − current + held-to-maturity financial assets − current + hedged derivative financial assets − current + measured financial assets − floating + bond investments with no market-floating + accounts receivable & notes + other receivables + loans to others − mobility)/current liabilities |
|
|
| ||
| Debt ratio% | Total liabilities/total assets |
|
|
| ||
| Net worth/assets | Shareholders' equity/total assets |
|
|
| ||
| The total number of assets turnover | Restore full year revenue/average total assets |
|
|
| ||
| Accounts receivable turnover times | Restore full year revenue/average accounts receivable and bills |
|
|
| ||
| Inventory turnover (times) | Operating costs/average inventory |
|
|
| ||
| Fixed asset turnover times | Restore full year revenue/average real estate plant and equipment |
|
|
| ||
| Working capital to total assets ratio | (Current assets − current liabilities)/total assets |
|
|
| ||
| EBIT to total assets ratio | Pretax interest before net interest/total assets |
|
|
| ||
| Cash flow to total liability ratio | Net cash flow-operating/total liabilities |
|
|
| ||
| Liquidity ratio | Current assets/total assets |
|
|
| ||
| Cash/total assets | Cash/total assets |
|
|
| ||
| Current liabilities/total assets | Current liabilities/total assets |
|
|
| ||
| Fixed Assets/liabilities and shareholders' equity | Real estate plant and equipment/liabilities and shareholders' equity |
|
|
| ||
| Shareholders' equity/total assets | Total shareholders' equity/total assets |
|
|
| ||
| Total liabilities | Current liabilities + noncurrent liabilities |
|
|
| ||
| Net profit after tax | Consolidated profit/loss/net income |
|
|
| ||
| Pretax net profit margin | Net profit before tax/net operating income | |
|
| ||
| Operating margin | Operating margin/net operating income |
|
|
| ||
| Operating profit margin | Operating profit/net operating income |
|
|
| ||
| ROE (B) - regular gain | (Continued operating unit profit or loss − gain on cheap purchases − disposal of property, plant, and equipment benefits + disposal of real estate, plant, and equipment losses − disposal of investment benefits + disposal of investment losses − gains or losses on financial assets (liabilities) measured at fair value through profit or loss + financial assets (liabilities) loss measured at fair value − asset evaluation benefit + asset evaluation loss − rotation interest deduction of financial assets + losses from impairment of financial assets − rotation interest of assets impairment + losses of assets impairment)/ABS (total shareholders' equity in the period + shareholders total equity) |
|
|
| ||
| EPS/total assets | EPS/total assets |
|
|
| ||
| Health: P; distress: N | Asset earning power = earnings before taxes (EBT)/total assets | Class |
Selected attributes for 14 methods.
| Linear | Nonlinear | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chi-sq. | DT | KNN | LDA | Logit | Join | Disjoin | Nave | SVM | RBF | RS | MLP | Join | Disjoin |
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
| ||||||||||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
| |||||||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
| ||||||||
|
|
| ||||||||||||
|
|
|
|
|
|
| ||||||||
|
|
| ||||||||||||
|
|
|
|
|
|
|
| |||||||
|
|
| ||||||||||||
|
|
| ||||||||||||
|
|
|
| |||||||||||
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
| ||||||||||
|
|
|
|
|
|
|
|
| ||||||
|
|
|
|
| ||||||||||
|
|
|
|
|
| Nave |
|
|
| |||||
The results for financial distress dataset.
| Feature selection | Classifier | Accuracy | Type I | Type II |
|---|---|---|---|---|
| Linear | ||||
| Chi-square | GEP | 98.96 | 0.010916 | 0.009292 |
| Decision tree | 98.85 | 0.01279 | 0.008168 | |
| MLP | 95.35 | 0.043108 | 0.054842 | |
| SVM | 97.94 | 0.009018 | 0.051282 | |
| RBF | 77.43 | 0.727498 | 0.018755 | |
| KNN | 87.57 | 0.225761 | 0.078842 | |
| Decision tree | GEP | 99.06 | 0.009967 | 0.00813 |
| Decision tree | 98.98 | 0.011369 | 0.007001 | |
| MLP | 97.43 | 0.024633 | 0.028005 | |
| SVM | 97.91 | 0.008543 | 0.051103 | |
| RBF | 75.21 | 0.838425 | 0.010161 | |
| KNN | 89.61 | 0.218475 | 0.057228 | |
| KNN | GEP | 98.96 | 0.011391 | 0.00813 |
| Decision tree | 98.95 | 0.011369 | 0.008168 | |
| MLP | 97.16 | 0.002369 | 0.092182 | |
| SVM | 95.18 | 0.060161 | 0.018673 | |
| RBF | 82.42 | 0.598425 | 0.048435 | |
| KNN | 98.14 | 0.025846 | 0.085834 | |
| LDA | GEP | 98.96 | 0.011391 | 0.00813 |
| Decision tree | 98.95 | 0.011369 | 0.008168 | |
| MLP | 96.52 | 0.001895 | 0.115519 | |
| SVM | 98.48 | 0.012228 | 0.032914 | |
| RBF | 85.14 | 0.493883 | 0.012481 | |
| KNN | 97.75 | 0.039478 | 0.013478 | |
| Logistic | GEP | 99.02 | 0.011391 | 0.005807 |
| Decision tree | 98.75 | 0.012912 | 0.011453 | |
| MLP | 97.60 | 0.002842 | 0.075846 | |
| SVM | 97.97 | 0.018475 | 0.024504 | |
| RBF | 95.17 | 0.147651 | 0.012779 | |
| KNN | 98.03 | 0.038432 | 0.013497 | |
|
| ||||
| Nonlinear | ||||
| MLP | GEP | 98.25 | 0.041766 | 0.034843 |
| Decision tree | 97.87 | 0.006158 | 0.058343 | |
| MLP | 98.04 | 0.007106 | 0.050175 | |
| SVM | 94.64 | 0.009000 | 0.163361 | |
| RBF | 97.49 | 0.058742 | 0.012411 | |
| KNN | 97.09 | 0.052413 | 0.018754 | |
| Naive Bayes | GEP | 98.52 | 0.010916 | 0.024390 |
| Decision tree | 97.97 | 0.006158 | 0.054842 | |
| MLP | 95.78 | 0.040739 | 0.045508 | |
| SVM | 94.81 | 0.005685 | 0.165694 | |
| RBF | 88.58 | 0.283871 | 0.052137 | |
| KNN | 86.37 | 0.262475 | 0.092571 | |
| RBF network | GEP | 98.99 | 0.00813 | 0.010916 |
| Decision tree | 98.95 | 0.011369 | 0.008168 | |
| MLP | 91.00 | 0.000474 | 0.310385 | |
| SVM | 96.36 | 0.039792 | 0.028005 | |
| RBF | 97.70 | 0.048742 | 0.012378 | |
| KNN | 97.97 | 0.028773 | 0.012475 | |
| Rough set | GEP | 99.02 | 0.010916 | 0.006969 |
| Decision tree | 98.88 | 0.01279 | 0.007001 | |
| MLP | 93.93 | 0.074372 | 0.026838 | |
| SVM | 97.91 | 0.008543 | 0.051103 | |
| RBF | 78.07 | 0.728475 | 0.013418 | |
| KNN | 88.48 | 0.217582 | 0.073591 | |
| SVM | GEP | 98.92 | 0.010067 | 0.012941 |
| Decision tree | 98.85 | 0.011369 | 0.011669 | |
| MLP | 73.28 | 0.03837 | 0.169195 | |
| SVM | 98.11 | 0.016106 | 0.025671 | |
| RBF | 73.10 | 0.884217 | 0.023458 | |
| KNN | 69.13 | 0.592195 | 0.201728 | |
|
| ||||
| Linear | ||||
| Join | GEP | 98.96 | 0.010916 | 0.009292 |
| Decision tree | 98.95 | 0.011369 | 0.008168 | |
| MLP | 97.20 | 0.002369 | 0.091015 | |
| SVM | 96.09 | 0.044529 | 0.025671 | |
| RBF | 78.09 | 0.724258 | 0.022889 | |
| KNN | 86.32 | 0.262479 | 0.091348 | |
| Average | 92.60 | 0.1759866 | 0.041397 | |
| Disjoin | GEP | 98.99 | 0.010916 | 0.00813 |
| Decision tree | 98.88 | 0.01279 | 0.007001 | |
| MLP | 92.38 | 0.085741 | 0.052509 | |
| SVM | 98.15 | 0.008068 | 0.027944 | |
| RBF | 77.43 | 0.732561 | 0.023457 | |
| KNN | 87.57 | 0.262877 | 0.083271 | |
| Average | 92.23 | 0.185492 | 0.033718 | |
|
| ||||
| Nonlinear | ||||
| Join | GEP | 98.96 | 0.011391 | 0.00813 |
| Decision tree | 98.95 | 0.011369 | 0.008168 | |
| MLP | 97.16 | 0.002369 | 0.092182 | |
| SVM | 95.18 | 0.060161 | 0.01867 | |
| RBF | 82.42 | 0.60247 | 0.002431 | |
| KNN | 98.14 | 0.034237 | 0.012798 | |
| Average | 95.14 | 0.120332 | 0.023723 | |
| Disjoin | GEP | 98.96 | 0.010916 | 0.009292 |
| Decision tree | 98.85 | 0.012808 | 0.008168 | |
| MLP | 95.30 | 0.043108 | 0.054842 | |
| SVM | 97.94 | 0.009018 | 0.04878 | |
| RBF | 77.11 | 0.763741 | 0.012885 | |
| KNN | 86.94 | 0.242298 | 0.083179 | |
| Average | 92.51 | 0.180314 | 0.036191 | |
Note. ∗∗ denotes the best result in accuracy, Type I, and Type II, respectively. ∗ denotes the better result for average of join and disjoin in accuracy, Type I, and Type II, respectively.
Figure 3The tree structure of the proposed GEP model.