| Literature DB >> 35602638 |
Mirza Rizwan Sajid1, Arshad Ali Khan2, Haitham M Albar3, Noryanti Muhammad4, Waqas Sami5,6, Syed Ahmad Chan Bukhari7, Iram Wajahat8.
Abstract
Machine learning (ML) often provides applicable high-performance models to facilitate decision-makers in various fields. However, this high performance is achieved at the expense of the interpretability of these models, which has been criticized by practitioners and has become a significant hindrance in their application. Therefore, in highly sensitive decisions, black boxes of ML models are not recommended. We proposed a novel methodology that uses complex supervised ML models and transforms them into simple, interpretable, transparent statistical models. This methodology is like stacking ensemble ML in which the best ML models are used as a base learner to compute relative feature weights. The index of these weights is further used as a single covariate in the simple logistic regression model to estimate the likelihood of an event. We tested this methodology on the primary dataset related to cardiovascular diseases (CVDs), the leading cause of mortalities in recent times. Therefore, early risk assessment is an important dimension that can potentially reduce the burden of CVDs and their related mortality through accurate but interpretable risk prediction models. We developed an artificial neural network and support vector machines based on ML models and transformed them into a simple statistical model and heart risk scores. These simplified models were found transparent, reliable, valid, interpretable, and approximate in predictions. The findings of this study suggest that complex supervised ML models can be efficiently transformed into simple statistical models that can also be validated.Entities:
Mesh:
Year: 2022 PMID: 35602638 PMCID: PMC9119773 DOI: 10.1155/2022/5475313
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Proposed methodology for exploration of supervised ML models.
Performance assessment of ML models for the development of NLHRS.
| Models | ANN | Linear SVM | RBF SVM | Random forest | LR | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Confusion matrix | Case | Control | Case | Control | Case | Control | Case | Control | Case | Control |
| Case | 185 | 45 | 187 | 43 | 187 | 43 | 191 | 39 | 188 | 42 |
| Control | 35 | 195 | 46 | 184 | 50 | 180 | 56 | 174 | 50 | 180 |
| Accuracy | 82.61 | 80.65 | 79.80 | 79.30 | 80.00 | |||||
| Sensitivity | 0.791 | 0.813 | 0.813 | 0.830 | 0.817 | |||||
| Specificity | 0.848 | 0.800 | 0.783 | 0.757 | 0.783 | |||||
| Kappa statistic | 0.653 | 0.613 | 0.595 | 0.587 | 0.600 | |||||
| AUC | 0.883 | 0.881 | 0.870 | 0.857 | 0.873 | |||||
| RMSE | 0.365 | 0.372 | 0.379 | 0.411 | 0.380 | |||||
| Number of criteria fulfilled | 5/6 | 5/6 | 1/6 | 1/6 | Baseline risk prediction model | |||||
Figure 2Illustration of finalized ANN model for prediction of disease status (DS).
Extraction of relative feature weights using ANN and linear SVM.
| Features | Artificial neural network | Linear support vector machine | ||
|---|---|---|---|---|
| Sum of input feature contribution | Relative feature weights (%) | Original feature weights | Relative feature weights (%) | |
| Age groups | 0.836 | 11.944 | 1.085 | 8.828 |
| Parental history of CVDs | 0.401 | 5.735 | 0.330 | 2.683 |
| Self-reported general stress | 0.360 | 5.138 | 0.732 | 5.957 |
| Consumption of high salty foods | 0.429 | 6.128 | 1.024 | 8.338 |
| Low fruit consumption | 0.666 | 9.517 | 0.832 | 6.773 |
| Physical inactivity | 0.562 | 8.024 | 1.167 | 9.499 |
| High fried foods/trans fats | 0.404 | 5.770 | 0.857 | 6.976 |
| Abdominal obesity | 0.406 | 5.794 | 1.046 | 8.512 |
| Diabetes mellitus | 0.446 | 6.371 | 0.978 | 7.961 |
| Hypertension | 0.737 | 10.527 | 1.386 | 11.283 |
| Smoking history | 0.635 | 9.067 | 0.969 | 7.887 |
| Low vegetables consumption | 0.588 | 8.394 | 0.894 | 7.273 |
| Red meat/poultry consumption | 0.531 | 7.591 | 0.987 | 8.030 |
|
|
|
|
|
|
Performance assessment of NLHRS-based risk prediction models.
| Models | ANN-RS | SVM-RS | ||
|---|---|---|---|---|
| Confusion matrix | Case | Control | Case | Control |
| Case | 190 | 40 | 190 | 40 |
| Control | 48 | 182 | 36 | 194 |
| Accuracy | 81.000 | 83.500 | ||
| Sensitivity | 0.826 | 0.826 | ||
| Specificity | 0.791 | 0.843 | ||
| Kappa statistic | 0.620 | 0.670 | ||
| AUC | 0.876 | 0.888 | ||
| RMSE | 0.378 | 0.362 | ||
Validation of NLHRS-based risk prediction models.
| Assessment | Test statistic/criteria | Ideal value | ANN-RS | SVM-RS |
|---|---|---|---|---|
| Overall discrimination and calibration | Brier mean probability score | 0 | 0.142 | 0.115 |
| Overall comparison of models | Spiegelhalter's Z-statistic ( | 0 ( | −1.791 (0.073) | −1.443 (0.150) |
| Calibration | H-statistic ( | <20 ( | 13.719 (0.089) | 14.427 (0.071) |
| Discrimination | AUC | 1 | 0.876 | 0.888 |
Figure 3Discrimination strength of ANN-RS- and SVM-RS-based risk prediction models.
Figure 4Calibration chart for NLHRS-based risk prediction models.