| Literature DB >> 24498276 |
Salah Bouktif1, Eileen Marie Hanna2, Nazar Zaki2, Eman Abu Khousa3.
Abstract
UNLABELLED: Prediction and classification techniques have been well studied by machine learning researchers and developed for several real-word problems. However, the level of acceptance and success of prediction models are still below expectation due to some difficulties such as the low performance of prediction models when they are applied in different environments. Such a problem has been addressed by many researchers, mainly from the machine learning community. A second problem, principally raised by model users in different communities, such as managers, economists, engineers, biologists, and medical practitioners, etc., is the prediction models' interpretability. The latter is the ability of a model to explain its predictions and exhibit the causality relationships between the inputs and the outputs. In the case of classification, a successful way to alleviate the low performance is to use ensemble classiers. It is an intuitive strategy to activate collaboration between different classifiers towards a better performance than individual classier. Unfortunately, ensemble classifiers method do not take into account the interpretability of the final classification outcome. It even worsens the original interpretability of the individual classifiers. In this paper we propose a novel implementation of classifiers combination approach that does not only promote the overall performance but also preserves the interpretability of the resulting model. We propose a solution based on Ant Colony Optimization and tailored for the case of Bayesian classifiers. We validate our proposed solution with case studies from medical domain namely, heart disease and Cardiotography-based predictions, problems where interpretability is critical to make appropriate clinical decisions. AVAILABILITY: The datasets, Prediction Models and software tool together with supplementary materials are available at http://faculty.uaeu.ac.ae/salahb/ACO4BC.htm.Entities:
Mesh:
Year: 2014 PMID: 24498276 PMCID: PMC3911928 DOI: 10.1371/journal.pone.0086456
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Graph for the Solution Construction Mechanism.
The confusion matrix of a decision function . is the number of cases in the evaluation dataset with real label classified as .
| Predicted label | |||||
|
|
|
|
| ||
|
|
|
|
|
| |
| real |
|
|
|
|
|
| label |
|
|
|
|
|
|
|
|
|
|
| |
The symptom attributes used to predict HD in the experiment.
| Name | Description |
| AGE | age of a patient |
| SEX | sex of patient (1 = male; 0 = female) |
| CPT | Chest Pain Type |
| – Value 1: typical angina | |
| – Value 2: atypical angina | |
| – Value 3: non-angina pain | |
| – Value 4: asymptomatic | |
| TRESTBPS | : resting blood pressure (in mm Hg on admission to the hospital) |
| CHOL | Serum Cholesterol in mg/dl |
| FBS | (Fasting Blood Sugar |
| RESTECG | Resting Electrocardiographic results |
| – Value 0: normal | |
| – Value 1: having ST-T wave abnormality (T wave inve- | |
| rsions and/or ST elevation or depression of | |
| – Value 2: showing probable or definite left ventricular | |
| hypertrophy by Estes’ criteria | |
| THALACH | maximum heart rate achieved |
| EXANG | exercise induced angina (1 = yes; 0 = no) |
| OLDPEAK | ST depression induced by exercise relative to rest |
| SLOPE | the slope of the peak exercise ST segment |
| – Value 1: up-sloping | |
| – Value 2: flat | |
| – Value 3: down-sloping | |
| CA | number of major vessels (0–3) colored by fluoroscope |
| THAL | 3 = normal; 6 = fixed defect; 7 = reversible defect |
Dataset description.
| dataset Name | Location | Size | Reference |
|
| Cleveland Clinic |
| |
| Foundation, Ohio | |||
|
| Hungarian Institute |
| |
| of Cardiology, Budapest | |||
| V.A. Medical Center, |
| ||
|
| Long Beach, California |
The CTG attributes used to predict potential fetal pathologies.
| Name | Description |
| FHRBL | Fetal Heart Rate (FHR) Baseline (beats per minute) |
| AC | # of accelerations |
| FM | # of fetal movements per second |
| UC | # of uterine contractions per second |
| DL | # of light decelerations per second |
| DS | # of severe decelerations per second |
| DP | # of prolonged decelerations per second |
| ASTV | percentage of time with abnormal short |
| term variability | |
| MSTV | mean value of short term variability |
| ALTV | percentage of time with abnormal long |
| term variability | |
| MLTV | mean value of long term variability |
| Width | width of FHR histogram |
| Min | minimum of FHR histogram |
| Max | maximum of the histogram |
| Nmax | # of histogram peaks |
| Nzeros | # of histogram zeros |
| Mode | histogram mode |
| Mean | histogram mean |
| Median | histogram median |
| Variance | histogram variance |
| Tendency | histogram tendency: −1 = left assymetric; |
| 0 = symmetric; 1 = right assymetric |
Experiments description.
| Experiment# | Prediction | Individual BCs | Population |
| Problem | learned on | (Context dataset) | |
| 1 | HD |
|
|
| 2 | HD |
|
|
|
| ( | ||
| 3 | HD |
|
|
|
| ( | ||
| 4 | CTG |
|
|
| ( |
ACO parameters setting.
| Experiment# |
|
|
|
|
|
|
| 1 | 150 | 100 | 1.0 | 0.02 | 2.0 | 1.0 |
| 2 | 120 | 70 | 1.0 | 0.04 | 3.0 | 2.0 |
| 3 | 150 | 100 | 1.0 | 0.02 | 2.0 | 1.0 |
| 4 | 100 | 50 | 2.0 | 0.03 | 2.0 | 2.0 |
Experimental results for HD prediction problem. Accuracy percentage values of ACO and Benchmark approaches in the context of Cleveland population, ( is the classifier compared to ).
|
| |||||
|
|
|
|
|
| |
|
| 73.33 | 54.45 | 61.08 | 51.61 | 66.19 |
|
| 11.95 | 13.78 | 12.78 | 11.50 | 13.61 |
|
| – | 0.003 | 0.040 | 0.001 | 0.23 |
|
| (Two-tail) | ||||
Experimental results for HD prediction problem. Accuracy percentage values of ACO and Benchmark approaches in the context of Hungarian population, ( is the classifier compared to ).
|
| |||||
|
|
|
|
|
| |
|
| 73.27 | 57.53 | 59.86 | 69.14 | 69.40 |
|
| 4.60 | 3.74 | 4.65 | 4.80 | 100 |
|
| – | 0.007 | 0.039 | 0.514 | 0.476 |
|
| (Two-tail) | ||||
Experimental results for CTG prediction problem. Accuracy percentage values of ACO and benchmark approaches in the context of CTG, ( is the classifier compared to ).
|
| |||||
|
|
|
|
|
| |
|
| 74.60 | 64.05 | 59.16 | 55.00 | 60.32 |
|
| 11.61 | 15.16 | 25.99 | 21.35 | 16.13 |
|
| – | 0.049 | 0.056 | 0.011 | 0.018 |
|
| (Two-tail) | ||||
Figure 2Evaluation in HD case: Prediction accuracies in the context of Cleveland population ACO-based approach Vs. Best model, data-combination Model, Boosting and Bagging.
Figure 3Evaluation in HD case: Prediction accuracies in the context of Hungarian population ACO-based approach Vs. Best model, data-combination Model, Boosting and Bagging.
Figure 4Evaluation in HD case: Prediction accuracies in the context of Long-Beach population ACO-based approach Vs. Best model, data-combination Model, Boosting and Bagging.
Figure 5Evaluation in CTG case: Prediction accuracies in the CTG context, ACO-based approach Vs. Best model, data-combination Model, Boosting and Bagging.
Experimental results for HD prediction problem. Accuracy percentage values of ACO and benchmark approaches in the context of Long-Beach population, ( is the classifier compared to ).
|
| |||||
|
|
|
|
|
| |
|
| 69.63 | 44.70 | 56.12 | 67.13 | 55.36 |
|
| 15.77 | 9.01 | 12.13 | 16.82 | 13.71 |
|
| – | 0.0023 | 0.04 | 0.04 | 0.39 |
|
| (Two-tail) | ||||