| Literature DB >> 27525160 |
Wiharto Wiharto1, Hari Kusnanto2, Herianto Herianto3.
Abstract
OBJECTIVES: The interpretation of clinical data for the diagnosis of coronary heart disease can be done using algorithms in data mining. Most clinical data interpretation systems for diagnosis developed using data mining algorithms with a black-box approach cannot recognize examination attribute relationships with the incidence of coronary heart disease.Entities:
Keywords: Algorithms; Data Mining; Decision Tree; Diagnosis; Heart Diseases
Year: 2016 PMID: 27525160 PMCID: PMC4981579 DOI: 10.4258/hir.2016.22.3.186
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Atribute leveland dataset for coronary heart disease
Figure 1The model-based clinical data interpretation system C4.5 algorithm for the diagnosis of coronary heart disease.
Confusion matric multiclass
Comparison of the performance of the system
PPV: positive prediction value, NPV: negative prediction value, AUC: the area under the curve.
The bold numbers indicate the highest value of the three models (C4.5, mSMOTE+C4.5, mSMOTE+IG+C4.5).
The results of t-test (p-value) of significance difference with C4.5 system
PPV: positive prediction value, NPV: negative prediction value, AUC: area under the curve.
Figure 2Knowledge-based in decision tree C4.5+mSMOTE+IG.
Comparison of performance with feauture selection
TPR: true positive rate.
The bold numbers indicate the highest value of each level/type in previous study and proposed study.
Comparison of performance without feature selection
TPR: true positive rate.
The bold numbers indicate the highest value of each level/type in both previous and proposed studies.
The results of t-test (p-value) of significance difference with the proposed system
TPR: true positive rate.
The bold numbers indicate the probability less than 0.05 (p < 0.05).
Comparison of accuracy performance without feature selection
The bold numbers indicate the highest value of each level/type in previous study and proposed study.
Comparison of accuracy performance with feature selection
The bold numbers indicate the highest value of each level/type in previous study and proposed study.