Qi Zhenya1, Zuoru Zhang2. 1. College of Management and Economics, Tianjin University, Nankai District, Tianjin, 300072, People's Republic of China. 2. School of Mathematical Science, Hebei Normal University, Yuhua District, Shijiazhuang, 050024, People's Republic of China. zhangzuoru@tju.edu.cn.
Abstract
BACKGROUND: Heart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What's more, the misclassification cost could be very high. METHODS: A cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm. RESULTS: The best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm. CONCLUSIONS: The proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.
BACKGROUND:Heart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What's more, the misclassification cost could be very high. METHODS: A cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm. RESULTS: The best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm. CONCLUSIONS: The proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.
Authors: Ryan J Urbanowicz; Melissa Meeker; William La Cava; Randal S Olson; Jason H Moore Journal: J Biomed Inform Date: 2018-07-18 Impact factor: 6.317
Authors: R Detrano; A Janosi; W Steinbrunn; M Pfisterer; J J Schmid; S Sandhu; K H Guppy; S Lee; V Froelicher Journal: Am J Cardiol Date: 1989-08-01 Impact factor: 2.778
Authors: Maria Lukács Krogager; Regitze Kuhr Skals; Emil Vincent R Appel; Theresia M Schnurr; Line Engelbrechtsen; Christian Theil Have; Oluf Pedersen; Thomas Engstrøm; Dan M Roden; Gunnar Gislason; Henrik Enghusen Poulsen; Lars Køber; Steen Stender; Torben Hansen; Niels Grarup; Charlotte Andersson; Christian Torp-Pedersen; Peter E Weeke Journal: PLoS One Date: 2018-12-19 Impact factor: 3.240
Authors: Jasjit S Suri; Mrinalini Bhagawati; Sudip Paul; Athanasios D Protogerou; Petros P Sfikakis; George D Kitas; Narendra N Khanna; Zoltan Ruzsa; Aditya M Sharma; Sanjay Saxena; Gavino Faa; John R Laird; Amer M Johri; Manudeep K Kalra; Kosmas I Paraskevas; Luca Saba Journal: Diagnostics (Basel) Date: 2022-03-16