| Literature DB >> 35211190 |
K S Archana1, B Sivakumar2, Ramya Kuppusamy3, Yuvaraja Teekaraman4, Arun Radhakrishnan5.
Abstract
Accurate prediction of cardiovascular disease is necessary and considered to be a difficult attempt to treat a patient effectively before a heart attack occurs. According to recent studies, heart disease is said to be one of the leading origins of death worldwide. Early identification of CHD can assist to reduce death rates. When it comes to prediction using traditional methodologies, the difficulty arises in the intricacy of the data and relationships. This research is aimed at applying recent machine learning technology to identify heart disease from past medical data to uncover correlations in data that can greatly improve the accuracy of prediction rates using various machine learning models. Models have been implemented using naive Bayes, random forest algorithms, and the combinations of two models such as naive Bayes and random forest methods. These methods offer numerous attributes associated with heart disease. This proposed system foresees the chance of rising heart disease. The proposed system uses 14 parameters such as age, sex, quick blood sugar, chest discomfort, and other medical parameters which are used in the proposed system. Our proposed systems find the probability of developing heart disease in percentages as well as the accuracy level (accuracy of 93%). Finally, this proposed method will support the doctors to analyze the heart patients competently.Entities:
Mesh:
Year: 2022 PMID: 35211190 PMCID: PMC8863449 DOI: 10.1155/2022/9797844
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Correlation matrix of features in complete dataset.
Figure 2Architecture for identifying heart disease.
Feature information and details of the heart disease dataset.
| S. no | Feature name | Feature code | Details |
|---|---|---|---|
| 1 | Patient's age | Age | Patients are calculated in years |
| 2 | Patient's sex | Sex | 0 = female; 1 = male |
| 3 | Variations of chest pain | CP | 1—typical angina |
| 4 | Level of BP | RBP | mmHg admitted at the hospital |
| 5 | Cholesterol status | SCL | Shows cholesterol level in mg/dl |
| 6 | Blood sugar fasting > 120 mg/dl | FBS | Fasting blood sugar > 120 mg/dl (0—false; 1—true) |
| 7 | Resting electrocardiography | RES | 0—normal |
| 8 | Maximum heart rate | MHR | Over heart rate |
| 9 | Exercise-induced angina | EIA | 1—yes |
| 10 | ST depression | ST | Exercise to rest |
| 11 | ST segment | PES | 1—upsloping |
| 12 | Size of vessels | VCA | (0–3) colored by fluoroscopy |
| 13 | Thalassemia | THA | 3—normal |
| 14 | Target | Class | 0—no risk |
Figure 3Feature analysis to identify the heart problem.
Predicting label of confusion matrix.
| Positive | Negative |
|---|---|
| True positive (TP) | False negative (TN) |
| False positive (FP) | True negative (FN) |
Figure 4Confusion matrix and ROC graph of the naïve Bayes classifier.
Figure 5Confusion matrix and ROC graph of the random forest classifier.
Figure 6Confusion matrix and ROC graph of the fusion Naïve Bayes and random forest classifier.
Algorithm 1Hybrid algorithm to identify the heart problem.
Performance comparison for various models.
| Models | Accuracy (%) | Specificity (%) | Sensitivity (%) |
|---|---|---|---|
| Naïve Bayes classifier | 85 | 84 | 86 |
| Random forest | 89 | 85 | 85 |
| Proposed method | 92 | 85 | 84 |
Figure 7Comparison graph of all classifiers.