| Literature DB >> 35351098 |
Sahar Shariatnia1, Majid Ziaratban2, Abdolhalim Rajabi3, Aref Salehi4, Kobra Abdi Zarrini5, Mohammadali Vakili6.
Abstract
PURPOSE: Coronary artery disease (CAD) is one of the most significant cardiovascular diseases that requires accurate angiography to diagnose. Angiography is an invasive approach involving risks like death, heart attack, and stroke. An appropriate alternative for diagnosis of the disease is to use statistical or data mining methods. The purpose of the study was to predict CAD by using discriminant analysis and compared with the logistic regression.Entities:
Keywords: Coronary artery disease; Discriminant analysis; Logistic regression
Mesh:
Year: 2022 PMID: 35351098 PMCID: PMC8966192 DOI: 10.1186/s12911-022-01823-8
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Variables of study dataset
| Type variables | Variable name |
|---|---|
| Demographic | Age |
| Gender | |
| Blood group | |
| Antigen | |
| Weight | |
| Height | |
| BMI (body mass index kg/m2) | |
| Hypertension history | |
| Family history of heart disease in first-degree relatives | |
| History of diabetes | |
| Smoking | |
| Illicit drug abuse | |
| Alcohol consumption | |
| Clinical | Systolic blood pressure |
| Diastolic blood pressure | |
| Laboratory | fasting blood sugar (FBS) |
| Creatinine (Cr) | |
| Blood urea nitrogen (BUN) | |
| Low-density lipoprotein (LDL) | |
| Triglyceride (TG) | |
| Total cholesterol (TC) | |
| High-density lipoprotein (HDL) |
Univariate logistic regression the association of independent variables with coronary artery disease
| Parameters | Coefficient (β) | S.E(β) | OR CI (0.95%) | P-value |
|---|---|---|---|---|
| Gender | ||||
| Female | Ref | Ref | ||
| Male | 1.25 | 0.16 | 3.50 (2.55–4.82) | < 0.001 |
| Smoking | ||||
| No | Ref | Ref | ||
| Yes | 0.96 | 0.24 | 2.63 (1.61–4.29) | < 0.001 |
| Illicit drug abuse | ||||
| No | Ref | Ref | ||
| Yes | 1.26 | 0.29 | 3.50 (1.96–6.34) | < 0.001 |
| Blood group | ||||
| A | − 0.15 | 0.18 | 0.85 (0.59–1.23) | 0.40 |
| B | − 0.07 | 0.19 | 0.93 (0.62–1.37) | 0.71 |
| AB | 0.03 | 0.32 | 1.03 (0.54–1.96) | 0.90 |
| O | Ref | Ref | ||
| Antigen | ||||
| Negative | Ref | Ref | ||
| Positive | 0.31 | 0.29 | 1.36 (0.76–2.41) | 0.28 |
| History of Blood pressure | ||||
| No | Ref | Ref | ||
| Yes | 0.13 | 0.15 | 1.14 (0.84–1.56) | 0.37 |
| Family history of heart disease | ||||
| No | Ref | Ref | ||
| Yes | 0.37 | 0.20 | 1.46 (0.97–2.18) | 0.06 |
| History of Blood pressure | ||||
| No | Ref | Ref | ||
| Yes | 0.18 | 0.18 | 1.20 (0.84–1.72) | 0.30 |
| Alcohol use | ||||
| No | Ref | Ref | ||
| Yes | 0.81 | 0.55 | 2.25 (0.75–6.74) | 0.14 |
| Age | 0.06 | 0.008 | 1.06 (1.04–1.08) | < 0.001 |
| BMI | − 0.09 | 0.01 | 0.91 (0.88–0.94) | < 0.001 |
| FBS | 0.005 | 0.001 | 1.005 (1.002–1.009) | 0.001 |
| TC | 0.001 | 0.001 | 1.001 (0.99–1.00) | 0.33 |
| TG | − 0.000 | 0.000 | 0.99 (0.99–1.00) | 0.14 |
| LDL | 0.004 | 0.002 | 1.00 (0.99–1.00) | 0.05 |
| HDL | − 0.03 | 0.009 | 0.96 (0.94–0.98) | < 0.001 |
| BUN | 0.07 | 0.01 | 1.07 (1.04–1.11) | < 0.001 |
| Cr | 1.88 | 0.39 | 6.56 (3.02–14.26) | < 0.001 |
| Systolic blood pressure | 0.02 | 0.005 | 1.02 (1.01–1.03) | < 0.001 |
| Diastolic blood pressure | 0.02 | 0.007 | 1.02 (1.008–1.03) | 0.002 |
Multivariate logistic regression the association of independent variables with coronary artery disease
| Parameters | Coefficient (β) | S.E(β) | OR CI (0.95%) | |
|---|---|---|---|---|
| Intercept | − 6.21 | 1.23 | 0.002 | < 0.001 |
| Gender | ||||
| Female | Ref | – | ||
| Male | 1.39 | 0.21 | 4.01 (2.67–6.01) | < 0.001 |
| Family history of heart disease | ||||
| No | Ref | – | ||
| Yes | 0.66 | 0.24 | 1.93 (1.20–3.09) | 0.006 |
| Illicit drug abuse | ||||
| No | Ref | – | ||
| Yes | 0.78 | 0.33 | 2.17 (1.14–4.13) | 0.019 |
| Age | 0.08 | 0.01 | 1.08 (1.06–1.10) | < 0.001 |
| BMI | − 0.05 | 0.02 | 0.95 (0.91–0.99) | 0.016 |
| FBS | 0.007 | 0.002 | 1.007 (1.003–1.011) | < 0.001 |
| HDL | − 0.035 | 0.012 | 0.96 (0.94–0.99) | 0.004 |
| LDL | 0.013 | 0.003 | 1.013 (1.007–1.019) | < 0.001 |
| Systolic blood pressure | 0.018 | 0.006 | 1.018 (1.007–1.03) | 0.001 |
The accuracy, sensitivity, specificity and AUC models
| Methods | Model | Accuracy % | Sensitivity % | Specificity % | AUC % |
|---|---|---|---|---|---|
| Classical | LDA | 78.6 | 81.3 | 71.3 | 81.9 |
| QDA | 64.6 | 88.2 | 48.2 | 81 | |
| LR | 77 | 87.6 | 55.6 | 82 | |
| Non-classical | KNN | 74 | 77.5 | 63.7 | 82 |
LDA linear discriminant analysis, QDA quadratic discriminant analysis, KNN K-nearest neighbor, LR logistic regression
Fig. 1ROC curve for logistic regression models, linear discriminant analysis, quadratic discriminant analysis, and K nearest neighbor
Differences in surface area under the ROC curve in linear discriminant analysis, quadratic discriminant analysis, K nearest neighbor and logistic regression
| Two by two comparison | Difference between areas | Standard error | 95% CI | z statistic | |
|---|---|---|---|---|---|
| KNN | |||||
| LDA | 0.0009 | 0.011 | (− 0.021 to 0.02) | 0.782 | 0.93 |
| LR | 0.00003 | 0.011 | (− 0.023 to 0.023) | 0.002 | 0.99 |
| QDA | 0.01 | 0.014 | (− 0.017 to 0.039) | 0.73 | 0.46 |
| LDA | |||||
| LR | 0.0008 | 0.001 | (− 0.001 to 0.0036) | 0.61 | 0.54 |
| QDA | 0.009 | 0.014 | (− 0.01 to 0.03) | 0.69 | 0.49 |
| LR | |||||
| QDA | 0.01 | 0.01 | (− 0.016 to 0.037) | 0.75 | 0.44 |
LDA linear discriminant analysis, QDA quadratic discriminant analysis, KNN K-nearest neighbor, LR logistic regression