| Literature DB >> 29065583 |
Abstract
BACKGROUND: Of the machine learning techniques used in predicting coronary heart disease (CHD), neural network (NN) is popularly used to improve performance accuracy.Entities:
Mesh:
Year: 2017 PMID: 29065583 PMCID: PMC5606055 DOI: 10.1155/2017/2780501
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Study design.
Confusion matrix.
| Confusion matrix | Prediction | ||
|---|---|---|---|
| Positive | Negative | ||
| Actual | Positive | True positive (TP) | False positive (FP) |
| Negative | False negative (FN) | True negative (TN) | |
Figure 2A schematic diagram of calculating the feature sensitivity using NN.
Pseudocode 1
Figure 3An example of NN predictor using the feature correlation analysis.
Characteristics (continuous variable: mean; categorical variable: count).
| Feature | Low risk (3031 people) | High risk (1115 people) |
|
|---|---|---|---|
| Age | 48.600 | 63.110 | 0.326 |
| Sex | 0.893 | ||
| Male | 1301 | 476 | |
| Female | 1739 | 639 | |
| BMI | 23.594 | 25.004 | 0.001 |
| To_chole | 191.738 | 188.898 | 0.024 |
| HDL | 52.642 | 49.671 | 0.001 |
| SBP | 115.583 | 128.210 | 0.001 |
| DBP | 75.610 | 76.397 | 0.035 |
| Triglyceride | 131.570 | 160.074 | 0.001 |
| Hemoglobin | 14.057 | 13.989 | 0.206 |
| TD | 0.370 | ||
| No | 2981 | 1092 | |
| Yes | 50 | 23 | |
| CRF | 0.002 | ||
| No | 3027 | 1092 | |
| Yes | 4 | 23 | |
| H_B | 0.933 | ||
| No | 3010 | 1107 | |
| Yes | 21 | 8 | |
| H_C | 0.801 | ||
| No | 3029 | 1114 | |
| Yes | 2 | 1 | |
| Cirrhosis | 0.349 | ||
| No | 3025 | 1111 | |
| Yes | 6 | 4 | |
| Smoking | 0.001 | ||
| No | 2350 | 972 | |
| Yes | 681 | 143 | |
| Diabetes | 0.001 | ||
| No | 2167 | 458 | |
| Impaired fasting glucose | 671 | 323 | |
| Diabetes | 193 | 334 |
Figure 4Calculation process of the feature sensitivity.
Results of feature sensitivity analysis.
| Features | Sensitivity | NNoutput-NNoutput( | Rank |
|---|---|---|---|
| NN | 0.815 | ||
| NN | 0.734 | 0.081 | 2 |
| NN | 0.726 | 0.008 | 11 |
| NN | 0.769 | 0.038 | 5 |
| NN | 0.677 | 0.100 | 1 |
| NN | 0.703 | 0.013 | 8 |
| NN | 0.729 | 0.073 | 3 |
| NN | 0.693 | 0.049 | 4 |
| NN | 0.753 | 0.013 | 7 |
| NN | 0.796 | 0.006 | 12 |
| NN | 0.806 | 0.003 | 13 |
| NN | 0.802 | 0.010 | 10 |
| NN | 0.813 | 0.001 | 15 |
| NN | 0.813 | 0.001 | 16 |
| NN | 0.812 | 0.002 | 14 |
| NN | 0.802 | 0.012 | 9 |
| NN | 0.786 | 0.024 | 6 |
Results of NNs eliminating the lowest ranked features (%).
| Without features | Accuracy |
|---|---|
| Without 1 (H_C) feature | 77.743 |
| Without 2 (H_C and H_B) features | 78.518 |
| Without 3 (without 2 features and cirrhosis) features | 80.644 |
| Without 4 (without 3 features and TD) features | 80.920 |
| Without 5 (without 4 features and hemoglobin) features | 81.120 |
| Without 6 (without 5 features and sex) features | 81.141 |
| Without 7 (without 6 features and CRF) feature | 81.163 |
| Without 8 (without 7 features and smoking) features | 81.018 |
| Without 9 (without 8 features and HDL) features | 80.921 |
| Without 10 (without 9 features and triglyceride) features | 80.222 |
| Without 11 (without 10 features and diabetes) features | 79.522 |
| Without 12 (without 11 features and BMI) features | 79.209 |
Figure 5The process of feature correlation analysis.
Nine features (age, BMI, To_chole, HDL, SBP, DBP, triglyceride, smoking, and diabetes) are selected and used for feature correlation analysis.
| Input dataset | Learned NN | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| Age | 0.080 | 0.009 | 0.016 | 0.004 | 0.011 | 0.008 | 0.009 |
| 0.019 |
| BMI | 0.031 | 0.038 | 0.019 |
|
|
| 0.037 | 0.010 |
|
| To_chole | 0.021 |
| 0.094 |
|
|
|
| 0.013 |
|
| HDL | 0.011 | 0.010 | 0.011 | 0.011 | 0.010 | 0.008 | 0.009 | 0.009 | 0.002 |
| SBP | 0.012 | 0.007 | 0.001 |
| 0.041 |
| 0.008 | 0.013 | 0.016 |
| DBP |
|
|
|
|
| 0.021 | 0.001 |
|
|
| Triglyceride | 0.009 | 0.005 | 0.008 | 0.008 | 0.003 | 0.005 | 0.009 | 0.005 | 0.006 |
| Smoking | 0.005 | 0.004 | 0.004 | 0.003 | 0.003 | 0.008 | 0.002 | 0.012 | 0.007 |
| Diabetes | 0.002 | 0.006 | 0.003 | 0.007 | 0.008 | 0.019 | 0.005 | 0.009 | 0.019 |
|
| |||||||||
| Average | 0.074 | 0.012 | 0.022 | 0.011 | 0.017 | 0.022 | 0.010 | 0.014 | 0.024 |
|
| |||||||||
| Candidates of correlated feature | DBP | To_chole, DBP | DBP | BMI, To_chole, SBP, DBP | BMI, To_chole, DBP | BMI, To_chole, SBP | To_chole | Age, DBP | BMI, To_chole, DBP |
Figure 6NN-based CHD prediction using feature correlation analysis.
Results of performance measure with training set (%).
| Training set | Validation set | |||||
|---|---|---|---|---|---|---|
| PPV | NPV | Accuracy | PPV | NPV | Accuracy | |
| LR | 57.24 | 87.63 | 86.11 | 67.53 | 83.63 | 80.32 |
| NN | 63.04 | 88.67 | 87.04 | 67.55 | 85.08 | 81.09 |
| FRS | 2.54 | 85.48 | 6.67 | 21.49 | 54.41 | 28.87 |
| NN_FCA | 67.57 | 89.00 | 87.63 | 71.29 | 85.70 | 82.51 |
Results of ROC curve using validation set.
| 95% Confidence Interval | ||||
|---|---|---|---|---|
| ROC curve |
| Lower bound | Upper bound | |
| LR | 0.713 ± 0.010 | 0.001 | 0.693 | 0.732 |
| NN | 0.735 ± 0.010 | 0.001 | 0.716 | 0.754 |
| FSNN | 0.741 ± 0.010 | 0.001 | 0.722 | 0.760 |
| FRS | 0.393 ± 0.010 | 0.001 | 0.373 | 0.414 |
| NN_FCA | 0.749 ± 0.010 | 0.001 | 0.731 | 0.768 |
Figure 7Result of ROC curve (a) compared to LR, NN, and FRS; (b) compared to NN and NN_FCA.