| Literature DB >> 29854369 |
Peng Lu1,2, Saidi Guo2,3, Hongpo Zhang2,4, Qihang Li1,2, Yuchen Wang1, Yingying Wang2,3, Lianxin Qi2,3.
Abstract
Quantitative analysis and prediction can help to reduce the risk of cardiovascular disease. Quantitative prediction based on traditional model has low accuracy. The variance of model prediction based on shallow neural network is larger. In this paper, cardiovascular disease prediction model based on improved deep belief network (DBN) is proposed. Using the reconstruction error, the network depth is determined independently, and unsupervised training and supervised optimization are combined. It ensures the accuracy of model prediction while guaranteeing stability. Thirty experiments were performed independently on the Statlog (Heart) and Heart Disease Database data sets in the UCI database. Experimental results showed that the mean of prediction accuracy was 91.26% and 89.78%, respectively. The variance of prediction accuracy was 5.78 and 4.46, respectively.Entities:
Mesh:
Year: 2018 PMID: 29854369 PMCID: PMC5966666 DOI: 10.1155/2018/8954878
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Restricted Boltzmann machine undirected configuration diagram.
Figure 2DBNs training flow chart.
Figure 3Calculation flowchart of DBNs depth.
Data set attributes.
| Features | Description | Data types | Normalization | Value |
|---|---|---|---|---|
| Age | Age | Continuous data | Min-max scaling | 16–80; 0∼1 |
|
| ||||
| Sex | Gender | Text-based data | Direct mapping | 0: female |
| 1: male | ||||
|
| ||||
| Cp | Chest pain type | Text-based data | Direct mapping | 0: typical angina |
| 1: typical type angina | ||||
| 2: nonangina pain | ||||
| 3: asymptomatic | ||||
|
| ||||
| Trestbps | Trest blood pressure | Range data | Improved min-max scaling | MmHg on admission to the hospital |
|
| ||||
| Chol | Serum cholesterol | Range data | Improved min-max scaling | (mg/dl) |
|
| ||||
| Fbs | Fasting blood sugar | Hierarchical data | Hierarchical mapping | 0: <120 mg/dl |
| 1: >120 mg/dl | ||||
|
| ||||
| Restecg | Resting electrographic results | Text-based data | Direct mapping | 0: normal |
| 1: having ST-T wave abnormality | ||||
| 2: showing probable or definite left ventricular hypertrophy | ||||
|
| ||||
| Thalach | Maximum heart rate achieved | Range data | Improved min-max scaling | — |
|
| ||||
| Exang | Exercise-induced angina | Text-based data | Direct mapping | 0 = no |
| 1 = yes | ||||
|
| ||||
| Oldpeak | ST depression induced by exercise relative to rest | Range data | Improved min-max scaling | |
|
| ||||
| Slope | Slope of the peak exercise ST segment | Text-based data | Direct mapping | 0: unsloping |
| 1: flat | ||||
| 2: downsloping | ||||
|
| ||||
| Ca | Number of major vessels colored by fluoroscopy | Text-based data | Direct mapping | 0–3 |
|
| ||||
| Thal | Text-based data | Direct mapping | 0: normal | |
| 1: fixed defect | ||||
| 2: reversible defect | ||||
|
| ||||
| Num | Predicted attribute | — | — | 0, 1, 2, 3, 4 |
Algorithm implementation steps.
|
| Use the input data to construct a hidden unit state |
|
| Reconstruct the input using the hidden layer structure |
|
| Construct the hidden layer with the reconstructed input again |
Figure 4The Rerror curve of each layer RBM on Statlog (Heart).
Figure 5The Rerror curve of each layer RBM on Heart Disease Database.
Figure 6Reconstruction error of RBM1 with different number of hidden units.
DBNs test results at different network depths.
| Data set | Network depth |
| Accuracy (%) | Runtime (s) |
|---|---|---|---|---|
| Statlog (Heart) | 2 | 1.8700 | 81.43 | 17.30 |
| 3 | 0.5189 | 85.71 | 19.90 | |
| 4 | 0.4912 | 91.26 | 22.70 | |
| 5 | 0.4306 | 87.14 | 25.20 | |
| 6 | 0.3877 | 82.58 | 29.50 | |
|
| ||||
| Heart Disease Database | 2 | 1.9450 | 80.40 | 21.40 |
| 3 | 1.1931 | 83.20 | 22.90 | |
| 4 | 0.6594 | 85.60 | 25.20 | |
| 5 | 0.6372 | 89.78 | 27.80 | |
| 6 | 0.5765 | 87.20 | 31.50 | |
Comparison of classification results of different techniques on the Statlog (Heart) data set.
| Classification algorithms | Accuracy (%) |
|---|---|
| SVM linear [ | 86.62 |
| SFM [ | 83.31 |
| OCSFM [ | 86.73 |
| Naïve Bayes [ | 78.93 |
| WAC [ | 84.00 |
| ANN [ | 86.04 |
| PSO-SVM-SMO-RBF [ | 88.24 |
| Naïve Bayes [ | 82.31 |
| Decision tree [ | 84.35 |
| LVQNN [ | 74.12 |
| BPNN [ | 85.00 |
| Improved DBN | 91.26 |