| Literature DB >> 31536581 |
Ji Min Sung1, In-Jeong Cho2, David Sung3, Sunhee Kim4, Hyeon Chang Kim5,6, Myeong-Hun Chae7, Maryam Kavousi8, Oscar L Rueda-Ochoa8,9, M Arfan Ikram8,10, Oscar H Franco8, Hyuk-Jae Chang5,11.
Abstract
OBJECTIVES: Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely known as an outstanding algorithm for analyzing time-series datasets. The objective of this study was to show the improved accuracy of deep learning by comparing the performance of a Cox hazard regression and RNN-LSTM based on survival analysis. METHODS ANDEntities:
Year: 2019 PMID: 31536581 PMCID: PMC6752799 DOI: 10.1371/journal.pone.0222809
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The process for selecting study subjects.
Baseline characteristics of the training set.
| Variable | Training set | |
|---|---|---|
| Male | Female | |
| Age, years | 51.2 ± 8.9 | 52.8 ± 9.2 |
| Current smoking, n (%) | 90,677 (55.28) | 3,582 (2.87) |
| Exercise, n (%) | 82,851 (50.51) | 40,903 (32.73) |
| Alcohol intake, n (%) | 107,962 (65.82) | 21,741 (17.40) |
| Body mass index, kg/m2 | 24.0 ± 2.8 | 23.9 ± 3.0 |
| Systolic blood pressure, mmHg | 128.2 ± 17.1 | 124.2 ± 18.3 |
| Diastolic blood pressure, mmHg | 81.2 ± 11.3 | 77.3 ± 11.6 |
| Fasting plasma glucose, mg/dL | 99.1 ± 34.4 | 94.8 ± 31.0 |
| Total cholesterol, mg/dL | 199.0 ± 37.8 | 201.7 ± 39.1 |
| Hemoglobin, g/dL | 14.8 ± 1.1 | 12.9 ± 1.2 |
| Aspartate transaminase, U/L | 29.0 ± 19.5 | 24.1 ± 14.1 |
| Alanine transaminase, U/L | 30.1 ± 23.2 | 21.1 ± 17.1 |
| Gamma-glutamyl transpeptidase, U/L | 50.1 ± 63.2 | 20.9 ± 22.2 |
| Urine protein, n (%) | 2,781 (1.70) | 2,192 (1.75) |
| History, n (%) | ||
| Diabetes mellitus | 5,989 (3.65) | 4,008 (3.21) |
| Hypertension | 8,919 (5.44) | 9,715 (7.77) |
| Etc (include cancer) | 16,841 (10.27) | 12,262 (9.81) |
| Family history, n (%) | ||
| Stroke | 9,565 (5.83) | 6,262 (5.01) |
| Diabetes mellitus | 10,185 (6.21) | 7.985 (6.39) |
| Heart disease | 4,614 (2.81) | 3,367 (2.69) |
| Hypertension | 12,345 (7.53) | 10,925 (8.74) |
| Etc (include cancer) | 23,328 (14.22) | 18,966 (15.18) |
| Number of periodic health examinations | 3.1 ± 1.1 | 2.6 ± 0.9 |
Rank of risk factors in deep learning model.
| Feature name | Sum of ranks | Feature name | Mean of values |
|---|---|---|---|
| Age | 233,322 | Age | 0.405 |
| Systolic blood pressure | 359,881 | Systolic blood pressure | 0.262 |
| Sex | 390,006 | Diastolic blood pressure | 0.153 |
| Diastolic blood pressure | 548,049 | Sex | 0.116 |
| Fasting plasma glucose | 584,936 | Fasting plasma glucose | 0.111 |
| Gamma-glutamyl transpeptidase | 664,941 | Current smoking | 0.111 |
| Aspartate transaminase | 668,470 | Exercise | 0.105 |
| Hemoglobin | 683,408 | Aspartate transaminase | 0.074 |
| Total cholesterol | 757,839 | Gamma-glutamyl transpeptidase | 0.066 |
| Exercise | 776,784 | Hemoglobin | 0.061 |
| Alcohol intake | 814,943 | Alcohol intake | 0.052 |
| Body mass index | 837,221 | Total cholesterol | 0.045 |
| Urine protein | 867,499 | Body mass index | 0.032 |
| Alanine transaminase | 973,370 | Urine protein | 0.028 |
| Family history of etc (include cancer) | 1,150,578 | History of Hypertension | 0.026 |
| Family history of Stroke | 1,187,298 | Family history of etc (include cancer) | 0.025 |
| Family history of Diabetes mellitus | 1,299,493 | History of etc (include cancer) | 0.022 |
| Family history of Heart disease | 1,392,376 | Alanine transaminase | 0.015 |
| Family history of Hypertension | 1,412,000 | History of Diabetes mellitus | 0.012 |
| Current smoking | 1,486,150 | Family history of Hypertension | 0.004 |
| History of Hypertension | 1,546,467 | Family history of Diabetes mellitus | 0.002 |
| History of Diabetes mellitus | 1,567,078 | Family history of Stroke | 0.001 |
| History of etc (include cancer) | 1,585,386 | Family history of Heart disease | 0.000 |
Sum of ranks indicate ranking each sample by absolute value of LRP, then ascending order by summing the ranks by variables in all samples. Mean of values indicate calculate the mean for the absolute value of LRP by variable in all samples and sort in descending order.