| Literature DB >> 30961614 |
Yafeng Ren1, Hao Fei2, Xiaohui Liang3, Donghong Ji1, Ming Cheng4.
Abstract
BACKGROUND: Disease prediction based on Electronic Health Records (EHR) has become one hot research topic in biomedical community. Existing work mainly focuses on the prediction of one target disease, and little work is proposed for multiple associated diseases prediction. Meanwhile, a piece of EHR usually contains two main information: the textual description and physical indicators. However, existing work largely adopts statistical models with discrete features from numerical physical indicators in EHR, and fails to make full use of textual description information.Entities:
Keywords: Disease prediction; Electronic health records; Kidney disease; Long short-term memory; Neural network
Mesh:
Year: 2019 PMID: 30961614 PMCID: PMC6454594 DOI: 10.1186/s12911-019-0765-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1A piece of EHR from one hypertension patient
Fig. 2The proposed neural network framework
Parameter values of the model in the experiments
| Parameter |
|
|
|
|
|---|---|---|---|---|
| Value | 100 | 128 | 6 | 2 |
| Parameter |
|
|
|
|
| Value | 0.001 | 0.5 | 16 | 15 |
Experimental results of the discrete baseline models
| Models | Features | Accuracy(%) |
|---|---|---|
| LR | Numerical | 64.9 |
| LR | Textual | 71.5 |
| LR | Textual+Numerical | 72.2 |
| NB | Numerical | 67.8 |
| NB | Textual | 71.1 |
| NB | Textual+Numerical | 76.6 |
| SVM | Numerical | 42.6 |
| SVM | Textual | 66.1 |
| SVM | Textual+Numerical | 71.3 |
| GBDT | Numerical | 71.1 |
| GBDT | Textual | 77.8 |
| GBDT | Textual+Numerical | 81.2 |
Experimental results of the neural models
| Models | Features | Accuracy(%) |
|---|---|---|
| CNN | Numerical | 75.9 |
| CNN | Textual | 83.8 |
| CNN | Textual+Numerical | 86.2 |
| BiLSTM | Numerical | 74.8 |
| BiLSTM | Textual | 84.2 |
| BiLSTM | Textual+Numerical | 87.6 |
| CNN+AE | Textual+Numerical | 88.3 |
| BiLSTM+AE | Textual+Numerical | 89.7 |
Fig. 3Comparison of output probability