| Literature DB >> 31856806 |
Fenglong Ma1, Yaqing Wang2, Houping Xiao3, Ye Yuan4, Radha Chitta5, Jing Zhou6, Jing Gao2.
Abstract
BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient.Entities:
Keywords: Diagnosis prediction; Healthcare informatics; Medical code embeddings
Mesh:
Year: 2019 PMID: 31856806 PMCID: PMC6921390 DOI: 10.1186/s12911-019-0961-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Base models for diagnosis prediction
| Base model | Visit modeling | Attention mechanism | ||
|---|---|---|---|---|
| FC | GRU | Location | Graph | |
| MLP | ||||
| RNN [ | ||||
| RNN | ||||
| Dipole [ | ||||
| RETAIN [ | ||||
| GRAM [ | ||||
Fig. 1An Example of CNN Architecture for Diagnosis Code Embedding. The word window sizes are 2 (red line) and 3 (blue line) respectively, i.e., q=2. For each word window, there are 2 filters in the example, i.e., m=2. The dimensionality of this code embedding is 4, i.e., d=mq=4
Statistics of MIMIC-III and heart failure datasets
| Dataset | MIMIC-III | Heart failure |
|---|---|---|
| # of patients | 7,499 | 4,925 |
| # of visits | 19,911 | 341,865 |
| Avg. visits per patient | 2.66 | 69.41 |
| # of unique ICD9 codes | 4,880 | 6,747 |
| Avg. # of diagnosis codes per visit | 13.06 | 3.92 |
| Max # of diagnosis codes per visit | 39 | 54 |
| # of words in code descriptions | 2,800 | 3,397 |
| # of category codes | 171 | 149 |
| Avg. # of category codes per visit | 10.16 | 3.33 |
| Max # of category codes per visit | 30 | 33 |
The visit-level precision @k of diagnosis prediction task
| Dataset | MLP | MLP + | RNN | RNN + | RNN | RNN | Dipole | Dipole + | RETAIN | RETAIN + | GRAM | GRAM + | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MIMIC-III | 5 | 0.6939 | 0.7124 | 0.6616 | 0.7160 | 0.6504 | 0.7083 | 0.6599 | 0.7074 | 0.6835 | 0.7167 ∗ | 0.6885 | 0.7132 |
| 10 | 0.6441 | 0.6603 | 0.6145 | 0.6565 | 0.6021 | 0.6527 | 0.6116 | 0.6539 | 0.6361 | 0.6623 ∗ | 0.6424 | 0.6596 | |
| 15 | 0.6812 | 0.6926 ∗ | 0.6546 | 0.6906 | 0.6412 | 0.6856 | 0.6524 | 0.6903 | 0.6777 | 0.6918 | 0.6828 | 0.6918 | |
| 20 | 0.7420 | 0.7544 ∗ | 0.7199 | 0.7511 | 0.7109 | 0.7455 | 0.7159 | 0.7483 | 0.7403 | 0.7501 | 0.7434 | 0.7513 | |
| 25 | 0.7939 | 0.8070 ∗ | 0.7755 | 0.8019 | 0.7697 | 0.8009 | 0.7723 | 0.8020 | 0.7912 | 0.8010 | 0.7941 | 0.8028 | |
| 30 | 0.8357 | 0.8460 | 0.8186 | 0.8456 | 0.8142 | 0.8445 | 0.8169 | 0.8453 | 0.8335 | 0.8445 | 0.8377 | 0.8468 ∗ | |
| Heart failure | 5 | 0.4451 | 0.4947 | 0.4890 | 0.5172 | 0.4976 | 0.5103 | 0.4964 | 0.5111 | 0.3751 | 0.5140 | 0.5341 | 0.5365 ∗ |
| 10 | 0.6122 | 0.6206 | 0.6585 | 0.6879 | 0.6675 | 0.6817 | 0.6689 | 0.6829 | 0.5378 | 0.6828 | 0.7123 | 0.7159 ∗ | |
| 15 | 0.6996 | 0.7060 | 0.7436 | 0.7683 | 0.7496 | 0.7631 | 0.7514 | 0.7648 | 0.6372 | 0.7613 | 0.7901 | 0.7939 ∗ | |
| 20 | 0.7606 | 0.7643 | 0.8006 | 0.8213 | 0.8050 | 0.8174 | 0.8070 | 0.8167 | 0.7088 | 0.8143 | 0.8402 | 0.8442 ∗ | |
| 25 | 0.8100 | 0.8140 | 0.8425 | 0.8593 | 0.8453 | 0.8560 | 0.8476 | 0.8557 | 0.7655 | 0.8533 | 0.8761 | 0.8789 ∗ | |
| 30 | 0.8477 | 0.8511 | 0.8743 | 0.8879 | 0.8770 | 0.8857 | 0.8785 | 0.8846 | 0.8102 | 0.8826 | 0.9025 | 0.9047 ∗ |
∗ denotes the highest precision among all the approaches on the same k
The code-level accuracy @k of diagnosis prediction task
| Dataset | MLP | MLP + | RNN | RNN + | RNN | RNN | Dipole | Dipole + | RETAIN | RETAIN + | GRAM | GRAM + | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MIMIC-III | 5 | 0.3104 | 0.3181 | 0.2952 | 0.3193 | 0.2910 | 0.3162 | 0.2941 | 0.3155 | 0.3056 | 0.3198 ∗ | 0.3072 | 0.3183 |
| 10 | 0.5040 | 0.5138 | 0.4796 | 0.5111 | 0.4693 | 0.5085 | 0.4767 | 0.5086 | 0.4980 | 0.5160 ∗ | 0.5003 | 0.5138 | |
| 15 | 0.6286 | 0.6352 | 0.6019 | 0.6335 | 0.5889 | 0.6290 | 0.5971 | 0.6325 | 0.6258 | 0.6360 ∗ | 0.6267 | 0.6348 | |
| 20 | 0.7114 | 0.7239 ∗ | 0.6894 | 0.7198 | 0.6822 | 0.7144 | 0.6845 | 0.7168 | 0.7129 | 0.7202 | 0.7130 | 0.7196 | |
| 25 | 0.7754 | 0.7852 ∗ | 0.7545 | 0.7804 | 0.7491 | 0.7785 | 0.7501 | 0.7795 | 0.7735 | 0.7806 | 0.7728 | 0.7794 | |
| 30 | 0.8214 | 0.8294 ∗ | 0.8040 | 0.8279 | 0.7987 | 0.8269 | 0.7990 | 0.8280 | 0.8198 | 0.8286 | 0.8220 | 0.8283 | |
| Heart failure | 5 | 0.4580 | 0.5132 | 0.5599 | 0.5960 | 0.5699 | 0.5882 | 0.5687 | 0.5868 | 0.4085 | 0.5808 | 0.6152 | 0.6227 ∗ |
| 10 | 0.6266 | 0.6412 | 0.6835 | 0.7169 | 0.6920 | 0.7109 | 0.6953 | 0.7105 | 0.5460 | 0.7042 | 0.7393 | 0.7455 ∗ | |
| 15 | 0.7124 | 0.7254 | 0.7603 | 0.7876 | 0.7645 | 0.7845 | 0.7702 | 0.7841 | 0.6512 | 0.7765 | 0.8088 | 0.8130 ∗ | |
| 20 | 0.7717 | 0.7827 | 0.8132 | 0.8355 | 0.8153 | 0.8334 | 0.8209 | 0.8307 | 0.7162 | 0.8261 | 0.8544 | 0.8580 ∗ | |
| 25 | 0.8206 | 0.8283 | 0.8516 | 0.8698 | 0.8532 | 0.8673 | 0.8580 | 0.8655 | 0.7684 | 0.8622 | 0.8872 | 0.8902 ∗ | |
| 30 | 0.8572 | 0.8635 | 0.8812 | 0.8958 | 0.8825 | 0.8943 | 0.8860 | 0.8923 | 0.8100 | 0.8899 | 0.9113 | 0.9134 ∗ |
∗ denotes the highest accuracy among all the approaches on the same k
Fig. 2Code-Level Accuracy @20 of Diagnosis Prediction on the MIMIC-III Dataset. a 0-25. b 25-50. c 50-75. d 75-100
Fig. 3t-SNE Scatterplots of Medical Codes Learned by Predictive Models. a MLP. b MLP +. c RNN. d RNN +. e RNN . f RNN +. g Dipole. h Dipole +. i RETAIN. j RETAIN +. k GRAM. l GRAM +