| Literature DB >> 34330264 |
Weidong Bao1,2, Hongfei Lin2, Yijia Zhang3, Jian Wang2, Shaowu Zhang2.
Abstract
BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text.Entities:
Keywords: Capsule network; Clinical notes; Domain knowledge; Medical code prediction
Year: 2021 PMID: 34330264 PMCID: PMC8323200 DOI: 10.1186/s12911-021-01426-9
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Note events in MIMIC-III corpus
DIAGNOSES_ICD in MIMIC-III corpus
| ROW_ID | SUBJECT_ID | HADM_ID | SEQ_NUM | ICD9_CODE |
|---|---|---|---|---|
| 1297 | 109 | 172,335 | 1 | 40,301 |
| 1298 | 109 | 172,335 | 2 | 486 |
| 1299 | 109 | 172,335 | 3 | 58,281 |
| 1300 | 109 | 172,335 | 4 | 5855 |
| 1301 | 109 | 172,335 | 5 | 4254 |
| 1302 | 109 | 172,335 | 6 | 2762 |
| 1303 | 109 | 172,335 | 7 | 45,829 |
| 1304 | 109 | 172,335 | 8 | 2875 |
Example of 3-digit ICD9 description
| ICD code | Description |
|---|---|
| 001 | Cholera |
| 002 | Typhoid and paratyphoid fevers |
| 003 | Other salmonella inflections |
| 004 | Shigellosis |
| 005 | Other food poisoning (bacterial) |
| 006 | Amebiasis |
| 007 | Other protozoal intestinal diseases |
Fig. 2Schematic overview of our model
Fig. 3The architecture of CapsNet with Bi-LSTM
Fig. 4Common methods to text classification
Fig. 5Joint label embedding method
Fig. 6capsule network model architecture
The performance of our model and the baseline models on the MIMIC-III dataset
| Methods | Macro F1 (%) | Micro F1 (%) | Test loss (%) | Top-10 recall (%) |
|---|---|---|---|---|
| CNN | 21.4 | 62.6 | 4.0 | 75.3 |
| LSTM | 20.4 | 65.3 | 3.2 | 77.2 |
| CapsNet | 21.8 | 64.7 | 3.5 | 76.1 |
| BiCapsNetLE | 26.5 | 67.5 | 2.9 | 82.3 |
Fig. 7Macro AUC by label frequency groups for different models. X-axis represents the label frequency groups and y-axis represents the Macro AUC
Comparison with related works
| Methods | Macro F1 (%) | Micro F1 (%) | Test loss (%) | Top-10 recall (%) |
|---|---|---|---|---|
| LRKSI [ | 19.6 | 55.7 | 5.5 | 73.8 |
| RNNKSI [ | 24.4 | 66.2 | 3.0 | 79.8 |
| CNNKSI [ | 23.7 | 63.7 | 3.9 | 77.5 |
| CAML [ | 25.7 | 65.6 | 3.2 | 80.6 |
| LEAM [ | 24.9 | 64.9 | – | – |
| BiCapsNetLE | 26.5 | 67.5 | 2.9 | 82.3 |
Ablation studies for our model
| Methods | Macro F1 (%) | Micro F1 (%) | Test loss (%) | Top-10 recall (%) |
|---|---|---|---|---|
| CapsNet | 21.8 | 64.7 | 3.5 | 76.1 |
| BiCapsNet | 23.1 | 67.0 | 3.1 | 81.9 |
| BiCapsNetLE | 26.5 | 67.5 | 2.9 | 82.3 |
Partial clinical note
| Clinical note | |
|---|---|
WordCloud visualization
| Disorders of fluid electrolyte and acid–base balance (ICD 276) | Atrial fibrillation and flutter (ICD 427) |
|---|---|
|
|
|