| Literature DB >> 29695376 |
Tsendsuren Munkhdalai1, Feifan Liu1, Hong Yu2,3.
Abstract
BACKGROUND: Medication and adverse drug event (ADE) information extracted from electronic health record (EHR) notes can be a rich resource for drug safety surveillance. Existing observational studies have mainly relied on structured EHR data to obtain ADE information; however, ADEs are often buried in the EHR narratives and not recorded in structured data.Entities:
Keywords: drug-related side effects and adverse reactions; electronic health records; medical informatics applications; natural language processing; neural networks
Year: 2018 PMID: 29695376 PMCID: PMC5943628 DOI: 10.2196/publichealth.9361
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Clinical relation types in our corpus. Entity mentions forming relations are in italics.
| Relation | Description | Example | #relationsa |
| An attribute of a medication: the amount of the medication to be taken | She receives | 2643/336/409 | |
| An attribute of a medication: how the medication is administered | She receives | 1908/269/332 | |
| An attribute of a medication: frequency of the administration | She receives | 2691/351/451 | |
| An attribute of a medication | The patient was treated with | 493/95/110 | |
| A causal relation between a medication and indication: why the drug is taken | He later received | 2301/264/379 | |
| A causal relation between a medication and an injury: the consequence of a medication | Patient’s death was due to | 717/134/134 | |
| The attribute of an adverse event | He has | 1505/259/241 |
athe number of relations for each type (train/develop/test).
Figure 1The distribution of relation token distance.
Figure 2Experimental workflow for adverse drug event (ADE) detection. EHRs: electronic health records; SVM: support vector machines; AE: adverse events.
Results (%) of rule induction classifier on test set.
| Relation | Precision | Recall | F1-score |
| None | 100 | 94 | 97 |
| Dosage | 20 | 63 | 30 |
| Route | 7 | 31 | 11 |
| Frequency | 2 | 7 | 3 |
| Duration | 1 | 4 | 1 |
| Indication | 1 | 14 | 2 |
| Adverse | 1 | 24 | 1 |
| Severity | 0 | 0 | 0 |
| Overall | 4.57 | 20.42 | 7.47 |
Overall F1-scores (%) of support vector machines system. Keep rate for negative down-sampling is varied.
| Keep rate | Train | Development | Test |
| 0.1 | 99.99 | 99.97 | 82.46 |
| 0.3 | 99.96 | 99.93 | 87.84 |
| 0.5 | 99.94 | 99.86 | 89.0 |
| 0.8 | 99.89 | 99.8 |
aBest score on test data are highlighted in italics.
Results (%) of the best performing support vector machines model on test set. Keep rate=0.8.
| Relation | Precision | Recall | F1-score |
| None | 100 | 100 | 100 |
| Dosage | 85 | 91 | 88 |
| Route | 96 | 97 | 96 |
| Frequency | 93 | 97 | 95 |
| Duration | 89 | 93 | 91 |
| Indication | 72 | 77 | 75 |
| Adverse | 85 | 84 | 85 |
| Severity | 95 | 94 | 95 |
| Overall | 87.85 | 90.42 | 89.1 |
Overall F1-score of the long short-term memory (LSTM)–based model. Keep rate=0.1.
| Window size | Train | Development | Test |
| 5 | 24.05 | 14.09 | 14.58 |
| 10 | 23.92 | 14.85 | 14.56 |
| 30 | 37.40 | 21.77 | |
| 50 | 32.1 | 17.15 | 18.43 |
| 70 | 27.62 | 15.04 | 15.93 |
aBest score on test data are highlighted in italics.
Overall F1-score of the long short-term memory (LSTM)–based model. Keep rate for negative down-sampling is varied. Window size=10.
| Keep rate | Train | Development | Test |
| 0.1 | 23.92 | 14.85 | 14.56 |
| 0.3 | 38.91 | 35.18 | 37.21 |
| 0.5 | 51.25 | 39.02 | |
| 0.8 | 24.82 | 23.65 | 21.11 |
aBest score on test data are highlighted in italics.
Overall F1-score (%) of long short-term memory (LSTM) and attention-based models. Keep rate=0.5, window size=30.
| Model | Train | Development | Test |
| LSTMa | 54.47 | 41.43 | 42.32 |
| Bidirectional LSTM | 86.56 | 66.47 | 62.79 |
| LSTM + Attention | 68.69 | 52.71 | 54.21 |
| Bidirectional LSTM + Attention | 83.71 | 68.95 |
aLSTM: Long short-term memory.
bBest score on test data are highlighted in italics.
Results (%) of the best-performing neural model (Bidirectional long short-term memory [LSTM] + Attention) on test set. Keep rate=0.5, window size=30.
| Relation | Precision | Recall | F1-score |
| None | 100 | 100 | 100 |
| Dosage | 78 | 80 | 79 |
| Route | 67 | 78 | 72 |
| Frequency | 61 | 76 | 68 |
| Duration | 54 | 69 | 61 |
| Indication | 32 | 32 | 32 |
| Adverse | 78 | 46 | 58 |
| Severity | 77 | 93 | 84 |
| Overall | 63.85 | 67.71 | 65.72 |
Figure 3Test F1-score over relation distance. BiLSTM: bidirectional long short term memory; SVM: support vector machine.
Comparison of different models in terms of overall F1-score.
| Model | Train | Development | Test |
| Rule induction classifier | 8.33 | 8.74 | 7.47 |
| Bidirectional LSTMb | 83.71 | 66.47 | 62.79 |
| Bidirectional LSTM + Attention | 86.56 | 68.95 | 65.72 |
| Bidirectional LSTM + Attention + Features | 88.14 | 77.77 | 77.35 |
| SVMa + Features | 87.85 | 90.42 |
aLSTM: Long short-term memory
bSVM: support vector machines.
cBest score on test data are highlighted in italics.
Figure 4Test F1-score over varying training sample size. BiLSTM: bidirectional long short term memory; SVM: support vector machine.