| Literature DB >> 29121053 |
Ehtesham Iqbal1, Robbie Mallah2, Daniel Rhodes1, Honghan Wu1, Alvin Romero3, Nynn Chang3, Olubanke Dzahini2, Chandra Pandey1,4, Matthew Broadbent4, Robert Stewart4,5, Richard J B Dobson1,4,6,7, Zina M Ibrahim1,4,6,7.
Abstract
Adverse drug events (ADEs) are unintended responses to medical treatment. They can greatly affect a patient's quality of life and present a substantial burden on healthcare. Although Electronic health records (EHRs) document a wealth of information relating to ADEs, they are frequently stored in the unstructured or semi-structured free-text narrative requiring Natural Language Processing (NLP) techniques to mine the relevant information. Here we present a rule-based ADE detection and classification pipeline built and tested on a large Psychiatric corpus comprising 264k patients using the de-identified EHRs of four UK-based psychiatric hospitals. The pipeline uses characteristics specific to Psychiatric EHRs to guide the annotation process, and distinguishes: a) the temporal value associated with the ADE mention (whether it is historical or present), b) the categorical value of the ADE (whether it is assertive, hypothetical, retrospective or a general discussion) and c) the implicit contextual value where the status of the ADE is deduced from surrounding indicators, rather than explicitly stated. We manually created the rulebase in collaboration with clinicians and pharmacists by studying ADE mentions in various types of clinical notes. We evaluated the open-source Adverse Drug Event annotation Pipeline (ADEPt) using 19 ADEs specific to antipsychotics and antidepressants medication. The ADEs chosen vary in severity, regularity and persistence. The average F-measure and accuracy achieved by our tool across all tested ADEs were 0.83 and 0.83 respectively. In addition to annotation power, the ADEPT pipeline presents an improvement to the state of the art context-discerning algorithm, ConText.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29121053 PMCID: PMC5679515 DOI: 10.1371/journal.pone.0187121
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Annotation agreement between two clinical annotators.
Annotations were retained as the labelled dataset for predictions if the experts annotators agree on the classifications of their mentions.
| ADE | Agreement (%) | Cohen’s Kappa Score |
|---|---|---|
| Agitation | 88% | 0.65 |
| Akathisia | 90% | 0.75 |
| Arrhythmia | 89% | 0.73 |
| Cardiomyopathy | 89% | 0.76 |
| Constipation | 91% | 0.78 |
| Convulsions | 98% | 0.96 |
| Diarrhoea | 93% | 0.84 |
| Dizziness | 93% | 0.78 |
| Dry Mouth | 89% | 0.71 |
| Galactorrhea | 92% | 0.83 |
| Hypersalivation | 94% | 0.74 |
| Insomnia | 92% | 0.75 |
| Myocarditis | 89% | 0.71 |
| Nausea | 85% | 0.69 |
| Pneumonia | 93% | 0.82 |
| Sedation | 96% | 0.91 |
| SJS | 93% | 0.82 |
| Tachycardia | 94% | 0.83 |
| Weight Gain | 96% | 0.90 |
Fig 1The GATE NLP based ADEPt pipeline comprising four rule-based processing components.
The pipeline takes as input EHR clinical notes documents and a dictionary containing all annotation-related terms. The pipeline sequentially applies the four components accumulating new annotations for the target annotation (ADE). The output of the pipeline is a single ADE annotation with six features (ADE type, Experiencer, Negation, Temporality, Categorical_Value, Refinedment_Rule, ADE_status and clause).
Enrichment of the ConText algorithm trigger terms.
| Triggers Terms | ConText Algorithm | Terms Added | Current Terms |
|---|---|---|---|
| Experiencer | 29 | 46 | 75 |
| Negation | 197 | 216 | 413 |
| Possibility terms & phrases | 28 | 16 | 44 |
| Termination | 89 | 17 | 106 |
| Temporality & Hypothetical | 49 | 23 | 72 |
Fig 2The retention rules pattern used in the ADEPt pipeline.
Fig 3The removal rules pattern used in the GATE NLP based ADEPt pipeline.
Incremental results of akathisia, galactorrhea, nausea and myocarditis.
| ADE | Corpus Ref | Total | Precision | Sensitivity | Specificity | Accuracy | F-measure |
|---|---|---|---|---|---|---|---|
| Akathisia | Paragraph | 215 | 0.73 | 0.87 | 0.33 | 0.69 | 0.80 |
| Statement | 215 | 0.76 | 0.89 | 0.39 | 0.73 | 0.82 | |
| Original ConText Algorithm | 215 | 0.77 | 0.90 | 0.43 | 0.74 | 0.83 | |
| ConText With extra terms | 215 | 0.94 | 0.88 | 0.88 | 0.87 | 0.91 | |
| With Extra Terms and Refinement Rules | 215 | 0.96 | 0.90 | 0.93 | 0.91 | 0.93 | |
| Nausea | Paragraph | 369 | 0.84 | 0.82 | 0.52 | 0.74 | 0.83 |
| Statement | 369 | 0.86 | 0.84 | 0.56 | 0.77 | 0.85 | |
| Original ConText Algorithm | 369 | 0.89 | 0.87 | 0.67 | 0.82 | 0.88 | |
| ConText With extra terms | 369 | 0.93 | 0.87 | 0.80 | 0.85 | 0.90 | |
| With Extra Terms and Refinement Rules | 369 | 0.95 | 0.86 | 0.84 | 0.85 | 0.90 | |
| Galactorrhea | Paragraph | 139 | 0.59 | 0.72 | 0.41 | 0.57 | 0.65 |
| Statement | 139 | 0.62 | 0.77 | 0.45 | 0.62 | 0.69 | |
| Original ConText | 139 | 0.66 | 0.81 | 0.50 | 0.66 | 0.73 | |
| ConText With extra terms | 139 | 0.73 | 0.89 | 0.61 | 0.76 | 0.80 | |
| With Extra Terms and Refinement Rules | 139 | 0.83 | 0.91 | 0.78 | 0.84 | 0.87 | |
| Myocarditis | Paragraph | 188 | 0.28 | 0.70 | 0.30 | 0.41 | 0.40 |
| Statement | 188 | 0.29 | 0.72 | 0.32 | 0.43 | 0.42 | |
| Original ConText Algorithm | 188 | 0.30 | 0.74 | 0.34 | 0.45 | 0.43 | |
| ConText With extra terms | 188 | 0.40 | 0.60 | 0.64 | 0.63 | 0.48 | |
| With Extra Terms and Refinement Rules | 188 | 0.51 | 0.75 | 0.71 | 0.72 | 0.61 |
Fig 4Receiver operating characteristic curves representing the performance of the ADEPt pipeline in identifying akathisia, nausea galactorrhea and myocarditis ADEs from free text.
The increments in each graph correspond to 1) our previous work [30], 2) using paragraph boundaries, 3) using clause-boundaries, 4) using unrefined (off-the-shelf) ConText algorithm, 5) adding domain-specific vocabulary to ConText and 6) final refined ConText algorithm.
Results showing the performance of the ADEPt pipeline in identifying a selection of rare to common ADEs related to antipsychotics and antidepressants drugs.
| ADE | Total | TP | TN | Precision | Sensitivity | Specificity | Accuracy | F-measure |
|---|---|---|---|---|---|---|---|---|
| Agitation | 221 | 142 | 32 | 0.89 | 0.83 | 0.65 | 0.79 | 0.86 |
| Akathisia | 215 | 132 | 64 | 0.96 | 0.90 | 0.93 | 0.91 | 0.93 |
| Arrhythmia | 232 | 129 | 61 | 0.88 | 0.85 | 0.77 | 0.82 | 0.86 |
| Cardiomyopathy | 204 | 55 | 109 | 0.79 | 0.68 | 0.88 | 0.80 | 0.73 |
| Constipation | 475 | 315 | 99 | 0.91 | 0.91 | 0.76 | 0.87 | 0.91 |
| Convulsions | 148 | 84 | 37 | 0.92 | 0.81 | 0.84 | 0.82 | 0.86 |
| Diarrhoea | 221 | 140 | 55 | 0.93 | 0.90 | 0.83 | 0.88 | 0.92 |
| Dizziness | 234 | 130 | 96 | 0.94 | 0.83 | 0.90 | 0.85 | 0.88 |
| Dry Mouth | 211 | 124 | 56 | 0.91 | 0.87 | 0.82 | 0.85 | 0.89 |
| Galactorrhea | 139 | 68 | 50 | 0.83 | 0.91 | 0.78 | 0.85 | 0.87 |
| Hypersalivation | 193 | 161 | 18 | 0.97 | 0.95 | 0.78 | 0.93 | 0.96 |
| Insomnia | 189 | 119 | 38 | 0.84 | 0.93 | 0.62 | 0.83 | 0.88 |
| Myocarditis | 188 | 40 | 96 | 0.51 | 0.75 | 0.71 | 0.72 | 0.61 |
| Nausea | 369 | 241 | 75 | 0.95 | 0.86 | 0.88 | 0.90 | 0.92 |
| Pneumonia | 173 | 81 | 75 | 0.89 | 0.94 | 0.93 | 0.90 | 0.92 |
| Sedation | 189 | 108 | 54 | 0.89 | 0.89 | 0.81 | 0.86 | 0.89 |
| Stephen Johnson’s Syndrome | 333 | 68 | 205 | 0.60 | 0.88 | 0.82 | 0.82 | 0.72 |
| Tachycardia | 230 | 192 | 13 | 0.96 | 0.91 | 0.65 | 0.89 | 0.94 |
| Weight Gain | 209 | 108 | 51 | 0.92 | 0.87 | 0.82 | 0.86 | 0.90 |