| Literature DB >> 23703825 |
Robert Eriksson1, Peter Bjødstrup Jensen, Sune Frankild, Lars Juhl Jensen, Søren Brunak.
Abstract
OBJECTIVE: Drugs have tremendous potential to cure and relieve disease, but the risk of unintended effects is always present. Healthcare providers increasingly record data in electronic patient records (EPRs), in which we aim to identify possible adverse events (AEs) and, specifically, possible adverse drug events (ADEs).Entities:
Keywords: Adverse Drug Event; Adverse Drug Reaction Reporting Systems; Data Mining; Dictionary; Electronic Health Records
Mesh:
Year: 2013 PMID: 23703825 PMCID: PMC3756275 DOI: 10.1136/amiajnl-2013-001708
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1Method flowchart. (A) Adverse event descriptions were extracted from the summaries of product characteristics (SPCs), representing the baseline dictionary. (B) The lexemes in the baseline dictionary were assigned into seven dictionary groups, split in two groups according to whether they are involved in post-coordination or not. The first number below each group indicates the number of concepts and the second number the unique identifiers in the group. (C) Tagging using the baseline dictionary. (D) Tagging using the group dictionary and subsequent post-coordination of tagged lexemes. (E) The four filtering groups split according to whether they disqualify sentence subparts or the whole sentence in the filtering step. The number below each group indicates the number of concepts. (F) Filtering, where any disqualified possible ADE is removed. (G) The final output of the pipeline.
Figure 2Synonymous coordinated terms. Lexemes from the groups localized event and laboratory event are combined with location and laboratory values, respectively, to produce coordinated terms. The method identifies the equality of two different ways of writing the same ADE and synonymous coordinated terms are merged to one common term. Order and possible prepositions used during post-coordination were excluded.
Figure 3Location collapse. Locations are collapsed and merged into a single identifier.
Figure 4Dictionary group synonyms and synonymous coordinated terms. Synonyms, inflections and spelling variants were merged into a common concept, where ordering and prepositions were omitted.
The 10 most recognized concepts in the corpus
| Dictionary | Possible ADE | Recognized/identified concepts | Unique ways identified |
|---|---|---|---|
| Group dictionary | Anxiety | 128839 | 30 |
| Sedation | 99191 | 51 | |
| Pain | 89960 | 39 | |
| Anger | 75623 | 20 | |
| Unrest | 69322 | 12 | |
| Auditory hallucination | 65888 | 46 | |
| Psychosis | 59435 | 13 | |
| Paranoia | 41040 | 21 | |
| Depression | 36302 | 59 | |
| Irritation | 33673 | 20 | |
| Baseline dictionary | Anxiety | 108049 | – |
| Psychosis | 47725 | – | |
| Unrest | 34848 | – | |
| Pain | 32291 | – | |
| Paranoia | 29358 | – | |
| Suicidal thoughts | 23856 | – | |
| Adverse effect | 19589 | – | |
| Headache | 19487 | – | |
| Restless | 18968 | – | |
| Schizophrenia | 17962 | – |
For the group dictionary, the table shows the number of unique ways each possible ADE was identified.
ADE, adverse drug event.
Dictionary group contributions to the total of 1970731 recognized concepts
| Dictionary group | Identified concepts | Unique text strings | Unique identifiers |
|---|---|---|---|
| Independent event | 1449924 | 5108 | 1707 |
| Localized event and location | 464052 | 26774 | 9484 |
| Laboratory event and laboratory value | 48543 | 3783 | 452 |
| Abbreviations | 8212 | 37 | 33 |
Negative filtering disqualification and dictionary group contribution
| Dictionary group | Contribution (%) |
|---|---|
| Negation | 56.5 |
| Undesirable effects information | 39.3 |
| Other subject | 3.9 |
| Temporal trigger | 0.3 |