| Literature DB >> 23256479 |
Harsha Gurulingappa1, Abdul Mateen-Rajput, Luca Toldo.
Abstract
: The sheer amount of information about potential adverse drug events published in medical case reports pose major challenges for drug safety experts to perform timely monitoring. Efficient strategies for identification and extraction of information about potential adverse drug events from free-text resources are needed to support pharmacovigilance research and pharmaceutical decision making. Therefore, this work focusses on the adaptation of a machine learning-based system for the identification and extraction of potential adverse drug event relations from MEDLINE case reports. It relies on a high quality corpus that was manually annotated using an ontology-driven methodology. Qualitative evaluation of the system showed robust results. An experiment with large scale relation extraction from MEDLINE delivered under-identified potential adverse drug events not reported in drug monographs. Overall, this approach provides a scalable auto-assistance platform for drug safety professionals to automatically collect potential adverse drug events communicated as free-text data.Entities:
Year: 2012 PMID: 23256479 PMCID: PMC3599676 DOI: 10.1186/2041-1480-3-15
Source DB: PubMed Journal: J Biomed Semantics
Figure 1Example of an annotated sentence in the ADE corpus. Example of a sentence annotated with drug, conditions, and relations between them in the ADE corpus. True indicates presence of adverse effect relation and False indicates absence of adverse effect relation.
Counts of entities and relations in ADE‐EXT corpus subsets
| Documents | 1884 | 210 |
| Drugs | 6770 | 758 |
| Conditions (adverse effect) | 8539 | 978 |
| Sentences | 5333 | 606 |
| 6030 | 671 | |
| 4799 | 546 |
Figure 2Ontologies discussed in this work. Mappings between ADE, OAE, and CLEF ontologies have been shown. Identical entities are in boxes with same colours. Condition in the CLEF ontology is mapped to Process in the OAE.
Assessment of results of relation extraction
| Cross‐Validation | 0.87 | 0.86 | 0.87 |
| Final Test | 0.86 | 0.89 | 0.87 |
Impact of size of the training set on relation extraction
| 10 | 0.58 | 0.41 | 0.60 | 0.44 | 0.55 | 0.38 |
| 20 | 0.62 | 0.36 | 0.69 | 0.38 | 0.64 | 0.37 |
| 50 | 0.79 | 0.13 | 0.87 | 0.06 | 0.82 | 0.09 |
| 100 | 0.81 | 0.05 | 0.75 | 0.08 | 0.78 | 0.04 |
| 200 | 0.85 | 0.07 | 0.84 | 0.05 | 0.84 | 0.04 |
| 500 | 0.82 | 0.04 | 0.85 | 0.01 | 0.84 | 0.02 |
| 1000 | 0.83 | 0.02 | 0.87 | 0.02 | 0.86 | 0.01 |
| 2000 | 0.87 | 0.01 | 0.86 | 0.01 | 0.87 | 0.01 |
Impact of size of the training set on relation extraction was measured by independent cross‐validations over subsets of the ADE‐EXT‐TRAIN corpus. N indicates number of documents in the training set and SD indicates the standard deviation measured during the 10‐fold cross validation.
Potential adverse drug events extracted from MEDLINE not reported in drug leaflets until 2009 and later introduced in package leaflets
| Rituximab | Progressive multifocal leukoencephalopathy |
| Efalizumab | Progressive multifocal leukoencephalopathy |
| Natalizumab | Hypersensitivity |