| Literature DB >> 22541596 |
Paea Lependu1, Srinivasan V Iyer, Cédrick Fairon, Nigam H Shah.
Abstract
BACKGROUND: The electronic surveillance for adverse drug events is largely based upon the analysis of coded data from reporting systems. Yet, the vast majority of electronic health data lies embedded within the free text of clinical notes and is not gathered into centralized repositories. With the increasing access to large volumes of electronic medical data-in particular the clinical notes-it may be possible to computationally encode and to test drug safety signals in an active manner.Entities:
Year: 2012 PMID: 22541596 PMCID: PMC3337270 DOI: 10.1186/2041-1480-3-S1-S5
Source DB: PubMed Journal: J Biomed Semantics
Figure 1The NCBO Annotator Workflow extracts terms from the clinical notes of patients: (1) We obtain a lexicon of over 2.8-million terms from the NCBO BioPortal library. (2) We use the NCBO Annotator to rapidly find those terms in clinical notes—which we call annotations. (3) We apply NegEx trigger rules to separate negated terms. (4) We compile terms (both positive and negative) into a temporally ordered series of sets for each patient and combine them with coded and structured data when possible. (5) We reason over the structure of the ontologies to normalize and to aggregate terms for further analysis.
Figure 2The Vioxx risk pattern (top row) occurs when a patient with rheumatoid arthritis (RA) who takes Vioxx and suffers a myocardial infarction (MI)—these frequencies are entered into cell a of the 2x2 contingency matrix. Based on the various combinations of temporal orderings for the initial mentions of RA, Vioxx, and MI in the patient notes, other possible patterns contribute to the expected background distribution (cells b, c, d of the contingency table) that the odds ratio calculation requires.
Two-by-two contingency table for rofecoxib and myocardial infarction within the STRIDE dataset for patients with rheumatoid arthritis before 2005 using NCBO annotations.
| myocardial infarction | no myocardial infarction | |
|---|---|---|
| rofecoxib | a=339 | b=1221 |
| no rofecoxib | c=1488 | d=11031 |
Odds Ratio: For a given pair (x,y), the reporting odds ratio provides the “unexpectedness of y, given x” via the simple calculation (ad) ÷ (bc) as defined in the contingency table below. For example, the cell marked as a denotes the number of patients that took drug x (e.g., Vioxx), and experienced condition y (e.g., myocardial infarction). The other cells help to establish the likelihood.
| y | not y | |
|---|---|---|
| a | b | |
| c | d |
Two-by-two contingency table for rofecoxib and myocardial infarction within the STRIDE dataset for patients with rheumatoid arthritis before 2005 using mainly ICD9 coded data.
| myocardial infarction | no myocardial infarction | |
|---|---|---|
| rofecoxib | a=16 | b=487 |
| no rofecoxib | c=61 | d=2831 |
Ontologies chosen for Annotation Workflow configuration as well as overall term frequency counts per ontology.
| Current Procedural Terminology | UMLS | CPT | 17243153 |
| Human Disease Ontology | OBO | DO | 122035173 |
| International Classification of Disease (ICD-10) | UMLS | ICD10 | 55572189 |
| International Classification of Disease (ICD-9) | UMLS | ICD9 | 58334369 |
| Logical Observation Identifier Names and Codes | UMLS | LNC | 1208284117 |
| Medical Dictionary for Regulatory Activities | UMLS | MDR | 361398956 |
| Medical Subject Headings | UMLS | MSH | 643026014 |
| National Drug File | UMLS | NDFRT | 232557746 |
| NCI Thesaurus | UMLS | NCI | 2498591490 |
| Online Mendelian Inheritance in Man | UMLS | OMIM | 262747872 |
| Systematized Nomenclature of Medicine–Clinical Terms | UMLS | SNOMEDCT | 2369959351 |