| Literature DB >> 25632348 |
Rave Harpaz1, David Odgers1, Greg Gaskin1, William DuMouchel2, Rainer Winnenburg3, Olivier Bodenreider3, Anna Ripple3, Ana Szarfman4, Alfred Sorbello4, Eric Horvitz5, Ryen W White5, Nigam H Shah1.
Abstract
Undetected adverse drug reactions (ADRs) pose a major burden on the health system. Data mining methodologies designed to identify signals of novel ADRs are of deep importance for drug safety surveillance. The development and evaluation of these methodologies requires proper reference benchmarks. While progress has recently been made in developing such benchmarks, our understanding of the performance characteristics of the data mining methodologies is limited because existing benchmarks do not support prospective performance evaluations. We address this shortcoming by providing a reference standard to support prospective performance evaluations. The reference standard was systematically curated from drug labeling revisions, such as new warnings, which were issued and communicated by the US Food and Drug Administration in 2013. The reference standard includes 62 positive test cases and 75 negative controls, and covers 44 drugs and 38 events. We provide usage guidance and empirical support for the reference standard by applying it to analyze two data sources commonly mined for drug safety surveillance.Entities:
Mesh:
Year: 2014 PMID: 25632348 PMCID: PMC4306188 DOI: 10.1038/sdata.2014.43
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Illustration of the resources (from FDA’s MedWatch and Drugs@FDA) and the process used to extract, filter, and verify, candidate test cases for the reference standard.
(1) Initial candidate test cases are obtained from FDA’s MedWatch monthly summaries, which provides a table of drugs (e.g., Ampyra) whose labels have been revised to include new safety information. (2) The events associated with each drug whose label was revised (e.g., Anaphylaxis) are obtained from the ‘detailed view’ linked to each row of a monthly summary Table. (3) The labels and revision history of each drug is obtained from Drugs@FDA to verify the labeling revision, and ensure that the candidate test case is indeed new and not qualified by special conditions.
Filtering rules used to select candidate positive test cases based on labeling revisions.
| Filtering Rule, Rationale, and Examples |
|---|
|
|
|
|
Statistics related to the prospective signal detection performance of FAERS.
|
| |||||
|---|---|---|---|---|---|
| AUC, area under the receiver operating characteristic curve; EBGM+, median signal score for the set of positive test cases; EBGM−, median signal score for the set of negative test cases; N+, median number of reports for the set of positive test cases; N−, median number of reports for the set of positive negative cases. | |||||
| 2013Q3 | 0.81 | 17 | 12 | 1.65 | 0.59 |
| 2012 | 0.82 | 17 | 11 | 1.65 | 0.55 |
| 2011 | 0.77 | 19 | 10 | 1.68 | 0.54 |
| 2010 | 0.75 | 17 | 9 | 1.66 | 0.55 |
| 2009 | 0.72 | 14 | 11 | 1.40 | 0.60 |
| 2008 | 0.70 | 13 | 10 | 1.56 | 0.62 |
| 2007 | 0.70 | 11 | 9 | 1.32 | 0.56 |
Statistics related to the prospective signal detection performance of MEDLINE.
| AUC_N, area under the receiver operating characteristic curve for number of articles used as a signal score; AUC_OR, area under the receiver operating characteristic curve for the corrected odds ratio signal score; F@1, F-score using a cutoff of 1 article; F@3, F-score using a cutoff of 3 articles; M+, number of positive test cases containing information in MEDLINE; M−, number of negative test cases containing information in MEDLINE. | ||||||
|---|---|---|---|---|---|---|
| narrow 2013 | 0.67 | 0.73 | 0.34 | 0.65 | 33 | 6 |
| narrow 2012 | 0.66 | 0.71 | 0.34 | 0.63 | 31 | 6 |
| narrow 2011 | 0.66 | 0.71 | 0.34 | 0.63 | 31 | 6 |
| narrow 2010 | 0.66 | 0.70 | 0.34 | 0.61 | 30 | 6 |
| narrow 2009 | 0.63 | 0.68 | 0.29 | 0.57 | 27 | 5 |
| broad 2013 | 0.71 | 0.75 | 0.47 | 0.70 | 40 | 13 |
| broad 2012 | 0.71 | 0.74 | 0.47 | 0.68 | 38 | 12 |
| broad 2011 | 0.70 | 0.73 | 0.45 | 0.67 | 37 | 12 |
| broad 2010 | 0.70 | 0.74 | 0.44 | 0.67 | 37 | 11 |
| broad 2009 | 0.67 | 0.71 | 0.40 | 0.63 | 33 | 10 |