Literature DB >> 33215076

Adverse drug event presentation and tracking (ADEPT): semiautomated, high throughput pharmacovigilance using real-world data.

Alon Geva1,2,3, Jason P Stedman4, Shannon F Manzi1,5,6, Chen Lin1, Guergana K Savova1,6, Paul Avillach1,4, Kenneth D Mandl1,4,6.   

Abstract

OBJECTIVE: To advance use of real-world data (RWD) for pharmacovigilance, we sought to integrate a high-sensitivity natural language processing (NLP) pipeline for detecting potential adverse drug events (ADEs) with easily interpretable output for high-efficiency human review and adjudication of true ADEs.
MATERIALS AND METHODS: The adverse drug event presentation and tracking (ADEPT) system employs an open source NLP pipeline to identify in clinical notes mentions of medications and signs and symptoms potentially indicative of ADEs. ADEPT presents the output to human reviewers by highlighting these drug-event pairs within the context of the clinical note. To measure incidence of seizures associated with sildenafil, we applied ADEPT to 149 029 notes for 982 patients with pediatric pulmonary hypertension.
RESULTS: Of 416 patients identified as taking sildenafil, NLP found 72 [17%, 95% confidence interval (CI) 14-21] with seizures as a potential ADE. Upon human review and adjudication, only 4 (0.96%, 95% CI 0.37-2.4) patients with seizures were determined to have true ADEs. Reviewers using ADEPT required a median of 89 s (interquartile range 57-142 s) per patient to review potential ADEs. DISCUSSION: ADEPT combines high throughput NLP to increase sensitivity of ADE detection and human review, to increase specificity by differentiating true ADEs from signs and symptoms related to comorbidities, effects of other medications, or other confounders.
CONCLUSION: ADEPT is a promising tool for creating gold standard, patient-level labels for advancing NLP-based pharmacovigilance. ADEPT is a potentially time savings platform for computer-assisted pharmacovigilance based on RWD.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association.

Entities:  

Keywords:  adverse drug event; hypertension; natural language processing; pulmonary; software design

Year:  2020        PMID: 33215076      PMCID: PMC7660953          DOI: 10.1093/jamiaopen/ooaa031

Source DB:  PubMed          Journal:  JAMIA Open        ISSN: 2574-2531


INTRODUCTION

The 21st Century Cures Act calls for using real-world data (RWD) in the drug approval process, including in the identification of adverse drug events (ADEs). Generally, RWD refers to any data generated outside of clinical trials, but most often is used to designate data produced in the course of routine delivery of healthcare. Because patients exposed to drugs during routine care often differ from those who are not prescribed medications, causal inference—attributing a particular adverse event to a particular drug—is more challenging than it is in well-designed clinical trials. Much of the work on pharmacovigilance using RWD has focused on disproportionality statistics in structured databases. However, these systems are not designed for causal assessment of drug safety signals. For instance, disproportionality statistics do not differentiate when an apparent adverse effect of a drug may overlap with or be a manifestation or complication of the disease for which it is being used. The preponderance of clinical data necessary for distinguishing ADEs from signs and symptoms not caused by a drug are found in clinicians’ notes. Natural language processing (NLP) can be used to identify terms for medications and for signs and symptoms that may suggest ADEs. However, most implementations of NLP for pharmacovigilance focus on analysis of signs or symptoms that clinicians have explicitly attributed in their documentation as being adverse reactions due to medications. Furthermore, even with explicit documentation of potential causality, current top-performing NLP systems have relatively poor performance for the end-to-end task of identifying a relationship between drugs and adverse events. Thus, differentiating in electronic health record (EHR) data when co-occurrence of drugs and signs or symptoms are true ADEs and not effects of the underlying disease, comorbidities, or other confounding medications has to date largely remained a manual task. We developed the adverse drug event presentation and tracking (ADEPT) toolkit to assess causality of putative ADEs in EHR clinical notes. ADEPT serves as a complement to active ADE surveillance systems and ADE hypotheses generated from structured databases by partially automating review and annotation of potential ADEs identified in the unstructured clinical narrative. Manual review of notes to find relevant information is a time-consuming and laborious task, which has been made more difficult with the prevalence of copied text in EHR notes. ADEPT uses NLP to screen patients’ clinical notes for mentions of medications and signs or symptoms of interest, presents the identified terms graphically on a timeline that reviewers use to navigate relevant notes in a patient’s EHR, and allows users to review whether each patient experienced each potential ADE of interest. Thus, ADEPT is a tool for computer-assisted pharmacovigilance. Unlike most NLP annotation tasks, ADEPT was not designed for instance- or document-level annotation of entities. Instead, ADEPT was optimized for patient-level, across-document assessment of whether an identified sign or symptom was likely to be a true ADE caused directly by a medication. In this context, temporality of events plays a crucial role. Four of ten criteria in the Naranjo Scale, an instrument for ADE evaluation, consider change over time. An additional criterion—whether alternate causes may explain the sign or symptom—requires knowing the patient’s clinical status and comorbidities at the time the sign or symptom occurred. Recent work in RWD-based pharmacovigilance has focused on incorporating temporality of drug and sign/symptom patterns into signal detection. However, we have identified only one such project that has incorporated visualization of temporality in ADE signal detection, and that work was based on structured data only. By highlighting the sequence of multiple events across a patient’s clinical timeline, ADEPT addresses the problem of determining causality from free-form, narrative clinical text. To test ADEPT, we use it to estimate the proportion of patients with incident seizures, a known but rare ADE associated with use of sildenafil to treat pediatric pulmonary hypertension (PH).

MATERIALS AND METHODS

Subjects and data processing

We previously identified a cohort of patients with PH using a computable phenotype. The study was approved by the institutional review board at Boston Children’s Hospital with waiver of informed consent for review of EHR data. ADEPT used the Apache clinical Text Analysis Knowledge Extraction System (cTAKES) to identify mentions of medications and signs or symptoms of interest in plain-text clinical notes extracted from the local data warehouse. Because of ADEPT’s modular design, the output of any clinical NLP system capable of identifying clinical events of interest within individual notes could be substituted as input to ADEPT. We previously described the cTAKES pipeline used for extracting these potential ADEs of interest from EHR notes. cTAKES identified textual mentions with mappings to concept unique identifiers (CUIs) representing medications and signs or symptoms and assigned relevant attributes—negation, conditional status (eg, “if you experience headache”), and temporality—to each mention using open-source modules whose methods are described in detail elsewhere. We included only mentions that were not negated or conditional and whose temporality did not place them prior to the document creation time. We excluded certain terms that generated false-positive associations by subsuming them into a more specific concept (eg, specifying that “heparin flush” was not an instance of “flushing” as a sign or symptom), which we implemented using a custom cTAKES dictionary. As a postprocessing, rules-based step, we linked into potential relations medication and sign/symptom mentions when mentions occurred within 25 newline characters of one another. This proximity parameter was determined empirically during pipeline development to maximize F1 score. We excluded putative relations between medications and signs and symptoms that were temporally implausible (ie, if the sign or symptom overlapped the document creation time, but the medication term had a temporality after the document creation time, that could not be a temporally plausible relationship for an ADE). The pipeline was tuned to maximize recall. Evaluated at the document level against gold standard human annotations (interannotator agreement 0.88) for extracting temporally compatible medication and relevant sign/symptom pairs, the final pipeline had sensitivity 90%, positive predictive value 69%, and F1 score 0.78. The output of cTAKES was a comma-separated file containing identifiers for each patient and note, a timestamp for the note’s creation, the CUIs identified in that note and any potential relationships among them, and the specific character spans in each note of the text relating to each CUI (Figure 1). Using the high throughput cTAKES NLP pipeline, ADEPT processed 149 029 notes for the 982 patients with pediatric PH. We then developed ADEPT to increase precision for the integrated task of clinical pharmacovigilance using the NLP data.
Figure 1:

Schematic of data flow in ADEPT. Clinical notes extracted from the EHR are processed using a NLP pipeline implemented in cTAKES. cTAKES identifies terms for drugs and signs/symptoms as well as relations between them representing potential ADEs. The output of cTAKES is a comma-separated values (CSV) file that includes patient and note identifiers, a timestamp of document creation time, and character spans for each CUI of interest identified. These data are uploaded into the ADEPT database alongside the encrypted raw text notes. Finally, ADEPT presents each patient’s cumulative, temporal narrative data in an easily navigable format.

Schematic of data flow in ADEPT. Clinical notes extracted from the EHR are processed using a NLP pipeline implemented in cTAKES. cTAKES identifies terms for drugs and signs/symptoms as well as relations between them representing potential ADEs. The output of cTAKES is a comma-separated values (CSV) file that includes patient and note identifiers, a timestamp of document creation time, and character spans for each CUI of interest identified. These data are uploaded into the ADEPT database alongside the encrypted raw text notes. Finally, ADEPT presents each patient’s cumulative, temporal narrative data in an easily navigable format.

ADEPT technical description

Overview and front end

ADEPT is a Java- and JavaScript-based web application. Figure 2 presents an overview of users’ workflow within ADEPT. Users are first presented with a listing of all patients in the system and a set of filters to limit the list based on drugs or signs/symptoms representing potential ADEs (Figure 3). Applying filters limits the patient list to patients with mentions of the drugs and/or signs and symptoms identified by the NLP pipeline. Underneath each patient identifier are the (filtered) medications and potential ADEs identified for that particular patient.
Figure 2:

User workflow within ADEPT. The same workflow is repeated for each potential ADE.

Figure 3:

The patient list screen in ADEPT showing the functionality of filters to select patients with seizures as a potential ADE for sildenafil. Note that clicking on a patient number will select all potential ADEs for that patient for review, whereas clicking in the box “Sildenafil - Seizure” will select only sildenafil and seizure as the potential ADE for review.

User workflow within ADEPT. The same workflow is repeated for each potential ADE. The patient list screen in ADEPT showing the functionality of filters to select patients with seizures as a potential ADE for sildenafil. Note that clicking on a patient number will select all potential ADEs for that patient for review, whereas clicking in the box “Sildenafil - Seizure” will select only sildenafil and seizure as the potential ADE for review. The patient-level ADE evaluation screen in ADEPT contains a large area for displaying note text, a patient history timeline with indicators for medications and signs and symptoms, and an ADE selection and annotation pane (Figure 4). The patient history timeline is used to navigate among a patient’s notes. Each note in the history is shown as a translucent-white vertical bar. Notes containing any mention of a drug or sign or symptom involved in one of the selected potential ADEs contain a color-coded triangle, where drug mentions are represented using an upward-pointed triangle and signs or symptoms are represented using a downward pointed triangle. Translucency of the colored triangles and the bars indicating notes provide an indication of note and mention density, cueing users to zoom in on times of heavy documentation. Drug-candidate ADE pairings can be selected and deselected at any time (Figure 4). Notes in which the NLP pipeline identified a potential relationship between the drug and sign or symptom are highlighted in a translucent orange color.
Figure 4:

The patient-level annotation screen in ADEPT. The selected note text is in the top-left, showing several potential terms of interest highlighted with color-coded rectangles. Note the thick line connecting “seizure” and “sildenafil,” indicating a potential plausible connection between the two terms. At the bottom-left is the Patient History Timeline navigation pane, with triangles indicating notes that contain “sildenafil” and/or “seizures.” At right is the review/adjudication pane (shown in review mode), where annotators indicate whether the potential ADE represents a true ADE.

The patient-level annotation screen in ADEPT. The selected note text is in the top-left, showing several potential terms of interest highlighted with color-coded rectangles. Note the thick line connecting “seizure” and “sildenafil,” indicating a potential plausible connection between the two terms. At the bottom-left is the Patient History Timeline navigation pane, with triangles indicating notes that contain “sildenafil” and/or “seizures.” At right is the review/adjudication pane (shown in review mode), where annotators indicate whether the potential ADE represents a true ADE. When a user clicks on a note to navigate to it, the note displays in a large, scrollable pane in ADEPT. The visualization highlights mentions of any drugs or signs and symptoms using color-coded rectangles (Figure 4). The visualization also highlights mentions involved in potentially causal relationships identified by the NLP pipeline using a thick light-gray line. This color-coded visualization directs user attention to potential ADE descriptions within the clinical note. The visualization of NLP output is implemented by a transparent JavaScript Canvas element overlaid on top of the note text. Using the character offsets from the cTAKES output for each note, each mention of a drug or sign or symptom identified by cTAKES is wrapped by a Hypertext Markup Language (HTML) span element. An HTML class attribute is added to the span class identifying the Unified Medical Language System (UMLS) CUI of the drug, sign, or symptom. The UMLS CUI classes are used to add the color-coded borders to the span element highlighting the mentions of interest. In addition, a Group ID is added to the HTML class attribute. The Group ID classes are used to identify mentions of drugs and signs or symptoms among which the light-gray connection bars indicating potential causal ADEs are drawn. Once a user has completed review of the patient’s notes, the reviewer uses the right-hand pane to record whether each drug-sign/symptom pair is consistent with a true ADE for that patient (Figure 4). ADEPT supports both “review” and “adjudication” roles for users. In the review role, a user sees only his/her own annotations of whether drug-sign/symptom pairs are ADEs for each patient. In the adjudication role, a user can identify conflicting potential ADE evaluations and, after review of the evidence, make a final adjudication of whether the drug-sign/symptom pair for the patient is a true ADE.

Data structure and back end

To configure ADEPT for a particular task, the clinical note content and metadata (patient identifier, date/time of note creation, and term spans and relationships) are loaded into a Structured Query Language (SQL) database (Figure 1). The locations for NLP-identified terms are expressed in the cTAKES output as character offsets from the start of the text. Each of the NLP-identified mentions of a term are stored in the database in the CTakesHit table. Related groupings of CTakesHit entries are tied together by a many-to-one relationship with the CTakesGroup table. Each candidate ADE identified by the NLP pipeline is stored in the Candidate table and tied to a CTakesGroup record through the candidateId column. User validations of candidate ADEs are captured in the CandidateValidation table. Candidate records which have been adjudicated are marked by the isAdjudicated column and have the adjudicator’s userId and adjudicated score recorded directly in the Candidate table. Additional measures are taken to ensure security of the note text data, since clinical notes frequently contain personally identifiable, protected health information and other sensitive/confidential patient information. ADEPT uses Transport Layer Security (TLS)-based standards to encrypt data in transit. For encryption at rest, in addition to volume-level encryption, ADEPT encrypts the text of each note using the Java Cryptography Extension implementation of the Advanced Encryption Standard/Galois/Counter Mode/NoPadding encryption algorithm. ADEPT also utilizes typical secure coding practices, such as using stored procedures for database access, to prevent SQL injection attacks. The only flow of execution that yields a plain-text note through ADEPT is a specific service call, which is secured via a role-based access control model in addition to user-specific rate-limiting. This provides a required human intervention to validate that the user’s token has not fallen into the wrong hands. Every action pertaining to user management, retrieval of patient data, and annotation of candidate ADEs is logged to the application log. This log is then forwarded to a log aggregation and analysis server instance. For the current project, we used Splunk Enterprise (Splunk, Inc., San Francisco, CA, USA) for tracking ADEPT interactions.

Evaluation

To test application of ADEPT to a clinical problem, we focused on sildenafil, the most commonly prescribed PH-targeted therapy in children. As our potential ADE of interest, we chose seizures, since preliminary work suggested that seizures were a rare ADE in patients taking sildenafil but a commonly recorded event in our patients’ clinical notes. Two clinicians (A.G. and S.F.M.) independently used ADEPT to review clinical notes of all patients identified as potentially experiencing seizures while on sildenafil. To review potential ADEs, we initially intended to use the Naranjo scale. However, we noted several limitations of that assessment tool, particularly to critically ill patients, which describes many of the patients in the current study. Thus, we instead agreed on consensus annotation criteria that accounted for temporal plausibility of the ADE, alternative causes for the ADE, and objective evidence of the ADE, but did not assign specific scores to those components. Other components of the Naranjo scale, including readministration of the drug, placebo administration, testing for drug levels, and a dose–response relationship were rarely assessed in our patient population. For each patient, each reviewer responded to the question, “Is there a relation between sildenafil and seizure for this patient across the clinical history?” and responses (“Yes,” “No,” or “Cannot tell”) were recorded in ADEPT. Interannotator agreement for the ADE determination was calculated using Cohen’s weighted κ. Discrepant responses were resolved by discussion and consensus between the two reviewers and recorded as an adjudicated result. Prevalence of the ADE was calculated as the percent of patients exposed to sildenafil who experienced seizures, with 95% confidence intervals calculated using the binomial approximation. The number of notes reviewed, time spent on each note, and total time spent from accessing a patient’s history to recording a response to whether the seizure was consistent with a true ADE were determined from the server log data. Times greater than 10 min [N = 3 (1.8%) of 169 actions] were considered potentially spurious and were eliminated from further analysis. Differences in number of notes and time reviewing notes were compared using the Mann–Whitney U test. A P-value < 0.05 was considered statistically significant. Data analysis was performed using R version 3.5.0 (R Foundation for Statistical Computing, Vienna, Austria).

RESULTS

The NLP pipeline identified seizures in 72 [17%, 95% confidence interval (CI) 14–21] of 416 patients taking sildenafil [age (mean ± SD) 20 ± 25 years, 46% male]. Most documents (77%) with mentions of sildenafil were written between 2012 and 2015. While interannotator agreement was initially poor (65%, κ = 0.2), most (16 of 25, 64%) of the initially conflicting reviews were due to one of the evaluators marking that he or she could not tell whether the seizure was a true ADE and the other evaluator indicating “yes” or “no.” After adjudication, only 4 (0.96%, 95% CI 0.37–2.4) of the patients’ seizures were found to be consistent with true ADEs. In an additional 13 patients (3.1%, 95% CI 1.8–5.3), users were not able to determine whether the seizure was consistent with a true ADE. As compared to reaching consensus for discrepant annotations, single-reviewer determination of whether a sign/symptom represented an ADE required review of fewer notes and took less time in total and per note (Table 1).

Table 1. Usage statistics for ADEPT interactions

ReviewAdjudication P-value
Notes (N)8 (6–10)11 (9–19)<0.001
Unique notes (N)7 (5–9)10 (8–18)<0.001
Total time to result (s)89 (57–142)224 (141–317)<0.001
Time per note (s)9.6 (0.21–21)13 (3.8–24)0.003

Data shown as median (interquartile range).

Table 1. Usage statistics for ADEPT interactions Data shown as median (interquartile range).

DISCUSSION

ADEPT facilitates ascertainment of gold standard, human-reviewed ADEs. Our approach takes advantage the scalability of natural language processing and machine learning for high throughput to maximize sensitivity and guide reviewers, while presenting the computed data in a format that minimizes cognitive load and effort on the part of human annotators. In this sense, ADEPT is similar to systems that use preannotation to aid in annotating clinical text for other NLP tasks., ADEPT, however, differs from other NLP annotation tools, in that it facilitates clinical pharmacovigilance by providing an interface to review whether a sign or symptom represents an ADE across a patient’s entire clinical corpus, rather than in a single instance or document. This functionality facilitates developing gold standard ADE labels for ADE causality determination, in which expert clinicians review the totality of the clinical record to determine whether the incidence of a sign or symptom is caused by a medication, but does so in a way that is notably more efficient than traditional, manual review of clinical notes. We demonstrate, with a driving use case, that many potential ADEs that might be identified through a chart search actually represent signs and symptoms that cannot be attributed definitively to the patient’s being exposed to a medication, even when searching specifically for known ADEs. Such review would be laborious without the efficiency gains of ADEPT. Given the importance of temporality in determining whether a drug-sign/symptom pair represents an ADE (exposure to the medication must precede the sign or symptom), ADEPT’s utility was heavily influenced by its ability to discriminate when a patient was exposed to a medication. Studies have shown that as many as 82% of notes may contain as much as 20% copied material from previous notes. Thus, although our NLP pipeline filtered out temporally impossible relationships based on temporality of terms relative to the document creation time, within-note temporality was not sufficient to determine the overall clinical course of the patient. In contrast, the visual representation of medication and sign or symptom mentions in ADEPT allowed annotators to find quickly first and final mentions of these terms in a patient’s clinical history, determine whether these terms truly represent initiation of a medication or only first mention within our EHR, and determine when medications were discontinued in order to assess whether signs/symptoms occurred prior to initiation or after cessation of the drug. Determining whether a drug-sign/symptom pair is consistent with a true ADE remains a difficult task requiring some subjective assessment. Our ultimate interannotator agreement was poor (κ = 0.2), reflecting how challenging the task of determining causality in a complex, dense clinical record can be, but was similar to that reported in several other studies of ADEs that used varied scales and assessment tools. Moreover, disagreement largely stemmed from the challenge of reviewers determining definitively whether there was sufficient information in the clinical notes to rule in or out each potential ADE. Agreement was better when both reviewers found sufficient information to determine whether or not the seizure was consistent with a true ADE, although the causal relation between sildenafil and seizures was rarely explicitly recorded by the clinician. ADEPT addresses the challenge of interpretability in ADE ascertainment by highlighting the specific sentences in which drugs and signs and symptoms are mentioned and displaying that information within the full context of the patient’s clinical narrative. This contextual representation allows consideration of confounding from other medications and conditions during manual annotation of potential ADEs. ADEPT increased efficiency of this pharmacovigilance case study by reducing the time needed to review potential ADEs. Two recent studies report times for case completion of 15–23 min,, whereas using ADEPT chart review typically took annotators less than 4 min per patient to complete. This efficiency was gained despite the density of inpatient notes for a relatively ill patient population. By providing an interface that combined navigation across notes with visual indicators of the most relevant notes, annotators were able to rapidly peruse a patient’s clinical narrative. We found that few of the potential ADEs identified by our screening NLP pipeline were adjudicated to be consistent with true ADEs. This was expected given that our NLP pipeline was tuned to maximize sensitivity, at the expense of positive predictive value, as an initial screen for ADE-related terms. In the case of seizures, the rate of confirmed ADEs in this study was similar to the rate reported in the STARTS-2 trial of sildenafil for pediatric PH. Of note, seizures were not a reported ADE in two other small studies of sildenafil., Using scalable, computer-assisted methods, we were able to scan the retrospective records of 416 patients taking sildenafil, examining the rate of seizures in over twice as many patients taking sildenafil compared to prior studies. These findings highlight the importance of RWD in pharmacovigilance, particularly for drugs used in children, for whom limited randomized clinical trial and postmarketing follow-up data exist. While many studies have used NLP for ADE detection, state-of-the-art NLP pipelines continue to have limited performance for the full, integrated task of ADE identification and depend on clinicians’ explicit assertion that a sign/symptom is related to a medication. In contrast, ADEPT is built around the idea that the connections between medications and signs/symptoms that may represent ADEs are often represented vaguely and inconsistently in the clinical narrative. In medically complex patients, in particular, many potential ADEs may represent effects of underlying or comorbid diseases or adverse effects of other treatments for those diseases. ADEPT leverages the scalability of NLP and machine learning to screen notes for potential informativeness regarding ADEs, while facilitating human review of those potential ADE mentions. This two-step process allows for the underlying NLP pipeline to be tuned to greater sensitivity—capturing implicit rather than only explicit relations—while the human review and validation allows for maintaining adequate specificity for pharmacovigilance applications. Thus, even using an NLP pipeline with relatively low positive predictive value, ADEPT facilitated efficient human screening of potential ADEs and, after adjudication to resolve disagreements between two reviewers, allowed for rapid creation of gold standard ADE labels. This study has several limitations. Though we found that ADEPT was used effectively by one user (S.F.M.) without prior experience with the platform, further usability testing with more clinicians, epidemiologists, and researchers is needed. We were not able to directly compare ADEPT to manual chart review of potential ADEs, in which an annotators would have note-level NLP results but not the visual/navigation functions of ADEPT. However, without ADEPT, which highlighted which notes over a patient’s clinical timeline had mentions of medications alone, signs/symptoms alone, and medications and signs/symptoms in the same note with a putative relationship, annotators would have had to review an average of over 200 notes per patient with mention of either a drug or sign/symptom of interest to determine causality. In future studies, we will compare ADEPT and manual chart review to confirm the hypothesized performance gains. ADEPT is currently designed for research purposes, and further refinements to the user interface for general clinical use will be required. The NLP pipeline required significant tuning and may not perform as well at other institutions or with different patient populations. Nonetheless, the methods underlying ADEPT can be readily adapted to more robustly tuned input data. Future planned application enhancements include the ability to modify, tune, and run the NLP pipeline from within ADEPT in order to allow end-users to develop custom NLP-assisted pharmacovigilance pipelines. Finally, although the performance of the NLP pipeline was evaluated on a hold-out test set of patients and showed high sensitivity, the potential of false-negative patient-level annotations using the ADEPT system could not be evaluated since we did not have a gold standard against which to compare annotations. A strength of ADEPT is that it allowed annotators to review discrepant annotations together and record an adjudicated result.

CONCLUSION

ADEPT, a platform for NLP-based computer-assisted human annotation of potential ADEs, effectively identifies and validates rare events consistent with true ADEs in unstructured, real-world data. Future studies in NLP-based pharmacovigilance should emphasize causal inference between medications and signs or symptoms even when the signs or symptoms are not explicitly labeled as an ADE in the clinical text. Such studies should also address the relationships among signs or symptoms and medications across a patient’s clinical documents in addition to within individual documents. Systems such as ADEPT that address temporality and confounding by facilitating rapid navigation of a patient’s clinical timeline can be used to create gold standard labels and accelerate pharmacovigilance applications based on RWD. In future work, we plan to test ADEPT’s ability to facilitate validation of novel potential ADEs. The ADEPT source code is available at https://github.com/hms-dbmi/adept.

Contributors

K.D.M. obtained funding. A.G., J.P.S., and K.D.M. wrote the manuscript. A.G. and K.D.M. designed the research. A.G., J.P.S., S.F.M., G.S., and C.L. performed the research. A.G., S.F.M., P.A., and K.D.M. analyzed the data. J.P.S., C.L., G.S., and P.A. contributed new analytical tools. S.F.M., C.L., G.K.S., and P.A. critically revised the manuscript for important intellectual content. All the authors take responsibility for the final approval of the version to be published and are accountable for all aspects of the work.

FUNDING

This work was supported by the National Institutes of Health (NHLBI U01HL121518, NHLBI L40HL133929, NICHD T32HD040128, NICHD K12HD047349, NLM R01LM010090, and NCATS U01TR002623). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. Conflict of interest statement. None declared.
  31 in total

1.  The Pediatric Research Equity Act Moves Into Adolescence.

Authors:  Florence T Bourgeois; Thomas J Hwang
Journal:  JAMA       Date:  2017-01-17       Impact factor: 56.272

2.  Adverse Drug Events Detection in Clinical Notes by Jointly Modeling Entities and Relations Using Neural Networks.

Authors:  Bharath Dandala; Venkata Joopudi; Murthy Devarakonda
Journal:  Drug Saf       Date:  2019-01       Impact factor: 5.606

3.  STARTS-2: long-term survival with oral sildenafil monotherapy in treatment-naive pediatric pulmonary arterial hypertension.

Authors:  Robyn J Barst; Maurice Beghetti; Tomas Pulido; Gary Layton; Irina Konourina; Min Zhang; D Dunbar Ivy
Journal:  Circulation       Date:  2014-03-17       Impact factor: 29.690

4.  A Computable Phenotype Improves Cohort Ascertainment in a Pediatric Pulmonary Hypertension Registry.

Authors:  Alon Geva; Jessica L Gronsbell; Tianxi Cai; Tianrun Cai; Shawn N Murphy; Jessica C Lyons; Michelle M Heinz; Marc D Natter; Nandan Patibandla; Jonathan Bickel; Mary P Mullen; Kenneth D Mandl
Journal:  J Pediatr       Date:  2017-06-16       Impact factor: 4.406

5.  A method for estimating the probability of adverse drug reactions.

Authors:  C A Naranjo; U Busto; E M Sellers; P Sandor; I Ruiz; E A Roberts; E Janecek; C Domecq; D J Greenblatt
Journal:  Clin Pharmacol Ther       Date:  1981-08       Impact factor: 6.875

6.  UArizona at the MADE1.0 NLP Challenge.

Authors:  Dongfang Xu; Vikas Yadav; Steven Bethard
Journal:  Proc Mach Learn Res       Date:  2018-05

7.  Prevalence of copied information by attendings and residents in critical care progress notes.

Authors:  J Daryl Thornton; Jesse D Schold; Lokesh Venkateshaiah; Bradley Lander
Journal:  Crit Care Med       Date:  2013-02       Impact factor: 7.598

8.  Reported sildenafil side effects in pediatric pulmonary hypertension patients.

Authors:  Stephanie L Siehr; Elisa K McCarthy; Michelle T Ogawa; Jeffrey A Feinstein
Journal:  Front Pediatr       Date:  2015-03-09       Impact factor: 3.418

9.  Longitudinal medical records as a complement to routine drug safety signal analysis.

Authors:  Kristina Star; Sarah Watson; Lovisa Sandberg; Jeanette Johansson; I Ralph Edwards
Journal:  Pharmacoepidemiol Drug Saf       Date:  2015-01-27       Impact factor: 2.890

10.  Adverse drug event rates in pediatric pulmonary hypertension: a comparison of real-world data sources.

Authors:  Alon Geva; Steven H Abman; Shannon F Manzi; Dunbar D Ivy; Mary P Mullen; John Griffin; Chen Lin; Guergana K Savova; Kenneth D Mandl
Journal:  J Am Med Inform Assoc       Date:  2020-02-01       Impact factor: 4.497

View more
  1 in total

1.  Artificial Intelligence in Pharmacovigilance: An Introduction to Terms, Concepts, Applications, and Limitations.

Authors:  Jeffrey K Aronson
Journal:  Drug Saf       Date:  2022-05-17       Impact factor: 5.606

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.