Literature DB >> 29860093

Evaluation of Natural Language Processing (NLP) systems to annotate drug product labeling with MedDRA terminology.

Thomas Ly1, Carol Pamer2, Oanh Dang2, Sonja Brajovic2, Shahrukh Haider3, Taxiarchis Botsis4, David Milward5, Andrew Winter5, Susan Lu2, Robert Ball2.   

Abstract

INTRODUCTION: The FDA Adverse Event Reporting System (FAERS) is a primary data source for identifying unlabeled adverse events (AEs) in a drug or biologic drug product's postmarketing phase. Many AE reports must be reviewed by drug safety experts to identify unlabeled AEs, even if the reported AEs are previously identified, labeled AEs. Integrating the labeling status of drug product AEs into FAERS could increase report triage and review efficiency. Medical Dictionary for Regulatory Activities (MedDRA) is the standard for coding AE terms in FAERS cases. However, drug manufacturers are not required to use MedDRA to describe AEs in product labels. We hypothesized that natural language processing (NLP) tools could assist in automating the extraction and MedDRA mapping of AE terms in drug product labels.
MATERIALS AND METHODS: We evaluated the performance of three NLP systems, (ETHER, I2E, MetaMap) for their ability to extract AE terms from drug labels and translate the terms to MedDRA Preferred Terms (PTs). Pharmacovigilance-based annotation guidelines for extracting AE terms from drug labels were developed for this study. We compared each system's output to MedDRA PT AE lists, manually mapped by FDA pharmacovigilance experts using the guidelines, for ten drug product labels known as the "gold standard AE list" (GSL) dataset. Strict time and configuration conditions were imposed in order to test each system's capabilities under conditions of no human intervention and minimal system configuration. Each NLP system's output was evaluated for precision, recall and F measure in comparison to the GSL. A qualitative error analysis (QEA) was conducted to categorize a random sample of each NLP system's false positive and false negative errors.
RESULTS: A total of 417, 278, and 250 false positive errors occurred in the ETHER, I2E, and MetaMap outputs, respectively. A total of 100, 80, and 187 false negative errors occurred in ETHER, I2E, and MetaMap outputs, respectively. Precision ranged from 64% to 77%, recall from 64% to 83% and F measure from 67% to 79%. I2E had the highest precision (77%), recall (83%) and F measure (79%). ETHER had the lowest precision (64%). MetaMap had the lowest recall (64%). The QEA found that the most prevalent false positive errors were context errors such as "Context error/General term", "Context error/Instructions or monitoring parameters", "Context error/Medical history preexisting condition underlying condition risk factor or contraindication", and "Context error/AE manifestations or secondary complication". The most prevalent false negative errors were in the "Incomplete or missed extraction" error category. Missing AE terms were typically due to long terms, or terms containing non-contiguous words which do not correspond exactly to MedDRA synonyms. MedDRA mapping errors were a minority of errors for ETHER and I2E but were the most prevalent false positive errors for MetaMap.
CONCLUSIONS: The results demonstrate that it may be feasible to use NLP tools to extract and map AE terms to MedDRA PTs. However, the NLP tools we tested would need to be modified or reconfigured to lower the error rates to support their use in a regulatory setting. Tools specific for extracting AE terms from drug labels and mapping the terms to MedDRA PTs may need to be developed to support pharmacovigilance. Conducting research using additional NLP systems on a larger, diverse GSL would also be informative. Published by Elsevier Inc.

Entities:  

Keywords:  Drug Safety; FDA; Labeling; MedDRA; Pharmacovigilance

Mesh:

Year:  2018        PMID: 29860093     DOI: 10.1016/j.jbi.2018.05.019

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

1.  "Artificial Intelligence" for Pharmacovigilance: Ready for Prime Time?

Authors:  Robert Ball; Gerald Dal Pan
Journal:  Drug Saf       Date:  2022-05-17       Impact factor: 5.228

2.  Large scale meta-analysis of preclinical toxicity data for target characterisation and hypotheses generation.

Authors:  Jordi Munoz-Muriedas
Journal:  PLoS One       Date:  2021-06-08       Impact factor: 3.240

3.  Implementation and comparison of two text mining methods with a standard pharmacovigilance method for signal detection of medication errors.

Authors:  Nadine Kadi Eskildsen; Robert Eriksson; Sten B Christensen; Tamilla Stine Aghassipour; Mikael Juul Bygsø; Søren Brunak; Suzanne Lisbet Hansen
Journal:  BMC Med Inform Decis Mak       Date:  2020-05-24       Impact factor: 2.796

4.  Measuring Adoption of Patient Priorities-Aligned Care Using Natural Language Processing of Electronic Health Records: Development and Validation of the Model.

Authors:  Javad Razjouyan; Jennifer Freytag; Lilian Dindo; Lea Kiefer; Edward Odom; Jaime Halaszynski; Jennifer W Silva; Aanand D Naik
Journal:  JMIR Med Inform       Date:  2021-02-19

5.  The Presentation, Pace, and Profile of Infection and Sepsis Patients Hospitalized Through the Emergency Department: An Exploratory Analysis.

Authors:  Vincent X Liu; Meghana Bhimarao; John D Greene; Raj N Manickam; Adriana Martinez; Alejandro Schuler; Fernando Barreda; Gabriel J Escobar
Journal:  Crit Care Explor       Date:  2021-02-24

6.  Pandemic tele-smart: a contactless tele-health system for efficient monitoring of remotely located COVID-19 quarantine wards in India using near-field communication and natural language processing system.

Authors:  Vishal Balasubramanian; Sapthagirivasan Vivekanandhan; Venkatesh Mahadevan
Journal:  Med Biol Eng Comput       Date:  2021-10-27       Impact factor: 2.602

7.  Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records.

Authors:  Matthew D Solomon; Grace Tabada; Amanda Allen; Sue Hee Sung; Alan S Go
Journal:  Cardiovasc Digit Health J       Date:  2021-03-18

8.  Comprehensibility of Contraindications in German, UK and US Summaries of Product Characteristics/Prescribing Information-A Comparative Qualitative and Quantitative Analysis.

Authors:  Melanie I Then; Wahram Andrikyan; Martin F Fromm; Renke Maas
Journal:  J Clin Med       Date:  2022-07-18       Impact factor: 4.964

9.  Artificial Intelligence, Real-World Automation and the Safety of Medicines.

Authors:  Andrew Bate; Steve F Hobbiger
Journal:  Drug Saf       Date:  2020-10-07       Impact factor: 5.606

10.  Broad-Spectrum Profiling of Drug Safety via Learning Complex Network.

Authors:  Ke Liu; Ruo-Fan Ding; Han Xu; Yang-Mei Qin; Qiu-Shun He; Fei Du; Yun Zhang; Li-Xia Yao; Pan You; Yan-Ping Xiang; Zhi-Liang Ji
Journal:  Clin Pharmacol Ther       Date:  2020-02-28       Impact factor: 6.875

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.