Feifan Liu1, Richeek Pradhan1, Emily Druhl2, Elaine Freund1, Weisong Liu3, Brian C Sauer4, Fran Cunningham5, Adam J Gordon6,7, Celena B Peters4,6, Hong Yu2,3,8,9. 1. Department of Population and Quantitative Health Sciences, University of Massachusetts Medical School, Worcester, Massachusetts, USA. 2. Bedford VA Medical Center, Bedford, Massachusetts, USA. 3. Department of Computer Science, University of Massachusetts Lowell, Lowell, Massachusetts, USA. 4. Departments of Internal Medicine and Biomedical Informatics, University of Utah, Salt Lake City, Utah, USA. 5. Department of Veterans Affairs Pharmacy Benefits Management Services, Hines, Illinois, USA. 6. Informatics, Decision-Enhancement, and Analytic Sciences Center (IDEAS 2.0), VA Salt Lake City Health Care System, Salt Lake City, Utah, USA. 7. Division of Epidemiology, Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, Utah, USA. 8. Department of Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, USA. 9. Department of Computer Science, University of Massachusetts Amherst, Amherst, Massachusetts, USA.
Abstract
OBJECTIVE: Identifying drug discontinuation (DDC) events and understanding their reasons are important for medication management and drug safety surveillance. Structured data resources are often incomplete and lack reason information. In this article, we assessed the ability of natural language processing (NLP) systems to unlock DDC information from clinical narratives automatically. MATERIALS AND METHODS: We collected 1867 de-identified providers' notes from the University of Massachusetts Medical School hospital electronic health record system. Then 2 human experts chart reviewed those clinical notes to annotate DDC events and their reasons. Using the annotated data, we developed and evaluated NLP systems to automatically identify drug discontinuations and reasons at the sentence level using a novel semantic enrichment-based vector representation (SEVR) method for enhanced feature representation. RESULTS: Our SEVR-based NLP system achieved the best performance of 0.785 (AUC-ROC) for detecting discontinuation events and 0.745 (AUC-ROC) for identifying reasons when testing this highly imbalanced data, outperforming 2 state-of-the-art non-SEVR-based models. Compared with a rule-based baseline system for discontinuation detection, our system improved the sensitivity significantly (57.75% vs 18.31%, absolute value) while retaining a high specificity of 99.25%, leading to a significant improvement in AUC-ROC by 32.83% (absolute value). CONCLUSION: Experiments have shown that a high-performance NLP system can be developed to automatically identify DDCs and their reasons from providers' notes. The SEVR model effectively improved the system performance showing better generalization and robustness on unseen test data. Our work is an important step toward identifying reasons for drug discontinuation that will inform drug safety surveillance and pharmacovigilance. Published by Oxford University Press on behalf of the American Medical Informatics Association 2019. This work is written by US Government employees and is in the public domain in the US.
OBJECTIVE: Identifying drug discontinuation (DDC) events and understanding their reasons are important for medication management and drug safety surveillance. Structured data resources are often incomplete and lack reason information. In this article, we assessed the ability of natural language processing (NLP) systems to unlock DDC information from clinical narratives automatically. MATERIALS AND METHODS: We collected 1867 de-identified providers' notes from the University of Massachusetts Medical School hospital electronic health record system. Then 2 human experts chart reviewed those clinical notes to annotate DDC events and their reasons. Using the annotated data, we developed and evaluated NLP systems to automatically identify drug discontinuations and reasons at the sentence level using a novel semantic enrichment-based vector representation (SEVR) method for enhanced feature representation. RESULTS: Our SEVR-based NLP system achieved the best performance of 0.785 (AUC-ROC) for detecting discontinuation events and 0.745 (AUC-ROC) for identifying reasons when testing this highly imbalanced data, outperforming 2 state-of-the-art non-SEVR-based models. Compared with a rule-based baseline system for discontinuation detection, our system improved the sensitivity significantly (57.75% vs 18.31%, absolute value) while retaining a high specificity of 99.25%, leading to a significant improvement in AUC-ROC by 32.83% (absolute value). CONCLUSION: Experiments have shown that a high-performance NLP system can be developed to automatically identify DDCs and their reasons from providers' notes. The SEVR model effectively improved the system performance showing better generalization and robustness on unseen test data. Our work is an important step toward identifying reasons for drug discontinuation that will inform drug safety surveillance and pharmacovigilance. Published by Oxford University Press on behalf of the American Medical Informatics Association 2019. This work is written by US Government employees and is in the public domain in the US.
Entities:
Keywords:
drug surveillance; electronic health records; knowledge representation; natural language processing; supervised machine learning
Authors: Kristine M Gleason; Jennifer M Groszek; Carol Sullivan; Denise Rooney; Cynthia Barnard; Gary A Noskin Journal: Am J Health Syst Pharm Date: 2004-08-15 Impact factor: 2.637
Authors: Alexander Turchin; Holly I Wheeler; Matthew Labreche; Julia T Chu; Merri L Pendergrass; Jonathan S Einbinder; Jonathan Seth Einbinder Journal: AMIA Annu Symp Proc Date: 2008-11-06
Authors: Isa Naina Mohamed; Peter J Helms; Colin R Simpson; Robert M Milne; James S McLay Journal: Br J Clin Pharmacol Date: 2011-02 Impact factor: 4.335
Authors: Michael H Kramer; Eugene Breydo; Maria Shubina; Kelly Babcock; Jonathan S Einbinder; Alexander Turchin Journal: BMC Health Serv Res Date: 2010-05-27 Impact factor: 2.655
Authors: Alexander Turchin; Maria Shubina; Eugene Breydo; Merri L Pendergrass; Jonathan S Einbinder Journal: J Am Med Inform Assoc Date: 2009-03-04 Impact factor: 4.497
Authors: Paola Cicconi; A Cozzi-Lepri; A Castagna; E M Trecarichi; A Antinori; F Gatti; G Cassola; L Sighinolfi; P Castelli; A d'Arminio Monforte Journal: HIV Med Date: 2009-09-01 Impact factor: 3.180
Authors: Su Golder; Davy Weissenbacher; Karen O'Connor; Sean Hennessy; Robert Gross; Graciela Gonzalez Hernandez Journal: Drug Saf Date: 2022-08-07 Impact factor: 5.228