Cosmin Adrian Bejan1, Wei-Qi Wei1, Joshua C Denny2. 1. Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA. 2. Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA Department of Medicine, Vanderbilt University, Nashville, Tennessee, USA.
Abstract
OBJECTIVE: To evaluate the contribution of the MEDication Indication (MEDI) resource and SemRep for identifying treatment relations in clinical text. MATERIALS AND METHODS: We first processed clinical documents with SemRep to extract the Unified Medical Language System (UMLS) concepts and the treatment relations between them. Then, we incorporated MEDI into a simple algorithm that identifies treatment relations between two concepts if they match a medication-indication pair in this resource. For a better coverage, we expanded MEDI using ontology relationships from RxNorm and UMLS Metathesaurus. We also developed two ensemble methods, which combined the predictions of SemRep and the MEDI algorithm. We evaluated our selected methods on two datasets, a Vanderbilt corpus of 6864 discharge summaries and the 2010 Informatics for Integrating Biology and the Bedside (i2b2)/Veteran's Affairs (VA) challenge dataset. RESULTS: The Vanderbilt dataset included 958 manually annotated treatment relations. A double annotation was performed on 25% of relations with high agreement (Cohen's κ = 0.86). The evaluation consisted of comparing the manual annotated relations with the relations identified by SemRep, the MEDI algorithm, and the two ensemble methods. On the first dataset, the best F1-measure results achieved by the MEDI algorithm and the union of the two resources (78.7 and 80, respectively) were significantly higher than the SemRep results (72.3). On the second dataset, the MEDI algorithm achieved better precision and significantly lower recall values than the best system in the i2b2 challenge. The two systems obtained comparable F1-measure values on the subset of i2b2 relations with both arguments in MEDI. CONCLUSIONS: Both SemRep and MEDI can be used to extract treatment relations from clinical text. Knowledge-based extraction with MEDI outperformed use of SemRep alone, but superior performance was achieved by integrating both systems. The integration of knowledge-based resources such as MEDI into information extraction systems such as SemRep and the i2b2 relation extractors may improve treatment relation extraction from clinical text.
OBJECTIVE: To evaluate the contribution of the MEDication Indication (MEDI) resource and SemRep for identifying treatment relations in clinical text. MATERIALS AND METHODS: We first processed clinical documents with SemRep to extract the Unified Medical Language System (UMLS) concepts and the treatment relations between them. Then, we incorporated MEDI into a simple algorithm that identifies treatment relations between two concepts if they match a medication-indication pair in this resource. For a better coverage, we expanded MEDI using ontology relationships from RxNorm and UMLS Metathesaurus. We also developed two ensemble methods, which combined the predictions of SemRep and the MEDI algorithm. We evaluated our selected methods on two datasets, a Vanderbilt corpus of 6864 discharge summaries and the 2010 Informatics for Integrating Biology and the Bedside (i2b2)/Veteran's Affairs (VA) challenge dataset. RESULTS: The Vanderbilt dataset included 958 manually annotated treatment relations. A double annotation was performed on 25% of relations with high agreement (Cohen's κ = 0.86). The evaluation consisted of comparing the manual annotated relations with the relations identified by SemRep, the MEDI algorithm, and the two ensemble methods. On the first dataset, the best F1-measure results achieved by the MEDI algorithm and the union of the two resources (78.7 and 80, respectively) were significantly higher than the SemRep results (72.3). On the second dataset, the MEDI algorithm achieved better precision and significantly lower recall values than the best system in the i2b2 challenge. The two systems obtained comparable F1-measure values on the subset of i2b2 relations with both arguments in MEDI. CONCLUSIONS: Both SemRep and MEDI can be used to extract treatment relations from clinical text. Knowledge-based extraction with MEDI outperformed use of SemRep alone, but superior performance was achieved by integrating both systems. The integration of knowledge-based resources such as MEDI into information extraction systems such as SemRep and the i2b2 relation extractors may improve treatment relation extraction from clinical text.
Authors: Bartłomiej Wilkowski; Marcelo Fiszman; Christopher M Miller; Dimitar Hristovski; Sivaram Arabandi; Graciela Rosemblat; Thomas C Rindflesch Journal: AMIA Annu Symp Proc Date: 2011-10-22
Authors: Anne-Lyse Minard; Anne-Laure Ligozat; Asma Ben Abacha; Delphine Bernhard; Bruno Cartoni; Louise Deléger; Brigitte Grau; Sophie Rosset; Pierre Zweigenbaum; Cyril Grouin Journal: J Am Med Inform Assoc Date: 2011-05-19 Impact factor: 4.497
Authors: Ying Liu; Robert Bill; Marcelo Fiszman; Thomas Rindflesch; Ted Pedersen; Genevieve B Melton; Serguei V Pakhomov Journal: AMIA Annu Symp Proc Date: 2012-11-03
Authors: Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2013-03-26 Impact factor: 4.497
Authors: Berry de Bruijn; Colin Cherry; Svetlana Kiritchenko; Joel Martin; Xiaodan Zhu Journal: J Am Med Inform Assoc Date: 2011-05-12 Impact factor: 4.497
Authors: Chen Lin; Elizabeth W Karlson; Helena Canhao; Timothy A Miller; Dmitriy Dligach; Pei Jun Chen; Raul Natanael Guzman Perez; Yuanyan Shen; Michael E Weinblatt; Nancy A Shadick; Robert M Plenge; Guergana K Savova Journal: PLoS One Date: 2013-08-16 Impact factor: 3.240
Authors: Neil S Zheng; V Eric Kerchberger; Victor A Borza; H Nur Eken; Joshua C Smith; Wei-Qi Wei Journal: Sci Rep Date: 2021-09-23 Impact factor: 4.996
Authors: Pedro L Teixeira; Wei-Qi Wei; Robert M Cronin; Huan Mo; Jacob P VanHouten; Robert J Carroll; Eric LaRose; Lisa A Bastarache; S Trent Rosenbloom; Todd L Edwards; Dan M Roden; Thomas A Lasko; Richard A Dart; Anne M Nikolai; Peggy L Peissig; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2016-08-07 Impact factor: 4.497
Authors: Elizabeth McNeer; Cole Beck; Hannah L Weeks; Michael L Williams; Nathan T James; Cosmin A Bejan; Leena Choi Journal: J Am Med Inform Assoc Date: 2021-03-18 Impact factor: 7.942
Authors: Cosmin A Bejan; Katherine N Cahill; Patrick J Staso; Leena Choi; Josh F Peterson; Elizabeth J Phillips Journal: Clin Pharmacol Ther Date: 2021-08-10 Impact factor: 6.903
Authors: Neil S Zheng; QiPing Feng; V Eric Kerchberger; Juan Zhao; Todd L Edwards; Nancy J Cox; C Michael Stein; Dan M Roden; Joshua C Denny; Wei-Qi Wei Journal: J Am Med Inform Assoc Date: 2020-11-01 Impact factor: 7.942