Literature DB >> 28379377

EliIE: An open-source information extraction system for clinical trial eligibility criteria.

Tian Kang1, Shaodian Zhang1, Youlan Tang2, Gregory W Hruby1, Alexander Rusanov1, Noémie Elhadad1, Chunhua Weng1.   

Abstract

OBJECTIVE: To develop an open-source information extraction system called Eligibility Criteria Information Extraction (EliIE) for parsing and formalizing free-text clinical research eligibility criteria (EC) following Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) version 5.0.
MATERIALS AND METHODS: EliIE parses EC in 4 steps: (1) clinical entity and attribute recognition, (2) negation detection, (3) relation extraction, and (4) concept normalization and output structuring. Informaticians and domain experts were recruited to design an annotation guideline and generate a training corpus of annotated EC for 230 Alzheimer's clinical trials, which were represented as queries against the OMOP CDM and included 8008 entities, 3550 attributes, and 3529 relations. A sequence labeling-based method was developed for automatic entity and attribute recognition. Negation detection was supported by NegEx and a set of predefined rules. Relation extraction was achieved by a support vector machine classifier. We further performed terminology-based concept normalization and output structuring.
RESULTS: In task-specific evaluations, the best F1 score for entity recognition was 0.79, and for relation extraction was 0.89. The accuracy of negation detection was 0.94. The overall accuracy for query formalization was 0.71 in an end-to-end evaluation.
CONCLUSIONS: This study presents EliIE, an OMOP CDM-based information extraction system for automatic structuring and formalization of free-text EC. According to our evaluation, machine learning-based EliIE outperforms existing systems and shows promise to improve. Published by Oxford University Press on behalf of the American Medical Informatics Association 2017. This work is written by US Government employees and is in the public domain in the United States.

Entities:  

Keywords:  clinical trials; common data model; machine learning; named entity recognition; natural language processing; patient selection

Mesh:

Year:  2017        PMID: 28379377      PMCID: PMC6259668          DOI: 10.1093/jamia/ocx019

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  38 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Biological nomenclatures: a source of lexical knowledge and ambiguity.

Authors:  O Tuason; L Chen; H Liu; J A Blake; C Friedman
Journal:  Pac Symp Biocomput       Date:  2004

3.  Electronic Health Record Adoption In US Hospitals: Progress Continues, But Challenges Persist.

Authors:  Julia Adler-Milstein; Catherine M DesRoches; Peter Kralovec; Gregory Foster; Chantal Worzala; Dustin Charles; Talisha Searcy; Ashish K Jha
Journal:  Health Aff (Millwood)       Date:  2015-11-11       Impact factor: 6.301

Review 4.  Frontiers of biomedical text mining: current progress.

Authors:  Pierre Zweigenbaum; Dina Demner-Fushman; Hong Yu; Kevin B Cohen
Journal:  Brief Bioinform       Date:  2007-10-30       Impact factor: 11.622

5.  Representing information in patient reports using natural language processing and the extensible markup language.

Authors:  C Friedman; G Hripcsak; L Shagina; H Liu
Journal:  J Am Med Inform Assoc       Date:  1999 Jan-Feb       Impact factor: 4.497

6.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Authors:  Min Jiang; Yukun Chen; Mei Liu; S Trent Rosenbloom; Subramani Mani; Joshua C Denny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2011-04-20       Impact factor: 4.497

7.  The UMLS Metathesaurus: representing different views of biomedical concepts.

Authors:  P L Schuyler; W T Hole; M S Tuttle; D D Sherertz
Journal:  Bull Med Libr Assoc       Date:  1993-04

8.  Trend and Network Analysis of Common Eligibility Features for Cancer Trials in ClinicalTrials.gov.

Authors:  Chunhua Weng; Anil Yaman; Kuo Lin; Zhe He
Journal:  Smart Health (2014)       Date:  2014-07

9.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors:  George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal:  Stud Health Technol Inform       Date:  2015

10.  Evaluating word representation features in biomedical named entity recognition tasks.

Authors:  Buzhou Tang; Hongxin Cao; Xiaolong Wang; Qingcai Chen; Hua Xu
Journal:  Biomed Res Int       Date:  2014-03-06       Impact factor: 3.411

View more
  26 in total

1.  Computable Eligibility Criteria through Ontology-driven Data Access: A Case Study of Hepatitis C Virus Trials.

Authors:  Hansi Zhang; Zhe He; Xing He; Yi Guo; David R Nelson; François Modave; Yonghui Wu; William Hogan; Mattia Prosperi; Jiang Bian
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Deep Learning Approach to Parse Eligibility Criteria in Dietary Supplements Clinical Trials Following OMOP Common Data Model.

Authors:  Anusha Bompelli; Jianfu Li; Yiqi Xu; Nan Wang; Yanshan Wang; Terrence Adam; Zhe He; Rui Zhang
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

3.  Transformer-Based Named Entity Recognition for Parsing Clinical Trial Eligibility Criteria.

Authors:  Shubo Tian; Arslan Erdengasileng; Xi Yang; Yi Guo; Yonghui Wu; Jinfeng Zhang; Jiang Bian; Zhe He
Journal:  ACM BCB       Date:  2021-08

4.  Computable Phenotype Implementation for a National, Multicenter Pragmatic Clinical Trial: Lessons Learned From ADAPTABLE.

Authors:  Faraz S Ahmad; Iben M Ricket; Bradley G Hammill; Lisa Eskenazi; Holly R Robertson; Lesley H Curtis; Cecilia D Dobi; Saket Girotra; Kevin Haynes; Jorge R Kizer; Sunil Kripalani; Mathew T Roe; Christianne L Roumie; Russ Waitman; W Schuyler Jones; Mark G Weiner
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2020-05-29

5.  CAS: corpus of clinical cases in French.

Authors:  Natalia Grabar; Clément Dalloux; Vincent Claveau
Journal:  J Biomed Semantics       Date:  2020-08-06

6.  An OMOP CDM-Based Relational Database of Clinical Research Eligibility Criteria.

Authors:  Yuqi Si; Chunhua Weng
Journal:  Stud Health Technol Inform       Date:  2017

7.  The Data Gap in the EHR for Clinical Research Eligibility Screening.

Authors:  Alex Butler; Wei Wei; Chi Yuan; Tian Kang; Yuqi Si; Chunhua Weng
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18

8.  Interactive Visual Displays for Interpreting the Results of Clinical Trials: Formative Evaluation With Case Vignettes.

Authors:  Jiantao Bian; Charlene Weir; Prasad Unni; Damian Borbolla; Thomas Reese; Yik-Ki Jacob Wan; Guilherme Del Fiol
Journal:  J Med Internet Res       Date:  2018-06-25       Impact factor: 5.428

9.  Automated classification of clinical trial eligibility criteria text based on ensemble learning and metric learning.

Authors:  Kun Zeng; Yibin Xu; Ge Lin; Likeng Liang; Tianyong Hao
Journal:  BMC Med Inform Decis Mak       Date:  2021-07-30       Impact factor: 2.796

10.  Chia, a large annotated corpus of clinical trial eligibility criteria.

Authors:  Fabrício Kury; Alex Butler; Chi Yuan; Li-Heng Fu; Yingcheng Sun; Hao Liu; Ida Sim; Simona Carini; Chunhua Weng
Journal:  Sci Data       Date:  2020-08-27       Impact factor: 6.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.