Literature DB >> 34414397

Transformer-Based Named Entity Recognition for Parsing Clinical Trial Eligibility Criteria.

Shubo Tian1, Arslan Erdengasileng1, Xi Yang2, Yi Guo2, Yonghui Wu2, Jinfeng Zhang1, Jiang Bian2, Zhe He1.   

Abstract

The rapid adoption of electronic health records (EHRs) systems has made clinical data available in electronic format for research and for many downstream applications. Electronic screening of potentially eligible patients using these clinical databases for clinical trials is a critical need to improve trial recruitment efficiency. Nevertheless, manually translating free-text eligibility criteria into database queries is labor intensive and inefficient. To facilitate automated screening, free-text eligibility criteria must be structured and coded into a computable format using controlled vocabularies. Named entity recognition (NER) is thus an important first step. In this study, we evaluate 4 state-of-the-art transformer-based NER models on two publicly available annotated corpora of eligibility criteria released by Columbia University (i.e., the Chia data) and Facebook Research (i.e.the FRD data). Four transformer-based models (i.e., BERT, ALBERT, RoBERTa, and ELECTRA) pretrained with general English domain corpora vs. those pretrained with PubMed citations, clinical notes from the MIMIC-III dataset and eligibility criteria extracted from all the clinical trials on ClinicalTrials.gov were compared. Experimental results show that RoBERTa pretrained with MIMIC-III clinical notes and eligibility criteria yielded the highest strict and relaxed F-scores in both the Chia data (i.e., 0.658/0.798) and the FRD data (i.e., 0.785/0.916). With promising NER results, further investigations on building a reliable natural language processing (NLP)-assisted pipeline for automated electronic screening are needed.

Entities:  

Keywords:  Clinical Trial; Computing methodologies → Information extraction; Eligibility Criteria Parsing; Named Entity Recognition; Transformer-Based Model

Year:  2021        PMID: 34414397      PMCID: PMC8373041          DOI: 10.1145/3459930.3469560

Source DB:  PubMed          Journal:  ACM BCB


  13 in total

1.  A practical method for transforming free-text eligibility criteria into computable criteria.

Authors:  Samson W Tu; Mor Peleg; Simona Carini; Michael Bobak; Jessica Ross; Daniel Rubin; Ida Sim
Journal:  J Biomed Inform       Date:  2010-09-17       Impact factor: 6.317

2.  HITECH Act Drove Large Gains In Hospital Electronic Health Record Adoption.

Authors:  Julia Adler-Milstein; Ashish K Jha
Journal:  Health Aff (Millwood)       Date:  2017-08-01       Impact factor: 6.301

3.  Valx: A System for Extracting and Structuring Numeric Lab Test Comparison Statements from Text.

Authors:  Tianyong Hao; Hongfang Liu; Chunhua Weng
Journal:  Methods Inf Med       Date:  2016-03-04       Impact factor: 2.176

Review 4.  Strategies addressing barriers to clinical trial enrollment of underrepresented populations: a systematic review.

Authors:  Caren Heller; Joyce E Balls-Berry; Jill Dumbauld Nery; Patricia J Erwin; Dawn Littleton; Mimi Kim; Winston P Kuo
Journal:  Contemp Clin Trials       Date:  2014-08-15       Impact factor: 2.226

5.  Electronic screening improves efficiency in clinical trial recruitment.

Authors:  Samir R Thadani; Chunhua Weng; J Thomas Bigger; John F Ennever; David Wajngurt
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

Review 6.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

7.  Clinical concept extraction using transformers.

Authors:  Xi Yang; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  J Am Med Inform Assoc       Date:  2020-12-09       Impact factor: 4.497

8.  EliIE: An open-source information extraction system for clinical trial eligibility criteria.

Authors:  Tian Kang; Shaodian Zhang; Youlan Tang; Gregory W Hruby; Alexander Rusanov; Noémie Elhadad; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2017-11-01       Impact factor: 4.497

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  Chia, a large annotated corpus of clinical trial eligibility criteria.

Authors:  Fabrício Kury; Alex Butler; Chi Yuan; Li-Heng Fu; Yingcheng Sun; Hao Liu; Ida Sim; Simona Carini; Chunhua Weng
Journal:  Sci Data       Date:  2020-08-27       Impact factor: 6.444

View more
  1 in total

1.  Combining human and machine intelligence for clinical trial eligibility querying.

Authors:  Yilu Fang; Betina Idnay; Yingcheng Sun; Hao Liu; Zhehuan Chen; Karen Marder; Hua Xu; Rebecca Schnall; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2022-06-14       Impact factor: 7.942

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.