Literature DB >> 34457154

A Comparison between Human and NLP-based Annotation of Clinical Trial Eligibility Criteria Text Using The OMOP Common Data Model.

Xinhang Li1,2, Hao Liu1,2, Fabrício Kury1, Chi Yuan1, Alex Butler1, Yingcheng Sun1, Anna Ostropolets1, Hua Xu3, Chunhua Weng1.   

Abstract

Human annotations are the established gold standard for evaluating natural language processing (NLP) methods. The goals of this study are to quantify and qualify the disagreement between human and NLP. We developed an NLP system for annotating clinical trial eligibility criteria text and constructed a manually annotated corpus, both following the OMOP Common Data Model (CDM). We analyzed the discrepancies between the human and NLP annotations and their causes (e.g., ambiguities in concept categorization and tacit decisions on inclusion of qualifiers and temporal attributes during concept annotation). This study initially reported complexities in clinical trial eligibility criteria text that complicate NLP and the limitations of the OMOP CDM. The disagreement between and human and NLP annotations may be generalizable. We discuss implications for NLP evaluation. ©2021 AMIA - All rights reserved.

Entities:  

Mesh:

Year:  2021        PMID: 34457154      PMCID: PMC8378608     

Source DB:  PubMed          Journal:  AMIA Jt Summits Transl Sci Proc


  11 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Semantic classification of biomedical concepts using distributional similarity.

Authors:  Jung-Wei Fan; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2007-04-25       Impact factor: 4.497

3.  SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  IEEE J Biomed Health Inform       Date:  2015-04-13       Impact factor: 5.772

4.  A graph-based method for reconstructing entities from coordination ellipsis in medical text.

Authors:  Chi Yuan; Yongli Wang; Ning Shang; Ziran Li; Ruxin Zhao; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

5.  A study of active learning methods for named entity recognition in clinical text.

Authors:  Yukun Chen; Thomas A Lasko; Qiaozhu Mei; Joshua C Denny; Hua Xu
Journal:  J Biomed Inform       Date:  2015-09-15       Impact factor: 6.317

6.  Facilitating cancer research using natural language processing of pathology reports.

Authors:  Hua Xu; Kristin Anderson; Victor R Grann; Carol Friedman
Journal:  Stud Health Technol Inform       Date:  2004

7.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors:  George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal:  Stud Health Technol Inform       Date:  2015

8.  Various criteria in the evaluation of biomedical named entity recognition.

Authors:  Richard Tzong-Han Tsai; Shih-Hung Wu; Wen-Chi Chou; Yu-Chun Lin; Ding He; Jieh Hsiang; Ting-Yi Sung; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2006-02-24       Impact factor: 3.169

9.  Criteria2Query: a natural language interface to clinical databases for cohort definition.

Authors:  Chi Yuan; Patrick B Ryan; Casey Ta; Yixuan Guo; Ziran Li; Jill Hardin; Rupa Makadia; Peng Jin; Ning Shang; Tian Kang; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2019-04-01       Impact factor: 4.497

10.  Chia, a large annotated corpus of clinical trial eligibility criteria.

Authors:  Fabrício Kury; Alex Butler; Chi Yuan; Li-Heng Fu; Yingcheng Sun; Hao Liu; Ida Sim; Simona Carini; Chunhua Weng
Journal:  Sci Data       Date:  2020-08-27       Impact factor: 6.444

View more
  2 in total

1.  A Framework for Systematic Assessment of Clinical Trial Population Representativeness Using Electronic Health Records Data.

Authors:  Yingcheng Sun; Alex Butler; Ibrahim Diallo; Jae Hyun Kim; Casey Ta; James R Rogers; Hao Liu; Chunhua Weng
Journal:  Appl Clin Inform       Date:  2021-09-08       Impact factor: 2.762

2.  Combining human and machine intelligence for clinical trial eligibility querying.

Authors:  Yilu Fang; Betina Idnay; Yingcheng Sun; Hao Liu; Zhehuan Chen; Karen Marder; Hua Xu; Rebecca Schnall; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2022-06-14       Impact factor: 7.942

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.