Literature DB >> 31562516

Cohort selection for clinical trials: n2c2 2018 shared task track 1.

Amber Stubbs1, Michele Filannino2,3, Ergin Soysal4, Samuel Henry2, Özlem Uzuner2,3,5.   

Abstract

OBJECTIVE: Track 1 of the 2018 National NLP Clinical Challenges shared tasks focused on identifying which patients in a corpus of longitudinal medical records meet and do not meet identified selection criteria.
MATERIALS AND METHODS: To address this challenge, we annotated American English clinical narratives for 288 patients according to whether they met these criteria. We chose criteria from existing clinical trials that represented a variety of natural language processing tasks, including concept extraction, temporal reasoning, and inference.
RESULTS: A total of 47 teams participated in this shared task, with 224 participants in total. The participants represented 18 countries, and the teams submitted 109 total system outputs. The best-performing system achieved a micro F1 score of 0.91 using a rule-based approach. The top 10 teams used rule-based and hybrid systems to approach the problems. DISCUSSION: Clinical narratives are open to interpretation, particularly in cases where the selection criterion may be underspecified. This leaves room for annotators to use domain knowledge and intuition in selecting patients, which may lead to error in system outputs. However, teams who consulted medical professionals while building their systems were more likely to have high recall for patients, which is preferable for patient selection systems.
CONCLUSIONS: There is not yet a 1-size-fits-all solution for natural language processing systems approaching this task. Future research in this area can look to examining criteria requiring even more complex inferences, temporal reasoning, and domain knowledge.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical narratives; cohort selection; information extraction; machine learning; natural language processing

Mesh:

Year:  2019        PMID: 31562516      PMCID: PMC6798568          DOI: 10.1093/jamia/ocz163

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  26 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 2.  Observational research methods. Research design II: cohort, cross sectional, and case-control studies.

Authors:  C J Mann
Journal:  Emerg Med J       Date:  2003-01       Impact factor: 2.740

3.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Research subject enrollment by primary care pediatricians using an electronic health record.

Authors:  Robert W Grundmeier; Marguerite Swietlik; Louis M Bell
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

5.  Annotating risk factors for heart disease in clinical narratives for diabetic patients.

Authors:  Amber Stubbs; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-05-21       Impact factor: 6.317

6.  Development of an automated phenotyping algorithm for hepatorenal syndrome.

Authors:  Jejo D Koola; Sharon E Davis; Omar Al-Nimri; Sharidan K Parr; Daniel Fabbri; Bradley A Malin; Samuel B Ho; Michael E Matheny
Journal:  J Biomed Inform       Date:  2018-03-09       Impact factor: 6.317

7.  Using Clinical Notes and Natural Language Processing for Automated HIV Risk Assessment.

Authors:  Daniel J Feller; Jason Zucker; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal:  J Acquir Immune Defic Syndr       Date:  2018-02-01       Impact factor: 3.731

8.  Hybrid bag of approaches to characterize selection criteria for cohort identification.

Authors:  V G Vinod Vydiswaran; Asher Strayhorn; Xinyan Zhao; Phil Robinson; Mahesh Agarwal; Erin Bagazinski; Madia Essiet; Bradley E Iott; Hyeon Joo; PingJui Ko; Dahee Lee; Jin Xiu Lu; Jinghui Liu; Adharsh Murali; Koki Sasagawa; Tianshi Wang; Nalingna Yuan
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

9.  Electronic screening improves efficiency in clinical trial recruitment.

Authors:  Samir R Thadani; Chunhua Weng; J Thomas Bigger; John F Ennever; David Wajngurt
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

10.  CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.

Authors:  Ergin Soysal; Jingqi Wang; Min Jiang; Yonghui Wu; Serguei Pakhomov; Hongfang Liu; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

View more
  11 in total

1.  The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Authors:  Sam Henry; Yanshan Wang; Feichen Shen; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

2.  Hybrid bag of approaches to characterize selection criteria for cohort identification.

Authors:  V G Vinod Vydiswaran; Asher Strayhorn; Xinyan Zhao; Phil Robinson; Mahesh Agarwal; Erin Bagazinski; Madia Essiet; Bradley E Iott; Hyeon Joo; PingJui Ko; Dahee Lee; Jin Xiu Lu; Jinghui Liu; Adharsh Murali; Koki Sasagawa; Tianshi Wang; Nalingna Yuan
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

Review 3.  A scoping review of publicly available language tasks in clinical natural language processing.

Authors:  Yanjun Gao; Dmitriy Dligach; Leslie Christensen; Samuel Tesch; Ryan Laffin; Dongfang Xu; Timothy Miller; Ozlem Uzuner; Matthew M Churpek; Majid Afshar
Journal:  J Am Med Inform Assoc       Date:  2022-09-12       Impact factor: 7.942

4.  Evaluation of patient-level retrieval from electronic health record data for a cohort discovery task.

Authors:  Steven R Chamberlin; Steven D Bedrick; Aaron M Cohen; Yanshan Wang; Andrew Wen; Sijia Liu; Hongfang Liu; William R Hersh
Journal:  JAMIA Open       Date:  2020-07-26

Review 5.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

6.  Benchmarking machine learning models on multi-centre eICU critical care dataset.

Authors:  Seyedmostafa Sheikhalishahi; Vevake Balaraman; Venet Osmani
Journal:  PLoS One       Date:  2020-07-02       Impact factor: 3.240

7.  A rule-based approach to identify patient eligibility criteria for clinical trials from narrative longitudinal records.

Authors:  George Karystianis; Oscar Florez-Vargas; Tony Butler; Goran Nenadic
Journal:  JAMIA Open       Date:  2019-08-20

8.  Semantic categorization of Chinese eligibility criteria in clinical trials using machine learning methods.

Authors:  Hui Zong; Jinxuan Yang; Zeyu Zhang; Zuofeng Li; Xiaoyan Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2021-04-15       Impact factor: 2.796

9.  Combining human and machine intelligence for clinical trial eligibility querying.

Authors:  Yilu Fang; Betina Idnay; Yingcheng Sun; Hao Liu; Zhehuan Chen; Karen Marder; Hua Xu; Rebecca Schnall; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2022-06-14       Impact factor: 7.942

10.  Automated classification of clinical trial eligibility criteria text based on ensemble learning and metric learning.

Authors:  Kun Zeng; Yibin Xu; Ge Lin; Likeng Liang; Tianyong Hao
Journal:  BMC Med Inform Decis Mak       Date:  2021-07-30       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.