Literature DB >> 31197354

Hybrid bag of approaches to characterize selection criteria for cohort identification.

V G Vinod Vydiswaran1,2, Asher Strayhorn1, Xinyan Zhao2, Phil Robinson2, Mahesh Agarwal3, Erin Bagazinski2, Madia Essiet2, Bradley E Iott2, Hyeon Joo2, PingJui Ko2, Dahee Lee2, Jin Xiu Lu2, Jinghui Liu2, Adharsh Murali2, Koki Sasagawa2, Tianshi Wang2, Nalingna Yuan2.   

Abstract

OBJECTIVE: The 2018 National NLP Clinical Challenge (2018 n2c2) focused on the task of cohort selection for clinical trials, where participating systems were tasked with analyzing longitudinal patient records to determine if the patients met or did not meet any of the 13 selection criteria. This article describes our participation in this shared task.
MATERIALS AND METHODS: We followed a hybrid approach combining pattern-based, knowledge-intensive, and feature weighting techniques. After preprocessing the notes using publicly available natural language processing tools, we developed individual criterion-specific components that relied on collecting knowledge resources relevant for these criteria and pattern-based and weighting approaches to identify "met" and "not met" cases.
RESULTS: As part of the 2018 n2c2 challenge, 3 runs were submitted. The overall micro-averaged F1 on the training set was 0.9444. On the test set, the micro-averaged F1 for the 3 submitted runs were 0.9075, 0.9065, and 0.9056. The best run was placed second in the overall challenge and all 3 runs were statistically similar to the top-ranked system. A reimplemented system achieved the best overall F1 of 0.9111 on the test set. DISCUSSION: We highlight the need for a focused resource-intensive effort to address the class imbalance in the cohort selection identification task.
CONCLUSION: Our hybrid approach was able to identify all selection criteria with high F1 performance on both training and test sets. Based on our participation in the 2018 n2c2 task, we conclude that there is merit in continuing a focused criterion-specific analysis and developing appropriate knowledge resources to build a quality cohort selection system.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical trial selection criteria; cohort identification; information storage and retrieval [L01.313.500.750.280]; information systems [L01.313.500.750.300]; natural language processing (L01.224.065.580)

Year:  2019        PMID: 31197354      PMCID: PMC7647216          DOI: 10.1093/jamia/ocz079

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  37 in total

1.  A bootstrapping algorithm to improve cohort identification using structured data.

Authors:  Sasikiran Kandula; Qing Zeng-Treitler; Lingji Chen; William L Salomon; Bruce E Bray
Journal:  J Biomed Inform       Date:  2011-11-07       Impact factor: 6.317

2.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

3.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Classifying Clinical Trial Eligibility Criteria to Facilitate Phased Cohort Identification Using Clinical Data Repositories.

Authors:  Amy Y Wang; William J Lancaster; Matthew C Wyatt; Luke V Rasmussen; Daniel G Fort; James J Cimino
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

5.  Learning to predict post-hospitalization VTE risk from EHR data.

Authors:  Emily Kawaler; Alexander Cobian; Peggy Peissig; Deanna Cross; Steve Yale; Mark Craven
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

6.  A method and knowledge base for automated inference of patient problems from structured data in an electronic medical record.

Authors:  Adam Wright; Justine Pang; Joshua C Feblowitz; Francine L Maloney; Allison R Wilcox; Harley Z Ramelson; Louise I Schneider; David W Bates
Journal:  J Am Med Inform Assoc       Date:  2011-05-25       Impact factor: 4.497

7.  Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases.

Authors:  Hua Xu; Zhenming Fu; Anushi Shah; Yukun Chen; Neeraja B Peterson; Qingxia Chen; Subramani Mani; Mia A Levy; Qi Dai; Josh C Denny
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

8.  Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence.

Authors:  David S Carrell; Scott Halgrim; Diem-Thy Tran; Diana S M Buist; Jessica Chubak; Wendy W Chapman; Guergana Savova
Journal:  Am J Epidemiol       Date:  2014-01-30       Impact factor: 4.897

9.  Association between glycated hemoglobin A1c levels with age and gender in Chinese adults with no prior diagnosis of diabetes mellitus.

Authors:  Qinglin Ma; Houming Liu; Guangxin Xiang; Wanshui Shan; Wanli Xing
Journal:  Biomed Rep       Date:  2016-03-29

10.  EpiDEA: extracting structured epilepsy and seizure information from patient discharge summaries for cohort identification.

Authors:  Licong Cui; Alireza Bozorgi; Samden D Lhatoo; Guo-Qiang Zhang; Satya S Sahoo
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03
View more
  7 in total

1.  Cohort selection for clinical trials: n2c2 2018 shared task track 1.

Authors:  Amber Stubbs; Michele Filannino; Ergin Soysal; Samuel Henry; Özlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  A systematic review on natural language processing systems for eligibility prescreening in clinical research.

Authors:  Betina Idnay; Caitlin Dreisbach; Chunhua Weng; Rebecca Schnall
Journal:  J Am Med Inform Assoc       Date:  2021-12-28       Impact factor: 4.497

3.  Artificial Intelligence-Driven Structurization of Diagnostic Information in Free-Text Pathology Reports.

Authors:  Pericles S Giannaris; Zainab Al-Taie; Mikhail Kovalenko; Nattapon Thanintorn; Olha Kholod; Yulia Innokenteva; Emily Coberly; Shellaine Frazier; Katsiarina Laziuk; Mihail Popescu; Chi-Ren Shyu; Dong Xu; Richard D Hammer; Dmitriy Shin
Journal:  J Pathol Inform       Date:  2020-02-11

4.  Evaluation of Doc'EDS: a French semantic search tool to query health documents from a clinical data warehouse.

Authors:  Thibaut Pressat-Laffouilhère; Pierre Balayé; Badisse Dahamna; Romain Lelong; Kévin Billey; Stéfan J Darmoni; Julien Grosjean
Journal:  BMC Med Inform Decis Mak       Date:  2022-02-08       Impact factor: 3.298

5.  Combining human and machine intelligence for clinical trial eligibility querying.

Authors:  Yilu Fang; Betina Idnay; Yingcheng Sun; Hao Liu; Zhehuan Chen; Karen Marder; Hua Xu; Rebecca Schnall; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2022-06-14       Impact factor: 7.942

6.  Identifying Caregiver Availability Using Medical Notes With Rule-Based Natural Language Processing: Retrospective Cohort Study.

Authors:  Elham Mahmoudi; Wenbo Wu; Cyrus Najarian; James Aikens; Julie Bynum; V G Vinod Vydiswaran
Journal:  JMIR Aging       Date:  2022-09-22

7.  Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach.

Authors:  Irena Spasic; Dominik Krzeminski; Padraig Corcoran; Alexander Balinsky
Journal:  JMIR Med Inform       Date:  2019-10-31
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.