Literature DB >> 26306621

ReCAP: Feasibility and Accuracy of Extracting Cancer Stage Information From Narrative Electronic Health Record Data.

Jeremy L Warner1, Mia A Levy2, Michael N Neuss2, Jeremy L Warner1, Mia A Levy2, Michael N Neuss2.   

Abstract

PURPOSE: Cancer stage, one of the most important prognostic factors for cancer-specific survival, is often documented in narrative form in electronic health records (EHRs). Such documentation results in tedious and time-consuming abstraction efforts by tumor registrars and other secondary users. This information may be amenable to extraction by automated methods.
METHODS: We developed a natural language processing algorithm to extract stage statements from machine-readable EHR documents, including automated rules to choose the most likely stage when discordance was present in the EHR. These methods were developed in a training set of patients with lung cancer, independently validated in a test set of patients with lung cancer, and compared with the gold standard of Vanderbilt Cancer Registry–determined stage (when available).
RESULTS: In the combined data set of 2,323 patients (training set, n = 1,103; validation set, n = 1,220), 751,880 documents were analyzed. A stage statement was extracted from 2,239 (98.6%) patient EHRs (median, 24 documents per patient). Stage discordance was common, affecting 83.6% of these EHRs. Nevertheless, algorithmically derived stage accuracy was high in the validation set (κ = 0.906; 95% CI, 0.873 to 0.939), when including notes generated within 14 weeks from diagnosis.
CONCLUSION: Accurate stage determination can be achieved through automated methods applied to narrative text, despite the frequent presence of discordance in such data. Our results also indicate that stage can be automatically captured in a shorter timeframe than the 6-month window used by cancer registries, as early as 5 weeks from diagnosis. These methods may be generalizable to large narrative cancer data sets.
Copyright © 2015 by American Society of Clinical Oncology.

Entities:  

Mesh:

Year:  2015        PMID: 26306621     DOI: 10.1200/JOP.2015.004622

Source DB:  PubMed          Journal:  J Oncol Pract        ISSN: 1554-7477            Impact factor:   3.840


  21 in total

1.  Electronic Health Record (EHR) Abstraction.

Authors:  Amal A Alzu'bi; Valerie J M Watzlaf; Patty Sheridan
Journal:  Perspect Health Inf Manag       Date:  2021-03-15

Review 2.  Making Sense of Big Textual Data for Health Care: Findings from the Section on Clinical Natural Language Processing.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2017-09-11

3.  Identifying Cases of Metastatic Prostate Cancer Using Machine Learning on Electronic Health Records.

Authors:  Martin G Seneviratne; Juan M Banda; James D Brooks; Nigam H Shah; Tina M Hernandez-Boussard
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

4.  A clinical text classification paradigm using weak supervision and deep representation.

Authors:  Yanshan Wang; Sunghwan Sohn; Sijia Liu; Feichen Shen; Liwei Wang; Elizabeth J Atkinson; Shreyasee Amin; Hongfang Liu
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-07       Impact factor: 2.796

5.  Machine Learning Approaches for Extracting Stage from Pathology Reports in Prostate Cancer.

Authors:  Raphael Lenain; Martin G Seneviratne; Selen Bozkurt; Douglas W Blayney; James D Brooks; Tina Hernandez-Boussard
Journal:  Stud Health Technol Inform       Date:  2019-08-21

6.  Improving precision in concept normalization.

Authors:  Mayla Boguslav; K Bretonnel Cohen; William A Baumgartner; Lawrence E Hunter
Journal:  Pac Symp Biocomput       Date:  2018

7.  Automating the Determination of Prostate Cancer Risk Strata From Electronic Medical Records.

Authors:  Justin R Gregg; Maximilian Lang; Lucy L Wang; Matthew J Resnick; Sandeep K Jain; Jeremy L Warner; Daniel A Barocas
Journal:  JCO Clin Cancer Inform       Date:  2017-06-08

Review 8.  Clinical information extraction applications: A literature review.

Authors:  Yanshan Wang; Liwei Wang; Majid Rastegar-Mojarad; Sungrim Moon; Feichen Shen; Naveed Afzal; Sijia Liu; Yuqun Zeng; Saeed Mehrabi; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2017-11-21       Impact factor: 6.317

9.  Automated Extraction of Tumor Staging and Diagnosis Information From Surgical Pathology Reports.

Authors:  Sajjad Abedian; Evan T Sholle; Prakash M Adekkanattu; Marika M Cusick; Stephanie E Weiner; Jonathan E Shoag; Jim C Hu; Thomas R Campion
Journal:  JCO Clin Cancer Inform       Date:  2021-10

10.  CUSTOM-SEQ: a prototype for oncology rapid learning in a comprehensive EHR environment.

Authors:  Jeremy L Warner; Lucy Wang; William Pao; Jeffrey A Sosman; Ravi V Atreya; Pam Carney; Mia A Levy
Journal:  J Am Med Inform Assoc       Date:  2016-03-23       Impact factor: 7.942

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.