Literature DB >> 24488511

Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence.

David S Carrell, Scott Halgrim, Diem-Thy Tran, Diana S M Buist, Jessica Chubak, Wendy W Chapman, Guergana Savova.   

Abstract

The increasing availability of electronic health records (EHRs) creates opportunities for automated extraction of information from clinical text. We hypothesized that natural language processing (NLP) could substantially reduce the burden of manual abstraction in studies examining outcomes, like cancer recurrence, that are documented in unstructured clinical text, such as progress notes, radiology reports, and pathology reports. We developed an NLP-based system using open-source software to process electronic clinical notes from 1995 to 2012 for women with early-stage incident breast cancers to identify whether and when recurrences were diagnosed. We developed and evaluated the system using clinical notes from 1,472 patients receiving EHR-documented care in an integrated health care system in the Pacific Northwest. A separate study provided the patient-level reference standard for recurrence status and date. The NLP-based system correctly identified 92% of recurrences and estimated diagnosis dates within 30 days for 88% of these. Specificity was 96%. The NLP-based system overlooked 5 of 65 recurrences, 4 because electronic documents were unavailable. The NLP-based system identified 5 other recurrences incorrectly classified as nonrecurrent in the reference standard. If used in similar cohorts, NLP could reduce by 90% the number of EHR charts abstracted to identify confirmed breast cancer recurrence cases at a rate comparable to traditional abstraction.

Entities:  

Keywords:  breast cancer recurrence; chart abstraction; natural language processing

Mesh:

Year:  2014        PMID: 24488511      PMCID: PMC3939853          DOI: 10.1093/aje/kwt441

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


  33 in total

1.  Natural language processing and its future in medicine.

Authors:  C Friedman; G Hripcsak
Journal:  Acad Med       Date:  1999-08       Impact factor: 6.893

2.  caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research.

Authors:  Rebecca S Crowley; Melissa Castine; Kevin Mitchell; Girish Chavan; Tara McSherry; Michael Feldman
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

Review 3.  Extracting information from textual documents in the electronic health record: a review of recent research.

Authors:  S M Meystre; G K Savova; K C Kipper-Schuler; J F Hurdle
Journal:  Yearb Med Inform       Date:  2008

4.  Enhanced identification of eligibility for depression research using an electronic medical record search engine.

Authors:  Lisa Seyfried; David A Hanauer; Donald Nease; Rashad Albeiruti; Janet Kavanagh; Helen C Kales
Journal:  Int J Med Inform       Date:  2009-06-27       Impact factor: 4.046

5.  Breast cancer treatment of older women in integrated health care settings.

Authors:  Shelley M Enger; Soe Soe Thwin; Diana S M Buist; Terry Field; Floyd Frost; Ann M Geiger; Timothy L Lash; Marianne Prout; Marianne Ulcickas Yood; Feifei Wei; Rebecca A Silliman
Journal:  J Clin Oncol       Date:  2006-09-20       Impact factor: 44.544

Review 6.  Review: use of electronic medical records for health outcomes research: a literature review.

Authors:  Bonnie B Dean; Jessica Lam; Jaime L Natoli; Qiana Butler; Daniel Aguilar; Robert J Nordyke
Journal:  Med Care Res Rev       Date:  2009-03-11       Impact factor: 3.929

7.  Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model.

Authors:  Anni Coden; Guergana Savova; Igor Sominsky; Michael Tanenblatt; James Masanz; Karin Schuler; James Cooper; Wei Guan; Piet C de Groen
Journal:  J Biomed Inform       Date:  2008-12-27       Impact factor: 6.317

Review 8.  Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.

Authors:  Lionel T E Cheng; Jiaping Zheng; Guergana K Savova; Bradley J Erickson
Journal:  J Digit Imaging       Date:  2009-05-30       Impact factor: 4.056

9.  Referral, receipt, and completion of chemotherapy in patients with early-stage breast cancer older than 65 years and at high risk of breast cancer recurrence.

Authors:  Diana S M Buist; Jessica Chubak; Marianne Prout; Marianne Ulcickas Yood; Jaclyn L F Bosco; Soe Soe Thwin; Heather Taffet Gold; Cynthia Owusu; Terry S Field; Virginia P Quinn; Feifei Wei; Rebecca A Silliman
Journal:  J Clin Oncol       Date:  2009-08-17       Impact factor: 44.544

10.  Breast cancer recurrence risk in relation to antidepressant use after diagnosis.

Authors:  Jessica Chubak; Diana S M Buist; Denise M Boudreau; Mary Anne Rossing; Thomas Lumley; Noel S Weiss
Journal:  Breast Cancer Res Treat       Date:  2007-12-06       Impact factor: 4.872

View more
  64 in total

Review 1.  Clinical Natural Language Processing in 2014: Foundational Methods Supporting Efficient Healthcare.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2015-08-13

Review 2.  Natural Language Processing Technologies in Radiology Research and Clinical Applications.

Authors:  Tianrun Cai; Andreas A Giannopoulos; Sheng Yu; Tatiana Kelil; Beth Ripley; Kanako K Kumamaru; Frank J Rybicki; Dimitrios Mitsouras
Journal:  Radiographics       Date:  2016 Jan-Feb       Impact factor: 5.333

3.  Diagnostic code agreement for electronic health records and claims data for tuberculosis.

Authors:  S A Iqbal; C J Isenhour; G Mazurek; B I Truman
Journal:  Int J Tuberc Lung Dis       Date:  2020-07-01       Impact factor: 2.373

4.  Using natural language processing to extract mammographic findings.

Authors:  Hongyuan Gao; Erin J Aiello Bowles; David Carrell; Diana S M Buist
Journal:  J Biomed Inform       Date:  2015-02-03       Impact factor: 6.317

5.  Contralateral Breast Cancer Event Detection Using Nature Language Processing.

Authors:  Zexian Zeng; Xiaoyu Li; Sasa Espino; Ankita Roy; Kristen Kitsch; Susan Clare; Seema Khan; Yuan Luo
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

6.  Use of emergency department electronic medical records for automated epidemiological surveillance of suicide attempts: a French pilot study.

Authors:  Marie-Hélène Metzger; Nastassia Tvardik; Quentin Gicquel; Côme Bouvry; Emmanuel Poulet; Véronique Potinet-Pagliaroli
Journal:  Int J Methods Psychiatr Res       Date:  2016-09-15       Impact factor: 4.035

7.  Building a self-measuring healthcare system with computable metrics, data fusion, and substitutable apps.

Authors:  Kenneth D Mandl; Joshua C Mandel
Journal:  BMJ Outcomes       Date:  2015-04

8.  Validity of Natural Language Processing for Ascertainment of EGFR and ALK Test Results in SEER Cases of Stage IV Non-Small-Cell Lung Cancer.

Authors:  Bernardo Haddock Lobo Goulart; Emily T Silgard; Christina S Baik; Aasthaa Bansal; Qin Sun; Eric B Durbin; Isaac Hands; Darshil Shah; Susanne M Arnold; Scott D Ramsey; Ramakanth Kavuluru; Stephen M Schwartz
Journal:  JCO Clin Cancer Inform       Date:  2019-05

9.  Effect of an Automated Tracking Registry on the Rate of Tracking Failure in Incidental Pulmonary Nodules.

Authors:  Jonathan Shelver; Chris H Wendt; Melissa McClure; Brian Bell; Angela E Fabbrini; Thomas Rector; Kathryn Rice
Journal:  J Am Coll Radiol       Date:  2017-04-21       Impact factor: 5.532

10.  A Machine Learning Approach to Identify NIH-Funded Applied Prevention Research.

Authors:  Jennifer Villani; Sheri D Schully; Payam Meyer; Ranell L Myles; Jocelyn A Lee; David M Murray; Ashley J Vargas
Journal:  Am J Prev Med       Date:  2018-10-25       Impact factor: 5.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.