PURPOSE: Cancer pathology findings are critical for many aspects of care but are often locked away as unstructured free text. Our objective was to develop a natural language processing (NLP) system to extract prostate pathology details from postoperative pathology reports and a parallel structured data entry process for use by urologists during routine documentation care and compare accuracy when compared with manual abstraction and concordance between NLP and clinician-entered approaches. MATERIALS AND METHODS: From February 2016, clinicians used note templates with custom structured data elements (SDEs) during routine clinical care for men with prostate cancer. We also developed an NLP algorithm to parse radical prostatectomy pathology reports and extract structured data. We compared accuracy of clinician-entered SDEs and NLP-parsed data to manual abstraction as a gold standard and compared concordance (Cohen's κ) between approaches assuming no gold standard. RESULTS: There were 523 patients with NLP-extracted data, 319 with SDE data, and 555 with manually abstracted data. For Gleason scores, NLP and clinician SDE accuracy was 95.6% and 95.8%, respectively, compared with manual abstraction, with concordance of 0.93 (95% CI, 0.89 to 0.98). For margin status, extracapsular extension, and seminal vesicle invasion, stage, and lymph node status, NLP accuracy was 94.8% to 100%, SDE accuracy was 87.7% to 100%, and concordance between NLP and SDE ranged from 0.92 to 1.0. CONCLUSION: We show that a real-world deployment of an NLP algorithm to extract pathology data and structured data entry by clinicians during routine clinical care in a busy clinical practice can generate accurate data when compared with manual abstraction for some, but not all, components of a prostate pathology report.
PURPOSE:Cancer pathology findings are critical for many aspects of care but are often locked away as unstructured free text. Our objective was to develop a natural language processing (NLP) system to extract prostate pathology details from postoperative pathology reports and a parallel structured data entry process for use by urologists during routine documentation care and compare accuracy when compared with manual abstraction and concordance between NLP and clinician-entered approaches. MATERIALS AND METHODS: From February 2016, clinicians used note templates with custom structured data elements (SDEs) during routine clinical care for men with prostate cancer. We also developed an NLP algorithm to parse radical prostatectomy pathology reports and extract structured data. We compared accuracy of clinician-entered SDEs and NLP-parsed data to manual abstraction as a gold standard and compared concordance (Cohen's κ) between approaches assuming no gold standard. RESULTS: There were 523 patients with NLP-extracted data, 319 with SDE data, and 555 with manually abstracted data. For Gleason scores, NLP and clinician SDE accuracy was 95.6% and 95.8%, respectively, compared with manual abstraction, with concordance of 0.93 (95% CI, 0.89 to 0.98). For margin status, extracapsular extension, and seminal vesicle invasion, stage, and lymph node status, NLP accuracy was 94.8% to 100%, SDE accuracy was 87.7% to 100%, and concordance between NLP and SDE ranged from 0.92 to 1.0. CONCLUSION: We show that a real-world deployment of an NLP algorithm to extract pathology data and structured data entry by clinicians during routine clinical care in a busy clinical practice can generate accurate data when compared with manual abstraction for some, but not all, components of a prostate pathology report.
Authors: Anil A Thomas; Chengyi Zheng; Howard Jung; Allen Chang; Brian Kim; Joy Gelfond; Jeff Slezak; Kim Porter; Steven J Jacobsen; Gary W Chien Journal: World J Urol Date: 2013-02-17 Impact factor: 4.226
Authors: Angus Roberts; Robert Gaizauskas; Mark Hepple; Neil Davis; George Demetriou; Yikun Guo; Jay Kola; Ian Roberts; Andrea Setzer; Archana Tapuria; Bill Wheeldin Journal: AMIA Annu Symp Proc Date: 2007-10-11
Authors: John Srigley; Sara Lankshear; James Brierley; Thomas McGowan; Dimitrios Divaris; Marta Yurcan; Robin Rossi; Tim Yardley; Mary Jane King; Jillian Ross; Jonathan Irish; Robin McLeod; Carol Sawka Journal: J Oncol Pract Date: 2013-07-02 Impact factor: 3.840
Authors: Brett R South; Shuying Shen; Makoto Jones; Jennifer Garvin; Matthew H Samore; Wendy W Chapman; Adi V Gundlapalli Journal: BMC Bioinformatics Date: 2009-09-17 Impact factor: 3.169
Authors: Emily Nash Smyth; Julie Beyrer; Kimberly R Saverno; Elizabeth Hadden; Hamed Abedtash; Angelo DeLuca; Garreth W Lawrence; Sarah Rybowski Journal: Drugs Real World Outcomes Date: 2022-09-12
Authors: Selen Bozkurt; Christopher J Magnani; Martin G Seneviratne; James D Brooks; Tina Hernandez-Boussard Journal: Front Digit Health Date: 2022-06-02
Authors: Youqing Mu; Hamid R Tizhoosh; Rohollah Moosavi Tayebi; Catherine Ross; Monalisa Sur; Brian Leber; Clinton J V Campbell Journal: Commun Med (Lond) Date: 2021-07-05
Authors: Anobel Y Odisho; Briton Park; Nicholas Altieri; John DeNero; Matthew R Cooperberg; Peter R Carroll; Bin Yu Journal: JAMIA Open Date: 2020-10-14