Literature DB >> 31002562

Natural Language Processing for Automated Quantification of Brain Metastases Reported in Free-Text Radiology Reports.

Joeky T Senders1, Aditya V Karhade1, David J Cote1, Alireza Mehrtash1, Nayan Lamba1, Aislyn DiRisio1, Ivo S Muskens1, William B Gormley1, Timothy R Smith1, Marike L D Broekman2, Omar Arnaout1.   

Abstract

PURPOSE: Although the bulk of patient-generated health data are increasing exponentially, their use is impeded because most data come in unstructured format, namely as free-text clinical reports. A variety of natural language processing (NLP) methods have emerged to automate the processing of free text ranging from statistical to deep learning-based models; however, the optimal approach for medical text analysis remains to be determined. The aim of this study was to provide a head-to-head comparison of novel NLP techniques and inform future studies about their utility for automated medical text analysis. PATIENTS AND METHODS: Magnetic resonance imaging reports of patients with brain metastases treated in two tertiary centers were retrieved and manually annotated using a binary classification (single metastasis v two or more metastases). Multiple bag-of-words and sequence-based NLP models were developed and compared after randomly splitting the annotated reports into training and test sets in an 80:20 ratio.
RESULTS: A total of 1,479 radiology reports of patients diagnosed with brain metastases were retrieved. The least absolute shrinkage and selection operator (LASSO) regression model demonstrated the best overall performance on the hold-out test set with an area under the receiver operating characteristic curve of 0.92 (95% CI, 0.89 to 0.94), accuracy of 83% (95% CI, 80% to 87%), calibration intercept of -0.06 (95% CI, -0.14 to 0.01), and calibration slope of 1.06 (95% CI, 0.95 to 1.17).
CONCLUSION: Among various NLP techniques, the bag-of-words approach combined with a LASSO regression model demonstrated the best overall performance in extracting binary outcomes from free-text clinical reports. This study provides a framework for the development of machine learning-based NLP models as well as a clinical vignette of patients diagnosed with brain metastases.

Entities:  

Mesh:

Year:  2019        PMID: 31002562      PMCID: PMC6873936          DOI: 10.1200/CCI.18.00138

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  30 in total

1.  Information extraction for tracking liver cancer patients' statuses: from mixture of clinical narrative report types.

Authors:  Xiao-Ou Ping; Yi-Ju Tseng; Yufang Chung; Ya-Lin Wu; Ching-Wei Hsu; Pei-Ming Yang; Guan-Tarn Huang; Feipei Lai; Ja-Der Liang
Journal:  Telemed J E Health       Date:  2013-07-20       Impact factor: 3.536

2.  Automated extraction of BI-RADS final assessment categories from radiology reports with natural language processing.

Authors:  Dorothy A Sippo; Graham I Warden; Katherine P Andriole; Ronilda Lacson; Ichiro Ikuta; Robyn L Birdwell; Ramin Khorasani
Journal:  J Digit Imaging       Date:  2013-10       Impact factor: 4.056

3.  Intelligent Word Embeddings of Free-Text Radiology Reports.

Authors:  Imon Banerjee; Sriraman Madhavan; Roger Eric Goldman; Daniel L Rubin
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

4.  Automated Extraction of Grade, Stage, and Quality Information From Transurethral Resection of Bladder Tumor Pathology Reports Using Natural Language Processing.

Authors:  Alexander P Glaser; Brian J Jordan; Jason Cohen; Anuj Desai; Philip Silberman; Joshua J Meeks
Journal:  JCO Clin Cancer Inform       Date:  2018-12

Review 5.  Machine Learning in Medicine.

Authors:  Rahul C Deo
Journal:  Circulation       Date:  2015-11-17       Impact factor: 29.690

6.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.

Authors:  Ziad Obermeyer; Ezekiel J Emanuel
Journal:  N Engl J Med       Date:  2016-09-29       Impact factor: 91.245

7.  Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing.

Authors:  Yvonne Sada; Jason Hou; Peter Richardson; Hashem El-Serag; Jessica Davila
Journal:  Med Care       Date:  2016-02       Impact factor: 2.983

8.  Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.

Authors:  Vijay Garla; Caroline Taylor; Cynthia Brandt
Journal:  J Biomed Inform       Date:  2013-07-08       Impact factor: 6.317

9.  The language of medicine.

Authors:  Henrik R Wulff
Journal:  J R Soc Med       Date:  2004-04       Impact factor: 18.000

10.  n-Gram-Based Text Compression.

Authors:  Vu H Nguyen; Hien T Nguyen; Hieu N Duong; Vaclav Snasel
Journal:  Comput Intell Neurosci       Date:  2016-11-14
View more
  8 in total

1.  Foundations of Machine Learning-Based Clinical Prediction Modeling: Part V-A Practical Approach to Regression Problems.

Authors:  Victor E Staartjes; Julius M Kernbach
Journal:  Acta Neurochir Suppl       Date:  2022

2.  Machine Intelligence in Clinical Neuroscience: Taming the Unchained Prometheus.

Authors:  Victor E Staartjes; Luca Regli; Carlo Serra
Journal:  Acta Neurochir Suppl       Date:  2022

3.  Foundations of Machine Learning-Based Clinical Prediction Modeling: Part IV-A Practical Approach to Binary Classification Problems.

Authors:  Victor E Staartjes; Julius M Kernbach
Journal:  Acta Neurochir Suppl       Date:  2022

Review 4.  Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors:  Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal:  Seizure       Date:  2021-01-13       Impact factor: 3.184

5.  Developing a Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports.

Authors:  Karen E Batch; Jianwei Yue; Alex Darcovich; Kaelan Lupton; Corinne C Liu; David P Woodlock; Mohammad Ali K El Amine; Pamela I Causa-Andrieu; Lior Gazit; Gary H Nguyen; Farhana Zulkernine; Richard K G Do; Amber L Simpson
Journal:  Front Artif Intell       Date:  2022-03-02

Review 6.  Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.

Authors:  Liwei Wang; Sunyang Fu; Andrew Wen; Xiaoyang Ruan; Huan He; Sijia Liu; Sungrim Moon; Michelle Mai; Irbaz B Riaz; Nan Wang; Ping Yang; Hua Xu; Jeremy L Warner; Hongfang Liu
Journal:  JCO Clin Cancer Inform       Date:  2022-07

7.  Patterns of Metastatic Disease in Patients with Cancer Derived from Natural Language Processing of Structured CT Radiology Reports over a 10-year Period.

Authors:  Richard K G Do; Kaelan Lupton; Pamela I Causa Andrieu; Anisha Luthra; Michio Taya; Karen Batch; Huy Nguyen; Prachi Rahurkar; Lior Gazit; Kevin Nicholas; Christopher J Fong; Natalie Gangai; Nikolaus Schultz; Farhana Zulkernine; Varadan Sevilimedu; Krishna Juluru; Amber Simpson; Hedvig Hricak
Journal:  Radiology       Date:  2021-08-03       Impact factor: 29.146

8.  Deep learning to automate the labelling of head MRI datasets for computer vision applications.

Authors:  David A Wood; Sina Kafiabadi; Aisha Al Busaidi; Emily L Guilhem; Jeremy Lynch; Matthew K Townend; Antanas Montvila; Martin Kiik; Juveria Siddiqui; Naveen Gadapa; Matthew D Benger; Asif Mazumder; Gareth Barker; Sebastian Ourselin; James H Cole; Thomas C Booth
Journal:  Eur Radiol       Date:  2021-07-20       Impact factor: 5.315

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.