Literature DB >> 25200472

A natural language processing pipeline for pairing measurements uniquely across free-text CT reports.

Merlijn Sevenster1, Jeffrey Bozeman2, Andrea Cowhy2, William Trost2.   

Abstract

OBJECTIVE: To standardize and objectivize treatment response assessment in oncology, guidelines have been proposed that are driven by radiological measurements, which are typically communicated in free-text reports defying automated processing. We study through inter-annotator agreement and natural language processing (NLP) algorithm development the task of pairing measurements that quantify the same finding across consecutive radiology reports, such that each measurement is paired with at most one other ("partial uniqueness"). METHODS AND MATERIALS: Ground truth is created based on 283 abdomen and 311 chest CT reports of 50 patients each. A pre-processing engine segments reports and extracts measurements. Thirteen features are developed based on volumetric similarity between measurements, semantic similarity between their respective narrative contexts and structural properties of their report positions. A Random Forest classifier (RF) integrates all features. A "mutual best match" (MBM) post-processor ensures partial uniqueness.
RESULTS: In an end-to-end evaluation, RF has precision 0.841, recall 0.807, F-measure 0.824 and AUC 0.971; with MBM, which performs above chance level (P<0.001), it has precision 0.899, recall 0.776, F-measure 0.833 and AUC 0.935. RF (RF+MBM) has error-free performance on 52.7% (57.4%) of report pairs. DISCUSSION: Inter-annotator agreement of three domain specialists with the ground truth (κ>0.960) indicates that the task is well defined. Domain properties and inter-section differences are discussed to explain superior performance in abdomen. Enforcing partial uniqueness has mixed but minor effects on performance.
CONCLUSION: A combined machine learning-filtering approach is proposed for pairing measurements, which can support prospective (supporting treatment response assessment) and retrospective purposes (data mining).
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Information correlation; Natural language processing; Oncologic measurement; RECIST; Radiology report

Mesh:

Year:  2014        PMID: 25200472     DOI: 10.1016/j.jbi.2014.08.015

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

1.  Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports.

Authors:  M Sevenster; J Buurman; P Liu; J F Peters; P J Chang
Journal:  Appl Clin Inform       Date:  2015-09-30       Impact factor: 2.342

2.  tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports.

Authors:  Margaret Mahan; Daniel Rafter; Hannah Casey; Marta Engelking; Tessneem Abdallah; Charles Truwit; Mark Oswood; Uzma Samadani
Journal:  PLoS One       Date:  2020-07-01       Impact factor: 3.240

3.  Tumor reference resolution and characteristic extraction in radiology reports for liver cancer stage prediction.

Authors:  Wen-Wai Yim; Sharon W Kwan; Meliha Yetisgen
Journal:  J Biomed Inform       Date:  2016-10-08       Impact factor: 6.317

4.  Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes.

Authors:  Yujia Bao; Zhengyi Deng; Yan Wang; Heeyoon Kim; Victor Diego Armengol; Francisco Acevedo; Nofal Ouardaoui; Cathy Wang; Giovanni Parmigiani; Regina Barzilay; Danielle Braun; Kevin S Hughes
Journal:  JCO Clin Cancer Inform       Date:  2019-09

5.  Deep Learning to Estimate RECIST in Patients with NSCLC Treated with PD-1 Blockade.

Authors:  Kathryn C Arbour; Anh Tuan Luu; Jia Luo; Justin F Gainor; Regina Barzilay; Matthew D Hellmann; Hira Rizvi; Andrew J Plodkowski; Mustafa Sakhi; Kevin B Huang; Subba R Digumarthy; Michelle S Ginsberg; Jeffrey Girshman; Mark G Kris; Gregory J Riely; Adam Yala
Journal:  Cancer Discov       Date:  2020-09-21       Impact factor: 39.397

6.  Using automatically extracted information from mammography reports for decision-support.

Authors:  Selen Bozkurt; Francisco Gimenez; Elizabeth S Burnside; Kemal H Gulkesen; Daniel L Rubin
Journal:  J Biomed Inform       Date:  2016-07-04       Impact factor: 6.317

7.  Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.

Authors:  Matthew S Alkaitis; Monica N Agrawal; Gregory J Riely; Pedram Razavi; David Sontag
Journal:  JCO Clin Cancer Inform       Date:  2021-05

8.  Introducing Explorer of Taxon Concepts with a case study on spider measurement matrix building.

Authors:  Hong Cui; Dongfang Xu; Steven S Chong; Martin Ramirez; Thomas Rodenhausen; James A Macklin; Bertram Ludäscher; Robert A Morris; Eduardo M Soto; Nicolás Mongiardino Koch
Journal:  BMC Bioinformatics       Date:  2016-11-17       Impact factor: 3.169

9.  Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm.

Authors:  Selen Bozkurt; Emel Alkim; Imon Banerjee; Daniel L Rubin
Journal:  J Digit Imaging       Date:  2019-08       Impact factor: 4.056

10.  A systematic review of natural language processing applied to radiology reports.

Authors:  Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex
Journal:  BMC Med Inform Decis Mak       Date:  2021-06-03       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.