Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A natural language processing pipeline for pairing measurements uniquely across free-text CT reports.

Literature DB >> 25200472

A natural language processing pipeline for pairing measurements uniquely across free-text CT reports.

Merlijn Sevenster¹, Jeffrey Bozeman², Andrea Cowhy², William Trost².

Abstract

OBJECTIVE: To standardize and objectivize treatment response assessment in oncology, guidelines have been proposed that are driven by radiological measurements, which are typically communicated in free-text reports defying automated processing. We study through inter-annotator agreement and natural language processing (NLP) algorithm development the task of pairing measurements that quantify the same finding across consecutive radiology reports, such that each measurement is paired with at most one other ("partial uniqueness"). METHODS AND MATERIALS: Ground truth is created based on 283 abdomen and 311 chest CT reports of 50 patients each. A pre-processing engine segments reports and extracts measurements. Thirteen features are developed based on volumetric similarity between measurements, semantic similarity between their respective narrative contexts and structural properties of their report positions. A Random Forest classifier (RF) integrates all features. A "mutual best match" (MBM) post-processor ensures partial uniqueness.
RESULTS: In an end-to-end evaluation, RF has precision 0.841, recall 0.807, F-measure 0.824 and AUC 0.971; with MBM, which performs above chance level (P<0.001), it has precision 0.899, recall 0.776, F-measure 0.833 and AUC 0.935. RF (RF+MBM) has error-free performance on 52.7% (57.4%) of report pairs. DISCUSSION: Inter-annotator agreement of three domain specialists with the ground truth (κ>0.960) indicates that the task is well defined. Domain properties and inter-section differences are discussed to explain superior performance in abdomen. Enforcing partial uniqueness has mixed but minor effects on performance.
CONCLUSION: A combined machine learning-filtering approach is proposed for pairing measurements, which can support prospective (supporting treatment response assessment) and retrospective purposes (data mining).

Entities: Species

Keywords: Information correlation; Natural language processing; Oncologic measurement; RECIST; Radiology report

Mesh：

Year: 2014 PMID： 25200472 DOI： 10.1016/j.jbi.2014.08.015

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

Keyword Cloud
Cited

11 in total

1. Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports.

Authors: M Sevenster; J Buurman; P Liu; J F Peters; P J Chang
Journal: Appl Clin Inform Date: 2015-09-30 Impact factor: 2.342

2. tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports.

Authors: Margaret Mahan; Daniel Rafter; Hannah Casey; Marta Engelking; Tessneem Abdallah; Charles Truwit; Mark Oswood; Uzma Samadani
Journal: PLoS One Date: 2020-07-01 Impact factor: 3.240

3. Tumor reference resolution and characteristic extraction in radiology reports for liver cancer stage prediction.

Authors: Wen-Wai Yim; Sharon W Kwan; Meliha Yetisgen
Journal: J Biomed Inform Date: 2016-10-08 Impact factor: 6.317

4. Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes.

Authors: Yujia Bao; Zhengyi Deng; Yan Wang; Heeyoon Kim; Victor Diego Armengol; Francisco Acevedo; Nofal Ouardaoui; Cathy Wang; Giovanni Parmigiani; Regina Barzilay; Danielle Braun; Kevin S Hughes
Journal: JCO Clin Cancer Inform Date: 2019-09

5. Deep Learning to Estimate RECIST in Patients with NSCLC Treated with PD-1 Blockade.

Authors: Kathryn C Arbour; Anh Tuan Luu; Jia Luo; Justin F Gainor; Regina Barzilay; Matthew D Hellmann; Hira Rizvi; Andrew J Plodkowski; Mustafa Sakhi; Kevin B Huang; Subba R Digumarthy; Michelle S Ginsberg; Jeffrey Girshman; Mark G Kris; Gregory J Riely; Adam Yala
Journal: Cancer Discov Date: 2020-09-21 Impact factor: 39.397

6. Using automatically extracted information from mammography reports for decision-support.

Authors: Selen Bozkurt; Francisco Gimenez; Elizabeth S Burnside; Kemal H Gulkesen; Daniel L Rubin
Journal: J Biomed Inform Date: 2016-07-04 Impact factor: 6.317

7. Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.

Authors: Matthew S Alkaitis; Monica N Agrawal; Gregory J Riely; Pedram Razavi; David Sontag
Journal: JCO Clin Cancer Inform Date: 2021-05

8. Introducing Explorer of Taxon Concepts with a case study on spider measurement matrix building.

Authors: Hong Cui; Dongfang Xu; Steven S Chong; Martin Ramirez; Thomas Rodenhausen; James A Macklin; Bertram Ludäscher; Robert A Morris; Eduardo M Soto; Nicolás Mongiardino Koch
Journal: BMC Bioinformatics Date: 2016-11-17 Impact factor: 3.169

9. Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm.

Authors: Selen Bozkurt; Emel Alkim; Imon Banerjee; Daniel L Rubin
Journal: J Digit Imaging Date: 2019-08 Impact factor: 4.056

10. A systematic review of natural language processing applied to radiology reports.

Authors: Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex
Journal: BMC Med Inform Decis Mak Date: 2021-06-03 Impact factor: 2.796