Literature DB >> 34862122

A Comparison of Natural Language Processing Methods for the Classification of Lumbar Spine Imaging Findings Related to Lower Back Pain.

Chethan Jujjavarapu1, Vikas Pejaver1, Trevor A Cohen1, Sean D Mooney1, Patrick J Heagerty2, Jeffrey G Jarvik3.   

Abstract

RATIONALE AND
OBJECTIVES: The use of natural language processing (NLP) in radiology provides an opportunity to assist clinicians with phenotyping patients. However, the performance and generalizability of NLP across healthcare systems is uncertain. We assessed the performance within and generalizability across four healthcare systems of different NLP representational methods, coupled with elastic-net logistic regression to classify lower back pain-related findings from lumbar spine imaging reports.
MATERIALS AND METHODS: We used a dataset of 871 X-ray and magnetic resonance imaging reports sampled from a prospective study across four healthcare systems between October 2013 and September 2016. We annotated each report for 26 findings potentially related to lower back pain. Our framework applied four different NLP methods to convert text into feature sets (representations). For each representation, our framework used an elastic-net logistic regression model for each finding (i.e., 26 binary or "one-vs.-rest" classification models). For performance evaluation, we split data into training (80%, 697/871) and testing (20%, 174/871). In the training set, we used cross validation to identify the optimal hyperparameter value and then retrained on the full training set. We then assessed performance based on area under the curve (AUC) for the test set. We repeated this process 25 times with each repeat using a different random train/test split of the data, so that we could estimate 95% confidence intervals, and assess significant difference in performance between representations. For generalizability evaluation, we trained models on data from three healthcare systems with cross validation and then tested on the fourth. We repeated this process for each system, then calculated mean and standard deviation (SD) of AUC across the systems.
RESULTS: For individual representations, n-grams had the best average performance across all 26 findings (AUC: 0.960). For generalizability, document embeddings had the most consistent average performance across systems (SD: 0.010). Out of these 26 findings, we considered eight as potentially clinically important (any stenosis, central stenosis, lateral stenosis, foraminal stenosis, disc extrusion, nerve root displacement compression, endplate edema, and listhesis grade 2) since they have a relatively greater association with a history of lower back pain compared to the remaining 18 classes. We found a similar pattern for these eight in which n-grams and document embeddings had the best average performance (AUC: 0.954) and generalizability (SD: 0.007), respectively.
CONCLUSION: Based on performance assessment, we found that n-grams is the preferred method if classifier development and deployment occur at the same system. However, for deployment at multiple systems outside of the development system, or potentially if physician behavior changes within a system, one should consider document embeddings since embeddings appear to have the most consistent performance across systems.
Copyright © 2021 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Document embeddings; Evaluation; Lower back pain; Lumbar spine diagnostic imaging; Natural language processing

Mesh:

Year:  2021        PMID: 34862122      PMCID: PMC8917985          DOI: 10.1016/j.acra.2021.09.005

Source DB:  PubMed          Journal:  Acad Radiol        ISSN: 1076-6332            Impact factor:   3.173


  28 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

Review 4.  Diagnosis and treatment of low back pain.

Authors:  B W Koes; M W van Tulder; S Thomas
Journal:  BMJ       Date:  2006-06-17

5.  The Longitudinal Assessment of Imaging and Disability of the Back (LAIDBack) Study: baseline data.

Authors:  J J Jarvik; W Hollingworth; P Heagerty; D R Haynor; R A Deyo
Journal:  Spine (Phila Pa 1976)       Date:  2001-05-15       Impact factor: 3.468

6.  Corpus domain effects on distributional semantic modeling of medical terms.

Authors:  Serguei V S Pakhomov; Greg Finley; Reed McEwan; Yan Wang; Genevieve B Melton
Journal:  Bioinformatics       Date:  2016-08-16       Impact factor: 6.937

7.  Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.

Authors:  John Zech; Margaret Pain; Joseph Titano; Marcus Badgeley; Javin Schefflein; Andres Su; Anthony Costa; Joshua Bederson; Joseph Lehar; Eric Karl Oermann
Journal:  Radiology       Date:  2018-01-30       Impact factor: 11.105

8.  Natural Language Processing of Radiology Reports in Patients With Hepatocellular Carcinoma to Predict Radiology Resource Utilization.

Authors:  A D Brown; J R Kachura
Journal:  J Am Coll Radiol       Date:  2019-03-02       Impact factor: 5.532

9.  Lumbar Imaging With Reporting Of Epidemiology (LIRE)--Protocol for a pragmatic cluster randomized trial.

Authors:  Jeffrey G Jarvik; Bryan A Comstock; Kathryn T James; Andrew L Avins; Brian W Bresnahan; Richard A Deyo; Patrick H Luetmer; Janna L Friedly; Eric N Meier; Daniel C Cherkin; Laura S Gold; Sean D Rundell; Safwan S Halabi; David F Kallmes; Katherine W Tan; Judith A Turner; Larry G Kessler; Danielle C Lavallee; Kari A Stephens; Patrick J Heagerty
Journal:  Contemp Clin Trials       Date:  2015-10-19       Impact factor: 2.226

10.  Magnetic resonance findings of acute severe lower back pain.

Authors:  Seon-Yu Kim; In-Sik Lee; Bo-Ram Kim; Jeong-Hoon Lim; Jongmin Lee; Seong-Eun Koh; Seung Beom Kim; Seung Lee Park
Journal:  Ann Rehabil Med       Date:  2012-02-29
View more
  1 in total

1.  Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

Authors:  Jia Li; Yucong Lin; Pengfei Zhao; Wenjuan Liu; Linkun Cai; Jing Sun; Lei Zhao; Zhenghan Yang; Hong Song; Han Lv; Zhenchang Wang
Journal:  BMC Med Inform Decis Mak       Date:  2022-07-30       Impact factor: 3.298

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.