Literature DB >> 34413062

Qualifying Certainty in Radiology Reports through Deep Learning-Based Natural Language Processing.

F Liu1,2, P Zhou2, S J Baccei2,3, M J Masciocchi2,3, N Amornsiripanitch2,3, C I Kiefe1, M P Rosen4,3.   

Abstract

BACKGROUND AND
PURPOSE: Communication gaps exist between radiologists and referring physicians in conveying diagnostic certainty. We aimed to explore deep learning-based bidirectional contextual language models for automatically assessing diagnostic certainty expressed in the radiology reports to facilitate the precision of communication.
MATERIALS AND METHODS: We randomly sampled 594 head MR imaging reports from an academic medical center. We asked 3 board-certified radiologists to read sentences from the Impression section and assign each sentence 1 of the 4 certainty categories: "Non-Definitive," "Definitive-Mild," "Definitive-Strong," "Other." Using the annotated 2352 sentences, we developed and validated a natural language-processing system based on the start-of-the-art bidirectional encoder representations from transformers (BERT), which can capture contextual uncertainty semantics beyond the lexicon level. Finally, we evaluated 3 BERT variant models and reported standard metrics including sensitivity, specificity, and area under the curve.
RESULTS: A κ score of 0.74 was achieved for interannotator agreement on uncertainty interpretations among 3 radiologists. For the 3 BERT variant models, the biomedical variant (BioBERT) achieved the best macro-average area under the curve of 0.931 (compared with 0.928 for the BERT-base and 0.925 for the clinical variant [ClinicalBERT]) on the validation data. All 3 models yielded high macro-average specificity (93.13%-93.65%), while the BERT-base obtained the highest macro-average sensitivity of 79.46% (compared with 79.08% for BioBERT and 78.52% for ClinicalBERT). The BioBERT model showed great generalizability on the heldout test data with a macro-average sensitivity of 77.29%, specificity of 92.89%, and area under the curve of 0.93.
CONCLUSIONS: A deep transfer learning model can be developed to reliably assess the level of uncertainty communicated in a radiology report.
© 2021 by American Journal of Neuroradiology.

Entities:  

Mesh:

Year:  2021        PMID: 34413062      PMCID: PMC8562739          DOI: 10.3174/ajnr.A7241

Source DB:  PubMed          Journal:  AJNR Am J Neuroradiol        ISSN: 0195-6108            Impact factor:   4.966


  20 in total

1.  Radiologic quality and safety: mapping value into radiology.

Authors:  Stephen J Swensen; C Daniel Johnson
Journal:  J Am Coll Radiol       Date:  2005-12       Impact factor: 5.532

2.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

3.  Radiology Reports: What YOU Think You're Saying and What THEY Think You're Saying.

Authors:  Bonmyong Lee; Matthew T Whitehead
Journal:  Curr Probl Diagn Radiol       Date:  2016-11-16

Review 4.  Natural Language Processing in Radiology: A Systematic Review.

Authors:  Ewoud Pons; Loes M M Braun; M G Myriam Hunink; Jan A Kors
Journal:  Radiology       Date:  2016-05       Impact factor: 11.105

5.  Differences in Perceptions Among Radiologists, Referring Physicians, and Patients Regarding Language for Incidental Findings Reporting.

Authors:  Andrew B Rosenkrantz
Journal:  AJR Am J Roentgenol       Date:  2016-09-22       Impact factor: 3.959

6.  Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora.

Authors:  Veronika Vincze; György Szarvas; György Móra; Tomoko Ohta; Richárd Farkas
Journal:  J Biomed Semantics       Date:  2011-10-06

7.  Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach.

Authors:  Rumeng Li; Baotian Hu; Feifan Liu; Weisong Liu; Francesca Cunningham; David D McManus; Hong Yu
Journal:  JMIR Med Inform       Date:  2019-02-08

8.  Multi-dimensional classification of biomedical text: toward automated, practical provision of high-utility text to diverse users.

Authors:  Hagit Shatkay; Fengxia Pan; Andrey Rzhetsky; W John Wilbur
Journal:  Bioinformatics       Date:  2008-08-20       Impact factor: 6.937

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more
  1 in total

1.  Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

Authors:  Jia Li; Yucong Lin; Pengfei Zhao; Wenjuan Liu; Linkun Cai; Jing Sun; Lei Zhao; Zhenghan Yang; Hong Song; Han Lv; Zhenchang Wang
Journal:  BMC Med Inform Decis Mak       Date:  2022-07-30       Impact factor: 3.298

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.