Literature DB >> 35923373

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Gunvant R Chaudhari1, Tengxiao Liu1, Timothy L Chen1, Gabby B Joseph1, Maya Vella1, Yoo Jin Lee1, Thienkhai H Vu1, Youngho Seo1, Andreas M Rauschecker1, Charles E McCulloch1, Jae Ho Sohn1.   

Abstract

Purpose: To develop radiology domain-specific bidirectional encoder representations from transformers (BERT) models that can identify speech recognition (SR) errors and suggest corrections in radiology reports. Materials and
Methods: A pretrained BERT model, Clinical BioBERT, was further pretrained on a corpus of 114 008 radiology reports between April 2016 and August 2019 that were retrospectively collected from two hospitals. Next, the model was fine-tuned on a training dataset of generated insertion, deletion, and substitution errors, creating Radiology BERT. This model was retrospectively evaluated on an independent dataset of radiology reports with generated errors (n = 18 885) and on unaltered report sentences (n = 2000) and prospectively evaluated on true clinical SR errors (n = 92). Correction Radiology BERT was separately trained to suggest corrections for detected deletion and substitution errors. Area under the receiver operating characteristic curve (AUC) and bootstrapped 95% CIs were calculated for each evaluation dataset.
Results: Radiology-specific BERT had AUC values of >.99 (95% CI: >0.99, >0.99), 0.94 (95% CI: 0.93, 0.94), 0.98 (95% CI: 0.98, 0.98), and 0.97 (95% CI: 0.97, 0.97) for detecting insertion, deletion, substitution, and all errors, respectively, on the independently generated test set. Testing on unaltered report impressions revealed a sensitivity of 82% (28 of 34; 95% CI: 70%, 93%) and specificity of 88% (1521 of 1728; 95% CI: 87%, 90%). Testing on prospective SR errors showed an accuracy of 75% (69 of 92; 95% CI: 65%, 83%). Finally, the correct word was the top suggestion for 45.6% (475 of 1041; 95% CI: 42.5%, 49.3%) of errors.
Conclusion: Radiology-specific BERT models fine-tuned on generated errors were able to identify SR errors in radiology reports and suggest corrections.Keywords: Computer Applications, Technology Assessment Supplemental material is available for this article. © RSNA, 2022See also the commentary by Abajian and Cheung in this issue.
© 2022 by the Radiological Society of North America, Inc.

Entities:  

Keywords:  Computer Applications; Technology Assessment

Year:  2022        PMID: 35923373      PMCID: PMC9344210          DOI: 10.1148/ryai.210185

Source DB:  PubMed          Journal:  Radiol Artif Intell        ISSN: 2638-6100


  13 in total

1.  Syntactic and semantic errors in radiology reports associated with speech recognition software.

Authors:  Michael D Ringler; Brian C Goss; Brian J Bartholmai
Journal:  Health Informatics J       Date:  2016-07-26       Impact factor: 2.681

Review 2.  Speech recognition in the radiology department: a systematic review.

Authors:  Imane Hammana; Luigi Lepanto; Thomas Poder; Christian Bellemare; My-Sandra Ly
Journal:  Health Inf Manag       Date:  2015       Impact factor: 3.185

3.  Improving the utility of speech recognition through error detection.

Authors:  Kimberly Voll; Stella Atkins; Bruce Forster
Journal:  J Digit Imaging       Date:  2008-12       Impact factor: 4.056

4.  Improving Radiology Report Quality by Rapidly Notifying Radiologist of Report Errors.

Authors:  Matthew J Minn; Arash R Zandieh; Ross W Filice
Journal:  J Digit Imaging       Date:  2015-08       Impact factor: 4.056

5.  Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Authors:  Keno K Bressem; Lisa C Adams; Robert A Gaudin; Daniel Tröltzsch; Bernd Hamm; Marcus R Makowski; Chan-Yong Schüle; Janis L Vahldiek; Stefan M Niehues
Journal:  Bioinformatics       Date:  2021-01-29       Impact factor: 6.937

6.  A Hybrid Deep Learning Approach for Spatial Trigger Extraction from Radiology Reports.

Authors:  Surabhi Datta; Kirk Roberts
Journal:  Proc Conf Empir Methods Nat Lang Process       Date:  2020-11

7.  Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports.

Authors:  Surabhi Datta; Morgan Ulinski; Jordan Godfrey-Stovall; Shekhar Khanpara; Roy F Riascos-Castaneda; Kirk Roberts
Journal:  LREC Int Conf Lang Resour Eval       Date:  2020-05

8.  Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models.

Authors:  John Zech; Jessica Forde; Joseph J Titano; Deepak Kaji; Anthony Costa; Eric Karl Oermann
Journal:  Ann Transl Med       Date:  2019-06

9.  Analysis of Errors in Dictated Clinical Documents Assisted by Speech Recognition Software and Professional Transcriptionists.

Authors:  Li Zhou; Suzanne V Blackley; Leigh Kowalski; Raymond Doan; Warren W Acker; Adam B Landman; Evgeni Kontrient; David Mack; Marie Meteer; David W Bates; Foster R Goss
Journal:  JAMA Netw Open       Date:  2018-07-06

10.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.