Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Literature DB >> 35923373

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Gunvant R Chaudhari¹, Tengxiao Liu¹, Timothy L Chen¹, Gabby B Joseph¹, Maya Vella¹, Yoo Jin Lee¹, Thienkhai H Vu¹, Youngho Seo¹, Andreas M Rauschecker¹, Charles E McCulloch¹, Jae Ho Sohn¹.

Abstract

Purpose: To develop radiology domain-specific bidirectional encoder representations from transformers (BERT) models that can identify speech recognition (SR) errors and suggest corrections in radiology reports. Materials and
Methods: A pretrained BERT model, Clinical BioBERT, was further pretrained on a corpus of 114 008 radiology reports between April 2016 and August 2019 that were retrospectively collected from two hospitals. Next, the model was fine-tuned on a training dataset of generated insertion, deletion, and substitution errors, creating Radiology BERT. This model was retrospectively evaluated on an independent dataset of radiology reports with generated errors (n = 18 885) and on unaltered report sentences (n = 2000) and prospectively evaluated on true clinical SR errors (n = 92). Correction Radiology BERT was separately trained to suggest corrections for detected deletion and substitution errors. Area under the receiver operating characteristic curve (AUC) and bootstrapped 95% CIs were calculated for each evaluation dataset.
Results: Radiology-specific BERT had AUC values of >.99 (95% CI: >0.99, >0.99), 0.94 (95% CI: 0.93, 0.94), 0.98 (95% CI: 0.98, 0.98), and 0.97 (95% CI: 0.97, 0.97) for detecting insertion, deletion, substitution, and all errors, respectively, on the independently generated test set. Testing on unaltered report impressions revealed a sensitivity of 82% (28 of 34; 95% CI: 70%, 93%) and specificity of 88% (1521 of 1728; 95% CI: 87%, 90%). Testing on prospective SR errors showed an accuracy of 75% (69 of 92; 95% CI: 65%, 83%). Finally, the correct word was the top suggestion for 45.6% (475 of 1041; 95% CI: 42.5%, 49.3%) of errors.
Conclusion: Radiology-specific BERT models fine-tuned on generated errors were able to identify SR errors in radiology reports and suggest corrections.Keywords: Computer Applications, Technology Assessment Supplemental material is available for this article. © RSNA, 2022See also the commentary by Abajian and Cheung in this issue.

Entities: Chemical

Keywords: Computer Applications; Technology Assessment

Year: 2022 PMID： 35923373 PMCID： PMC9344210 DOI： 10.1148/ryai.210185

Source DB: PubMed Journal: Radiol Artif Intell ISSN： 2638-6100

Keyword Cloud
References

13 in total

1. Syntactic and semantic errors in radiology reports associated with speech recognition software.

Authors: Michael D Ringler; Brian C Goss; Brian J Bartholmai
Journal: Health Informatics J Date: 2016-07-26 Impact factor: 2.681

Review 2. Speech recognition in the radiology department: a systematic review.

Authors: Imane Hammana; Luigi Lepanto; Thomas Poder; Christian Bellemare; My-Sandra Ly
Journal: Health Inf Manag Date: 2015 Impact factor: 3.185

3. Improving the utility of speech recognition through error detection.

Authors: Kimberly Voll; Stella Atkins; Bruce Forster
Journal: J Digit Imaging Date: 2008-12 Impact factor: 4.056

4. Improving Radiology Report Quality by Rapidly Notifying Radiologist of Report Errors.

Authors: Matthew J Minn; Arash R Zandieh; Ross W Filice
Journal: J Digit Imaging Date: 2015-08 Impact factor: 4.056

5. Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Authors: Keno K Bressem; Lisa C Adams; Robert A Gaudin; Daniel Tröltzsch; Bernd Hamm; Marcus R Makowski; Chan-Yong Schüle; Janis L Vahldiek; Stefan M Niehues
Journal: Bioinformatics Date: 2021-01-29 Impact factor: 6.937

6. A Hybrid Deep Learning Approach for Spatial Trigger Extraction from Radiology Reports.

Authors: Surabhi Datta; Kirk Roberts
Journal: Proc Conf Empir Methods Nat Lang Process Date: 2020-11

7. Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports.

Authors: Surabhi Datta; Morgan Ulinski; Jordan Godfrey-Stovall; Shekhar Khanpara; Roy F Riascos-Castaneda; Kirk Roberts
Journal: LREC Int Conf Lang Resour Eval Date: 2020-05

8. Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models.

Authors: John Zech; Jessica Forde; Joseph J Titano; Deepak Kaji; Anthony Costa; Eric Karl Oermann
Journal: Ann Transl Med Date: 2019-06

9. Analysis of Errors in Dictated Clinical Documents Assisted by Speech Recognition Software and Professional Transcriptionists.

Authors: Li Zhou; Suzanne V Blackley; Leigh Kowalski; Raymond Doan; Warren W Acker; Adam B Landman; Evgeni Kontrient; David Mack; Marie Meteer; David W Bates; Foster R Goss
Journal: JAMA Netw Open Date: 2018-07-06

10. BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors: Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal: Bioinformatics Date: 2020-02-15 Impact factor: 6.937