Literature DB >> 32702106

Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Keno K Bressem1,2, Lisa C Adams1,2, Robert A Gaudin3, Daniel Tröltzsch3, Bernd Hamm1, Marcus R Makowski4, Chan-Yong Schüle1, Janis L Vahldiek1, Stefan M Niehues1.   

Abstract

MOTIVATION: The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, advanced NLP methods are needed to enable accurate text classification. While neural networks can be used for this purpose, they must first be trained on large amounts of manually labelled data to achieve good results. In contrast, BERT models can be pre-trained on unlabelled data and then only require fine tuning on a small amount of manually labelled data to achieve even better results.
RESULTS: Using BERT to identify the most important findings in intensive care chest radiograph reports, we achieve areas under the receiver operation characteristics curve of 0.98 for congestion, 0.97 for effusion, 0.97 for consolidation and 0.99 for pneumothorax, surpassing the accuracy of previous approaches with comparatively little annotation effort. Our approach could therefore help to improve information extraction from free-text medical reports. Availability  and implementationWe make the source code for fine-tuning the BERT-models freely available at https://github.com/fast-raidiology/bert-for-radiology. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 32702106     DOI: 10.1093/bioinformatics/btaa668

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Natural Language Processing of Radiology Text Reports: Interactive Text Classification.

Authors:  Walter F Wiggins; Felipe Kitamura; Igor Santos; Luciano M Prevedello
Journal:  Radiol Artif Intell       Date:  2021-05-12

2.  Automatic Diagnosis Labeling of Cardiovascular MRI by Using Semisupervised Natural Language Processing of Text Reports.

Authors:  Sameer Zaman; Camille Petri; Kavitha Vimalesvaran; James Howard; Anil Bharath; Darrel Francis; Nicholas Peters; Graham D Cole; Nick Linton
Journal:  Radiol Artif Intell       Date:  2021-11-24

3.  The h-ANN Model: Comprehensive Colonoscopy Concept Compilation Using Combined Contextual Embeddings.

Authors:  Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole France Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Meredith Zozus; Benjamin Tharian; Fred Prior
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

4.  Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

Authors:  Ali S Tejani; Yee S Ng; Yin Xi; Julia R Fielding; Travis G Browning; Jesse C Rayan
Journal:  Radiol Artif Intell       Date:  2022-06-29

5.  Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Authors:  Gunvant R Chaudhari; Tengxiao Liu; Timothy L Chen; Gabby B Joseph; Maya Vella; Yoo Jin Lee; Thienkhai H Vu; Youngho Seo; Andreas M Rauschecker; Charles E McCulloch; Jae Ho Sohn
Journal:  Radiol Artif Intell       Date:  2022-05-25

6.  Multi-objective data enhancement for deep learning-based ultrasound analysis.

Authors:  Chengkai Piao; Mengyue Lv; Shujie Wang; Rongyan Zhou; Yuchen Wang; Jinmao Wei; Jian Liu
Journal:  BMC Bioinformatics       Date:  2022-10-20       Impact factor: 3.307

7.  Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

Authors:  A W Olthof; P M A van Ooijen; L J Cornelissen
Journal:  J Med Syst       Date:  2021-09-04       Impact factor: 4.460

8.  Automatic extraction of 12 cardiovascular concepts from German discharge letters using pre-trained language models.

Authors:  Phillip Richter-Pechanski; Nicolas A Geis; Christina Kiriakou; Dominic M Schwab; Christoph Dieterich
Journal:  Digit Health       Date:  2021-11-26
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.