Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Literature DB >> 32702106

Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Keno K Bressem^1,2, Lisa C Adams^1,2, Robert A Gaudin³, Daniel Tröltzsch³, Bernd Hamm¹, Marcus R Makowski⁴, Chan-Yong Schüle¹, Janis L Vahldiek¹, Stefan M Niehues¹.

Abstract

MOTIVATION: The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, advanced NLP methods are needed to enable accurate text classification. While neural networks can be used for this purpose, they must first be trained on large amounts of manually labelled data to achieve good results. In contrast, BERT models can be pre-trained on unlabelled data and then only require fine tuning on a small amount of manually labelled data to achieve even better results.
RESULTS: Using BERT to identify the most important findings in intensive care chest radiograph reports, we achieve areas under the receiver operation characteristics curve of 0.98 for congestion, 0.97 for effusion, 0.97 for consolidation and 0.99 for pneumothorax, surpassing the accuracy of previous approaches with comparatively little annotation effort. Our approach could therefore help to improve information extraction from free-text medical reports. Availability and implementationWe make the source code for fine-tuning the BERT-models freely available at https://github.com/fast-raidiology/bert-for-radiology. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Disease

Year: 2021 PMID： 32702106 DOI： 10.1093/bioinformatics/btaa668

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

8 in total

1. Natural Language Processing of Radiology Text Reports: Interactive Text Classification.

Authors: Walter F Wiggins; Felipe Kitamura; Igor Santos; Luciano M Prevedello
Journal: Radiol Artif Intell Date: 2021-05-12

2. Automatic Diagnosis Labeling of Cardiovascular MRI by Using Semisupervised Natural Language Processing of Text Reports.

Authors: Sameer Zaman; Camille Petri; Kavitha Vimalesvaran; James Howard; Anil Bharath; Darrel Francis; Nicholas Peters; Graham D Cole; Nick Linton
Journal: Radiol Artif Intell Date: 2021-11-24

3. The h-ANN Model: Comprehensive Colonoscopy Concept Compilation Using Combined Contextual Embeddings.

Authors: Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole France Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Meredith Zozus; Benjamin Tharian; Fred Prior
Journal: Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap Date: 2022-02

4. Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

Authors: Ali S Tejani; Yee S Ng; Yin Xi; Julia R Fielding; Travis G Browning; Jesse C Rayan
Journal: Radiol Artif Intell Date: 2022-06-29

5. Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Authors: Gunvant R Chaudhari; Tengxiao Liu; Timothy L Chen; Gabby B Joseph; Maya Vella; Yoo Jin Lee; Thienkhai H Vu; Youngho Seo; Andreas M Rauschecker; Charles E McCulloch; Jae Ho Sohn
Journal: Radiol Artif Intell Date: 2022-05-25

6. Multi-objective data enhancement for deep learning-based ultrasound analysis.

Authors: Chengkai Piao; Mengyue Lv; Shujie Wang; Rongyan Zhou; Yuchen Wang; Jinmao Wei; Jian Liu
Journal: BMC Bioinformatics Date: 2022-10-20 Impact factor: 3.307

7. Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

Authors: A W Olthof; P M A van Ooijen; L J Cornelissen
Journal: J Med Syst Date: 2021-09-04 Impact factor: 4.460

8. Automatic extraction of 12 cardiovascular concepts from German discharge letters using pre-trained language models.

Authors: Phillip Richter-Pechanski; Nicolas A Geis; Christina Kiriakou; Dominic M Schwab; Christoph Dieterich
Journal: Digit Health Date: 2021-11-26

8 in total