Literature DB >> 29381109

Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.

John Zech1, Margaret Pain1, Joseph Titano1, Marcus Badgeley1, Javin Schefflein1, Andres Su1, Anthony Costa1, Joshua Bederson1, Joseph Lehar1, Eric Karl Oermann1.   

Abstract

Purpose To compare different methods for generating features from radiology reports and to develop a method to automatically identify findings in these reports. Materials and Methods In this study, 96 303 head computed tomography (CT) reports were obtained. The linguistic complexity of these reports was compared with that of alternative corpora. Head CT reports were preprocessed, and machine-analyzable features were constructed by using bag-of-words (BOW), word embedding, and Latent Dirichlet allocation-based approaches. Ultimately, 1004 head CT reports were manually labeled for findings of interest by physicians, and a subset of these were deemed critical findings. Lasso logistic regression was used to train models for physician-assigned labels on 602 of 1004 head CT reports (60%) using the constructed features, and the performance of these models was validated on a held-out 402 of 1004 reports (40%). Models were scored by area under the receiver operating characteristic curve (AUC), and aggregate AUC statistics were reported for (a) all labels, (b) critical labels, and (c) the presence of any critical finding in a report. Sensitivity, specificity, accuracy, and F1 score were reported for the best performing model's (a) predictions of all labels and (b) identification of reports containing critical findings. Results The best-performing model (BOW with unigrams, bigrams, and trigrams plus average word embeddings vector) had a held-out AUC of 0.966 for identifying the presence of any critical head CT finding and an average 0.957 AUC across all head CT findings. Sensitivity and specificity for identifying the presence of any critical finding were 92.59% (175 of 189) and 89.67% (191 of 213), respectively. Average sensitivity and specificity across all findings were 90.25% (1898 of 2103) and 91.72% (18 351 of 20 007), respectively. Simpler BOW methods achieved results competitive with those of more sophisticated approaches, with an average AUC for presence of any critical finding of 0.951 for unigram BOW versus 0.966 for the best-performing model. The Yule I of the head CT corpus was 34, markedly lower than that of the Reuters corpus (at 103) or I2B2 discharge summaries (at 271), indicating lower linguistic complexity. Conclusion Automated methods can be used to identify findings in radiology reports. The success of this approach benefits from the standardized language of these reports. With this method, a large labeled corpus can be generated for applications such as deep learning. © RSNA, 2018 Online supplemental material is available for this article.

Mesh:

Year:  2018        PMID: 29381109     DOI: 10.1148/radiol.2018171093

Source DB:  PubMed          Journal:  Radiology        ISSN: 0033-8419            Impact factor:   11.105


  33 in total

1.  Can Contrast-Enhanced Ultrasound Increase or Predict the Success Rate of Testicular Sperm Aspiration in Patients With Azoospermia?

Authors:  Heng Xue; Shou-Yang Wang; Li-Gang Cui; Kai Hong
Journal:  AJR Am J Roentgenol       Date:  2019-02-26       Impact factor: 3.959

Review 2.  Technical and clinical overview of deep learning in radiology.

Authors:  Daiju Ueda; Akitoshi Shimazaki; Yukio Miki
Journal:  Jpn J Radiol       Date:  2018-12-01       Impact factor: 2.374

3.  Automated Triaging of Adult Chest Radiographs with Deep Artificial Neural Networks.

Authors:  Mauro Annarumma; Samuel J Withey; Robert J Bakewell; Emanuele Pesce; Vicky Goh; Giovanni Montana
Journal:  Radiology       Date:  2019-01-22       Impact factor: 11.105

4.  Interactive NLP in Clinical Care: Identifying Incidental Findings in Radiology Reports.

Authors:  Gaurav Trivedi; Esmaeel R Dadashzadeh; Robert M Handzel; Wendy W Chapman; Shyam Visweswaran; Harry Hochheiser
Journal:  Appl Clin Inform       Date:  2019-09-04       Impact factor: 2.342

5.  Comparison of Natural Language Processing Rules-based and Machine-learning Systems to Identify Lumbar Spine Imaging Findings Related to Low Back Pain.

Authors:  W Katherine Tan; Saeed Hassanpour; Patrick J Heagerty; Sean D Rundell; Pradeep Suri; Hannu T Huhdanpaa; Kathryn James; David S Carrell; Curtis P Langlotz; Nancy L Organ; Eric N Meier; Karen J Sherman; David F Kallmes; Patrick H Luetmer; Brent Griffith; David R Nerenz; Jeffrey G Jarvik
Journal:  Acad Radiol       Date:  2018-03-28       Impact factor: 3.173

Review 6.  On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities.

Authors:  Mauricio Reyes; Raphael Meier; Sérgio Pereira; Carlos A Silva; Fried-Michael Dahlweid; Hendrik von Tengg-Kobligk; Ronald M Summers; Roland Wiest
Journal:  Radiol Artif Intell       Date:  2020-05-27

Review 7.  CAD and AI for breast cancer-recent development and challenges.

Authors:  Heang-Ping Chan; Ravi K Samala; Lubomir M Hadjiiski
Journal:  Br J Radiol       Date:  2019-12-16       Impact factor: 3.039

8.  Assisting radiologists with reporting urgent findings to referring physicians: A machine learning approach to identify cases for prompt communication.

Authors:  Xing Meng; Craig H Ganoe; Ryan T Sieberg; Yvonne Y Cheung; Saeed Hassanpour
Journal:  J Biomed Inform       Date:  2019-04-05       Impact factor: 6.317

Review 9.  Computer-aided diagnosis in the era of deep learning.

Authors:  Heang-Ping Chan; Lubomir M Hadjiiski; Ravi K Samala
Journal:  Med Phys       Date:  2020-06       Impact factor: 4.071

10.  Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models.

Authors:  John Zech; Jessica Forde; Joseph J Titano; Deepak Kaji; Anthony Costa; Eric Karl Oermann
Journal:  Ann Transl Med       Date:  2019-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.