Literature DB >> 34388050

Bag-of-Words Technique in Natural Language Processing: A Primer for Radiologists.

Krishna Juluru1, Hao-Hsin Shih1, Krishna Nand Keshava Murthy1, Pierre Elnajjar1.   

Abstract

Natural language processing (NLP) is a methodology designed to extract concepts and meaning from human-generated unstructured (free-form) text. It is intended to be implemented by using computer algorithms so that it can be run on a corpus of documents quickly and reliably. To enable machine learning (ML) techniques in NLP, free-form text must be converted to a numerical representation. After several stages of preprocessing including tokenization, removal of stop words, token normalization, and creation of a master dictionary, the bag-of-words (BOW) technique can be used to represent each remaining word as a feature of the document. The preprocessing steps simplify the documents but also potentially degrade meaning. The values of the features in BOW can be modified by using techniques such as term count, term frequency, and term frequency-inverse document frequency. Experience and experimentation will guide decisions on which specific techniques will optimize ML performance. These and other NLP techniques are being applied in radiology. Radiologists' understanding of the strengths and limitations of these techniques will help in communication with data scientists and in implementation for specific tasks. Online supplemental material is available for this article. ©RSNA, 2021.

Entities:  

Mesh:

Year:  2021        PMID: 34388050      PMCID: PMC8415041          DOI: 10.1148/rg.2021210025

Source DB:  PubMed          Journal:  Radiographics        ISSN: 0271-5333            Impact factor:   6.312


  5 in total

Review 1.  Natural Language Processing Technologies in Radiology Research and Clinical Applications.

Authors:  Tianrun Cai; Andreas A Giannopoulos; Sheng Yu; Tatiana Kelil; Beth Ripley; Kanako K Kumamaru; Frank J Rybicki; Dimitrios Mitsouras
Journal:  Radiographics       Date:  2016 Jan-Feb       Impact factor: 5.333

2.  Machine Learning for Automation of Radiology Protocols for Quality and Efficiency Improvement.

Authors:  Angad Kalra; Amit Chakraborty; Benjamin Fine; Joshua Reicher
Journal:  J Am Coll Radiol       Date:  2020-04-09       Impact factor: 5.532

3.  Deep Learning to Classify Radiology Free-Text Reports.

Authors:  Matthew C Chen; Robyn L Ball; Lingyao Yang; Nathaniel Moradzadeh; Brian E Chapman; David B Larson; Curtis P Langlotz; Timothy J Amrhein; Matthew P Lungren
Journal:  Radiology       Date:  2017-11-13       Impact factor: 11.105

4.  Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.

Authors:  John Zech; Margaret Pain; Joseph Titano; Marcus Badgeley; Javin Schefflein; Andres Su; Anthony Costa; Joshua Bederson; Joseph Lehar; Eric Karl Oermann
Journal:  Radiology       Date:  2018-01-30       Impact factor: 11.105

5.  Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

Authors:  Po-Hao Chen; Hanna Zafar; Maya Galperin-Aizenberg; Tessa Cook
Journal:  J Digit Imaging       Date:  2018-04       Impact factor: 4.056

  5 in total
  1 in total

1.  Natural Language Processing in Diagnostic Texts from Nephropathology.

Authors:  Maximilian Legnar; Philipp Daumke; Jürgen Hesser; Stefan Porubsky; Zoran Popovic; Jan Niklas Bindzus; Joern-Helge Heinrich Siemoneit; Cleo-Aron Weis
Journal:  Diagnostics (Basel)       Date:  2022-07-15
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.