Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Limitations of Transformers on Clinical Text Classification.

Literature DB >> 33635801

Limitations of Transformers on Clinical Text Classification.

Shang Gao, Mohammed Alawad, M Todd Young, John Gounley, Noah Schaefferkoetter, Hong Jun Yoon, Xiao-Cheng Wu, Eric B Durbin, Jennifer Doherty, Antoinette Stroup, Linda Coyle, Georgia Tourassi.

Abstract

Bidirectional Encoder Representations from Transformers (BERT) and BERT-based approaches are the current state-of-the-art in many natural language processing (NLP) tasks; however, their application to document classification on long clinical texts is limited. In this work, we introduce four methods to scale BERT, which by default can only handle input sequences up to approximately 400 words long, to perform document classification on clinical texts several thousand words long. We compare these methods against two much simpler architectures - a word-level convolutional neural network and a hierarchical self-attention network - and show that BERT often cannot beat these simpler baselines when classifying MIMIC-III discharge summaries and SEER cancer pathology reports. In our analysis, we show that two key components of BERT - pretraining and WordPiece tokenization - may actually be inhibiting BERT's performance on clinical text classification tasks where the input document is several thousand words long and where correctly identifying labels may depend more on identifying a few key words or phrases rather than understanding the contextual meaning of sequences of text.

Entities: Chemical

Mesh：

Year: 2021 PMID： 33635801 PMCID： PMC8387496 DOI： 10.1109/JBHI.2021.3062322

Source DB: PubMed Journal: IEEE J Biomed Health Inform ISSN： 2168-2194 Impact factor: 7.021

17 in total

1. Medical Text Classification Using Convolutional Neural Networks.

Authors: Mark Hughes; Irene Li; Spyros Kotoulas; Toyotaro Suzumura
Journal: Stud Health Technol Inform Date: 2017

2. Deep Learning for Automated Extraction of Primary Sites From Cancer Pathology Reports.

Authors: John X Qiu; Hong-Jun Yoon; Paul A Fearn; Georgia D Tourassi
Journal: IEEE J Biomed Health Inform Date: 2017-05-03 Impact factor: 5.772

Review 3. Deep learning in clinical natural language processing: a methodical review.

Authors: Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal: J Am Med Inform Assoc Date: 2020-03-01 Impact factor: 4.497

4. Classifying cancer pathology reports with hierarchical self-attention networks.

Authors: Shang Gao; John X Qiu; Mohammed Alawad; Jacob D Hinkle; Noah Schaefferkoetter; Hong-Jun Yoon; Blair Christian; Paul A Fearn; Lynne Penberthy; Xiao-Cheng Wu; Linda Coyle; Georgia Tourassi; Arvind Ramanathan
Journal: Artif Intell Med Date: 2019-10-15 Impact factor: 5.326

5. Using machine learning to parse breast pathology reports.

Authors: Adam Yala; Regina Barzilay; Laura Salama; Molly Griffin; Grace Sollender; Aditya Bardia; Constance Lehman; Julliette M Buckley; Suzanne B Coopey; Fernanda Polubriaginof; Judy E Garber; Barbara L Smith; Michele A Gadd; Michelle C Specht; Thomas M Gudewicz; Anthony J Guidi; Alphonse Taghian; Kevin S Hughes
Journal: Breast Cancer Res Treat Date: 2016-11-08 Impact factor: 4.872

6. Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives.

Authors: Sebastian Gehrmann; Franck Dernoncourt; Yeran Li; Eric T Carlson; Joy T Wu; Jonathan Welt; John Foote; Edward T Moseley; David W Grant; Patrick D Tyler; Leo A Celi
Journal: PLoS One Date: 2018-02-15 Impact factor: 3.240

7. BioWordVec, improving biomedical word embeddings with subword information and MeSH.

Authors: Yijia Zhang; Qingyu Chen; Zhihao Yang; Hongfei Lin; Zhiyong Lu
Journal: Sci Data Date: 2019-05-10 Impact factor: 6.444

Review 8. Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.

Authors: Seyedmostafa Sheikhalishahi; Riccardo Miotto; Joel T Dudley; Alberto Lavelli; Fabio Rinaldi; Venet Osmani
Journal: JMIR Med Inform Date: 2019-04-27

9. MIMIC-III, a freely accessible critical care database.

Authors: Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal: Sci Data Date: 2016-05-24 Impact factor: 6.444

10. Hierarchical attention networks for information extraction from cancer pathology reports.

Authors: Shang Gao; Michael T Young; John X Qiu; Hong-Jun Yoon; James B Christian; Paul A Fearn; Georgia D Tourassi; Arvind Ramanthan
Journal: J Am Med Inform Assoc Date: 2018-03-01 Impact factor: 4.497

11 in total

1. Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types.

Authors: Kevin De Angeli; Shang Gao; Ioana Danciu; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen Schwartz; Charles Wiggins; Mark Damesyn; Linda Coyle; Lynne Penberthy; Georgia D Tourassi; Hong-Jun Yoon
Journal: J Biomed Inform Date: 2021-11-22 Impact factor: 8.000

2. Development and multimodal validation of a substance misuse algorithm for referral to treatment using artificial intelligence (SMART-AI): a retrospective deep learning study.

Authors: Majid Afshar; Brihat Sharma; Dmitriy Dligach; Madeline Oguss; Randall Brown; Neeraj Chhabra; Hale M Thompson; Talar Markossian; Cara Joyce; Matthew M Churpek; Niranjan S Karnik
Journal: Lancet Digit Health Date: 2022-06

3. Do Words Matter? Detecting Social Isolation and Loneliness in Older Adults Using Natural Language Processing.

Authors: Varsha D Badal; Camille Nebeker; Kaoru Shinkawa; Yasunori Yamada; Kelly E Rentscher; Ho-Cheol Kim; Ellen E Lee
Journal: Front Psychiatry Date: 2021-11-16 Impact factor: 4.157

4. Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

Authors: A W Olthof; P M A van Ooijen; L J Cornelissen
Journal: J Med Syst Date: 2021-09-04 Impact factor: 4.460

5. Predicting Clinical Events Based on Raw Text: From Bag-of-Words to Attention-Based Transformers.

Authors: Dmitri Roussinov; Andrew Conkie; Andrew Patterson; Christopher Sainsbury
Journal: Front Digit Health Date: 2022-02-21

6. Strategies to Address the Lack of Labeled Data for Supervised Machine Learning Training With Electronic Health Records: Case Study for the Extraction of Symptoms From Clinical Notes.

Authors: Marie Humbert-Droz; Pritam Mukherjee; Olivier Gevaert
Journal: JMIR Med Inform Date: 2022-03-14

7. Using Text Content From Coronary Catheterization Reports to Predict 5-Year Mortality Among Patients Undergoing Coronary Angiography: A Deep Learning Approach.

Authors: Yu-Hsuan Li; I-Te Lee; Yu-Wei Chen; Yow-Kuan Lin; Yu-Hsin Liu; Fei-Pei Lai
Journal: Front Cardiovasc Med Date: 2022-02-28

8. Transformer-Based High-Frequency Oscillation Signal Detection on Magnetoencephalography From Epileptic Patients.

Authors: Jiayang Guo; Naian Xiao; Hailong Li; Lili He; Qiyuan Li; Ting Wu; Xiaonan He; Peizhi Chen; Duo Chen; Jing Xiang; Xueping Peng
Journal: Front Mol Biosci Date: 2022-03-04

9. Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports.

Authors: Kevin De Angeli; Shang Gao; Andrew Blanchard; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen M Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy; Georgia Tourassi; Hong-Jun Yoon
Journal: JAMIA Open Date: 2022-09-13

10. Automatic information extraction from childhood cancer pathology reports.

Authors: Hong-Jun Yoon; Alina Peluso; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy
Journal: JAMIA Open Date: 2022-06-16