Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Classifying cancer pathology reports with hierarchical self-attention networks.

Literature DB >> 31813492

Classifying cancer pathology reports with hierarchical self-attention networks.

Shang Gao¹, John X Qiu², Mohammed Alawad², Jacob D Hinkle², Noah Schaefferkoetter², Hong-Jun Yoon², Blair Christian², Paul A Fearn³, Lynne Penberthy³, Xiao-Cheng Wu⁴, Linda Coyle⁵, Georgia Tourassi⁶, Arvind Ramanathan⁷.

Abstract

We introduce a deep learning architecture, hierarchical self-attention networks (HiSANs), designed for classifying pathology reports and show how its unique architecture leads to a new state-of-the-art in accuracy, faster training, and clear interpretability. We evaluate performance on a corpus of 374,899 pathology reports obtained from the National Cancer Institute's (NCI) Surveillance, Epidemiology, and End Results (SEER) program. Each pathology report is associated with five clinical classification tasks - site, laterality, behavior, histology, and grade. We compare the performance of the HiSAN against other machine learning and deep learning approaches commonly used on medical text data - Naive Bayes, logistic regression, convolutional neural networks, and hierarchical attention networks (the previous state-of-the-art). We show that HiSANs are superior to other machine learning and deep learning text classifiers in both accuracy and macro F-score across all five classification tasks. Compared to the previous state-of-the-art, hierarchical attention networks, HiSANs not only are an order of magnitude faster to train, but also achieve about 1% better relative accuracy and 5% better relative macro F-score.

Entities: Disease

Keywords: Cancer pathology reports; Clinical reports; Deep learning; Natural language processing; Text classification

Year: 2019 PMID： 31813492 DOI： 10.1016/j.artmed.2019.101726

Source DB: PubMed Journal: Artif Intell Med ISSN： 0933-3657 Impact factor: 5.326

Keyword Cloud
Cited

13 in total

1. Privacy-Preserving Deep Learning NLP Models for Cancer Registries.

Authors: Mohammed Alawad; Hong-Jun Yoon; Shang Gao; Brent Mumphrey; Xiao-Cheng Wu; Eric B Durbin; Jong Cheol Jeong; Isaac Hands; David Rust; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal: IEEE Trans Emerg Top Comput Date: 2020-04-16 Impact factor: 6.595

2. Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types.

Authors: Kevin De Angeli; Shang Gao; Ioana Danciu; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen Schwartz; Charles Wiggins; Mark Damesyn; Linda Coyle; Lynne Penberthy; Georgia D Tourassi; Hong-Jun Yoon
Journal: J Biomed Inform Date: 2021-11-22 Impact factor: 8.000

3. Accelerated training of bootstrap aggregation-based deep information extraction systems from cancer pathology reports.

Authors: Hong-Jun Yoon; Hilda B Klasky; John P Gounley; Mohammed Alawad; Shang Gao; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Linda Coyle; Lynne Penberthy; J Blair Christian; Georgia D Tourassi
Journal: J Biomed Inform Date: 2020-09-09 Impact factor: 6.317

4. Limitations of Transformers on Clinical Text Classification.

Authors: Shang Gao; Mohammed Alawad; M Todd Young; John Gounley; Noah Schaefferkoetter; Hong Jun Yoon; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Georgia Tourassi
Journal: IEEE J Biomed Health Inform Date: 2021-09-03 Impact factor: 7.021

5. Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing.

Authors: Wee-Ming Tan; Kean-Hooi Teoh; Mogana Darshini Ganggayah; Nur Aishah Taib; Hana Salwani Zaini; Sarinder Kaur Dhillon
Journal: Diagnostics (Basel) Date: 2022-04-01

6. Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study.

Authors: Wenjie Qu; Qingqing Liu; Xinlin Jiao; Teng Zhang; Bingyu Wang; Ningfeng Li; Taotao Dong; Baoxia Cui
Journal: Front Oncol Date: 2021-02-18 Impact factor: 6.244

7. Deep active learning for classifying cancer pathology reports.

Authors: Kevin De Angeli; Shang Gao; Mohammed Alawad; Hong-Jun Yoon; Noah Schaefferkoetter; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal: BMC Bioinformatics Date: 2021-03-09 Impact factor: 3.169

8. Automatic Classification of Cancer Pathology Reports: A Systematic Review.

Authors: Thiago Santos; Amara Tariq; Judy Wawira Gichoya; Hari Trivedi; Imon Banerjee
Journal: J Pathol Inform Date: 2022-01-20

9. Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports.

Authors: Kevin De Angeli; Shang Gao; Andrew Blanchard; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen M Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy; Georgia Tourassi; Hong-Jun Yoon
Journal: JAMIA Open Date: 2022-09-13

10. Using case-level context to classify cancer pathology reports.

Authors: Shang Gao; Mohammed Alawad; Noah Schaefferkoetter; Lynne Penberthy; Xiao-Cheng Wu; Eric B Durbin; Linda Coyle; Arvind Ramanathan; Georgia Tourassi
Journal: PLoS One Date: 2020-05-12 Impact factor: 3.240