Literature DB >> 31813492

Classifying cancer pathology reports with hierarchical self-attention networks.

Shang Gao1, John X Qiu2, Mohammed Alawad2, Jacob D Hinkle2, Noah Schaefferkoetter2, Hong-Jun Yoon2, Blair Christian2, Paul A Fearn3, Lynne Penberthy3, Xiao-Cheng Wu4, Linda Coyle5, Georgia Tourassi6, Arvind Ramanathan7.   

Abstract

We introduce a deep learning architecture, hierarchical self-attention networks (HiSANs), designed for classifying pathology reports and show how its unique architecture leads to a new state-of-the-art in accuracy, faster training, and clear interpretability. We evaluate performance on a corpus of 374,899 pathology reports obtained from the National Cancer Institute's (NCI) Surveillance, Epidemiology, and End Results (SEER) program. Each pathology report is associated with five clinical classification tasks - site, laterality, behavior, histology, and grade. We compare the performance of the HiSAN against other machine learning and deep learning approaches commonly used on medical text data - Naive Bayes, logistic regression, convolutional neural networks, and hierarchical attention networks (the previous state-of-the-art). We show that HiSANs are superior to other machine learning and deep learning text classifiers in both accuracy and macro F-score across all five classification tasks. Compared to the previous state-of-the-art, hierarchical attention networks, HiSANs not only are an order of magnitude faster to train, but also achieve about 1% better relative accuracy and 5% better relative macro F-score.
Copyright © 2019 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Cancer pathology reports; Clinical reports; Deep learning; Natural language processing; Text classification

Year:  2019        PMID: 31813492     DOI: 10.1016/j.artmed.2019.101726

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  13 in total

1.  Privacy-Preserving Deep Learning NLP Models for Cancer Registries.

Authors:  Mohammed Alawad; Hong-Jun Yoon; Shang Gao; Brent Mumphrey; Xiao-Cheng Wu; Eric B Durbin; Jong Cheol Jeong; Isaac Hands; David Rust; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal:  IEEE Trans Emerg Top Comput       Date:  2020-04-16       Impact factor: 6.595

2.  Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types.

Authors:  Kevin De Angeli; Shang Gao; Ioana Danciu; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen Schwartz; Charles Wiggins; Mark Damesyn; Linda Coyle; Lynne Penberthy; Georgia D Tourassi; Hong-Jun Yoon
Journal:  J Biomed Inform       Date:  2021-11-22       Impact factor: 8.000

3.  Accelerated training of bootstrap aggregation-based deep information extraction systems from cancer pathology reports.

Authors:  Hong-Jun Yoon; Hilda B Klasky; John P Gounley; Mohammed Alawad; Shang Gao; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Linda Coyle; Lynne Penberthy; J Blair Christian; Georgia D Tourassi
Journal:  J Biomed Inform       Date:  2020-09-09       Impact factor: 6.317

4.  Limitations of Transformers on Clinical Text Classification.

Authors:  Shang Gao; Mohammed Alawad; M Todd Young; John Gounley; Noah Schaefferkoetter; Hong Jun Yoon; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Georgia Tourassi
Journal:  IEEE J Biomed Health Inform       Date:  2021-09-03       Impact factor: 7.021

5.  Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing.

Authors:  Wee-Ming Tan; Kean-Hooi Teoh; Mogana Darshini Ganggayah; Nur Aishah Taib; Hana Salwani Zaini; Sarinder Kaur Dhillon
Journal:  Diagnostics (Basel)       Date:  2022-04-01

6.  Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study.

Authors:  Wenjie Qu; Qingqing Liu; Xinlin Jiao; Teng Zhang; Bingyu Wang; Ningfeng Li; Taotao Dong; Baoxia Cui
Journal:  Front Oncol       Date:  2021-02-18       Impact factor: 6.244

7.  Deep active learning for classifying cancer pathology reports.

Authors:  Kevin De Angeli; Shang Gao; Mohammed Alawad; Hong-Jun Yoon; Noah Schaefferkoetter; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal:  BMC Bioinformatics       Date:  2021-03-09       Impact factor: 3.169

8.  Automatic Classification of Cancer Pathology Reports: A Systematic Review.

Authors:  Thiago Santos; Amara Tariq; Judy Wawira Gichoya; Hari Trivedi; Imon Banerjee
Journal:  J Pathol Inform       Date:  2022-01-20

9.  Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports.

Authors:  Kevin De Angeli; Shang Gao; Andrew Blanchard; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen M Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy; Georgia Tourassi; Hong-Jun Yoon
Journal:  JAMIA Open       Date:  2022-09-13

10.  Using case-level context to classify cancer pathology reports.

Authors:  Shang Gao; Mohammed Alawad; Noah Schaefferkoetter; Lynne Penberthy; Xiao-Cheng Wu; Eric B Durbin; Linda Coyle; Arvind Ramanathan; Georgia Tourassi
Journal:  PLoS One       Date:  2020-05-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.