Literature DB >> 28475069

Deep Learning for Automated Extraction of Primary Sites From Cancer Pathology Reports.

John X Qiu, Hong-Jun Yoon, Paul A Fearn, Georgia D Tourassi.   

Abstract

Pathology reports are a primary source of information for cancer registries which process high volumes of free-text reports annually. Information extraction and coding is a manual, labor-intensive process. In this study, we investigated deep learning and a convolutional neural network (CNN), for extracting ICD-O-3 topographic codes from a corpus of breast and lung cancer pathology reports. We performed two experiments, using a CNN and a more conventional term frequency vector approach, to assess the effects of class prevalence and inter-class transfer learning. The experiments were based on a set of 942 pathology reports with human expert annotations as the gold standard. CNN performance was compared against a more conventional term frequency vector space approach. We observed that the deep learning models consistently outperformed the conventional approaches in the class prevalence experiment, resulting in micro- and macro-F score increases of up to 0.132 and 0.226, respectively, when class labels were well populated. Specifically, the best performing CNN achieved a micro-F score of 0.722 over 12 ICD-O-3 topography codes. Transfer learning provided a consistent but modest performance boost for the deep learning methods but trends were contingent on the CNN method and cancer site. These encouraging results demonstrate the potential of deep learning for automated abstraction of pathology reports.

Entities:  

Mesh:

Year:  2017        PMID: 28475069     DOI: 10.1109/JBHI.2017.2700722

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  37 in total

1.  Machine learning mortality classification in clinical documentation with increased accuracy in visual-based analyses.

Authors:  Susan M Slattery; Daniel C Knight; Debra E Weese-Mayer; William A Grobman; Doug C Downey; Karna Murthy
Journal:  Acta Paediatr       Date:  2019-12-10       Impact factor: 2.299

2.  Cross-registry neural domain adaptation to extract mutational test results from pathology reports.

Authors:  Anthony Rios; Eric B Durbin; Isaac Hands; Susanne M Arnold; Darshil Shah; Stephen M Schwartz; Bernardo H L Goulart; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2019-08-08       Impact factor: 6.317

Review 3.  Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records.

Authors:  Guergana K Savova; Ioana Danciu; Folami Alamudun; Timothy Miller; Chen Lin; Danielle S Bitterman; Georgia Tourassi; Jeremy L Warner
Journal:  Cancer Res       Date:  2019-08-08       Impact factor: 12.701

4.  Privacy-Preserving Deep Learning NLP Models for Cancer Registries.

Authors:  Mohammed Alawad; Hong-Jun Yoon; Shang Gao; Brent Mumphrey; Xiao-Cheng Wu; Eric B Durbin; Jong Cheol Jeong; Isaac Hands; David Rust; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal:  IEEE Trans Emerg Top Comput       Date:  2020-04-16       Impact factor: 6.595

5.  Deep Transfer Learning Across Cancer Registries for Information Extraction from Pathology Reports.

Authors:  Mohammed Alawad; Shang Gao; John Qiu; Noah Schaefferkoetter; Jacob D Hinkle; Hong-Jun Yoon; J Blair Christian; Xiao-Cheng Wu; Eric B Durbin; Jong Cheol Jeong; Isaac Hands; David Rust; Georgia Tourassi
Journal:  IEEE EMBS Int Conf Biomed Health Inform       Date:  2019-09-12

6.  Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes.

Authors:  Kenneth L Kehl; Wenxin Xu; Eva Lepisto; Haitham Elmarakeby; Michael J Hassett; Eliezer M Van Allen; Bruce E Johnson; Deborah Schrag
Journal:  JCO Clin Cancer Inform       Date:  2020-08

7.  Accelerated training of bootstrap aggregation-based deep information extraction systems from cancer pathology reports.

Authors:  Hong-Jun Yoon; Hilda B Klasky; John P Gounley; Mohammed Alawad; Shang Gao; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Linda Coyle; Lynne Penberthy; J Blair Christian; Georgia D Tourassi
Journal:  J Biomed Inform       Date:  2020-09-09       Impact factor: 6.317

8.  Deep Learning Approaches Substantially Improve Automated Extraction of Information from Free-Text Medical Reports.

Authors:  Tiffany Ting Liu
Journal:  Radiol Artif Intell       Date:  2019-08-07

9.  Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.

Authors:  Matthew S Alkaitis; Monica N Agrawal; Gregory J Riely; Pedram Razavi; David Sontag
Journal:  JCO Clin Cancer Inform       Date:  2021-05

10.  Limitations of Transformers on Clinical Text Classification.

Authors:  Shang Gao; Mohammed Alawad; M Todd Young; John Gounley; Noah Schaefferkoetter; Hong Jun Yoon; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Georgia Tourassi
Journal:  IEEE J Biomed Health Inform       Date:  2021-09-03       Impact factor: 7.021

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.