Literature DB >> 33947927

Weakly supervised temporal model for prediction of breast cancer distant recurrence.

Daniel Rubin1, Imon Banerjee2, Josh Sanyal1, Amara Tariq3, Allison W Kurian4.   

Abstract

Efficient prediction of cancer recurrence in advance may help to recruit high risk breast cancer patients for clinical trial on-time and can guide a proper treatment plan. Several machine learning approaches have been developed for recurrence prediction in previous studies, but most of them use only structured electronic health records and only a small training dataset, with limited success in clinical application. While free-text clinic notes may offer the greatest nuance and detail about a patient's clinical status, they are largely excluded in previous predictive models due to the increase in processing complexity and need for a complex modeling framework. In this study, we developed a weak-supervision framework for breast cancer recurrence prediction in which we trained a deep learning model on a large sample of free-text clinic notes by utilizing a combination of manually curated labels and NLP-generated non-perfect recurrence labels. The model was trained jointly on manually curated data from 670 patients and NLP-curated data of 8062 patients. It was validated on manually annotated data from 224 patients with recurrence and achieved 0.94 AUROC. This weak supervision approach allowed us to learn from a larger dataset using imperfect labels and ultimately provided greater accuracy compared to a smaller hand-curated dataset, with less manual effort invested in curation.

Entities:  

Year:  2021        PMID: 33947927     DOI: 10.1038/s41598-021-89033-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  6 in total

1.  STRIDE--An integrated standards-based translational research informatics platform.

Authors:  Henry J Lowe; Todd A Ferris; Penni M Hernandez; Susan C Weber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

2.  Oncoshare: lessons learned from building an integrated multi-institutional database for comparative effectiveness research.

Authors:  Susan C Weber; Tina Seto; Cliff Olson; Pragati Kenkare; Allison W Kurian; Amar K Das
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

3.  A Systematic Review of Estimating Breast Cancer Recurrence at the Population Level With Administrative Data.

Authors:  Hava Izci; Tim Tambuyzer; Krizia Tuand; Victoria Depoorter; Annouschka Laenen; Hans Wildiers; Ignace Vergote; Liesbet Van Eycken; Harlinde De Schutter; Freija Verdoodt; Patrick Neven
Journal:  J Natl Cancer Inst       Date:  2020-10-01       Impact factor: 13.506

4.  Image analysis and machine learning applied to breast cancer diagnosis and prognosis.

Authors:  W H Wolberg; W N Street; O L Mangasarian
Journal:  Anal Quant Cytol Histol       Date:  1995-04       Impact factor: 0.302

5.  Natural Language Processing Approaches to Detect the Timeline of Metastatic Recurrence of Breast Cancer.

Authors:  Imon Banerjee; Selen Bozkurt; Jennifer Lee Caswell-Jin; Allison W Kurian; Daniel L Rubin
Journal:  JCO Clin Cancer Inform       Date:  2019-10

6.  Identifying Metastases-related Information from Pathology Reports of Lung Cancer Patients.

Authors:  Ergin Soysal; Jeremy L Warner; Joshua C Denny; Hua Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.