Literature DB >> 31113820

Gaussian Mixture Models for Probabilistic Classification of Breast Cancer.

Indira Prabakaran1, Zhengdong Wu2, Changgun Lee3, Brian Tong1, Samantha Steeman1, Gabriel Koo1, Paul J Zhang4, Marina A Guvakova5.   

Abstract

In the era of omics-driven research, it remains a common dilemma to stratify individual patients based on the molecular characteristics of their tumors. To improve molecular stratification of patients with breast cancer, we developed the Gaussian mixture model (GMM)-based classifier. This probabilistic classifier was built on mRNA expression data from more than 300 clinical samples of breast cancer and healthy tissue and was validated on datasets of ESR1, PGR, and ERBB2, which encode standard clinical markers and therapeutic targets. To demonstrate how a GMM approach could be exploited for multiclass classification using data from a candidate marker, we analyzed the insulin-like growth factor I receptor (IGF1R), a promising target, but a marker of uncertain importance in breast cancer. The GMM defined subclasses with downregulated (40%), unchanged (39%), upregulated (19%), and overexpressed (2%) IGF1R levels; inter- and intrapatient analyses of IGF1R transcript and protein levels supported these predictions. Overexpressed IGF1R was observed in a small percentage of tumors. Samples with unchanged and upregulated IGF1R were differentiated tumors, and downregulation of IGF1R correlated with poorly differentiated, high-risk hormone receptor-negative and HER2-positive tumors. A similar correlation was found in the independent cohort of carcinoma in situ, suggesting that loss or low expression of IGF1R is a marker of aggressiveness in subsets of preinvasive and invasive breast cancer. These results demonstrate the importance of probabilistic modeling that delves deeper into molecular data and aims to improve diagnostic classification, prognostic assessment, and treatment selection. SIGNIFICANCE: A GMM classifier demonstrates potential use for clinical validation of markers and determination of target populations, particularly when availability of specimens for marker development is low. ©2019 American Association for Cancer Research.

Entities:  

Year:  2019        PMID: 31113820     DOI: 10.1158/0008-5472.CAN-19-0573

Source DB:  PubMed          Journal:  Cancer Res        ISSN: 0008-5472            Impact factor:   12.701


  3 in total

1.  Confidence limits for conformance proportions in normal mixture models.

Authors:  Shin-Fu Tsai; Tse-Le Huang
Journal:  J Appl Stat       Date:  2020-05-25       Impact factor: 1.416

2.  Identification of pyroptosis related subtypes and tumor microenvironment infiltration characteristics in breast cancer.

Authors:  Guo Huang; Jun Zhou; Juan Chen; Guowen Liu
Journal:  Sci Rep       Date:  2022-06-23       Impact factor: 4.996

3.  Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential.

Authors:  David Greenwood; Thomas Taverner; Nicola J Adderley; Malcolm James Price; Krishna Gokhale; Christopher Sainsbury; Suzy Gallier; Carly Welch; Elizabeth Sapey; Duncan Murray; Hilary Fanning; Simon Ball; Krishnarajah Nirantharakumar; Wayne Croft; Paul Moss
Journal:  iScience       Date:  2022-05-31
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.