Literature DB >> 24416092

Multiple Response Regression for Gaussian Mixture Models with Known Labels.

Wonyul Lee1, Ying Du1, Wei Sun1, D Neil Hayes1, Yufeng Liu1.   

Abstract

Multiple response regression is a useful regression technique to model multiple response variables using the same set of predictor variables. Most existing methods for multiple response regression are designed for modeling homogeneous data. In many applications, however, one may have heterogeneous data where the samples are divided into multiple groups. Our motivating example is a cancer dataset where the samples belong to multiple cancer subtypes. In this paper, we consider modeling the data coming from a mixture of several Gaussian distributions with known group labels. A naive approach is to split the data into several groups according to the labels and model each group separately. Although it is simple, this approach ignores potential common structures across different groups. We propose new penalized methods to model all groups jointly in which the common and unique structures can be identified. The proposed methods estimate the regression coefficient matrix, as well as the conditional inverse covariance matrix of response variables. Asymptotic properties of the proposed methods are explored. Through numerical examples, we demonstrate that both estimation and prediction can be improved by modeling all groups jointly using the proposed methods. An application to a glioblastoma cancer dataset reveals some interesting common and unique gene relationships across different cancer subtypes.

Entities:  

Keywords:  Covariance estimation; GLASSO; Hierarchical penalty; LASSO; Multiple response; Regression; Sparsity

Year:  2012        PMID: 24416092      PMCID: PMC3885347          DOI: 10.1002/sam.11158

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  15 in total

1.  Hierarchical organization of modularity in metabolic networks.

Authors:  E Ravasz; A L Somera; D A Mongru; Z N Oltvai; A L Barabási
Journal:  Science       Date:  2002-08-30       Impact factor: 47.728

2.  Downregulation of miR-21 inhibits EGFR pathway and suppresses the growth of human glioblastoma cells independent of PTEN status.

Authors:  Xuan Zhou; Yu Ren; Lynette Moore; Mei Mei; Yongping You; Peng Xu; Baoli Wang; Guangxiu Wang; Zhifan Jia; Peiyu Pu; Wei Zhang; Chunsheng Kang
Journal:  Lab Invest       Date:  2010-01-04       Impact factor: 5.662

3.  GLI2 transcription factor mediates cytokine cross-talk in the tumor microenvironment.

Authors:  Sherine F Elsawa; Luciana L Almada; Steven C Ziesmer; Anne J Novak; Thomas E Witzig; Stephen M Ansell; Martin E Fernandez-Zapico
Journal:  J Biol Chem       Date:  2011-03-18       Impact factor: 5.157

4.  Role of sonic hedgehog signaling in migration of cell lines established from CD133-positive malignant glioma cells.

Authors:  Hiroyuki Uchida; Kazunori Arita; Shunji Yunoue; Hajime Yonezawa; Yoshinari Shinsato; Hiroto Kawano; Hirofumi Hirano; Ryosuke Hanaya; Hiroshi Tokimura
Journal:  J Neurooncol       Date:  2011-03-05       Impact factor: 4.130

5.  MiR-21 is an EGFR-regulated anti-apoptotic factor in lung cancer in never-smokers.

Authors:  Masahiro Seike; Akiteru Goto; Tetsuya Okano; Elise D Bowman; Aaron J Schetter; Izumi Horikawa; Ewy A Mathe; Jin Jen; Ping Yang; Haruhiko Sugimura; Akihiko Gemma; Shoji Kudoh; Carlo M Croce; Curtis C Harris
Journal:  Proc Natl Acad Sci U S A       Date:  2009-07-13       Impact factor: 11.205

6.  Sparse Multivariate Regression With Covariance Estimation.

Authors:  Adam J Rothman; Elizaveta Levina; Ji Zhu
Journal:  J Comput Graph Stat       Date:  2010       Impact factor: 2.302

7.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

8.  Simultaneous Multiple Response Regression and Inverse Covariance Matrix Estimation via Penalized Gaussian Maximum Likelihood.

Authors:  Wonyul Lee; Yufeng Liu
Journal:  J Multivar Anal       Date:  2012-04-27       Impact factor: 1.473

9.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

10.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways.

Authors: 
Journal:  Nature       Date:  2008-09-04       Impact factor: 49.962

View more
  2 in total

1.  Joint Estimation of Multiple Precision Matrices with Common Structures.

Authors:  Wonyul Lee; Yufeng Liu
Journal:  J Mach Learn Res       Date:  2015       Impact factor: 3.654

2.  Double Sparsity Kernel Learning with Automatic Variable Selection and Data Extraction.

Authors:  Jingxiang Chen; Chong Zhang; Michael R Kosorok; Yufeng Liu
Journal:  Stat Interface       Date:  2018       Impact factor: 0.582

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.