Literature DB >> 28913902

Analysis of cancer gene expression data with an assisted robust marker identification approach.

Hao Chai1, Xingjie Shi2, Qingzhao Zhang3, Qing Zhao4, Yuan Huang1, Shuangge Ma1.   

Abstract

Gene expression (GE) studies have been playing a critical role in cancer research. Despite tremendous effort, the analysis results are still often unsatisfactory, because of the weak signals and high data dimensionality. Analysis is often further challenged by the long-tailed distributions of the outcome variables. In recent multidimensional studies, data have been collected on GEs as well as their regulators (e.g., copy number alterations (CNAs), methylation, and microRNAs), which can provide additional information on the associations between GEs and cancer outcomes. In this study, we develop an ARMI (assisted robust marker identification) approach for analyzing cancer studies with measurements on GEs as well as regulators. The proposed approach borrows information from regulators and can be more effective than analyzing GE data alone. A robust objective function is adopted to accommodate long-tailed distributions. Marker identification is effectively realized using penalization. The proposed approach has an intuitive formulation and is computationally much affordable. Simulation shows its satisfactory performance under a variety of settings. TCGA (The Cancer Genome Atlas) data on melanoma and lung cancer are analyzed, which leads to biologically plausible marker identification and superior prediction.
© 2017 WILEY PERIODICALS, INC.

Entities:  

Keywords:  assisted analysis; cancer; gene expression; robustness

Mesh:

Substances:

Year:  2017        PMID: 28913902      PMCID: PMC5912176          DOI: 10.1002/gepi.22066

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  17 in total

1.  Averaged gene expressions for regression.

Authors:  Mee Young Park; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2006-05-11       Impact factor: 5.899

2.  Matrix correlations for high-dimensional data: the modified RV-coefficient.

Authors:  A K Smilde; H A L Kiers; S Bijlsma; C M Rubingh; M J van Erk
Journal:  Bioinformatics       Date:  2008-12-10       Impact factor: 6.937

3.  Collaborative regression.

Authors:  Samuel M Gross; Robert Tibshirani
Journal:  Biostatistics       Date:  2014-11-17       Impact factor: 5.899

4.  Integrating multidimensional omics data for cancer outcome.

Authors:  Ruoqing Zhu; Qing Zhao; Hongyu Zhao; Shuangge Ma
Journal:  Biostatistics       Date:  2016-03-14       Impact factor: 5.899

5.  Integrative analysis of high-throughput cancer studies with contrasted penalization.

Authors:  Xingjie Shi; Jin Liu; Jian Huang; Yong Zhou; BenChang Shia; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2014-01-06       Impact factor: 2.135

6.  Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer.

Authors:  Jie Peng; Ji Zhu; Anna Bergamaschi; Wonshik Han; Dong-Young Noh; Jonathan R Pollack; Pei Wang
Journal:  Ann Appl Stat       Date:  2010-03       Impact factor: 2.083

7.  Integrated analysis of multidimensional omics data on cutaneous melanoma prognosis.

Authors:  Yu Jiang; Xingjie Shi; Qing Zhao; Michael Krauthammer; Bonnie E Gould Rothberg; Shuangge Ma
Journal:  Genomics       Date:  2016-04-30       Impact factor: 5.736

8.  Network-guided regression for detecting associations between DNA methylation and gene expression.

Authors:  Zi Wang; Edward Curry; Giovanni Montana
Journal:  Bioinformatics       Date:  2014-06-11       Impact factor: 6.937

Review 9.  A selective review of robust variable selection with applications in bioinformatics.

Authors:  Cen Wu; Shuangge Ma
Journal:  Brief Bioinform       Date:  2014-12-05       Impact factor: 13.994

10.  Combining multidimensional genomic measurements for predicting cancer prognosis: observations from TCGA.

Authors:  Qing Zhao; Xingjie Shi; Yang Xie; Jian Huang; BenChang Shia; Shuangge Ma
Journal:  Brief Bioinform       Date:  2014-03-13       Impact factor: 13.994

View more
  7 in total

1.  Assisted gene expression-based clustering with AWNCut.

Authors:  Yang Li; Ruofan Bie; Sebastian J Teran Hidalgo; Yichen Qin; Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

2.  Assisted graphical model for gene expression data analysis.

Authors:  Xinyan Fan; Kuangnan Fang; Shuangge Ma; Shuaichao Wang; Qingzhao Zhang
Journal:  Stat Med       Date:  2019-03-10       Impact factor: 2.373

3.  Identification of cancer omics commonality and difference via community fusion.

Authors:  Yifan Sun; Yu Jiang; Yang Li; Shuangge Ma
Journal:  Stat Med       Date:  2018-11-12       Impact factor: 2.373

4.  Integrative functional linear model for genome-wide association studies with multiple traits.

Authors:  Yang Li; Fan Wang; Mengyun Wu; Shuangge Ma
Journal:  Biostatistics       Date:  2022-04-13       Impact factor: 5.899

5.  Structured Analysis of the High-dimensional FMR Model.

Authors:  Mengque Liu; Qingzhao Zhang; Kuangnan Fang; Shuangge Ma
Journal:  Comput Stat Data Anal       Date:  2019-11-13       Impact factor: 1.681

6.  Integration of Proteomics and Other Omics Data.

Authors:  Mengyun Wu; Yu Jiang; Shuangge Ma
Journal:  Methods Mol Biol       Date:  2021

7.  Assisted differential network analysis for gene expression data.

Authors:  Huangdi Yi; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2021-06-26       Impact factor: 2.344

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.