Literature DB >> 18753155

Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes.

Xi Chen1, Lily Wang, Jonathan D Smith, Bing Zhang.   

Abstract

MOTIVATION: Gene set analysis allows formal testing of subtle but coordinated changes in a group of genes, such as those defined by Gene Ontology (GO) or KEGG Pathway databases. We propose a new method for gene set analysis that is based on principal component analysis (PCA) of genes expression values in the gene set. PCA is an effective method for reducing high dimensionality and capture variations in gene expression values. However, one limitation with PCA is that the latent variable identified by the first PC may be unrelated to outcome.
RESULTS: In the proposed supervised PCA (SPCA) model for gene set analysis, the PCs are estimated from a selected subset of genes that are associated with outcome. As outcome information is used in the gene selection step, this method is supervised, thus called the Supervised PCA model. Because of the gene selection step, test statistic in SPCA model can no longer be approximated well using t-distribution. We propose a two-component mixture distribution based on Gumbel exteme value distributions to account for the gene selection step. We show the proposed method compares favorably to currently available gene set analysis methods using simulated and real microarray data. SOFTWARE: The R code for the analysis used in this article are available upon request, we are currently working on implementing the proposed method in an R package.

Mesh:

Year:  2008        PMID: 18753155      PMCID: PMC2732277          DOI: 10.1093/bioinformatics/btn458

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Testing association of a pathway with survival using gene expression data.

Authors:  Jelle J Goeman; Jan Oosting; Anne-Marie Cleton-Jansen; Jakob K Anninga; Hans C van Houwelingen
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

3.  Extensions to gene set enrichment.

Authors:  Zhen Jiang; Robert Gentleman
Journal:  Bioinformatics       Date:  2006-11-24       Impact factor: 6.937

4.  Atherosclerosis susceptibility loci identified from a strain intercross of apolipoprotein E-deficient mice via a high-density genome scan.

Authors:  Jonathan D Smith; Jeffrey M Bhasin; Julie Baglione; Megan Settle; Yaomin Xu; John Barnard
Journal:  Arterioscler Thromb Vasc Biol       Date:  2005-12-22       Impact factor: 8.311

5.  A multivariate extension of the gene set enrichment analysis.

Authors:  Lev Klebanov; Galina Glazko; Peter Salzman; Andrei Yakovlev; Yuanhui Xiao
Journal:  J Bioinform Comput Biol       Date:  2007-10       Impact factor: 1.122

6.  Group testing for pathway analysis improves comparability of different microarray datasets.

Authors:  Theodora Manoli; Norbert Gretz; Hermann-Josef Gröne; Marc Kenzelmann; Roland Eils; Benedikt Brors
Journal:  Bioinformatics       Date:  2006-08-07       Impact factor: 6.937

7.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

8.  Generation of reactive oxygen species by the mitochondrial electron transport chain.

Authors:  Yuanbin Liu; Gary Fiskum; David Schubert
Journal:  J Neurochem       Date:  2002-03       Impact factor: 5.372

9.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

10.  PAGE: parametric analysis of gene set enrichment.

Authors:  Seon-Young Kim; David J Volsky
Journal:  BMC Bioinformatics       Date:  2005-06-08       Impact factor: 3.169

View more
  32 in total

1.  Identification of differential gene pathways with principal component analysis.

Authors:  Shuangge Ma; Michael R Kosorok
Journal:  Bioinformatics       Date:  2009-02-17       Impact factor: 6.937

2.  Integrating biological knowledge with gene expression profiles for survival prediction of cancer.

Authors:  Xi Chen; Lily Wang
Journal:  J Comput Biol       Date:  2009-02       Impact factor: 1.479

3.  A unified mixed effects model for gene set analysis of time course microarray experiments.

Authors:  Lily Wang; Xi Chen; Russell D Wolfinger; Jeffrey L Franklin; Robert J Coffey; Bing Zhang
Journal:  Stat Appl Genet Mol Biol       Date:  2009-11-07

4.  Statistical Analysis of Patient-Specific Pathway Activities via Mixed Models.

Authors:  Lily Wang; Xi Chen; Bing Zhang
Journal:  J Biom Biostat       Date:  2012

5.  Principal component analysis based methods in bioinformatics studies.

Authors:  Shuangge Ma; Ying Dai
Journal:  Brief Bioinform       Date:  2011-01-17       Impact factor: 11.622

6.  An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.

Authors:  Xi Chen; Lily Wang; Hemant Ishwaran
Journal:  Stat Probab Lett       Date:  2010-09-07       Impact factor: 0.870

7.  Pathway hunting by random survival forests.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

8.  Adaptive elastic-net sparse principal component analysis for pathway association testing.

Authors:  Xi Chen
Journal:  Stat Appl Genet Mol Biol       Date:  2011-10-24

9.  Analyzing LC/MS metabolic profiling data in the context of existing metabolic networks.

Authors:  Tianwei Yu; Yun Bai
Journal:  Curr Metabolomics       Date:  2013-01-01

10.  Gene expression profiles of the one-carbon metabolism pathway.

Authors:  Yin Leng Lee; Xinran Xu; Sylvan Wallenstein; Jia Chen
Journal:  J Genet Genomics       Date:  2009-05       Impact factor: 4.275

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.