Literature DB >> 23044539

Gene set analysis for self-contained tests: complex null and specific alternative hypotheses.

Y Rahmatallah1, F Emmert-Streib, G Glazko.   

Abstract

MOTIVATION: The analysis of differentially expressed gene sets became a routine in the analyses of gene expression data. There is a multitude of tests available, ranging from aggregation tests that summarize gene-level statistics for a gene set to true multivariate tests, accounting for intergene correlations. Most of them detect complex departures from the null hypothesis but when the null hypothesis is rejected, the specific alternative leading to the rejection is not easily identifiable.
RESULTS: In this article we compare the power and Type I error rates of minimum-spanning tree (MST)-based non-parametric multivariate tests with several multivariate and aggregation tests, which are frequently used for pathway analyses. In our simulation study, we demonstrate that MST-based tests have power that is for many settings comparable with the power of conventional approaches, but outperform them in specific regions of the parameter space corresponding to biologically relevant configurations. Further, we find for simulated and for gene expression data that MST-based tests discriminate well against shift and scale alternatives. As a general result, we suggest a two-step practical analysis strategy that may increase the interpretability of experimental data: first, apply the most powerful multivariate test to find the subset of pathways for which the null hypothesis is rejected and second, apply MST-based tests to these pathways to select those that support specific alternative hypotheses. CONTACT: gvglazko@uams.edu or yrahmatallah@uams.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2012        PMID: 23044539      PMCID: PMC3509490          DOI: 10.1093/bioinformatics/bts579

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

3.  Extensions to gene set enrichment.

Authors:  Zhen Jiang; Robert Gentleman
Journal:  Bioinformatics       Date:  2006-11-24       Impact factor: 6.937

4.  Analyzing gene expression data in terms of gene sets: methodological issues.

Authors:  Jelle J Goeman; Peter Bühlmann
Journal:  Bioinformatics       Date:  2007-02-15       Impact factor: 6.937

5.  A multivariate extension of the gene set enrichment analysis.

Authors:  Lev Klebanov; Galina Glazko; Peter Salzman; Andrei Yakovlev; Yuanhui Xiao
Journal:  J Bioinform Comput Biol       Date:  2007-10       Impact factor: 1.122

6.  Linear combination test for hierarchical gene set analysis.

Authors:  Xiaoming Wang; Irina Dinu; Wei Liu; Yutaka Yasui
Journal:  Stat Appl Genet Mol Biol       Date:  2011

7.  Pathway analysis of expression data: deciphering functional building blocks of complex diseases.

Authors:  Frank Emmert-Streib; Galina V Glazko
Journal:  PLoS Comput Biol       Date:  2011-05-26       Impact factor: 4.475

8.  Assessment method for a power analysis to identify differentially expressed pathways.

Authors:  Shailesh Tripathi; Frank Emmert-Streib
Journal:  PLoS One       Date:  2012-05-18       Impact factor: 3.240

9.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nucleic Acids Res       Date:  2008-11-25       Impact factor: 16.971

10.  Comparative evaluation of gene-set analysis methods.

Authors:  Qi Liu; Irina Dinu; Adeniyi J Adewale; John D Potter; Yutaka Yasui
Journal:  BMC Bioinformatics       Date:  2007-11-07       Impact factor: 3.169

View more
  8 in total

1.  Extracting the Strongest Signals from Omics Data: Differentially Expressed Pathways and Beyond.

Authors:  Galina Glazko; Yasir Rahmatallah; Boris Zybailov; Frank Emmert-Streib
Journal:  Methods Mol Biol       Date:  2017

2.  Carcinogenic effects of oil dispersants: A KEGG pathway-based RNA-seq study of human airway epithelial cells.

Authors:  Yao-Zhong Liu; Lei Zhang; Astrid M Roy-Engel; Shigeki Saito; Joseph A Lasky; Guangdi Wang; He Wang
Journal:  Gene       Date:  2016-11-16       Impact factor: 3.688

3.  Comparative evaluation of gene set analysis approaches for RNA-Seq data.

Authors:  Yasir Rahmatallah; Frank Emmert-Streib; Galina Glazko
Journal:  BMC Bioinformatics       Date:  2014-12-05       Impact factor: 3.169

4.  Gene set analysis using sufficient dimension reduction.

Authors:  Huey-Miin Hsueh; Chen-An Tsai
Journal:  BMC Bioinformatics       Date:  2016-02-06       Impact factor: 3.169

5.  A knowledge-based T2-statistic to perform pathway analysis for quantitative proteomic data.

Authors:  En-Yu Lai; Yi-Hau Chen; Kun-Pin Wu
Journal:  PLoS Comput Biol       Date:  2017-06-16       Impact factor: 4.475

6.  GSAR: Bioconductor package for Gene Set analysis in R.

Authors:  Yasir Rahmatallah; Boris Zybailov; Frank Emmert-Streib; Galina Glazko
Journal:  BMC Bioinformatics       Date:  2017-01-24       Impact factor: 3.169

7.  Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets.

Authors:  Yasir Rahmatallah; Frank Emmert-Streib; Galina Glazko
Journal:  Bioinformatics       Date:  2013-11-30       Impact factor: 6.937

8.  Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline.

Authors:  Yasir Rahmatallah; Frank Emmert-Streib; Galina Glazko
Journal:  Brief Bioinform       Date:  2015-09-04       Impact factor: 11.622

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.