Literature DB >> 17553853

Significance analysis of groups of genes in expression profiling studies.

James J Chen1, Taewon Lee, Robert R Delongchamp, Tao Chen, Chen-An Tsai.   

Abstract

MOTIVATION: Gene class testing (GCT) is a statistical approach to determine whether some functionally predefined classes of genes express differently under two experimental conditions. GCT computes the P-value of each gene class based on the null distribution and the gene classes are ranked for importance in accordance with their P-values. Currently, two null hypotheses have been considered: the Q1 hypothesis tests the relative strength of association with the phenotypes among the gene classes, and the Q2 hypothesis assesses the statistical significance. These two hypotheses are related but not equivalent.
METHOD: We investigate three one-sided and two two-sided test statistics under Q1 and Q2. The null distributions of gene classes under Q1 are generated by permuting gene labels and the null distributions under Q2 are generated by permuting samples.
RESULTS: We applied the five statistics to a diabetes dataset with 143 gene classes and to a breast cancer dataset with 508 GO (Gene Ontology) terms. In each statistic, the null distributions of the gene classes under Q1 are different from those under Q2 in both datasets, and their rankings can be different too. We clarify the one-sided and two-sided hypotheses, and discuss some issues regarding the Q1 and Q2 hypotheses for gene class ranking in the GCT. Because Q1 does not deal with correlations among genes, we prefer test based on Q2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17553853     DOI: 10.1093/bioinformatics/btm310

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

Review 1.  Gene-set analysis and reduction.

Authors:  Irina Dinu; John D Potter; Thomas Mueller; Qi Liu; Adeniyi J Adewale; Gian S Jhangri; Gunilla Einecke; Konrad S Famulski; Philip Halloran; Yutaka Yasui
Journal:  Brief Bioinform       Date:  2008-10-04       Impact factor: 11.622

2.  Analysis of high dimensional data using pre-defined set and subset information, with applications to genomic data.

Authors:  Wenge Guo; Mingan Yang; Chuanhua Xing; Shyamal D Peddada
Journal:  BMC Bioinformatics       Date:  2012-07-24       Impact factor: 3.169

3.  Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset.

Authors:  Yoichi Yamada; Hiroki Sawada; Ken-Ichi Hirotani; Masanobu Oshima; Kenji Satou
Journal:  BMC Res Notes       Date:  2012-12-12

4.  sscMap: an extensible Java application for connecting small-molecule drugs using gene-expression signatures.

Authors:  Shu-Dong Zhang; Timothy W Gant
Journal:  BMC Bioinformatics       Date:  2009-07-31       Impact factor: 3.169

5.  Gene set enrichment analysis for non-monotone association and multiple experimental categories.

Authors:  Rongheng Lin; Shuangshuang Dai; Richard D Irwin; Alexandra N Heinloth; Gary A Boorman; Leping Li
Journal:  BMC Bioinformatics       Date:  2008-11-14       Impact factor: 3.169

6.  Choosing the right path: enhancement of biologically relevant sets of genes or proteins using pathway structure.

Authors:  Reuben Thomas; Julia M Gohlke; Geffrey F Stopper; Frederick M Parham; Christopher J Portier
Journal:  Genome Biol       Date:  2009-04-24       Impact factor: 13.583

7.  A general modular framework for gene set enrichment analysis.

Authors:  Marit Ackermann; Korbinian Strimmer
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

8.  Assessing statistical significance in microarray experiments using the distance between microarrays.

Authors:  Douglas Hayden; Peter Lazar; David Schoenfeld
Journal:  PLoS One       Date:  2009-06-16       Impact factor: 3.240

9.  A simple and robust method for connecting small-molecule drugs using gene-expression signatures.

Authors:  Shu-Dong Zhang; Timothy W Gant
Journal:  BMC Bioinformatics       Date:  2008-06-02       Impact factor: 3.169

10.  MAVTgsa: an R package for gene set (enrichment) analysis.

Authors:  Chih-Yi Chien; Ching-Wei Chang; Chen-An Tsai; James J Chen
Journal:  Biomed Res Int       Date:  2014-07-03       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.