Literature DB >> 19648135

Gene ranking and biomarker discovery under correlation.

Verena Zuber1, Korbinian Strimmer.   

Abstract

MOTIVATION: Biomarker discovery and gene ranking is a standard task in genomic high-throughput analysis. Typically, the ordering of markers is based on a stabilized variant of the t-score, such as the moderated t or the SAM statistic. However, these procedures ignore gene-gene correlations, which may have a profound impact on the gene orderings and on the power of the subsequent tests.
RESULTS: We propose a simple procedure that adjusts gene-wise t-statistics to take account of correlations among genes. The resulting correlation-adjusted t-scores ('cat' scores) are derived from a predictive perspective, i.e. as a score for variable selection to discriminate group membership in two-class linear discriminant analysis. In the absence of correlation the cat score reduces to the standard t-score. Moreover, using the cat score it is straightforward to evaluate groups of features (i.e. gene sets). For computation of the cat score from small sample data, we propose a shrinkage procedure. In a comparative study comprising six different synthetic and empirical correlation structures, we show that the cat score improves estimation of gene orderings and leads to higher power for fixed true discovery rate, and vice versa. Finally, we also illustrate the cat score by analyzing metabolomic data. AVAILABILITY: The shrinkage cat score is implemented in the R package 'st', which is freely available under the terms of the GNU General Public License (version 3 or later) from CRAN (http://cran.r-project.org/web/packages/st/).

Mesh:

Substances:

Year:  2009        PMID: 19648135     DOI: 10.1093/bioinformatics/btp460

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Inference with Transposable Data: Modeling the Effects of Row and Column Correlations.

Authors:  Genevera I Allen; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-03-16       Impact factor: 4.488

2.  The limitations of simple gene set enrichment analysis assuming gene independence.

Authors:  Pablo Tamayo; George Steinhardt; Arthur Liberzon; Jill P Mesirov
Journal:  Stat Methods Med Res       Date:  2012-10-14       Impact factor: 3.021

3.  High-Dimensional Structured Feature Screening Using Binary Markov Random Fields.

Authors:  Jie Liu; Peggy Peissig; Chunming Zhang; Elizabeth Burnside; Catherine McCarty; David Page
Journal:  JMLR Workshop Conf Proc       Date:  2012

4.  Specificity in ROS signaling and transcript signatures.

Authors:  Lauri Vaahtera; Mikael Brosché; Michael Wrzaczek; Jaakko Kangasjärvi
Journal:  Antioxid Redox Signal       Date:  2014-02-06       Impact factor: 8.401

5.  Metabolomic signatures in elite cyclists: differential characterization of a seeming normal endocrine status regarding three serum hormones.

Authors:  Boris Labrador; François-Xavier Lejeune; Alain Paris; Cécile Canlet; Jérôme Molina; Michel Guinot; Armand Mégret; Michel Rieu; Jean-Christophe Thalabard; Yves Le Bouc
Journal:  Metabolomics       Date:  2021-07-06       Impact factor: 4.290

6.  A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies.

Authors:  Verena Zuber; A Pedro Duarte Silva; Korbinian Strimmer
Journal:  BMC Bioinformatics       Date:  2012-10-31       Impact factor: 3.169

7.  Identification of single- and multiple-class specific signature genes from gene expression profiles by group marker index.

Authors:  Yu-Shuen Tsai; Kripamoy Aguan; Nikhil R Pal; I-Fang Chung
Journal:  PLoS One       Date:  2011-09-01       Impact factor: 3.240

8.  A benchmark for statistical microarray data analysis that preserves actual biological and technical variance.

Authors:  Benoît De Hertogh; Bertrand De Meulder; Fabrice Berger; Michael Pierre; Eric Bareke; Anthoula Gaigneaux; Eric Depiereux
Journal:  BMC Bioinformatics       Date:  2010-01-11       Impact factor: 3.169

9.  Distributional fold change test - a statistical approach for detecting differential expression in microarray experiments.

Authors:  Vadim Farztdinov; Fionnuala McDyer
Journal:  Algorithms Mol Biol       Date:  2012-11-02       Impact factor: 1.405

10.  Combining multiple hypothesis testing and affinity propagation clustering leads to accurate, robust and sample size independent classification on gene expression data.

Authors:  Argiris Sakellariou; Despina Sanoudou; George Spyrou
Journal:  BMC Bioinformatics       Date:  2012-10-17       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.