Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Gene ranking and biomarker discovery under correlation.

Literature DB >> 19648135

Gene ranking and biomarker discovery under correlation.

Abstract

MOTIVATION: Biomarker discovery and gene ranking is a standard task in genomic high-throughput analysis. Typically, the ordering of markers is based on a stabilized variant of the t-score, such as the moderated t or the SAM statistic. However, these procedures ignore gene-gene correlations, which may have a profound impact on the gene orderings and on the power of the subsequent tests.
RESULTS: We propose a simple procedure that adjusts gene-wise t-statistics to take account of correlations among genes. The resulting correlation-adjusted t-scores ('cat' scores) are derived from a predictive perspective, i.e. as a score for variable selection to discriminate group membership in two-class linear discriminant analysis. In the absence of correlation the cat score reduces to the standard t-score. Moreover, using the cat score it is straightforward to evaluate groups of features (i.e. gene sets). For computation of the cat score from small sample data, we propose a shrinkage procedure. In a comparative study comprising six different synthetic and empirical correlation structures, we show that the cat score improves estimation of gene orderings and leads to higher power for fixed true discovery rate, and vice versa. Finally, we also illustrate the cat score by analyzing metabolomic data. AVAILABILITY: The shrinkage cat score is implemented in the R package 'st', which is freely available under the terms of the GNU General Public License (version 3 or later) from CRAN (http://cran.r-project.org/web/packages/st/).

Mesh：

Substances：
Genetic Markers

Year: 2009 PMID： 19648135 DOI： 10.1093/bioinformatics/btp460

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

28 in total

1. Inference with Transposable Data: Modeling the Effects of Row and Column Correlations.

Authors: Genevera I Allen; Robert Tibshirani
Journal: J R Stat Soc Series B Stat Methodol Date: 2012-03-16 Impact factor: 4.488

2. The limitations of simple gene set enrichment analysis assuming gene independence.

Authors: Pablo Tamayo; George Steinhardt; Arthur Liberzon; Jill P Mesirov
Journal: Stat Methods Med Res Date: 2012-10-14 Impact factor: 3.021

3. High-Dimensional Structured Feature Screening Using Binary Markov Random Fields.

Authors: Jie Liu; Peggy Peissig; Chunming Zhang; Elizabeth Burnside; Catherine McCarty; David Page
Journal: JMLR Workshop Conf Proc Date: 2012

4. Specificity in ROS signaling and transcript signatures.

Authors: Lauri Vaahtera; Mikael Brosché; Michael Wrzaczek; Jaakko Kangasjärvi
Journal: Antioxid Redox Signal Date: 2014-02-06 Impact factor: 8.401

5. Metabolomic signatures in elite cyclists: differential characterization of a seeming normal endocrine status regarding three serum hormones.

Authors: Boris Labrador; François-Xavier Lejeune; Alain Paris; Cécile Canlet; Jérôme Molina; Michel Guinot; Armand Mégret; Michel Rieu; Jean-Christophe Thalabard; Yves Le Bouc
Journal: Metabolomics Date: 2021-07-06 Impact factor: 4.290

6. A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies.

Authors: Verena Zuber; A Pedro Duarte Silva; Korbinian Strimmer
Journal: BMC Bioinformatics Date: 2012-10-31 Impact factor: 3.169

7. Identification of single- and multiple-class specific signature genes from gene expression profiles by group marker index.

Authors: Yu-Shuen Tsai; Kripamoy Aguan; Nikhil R Pal; I-Fang Chung
Journal: PLoS One Date: 2011-09-01 Impact factor: 3.240

8. A benchmark for statistical microarray data analysis that preserves actual biological and technical variance.

Authors: Benoît De Hertogh; Bertrand De Meulder; Fabrice Berger; Michael Pierre; Eric Bareke; Anthoula Gaigneaux; Eric Depiereux
Journal: BMC Bioinformatics Date: 2010-01-11 Impact factor: 3.169

9. Distributional fold change test - a statistical approach for detecting differential expression in microarray experiments.

Authors: Vadim Farztdinov; Fionnuala McDyer
Journal: Algorithms Mol Biol Date: 2012-11-02 Impact factor: 1.405

10. Combining multiple hypothesis testing and affinity propagation clustering leads to accurate, robust and sample size independent classification on gene expression data.

Authors: Argiris Sakellariou; Despina Sanoudou; George Spyrou
Journal: BMC Bioinformatics Date: 2012-10-17 Impact factor: 3.169