Literature DB >> 16278953

Rank-based methods as a non-parametric alternative of the T-statistic for the analysis of biological microarray data.

Rainer Breitling1, Pawel Herzyk.   

Abstract

We have recently introduced a rank-based test statistic, RankProducts (RP), as a new non-parametric method for detecting differentially expressed genes in microarray experiments. It has been shown to generate surprisingly good results with biological datasets. The basis for this performance and the limits of the method are, however, little understood. Here we explore the performance of such rank-based approaches under a variety of conditions using simulated microarray data, and compare it with classical Wilcoxon rank sums and t-statistics, which form the basis of most alternative differential gene expression detection techniques. We show that for realistic simulated microarray datasets, RP is more powerful and accurate for sorting genes by differential expression than t-statistics or Wilcoxon rank sums - in particular for replicate numbers below 10, which are most commonly used in biological experiments. Its relative performance is particularly strong when the data are contaminated by non-normal random noise or when the samples are very inhomogenous, e.g. because they come from different time points or contain a mixture of affected and unaffected cells. However, RP assumes equal measurement variance for all genes and tends to give overly optimistic p-values when this assumption is violated. It is therefore essential that proper variance stabilizing normalization is performed on the data before calculating the RP values. Where this is impossible, another rank-based variant of RP (average ranks) provides a useful alternative with very similar overall performance. The Perl scripts implementing the simulation and evaluation are available upon request. Implementations of the RP method are available for download from the authors website (http://www.brc.dcs.gla.ac.uk/glama).

Mesh:

Year:  2005        PMID: 16278953     DOI: 10.1142/s0219720005001442

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  67 in total

1.  Crosstalk analysis of pathways in breast cancer using a network model based on overlapping differentially expressed genes.

Authors:  Yong Sun; Kai Yuan; Peng Zhang; Rong Ma; Qi-Wen Zhang; Xing-Song Tian
Journal:  Exp Ther Med       Date:  2015-05-27       Impact factor: 2.447

2.  Comments on the rank product method for analyzing replicated experiments.

Authors:  James A Koziol
Journal:  FEBS Lett       Date:  2010-01-20       Impact factor: 4.124

3.  Biomarker detection in the integration of multiple multi-class genomic studies.

Authors:  Shuya Lu; Jia Li; Chi Song; Kui Shen; George C Tseng
Journal:  Bioinformatics       Date:  2009-12-04       Impact factor: 6.937

4.  A genomic study on mammary gland acclimatization to tropical environment in the Holstein cattle.

Authors:  D Wetzel-Gastal; F Feitor; S van Harten; M Sebastiana; L M R Sousa; L A Cardoso
Journal:  Trop Anim Health Prod       Date:  2017-09-27       Impact factor: 1.559

5.  Gene array analysis reveals a common Runx transcriptional programme controlling cell adhesion and survival.

Authors:  S Wotton; A Terry; A Kilbey; A Jenkins; P Herzyk; E Cameron; J C Neil
Journal:  Oncogene       Date:  2008-06-16       Impact factor: 9.867

6.  Meta-analysis of glioblastoma multiforme versus anaplastic astrocytoma identifies robust gene markers.

Authors:  Jonathan M Dreyfuss; Mark D Johnson; Peter J Park
Journal:  Mol Cancer       Date:  2009-09-04       Impact factor: 27.401

7.  A gene signature for post-infectious chronic fatigue syndrome.

Authors:  John W Gow; Suzanne Hagan; Pawel Herzyk; Celia Cannon; Peter O Behan; Abhijit Chaudhuri
Journal:  BMC Med Genomics       Date:  2009-06-25       Impact factor: 3.063

8.  Discovering collectively informative descriptors from high-throughput experiments.

Authors:  Clark D Jeffries; William O Ward; Diana O Perkins; Fred A Wright
Journal:  BMC Bioinformatics       Date:  2009-12-18       Impact factor: 3.169

9.  Data perturbation independent diagnosis and validation of breast cancer subtypes using clustering and patterns.

Authors:  G Alexe; G S Dalgin; R Ramaswamy; C Delisi; G Bhanot
Journal:  Cancer Inform       Date:  2007-02-19

10.  Chronic exposure to arsenic in the drinking water alters the expression of immune response genes in mouse lung.

Authors:  Courtney D Kozul; Thomas H Hampton; Jennifer C Davey; Julie A Gosse; Athena P Nomikos; Phillip L Eisenhauer; Daniel J Weiss; Jessica E Thorpe; Michael A Ihnat; Joshua W Hamilton
Journal:  Environ Health Perspect       Date:  2009-03-04       Impact factor: 9.031

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.