Literature DB >> 15059035

At what scale should microarray data be analyzed?

Shuguang Huang1, Adeline A Yeo, Lawrence Gelbert, Xi Lin, Laura Nisenbaum, Kerry G Bemis.   

Abstract

INTRODUCTION: The hybridization intensities derived from microarray experiments, for example Affymetrix's MAS5 signals, are very often transformed in one way or another before statistical models are fitted. The motivation for performing transformation is usually to satisfy the model assumptions such as normality and homogeneity in variance. Generally speaking, two types of strategies are often applied to microarray data depending on the analysis need: correlation analysis where all the gene intensities on the array are considered simultaneously, and gene-by-gene ANOVA where each gene is analyzed individually. AIM: We investigate the distributional properties of the Affymetrix GeneChip signal data under the two scenarios, focusing on the impact of analyzing the data at an inappropriate scale.
METHODS: The Box-Cox type of transformation is first investigated for the strategy of pooling genes. The commonly used log-transformation is particularly applied for comparison purposes. For the scenario where analysis is on a gene-by-gene basis, the model assumptions such as normality are explored. The impact of using a wrong scale is illustrated by log-transformation and quartic-root transformation.
RESULTS: When all the genes on the array are considered together, the dependent relationship between the expression and its variation level can be satisfactorily removed by Box-Cox transformation. When genes are analyzed individually, the distributional properties of the intensities are shown to be gene dependent. Derivation and simulation show that some loss of power is incurred when a wrong scale is used, but due to the robustness of the t-test, the loss is acceptable when the fold-change is not very large.

Mesh:

Substances:

Year:  2004        PMID: 15059035     DOI: 10.2165/00129785-200404020-00007

Source DB:  PubMed          Journal:  Am J Pharmacogenomics        ISSN: 1175-2203


  4 in total

1.  Comparison of Statistical Tests and Power Analysis for Phosphoproteomics Data.

Authors:  Lei J Ding; Hannah M Schlüter; Matthew J Szucs; Rushdy Ahmad; Zheyang Wu; Weifeng Xu
Journal:  J Proteome Res       Date:  2019-12-26       Impact factor: 4.466

2.  Probe set algorithms: is there a rational best bet?

Authors:  Jinwook Seo; Eric P Hoffman
Journal:  BMC Bioinformatics       Date:  2006-08-30       Impact factor: 3.169

3.  Dissecting early differentially expressed genes in a mixture of differentiating embryonic stem cells.

Authors:  Feng Hong; Fang Fang; Xuming He; Xiaoyi Cao; Hiram Chipperfield; Dan Xie; Wing H Wong; Huck H Ng; Sheng Zhong
Journal:  PLoS Comput Biol       Date:  2009-12-18       Impact factor: 4.475

4.  Model-based variance-stabilizing transformation for Illumina microarray data.

Authors:  Simon M Lin; Pan Du; Wolfgang Huber; Warren A Kibbe
Journal:  Nucleic Acids Res       Date:  2008-01-04       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.