Literature DB >> 17105717

Interpretation of ANOVA models for microarray data using PCA.

J R de Haan1, R Wehrens, S Bauerschmidt, E Piek, R C van Schaik, L M C Buydens.   

Abstract

MOTIVATION: ANOVA is a technique, which is frequently used in the analysis of microarray data, e.g. to assess the significance of treatment effects, and to select interesting genes based on P-values. However, it does not give information about what exactly is causing the effect. Our purpose is to improve the interpretation of the results from ANOVA on large microarray datasets, by applying PCA on the individual variance components. Interaction effects can be visualized by biplots, showing genes and variables in one plot, providing insight in the effect of e.g. treatment or time on gene expression. Because ANOVA has removed uninteresting sources of variance, the results are much more interpretable than without ANOVA. Moreover, the combination of ANOVA and PCA provides a simple way to select genes, based on the interactions of interest.
RESULTS: It is shown that the components from an ANOVA model can be summarized and visualized with PCA, which improves the interpretability of the models. The method is applied to a real time-course gene expression dataset of mesenchymal stem cells. The dataset was designed to investigate the effect of different treatments on osteogenesis. The biplots generated with the algorithm give specific information about the effects of specific treatments on genes over time. These results are in agreement with the literature. The biological validation with GO annotation from the genes present in the selections shows that biologically relevant groups of genes are selected. AVAILABILITY: R code with the implementation of the method for this dataset is available from http://www.cac.science.ru.nl under the heading "Software".

Mesh:

Substances:

Year:  2006        PMID: 17105717     DOI: 10.1093/bioinformatics/btl572

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  Principal component model of multispectral data for near real-time skin chromophore mapping.

Authors:  Jana M Kainerstorfer; Martin Ehler; Franck Amyot; Moinuddin Hassan; Stavros G Demos; Victor Chernomordik; Christoph K Hitzenberger; Amir H Gandjbakhche; Jason D Riley
Journal:  J Biomed Opt       Date:  2010 Jul-Aug       Impact factor: 3.170

Review 2.  Bioinformatics and systems biology of the lipidome.

Authors:  Shankar Subramaniam; Eoin Fahy; Shakti Gupta; Manish Sud; Robert W Byrnes; Dawn Cotter; Ashok Reddy Dinasarapu; Mano Ram Maurya
Journal:  Chem Rev       Date:  2011-09-23       Impact factor: 60.622

3.  Integrating gene expression and GO classification for PCA by preclustering.

Authors:  Jorn R De Haan; Ester Piek; Rene C van Schaik; Jacob de Vlieg; Susanne Bauerschmidt; Lutgarde M C Buydens; Ron Wehrens
Journal:  BMC Bioinformatics       Date:  2010-03-26       Impact factor: 3.169

4.  Human Spermatozoa Quantitative Proteomic Signature Classifies Normo- and Asthenozoospermia.

Authors:  Mayank Saraswat; Sakari Joenväärä; Tushar Jain; Anil Kumar Tomar; Ashima Sinha; Sarman Singh; Savita Yadav; Risto Renkonen
Journal:  Mol Cell Proteomics       Date:  2016-11-28       Impact factor: 5.911

5.  High-fat diet leads to tissue-specific changes reflecting risk factors for diseases in DBA/2J mice.

Authors:  Rachael S Hageman; Asja Wagener; Claudia Hantschel; Karen L Svenson; Gary A Churchill; Gudrun A Brockmann
Journal:  Physiol Genomics       Date:  2010-03-09       Impact factor: 3.107

6.  Interpretation and visualization of non-linear data fusion in kernel space: study on metabolomic characterization of progression of multiple sclerosis.

Authors:  Agnieszka Smolinska; Lionel Blanchet; Leon Coulier; Kirsten A M Ampt; Theo Luider; Rogier Q Hintzen; Sybren S Wijmenga; Lutgarde M C Buydens
Journal:  PLoS One       Date:  2012-06-08       Impact factor: 3.240

7.  Ontology based molecular signatures for immune cell types via gene expression analysis.

Authors:  Terrence F Meehan; Nicole A Vasilevsky; Christopher J Mungall; David S Dougall; Melissa A Haendel; Judith A Blake; Alexander D Diehl
Journal:  BMC Bioinformatics       Date:  2013-08-30       Impact factor: 3.169

8.  Two-way AIC: detection of differentially expressed genes from large scale microarray meta-dataset.

Authors:  Koki Tsuyuzaki; Daisuke Tominaga; Yeondae Kwon; Satoru Miyazaki
Journal:  BMC Genomics       Date:  2013-02-15       Impact factor: 3.969

9.  Transmission of Vibrio cholerae is antagonized by lytic phage and entry into the aquatic environment.

Authors:  Eric J Nelson; Ashrafuzzaman Chowdhury; James Flynn; Stefan Schild; Lori Bourassa; Yue Shao; Regina C LaRocque; Stephen B Calderwood; Firdausi Qadri; Andrew Camilli
Journal:  PLoS Pathog       Date:  2008-10-24       Impact factor: 6.823

10.  Principal component analysis for designed experiments.

Authors:  Tomokazu Konishi
Journal:  BMC Bioinformatics       Date:  2015-12-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.