Literature DB >> 16091409

Accounting for probe-level noise in principal component analysis of microarray data.

Guido Sanguinetti1, Marta Milo, Magnus Rattray, Neil D Lawrence.   

Abstract

MOTIVATION: Principal Component Analysis (PCA) is one of the most popular dimensionality reduction techniques for the analysis of high-dimensional datasets. However, in its standard form, it does not take into account any error measures associated with the data points beyond a standard spherical noise. This indiscriminate nature provides one of its main weaknesses when applied to biological data with inherently large variability, such as expression levels measured with microarrays. Methods now exist for extracting credibility intervals from the probe-level analysis of cDNA and oligonucleotide microarray experiments. These credibility intervals are gene and experiment specific, and can be propagated through an appropriate probabilistic downstream analysis.
RESULTS: We propose a new model-based approach to PCA that takes into account the variances associated with each gene in each experiment. We develop an efficient EM-algorithm to estimate the parameters of our new model. The model provides significantly better results than standard PCA, while remaining computationally reasonable. We show how the model can be used to 'denoise' a microarray dataset leading to improved expression profiles and tighter clustering across profiles. The probabilistic nature of the model means that the correct number of principal components is automatically obtained.

Entities:  

Mesh:

Year:  2005        PMID: 16091409     DOI: 10.1093/bioinformatics/bti617

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

Review 1.  Matrix factorisation methods applied in microarray data analysis.

Authors:  Andrew V Kossenkov; Michael F Ochs
Journal:  Int J Data Min Bioinform       Date:  2010       Impact factor: 0.667

2.  Transcriptional profiling of bovine intervertebral disc cells: implications for identification of normal and degenerate human intervertebral disc cell phenotypes.

Authors:  Ben M Minogue; Stephen M Richardson; Leo Ah Zeef; Anthony J Freemont; Judith A Hoyland
Journal:  Arthritis Res Ther       Date:  2010-02-11       Impact factor: 5.156

3.  Study protocol: a randomised controlled trial investigating the effect of exercise training on peripheral blood gene expression in patients with stable angina.

Authors:  Liam Bourke; Garry A Tew; Marta Milo; David C Crossman; John M Saxton; Timothy J A Chico
Journal:  BMC Public Health       Date:  2010-10-18       Impact factor: 3.295

4.  Transcriptional and functional differences in stem cell populations isolated from extraocular and limb muscles.

Authors:  Eugenia C Pacheco-Pinedo; Murat T Budak; Ulrike Zeiger; Louise Helskov Jørgensen; Sasha Bogdanovich; Henrik Daa Schrøder; Neal A Rubinstein; Tejvir S Khurana
Journal:  Physiol Genomics       Date:  2008-12-30       Impact factor: 3.107

5.  LS-NMF: a modified non-negative matrix factorization algorithm utilizing uncertainty estimates.

Authors:  Guoli Wang; Andrew V Kossenkov; Michael F Ochs
Journal:  BMC Bioinformatics       Date:  2006-03-28       Impact factor: 3.169

6.  Intensity dependent confidence intervals on microarray measurements of differentially expressed genes: a case study of the effect of MK5, FKRP and TAF4 on the transcriptome.

Authors:  Werner Van Belle; Nancy Gerits; Kirsti Jakobsen; Vigdis Brox; Marijke Van Ghelue; Ugo Moens
Journal:  Gene Regul Syst Bio       Date:  2007-07-17

7.  puma: a Bioconductor package for propagating uncertainty in microarray analysis.

Authors:  Richard D Pearson; Xuejun Liu; Guido Sanguinetti; Marta Milo; Neil D Lawrence; Magnus Rattray
Journal:  BMC Bioinformatics       Date:  2009-07-09       Impact factor: 3.169

8.  A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability.

Authors:  Herman M J Sontrop; Perry D Moerland; René van den Ham; Marcel J T Reinders; Wim F J Verhaegh
Journal:  BMC Bioinformatics       Date:  2009-11-26       Impact factor: 3.169

9.  puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis.

Authors:  Xuejun Liu; Zhenzhu Gao; Li Zhang; Magnus Rattray
Journal:  BMC Bioinformatics       Date:  2013-02-05       Impact factor: 3.169

10.  Including probe-level uncertainty in model-based gene expression clustering.

Authors:  Xuejun Liu; Kevin K Lin; Bogi Andersen; Magnus Rattray
Journal:  BMC Bioinformatics       Date:  2007-03-21       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.