Literature DB >> 14988110

Robust PCA and classification in biosciences.

Mia Hubert1, Sanne Engelen.   

Abstract

MOTIVATION: Principal components analysis (PCA) is a very popular dimension reduction technique that is widely used as a first step in the analysis of high-dimensional microarray data. However, the classical approach that is based on the mean and the sample covariance matrix of the data is very sensitive to outliers. Also, classification methods based on this covariance matrix do not give good results in the presence of outlying measurements.
RESULTS: First, we propose a robust PCA (ROBPCA) method for high-dimensional data. It combines projection-pursuit ideas with robust estimation of low-dimensional data. We also propose a diagnostic plot to display and classify the outliers. This ROBPCA method is applied to several bio-chemical datasets. In one example, we also apply a robust discriminant method on the scores obtained with ROBPCA. We show that this combination of robust methods leads to better classifications than classical PCA and quadratic discriminant analysis. AVAILABILITY: All the programs are part of the Matlab Toolbox for Robust Calibration, available at http://www.wis.kuleuven.ac.be/stat/robust.html.

Mesh:

Year:  2004        PMID: 14988110     DOI: 10.1093/bioinformatics/bth158

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

Review 1.  Cardiovascular genomics: a biomarker identification pipeline.

Authors:  John H Phan; Chang F Quo; May Dongmei Wang
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-05-16

2.  Influence of in vivo growth on human glioma cell line gene expression: convergent profiles under orthotopic conditions.

Authors:  Kevin Camphausen; Benjamin Purow; Mary Sproull; Tamalee Scott; Tomoko Ozawa; Dennis F Deen; Philip J Tofilon
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-31       Impact factor: 11.205

Review 3.  Multivariate data analysis for neuroimaging data: overview and application to Alzheimer's disease.

Authors:  Christian Habeck; Yaakov Stern
Journal:  Cell Biochem Biophys       Date:  2010-11       Impact factor: 2.194

4.  Francisella tularensis alters human neutrophil gene expression: insights into the molecular basis of delayed neutrophil apoptosis.

Authors:  Justin T Schwartz; Sarmistha Bandyopadhyay; Scott D Kobayashi; Jenna McCracken; Adeline R Whitney; Frank R Deleo; Lee-Ann H Allen
Journal:  J Innate Immun       Date:  2012-09-14       Impact factor: 7.349

5.  Fusion of metabolomics and proteomics data for biomarkers discovery: case study on the experimental autoimmune encephalomyelitis.

Authors:  Lionel Blanchet; Agnieszka Smolinska; Amos Attali; Marcel P Stoop; Kirsten A M Ampt; Hans van Aken; Ernst Suidgeest; Tinka Tuinstra; Sybren S Wijmenga; Theo Luider; Lutgarde M C Buydens
Journal:  BMC Bioinformatics       Date:  2011-06-22       Impact factor: 3.169

6.  Evaluating the impact of abrupt changes in forest policy and management practices on landscape dynamics: analysis of a Landsat image time series in the Atlantic Northern Forest.

Authors:  Kasey R Legaard; Steven A Sader; Erin M Simons-Legaard
Journal:  PLoS One       Date:  2015-06-24       Impact factor: 3.240

7.  Exploring neighborhoods in the metagenome universe.

Authors:  Kathrin P Aßhauer; Heiner Klingenberg; Thomas Lingner; Peter Meinicke
Journal:  Int J Mol Sci       Date:  2014-07-14       Impact factor: 5.923

8.  Serum metabolites and risk of myocardial infarction and ischemic stroke: a targeted metabolomic approach in two German prospective cohorts.

Authors:  Anna Floegel; Tilman Kühn; Disorn Sookthai; Theron Johnson; Cornelia Prehn; Ulrike Rolle-Kampczyk; Wolfgang Otto; Cornelia Weikert; Thomas Illig; Martin von Bergen; Jerzy Adamski; Heiner Boeing; Rudolf Kaaks; Tobias Pischon
Journal:  Eur J Epidemiol       Date:  2017-11-27       Impact factor: 8.082

9.  Application of wavelet-based neural network on DNA microarray data.

Authors:  Jack Lee; Benny Zee
Journal:  Bioinformation       Date:  2008-12-31

10.  QC metrics from CPTAC raw LC-MS/MS data interpreted through multivariate statistics.

Authors:  Xia Wang; Matthew C Chambers; Lorenzo J Vega-Montoto; David M Bunk; Stephen E Stein; David L Tabb
Journal:  Anal Chem       Date:  2014-02-17       Impact factor: 6.986

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.