Literature DB >> 20014472

Dimension reduction of microarray gene expression data: the accelerated failure time model.

Tuan S Nguyen1, Javier Rojo.   

Abstract

The construction of the components of Partial Least Squares (PLS) is based on the maximization of the covariance/correlation between linear combinations of the predictors and the response. However, the usual Pearson correlation is influenced by outliers in the response or in the predictors. To cope with outliers, we replace the Pearson correlation with the Spearman rank correlation in the optimization criteria of PLS. The rank-based method of PLS is insensitive to outlying values in both the predictors and response, and incorporates the censoring information by using an approach of Nguyen and Rocke (2004) and two approaches of reweighting and mean imputation of Datta et al. (2007). The performance of the rank-based approaches of PLS, denoted by Rank-based Modified Partial Least Squares (RMPLS), Rank-based Reweighted Partial Least Squares (RRWPLS), and Rank-based Mean-Imputation Partial Least Squares (RMIPLS), is investigated in a simulation study and on four real datasets, under an Accelerated Failure Time (AFT) model, against their un-ranked counterparts, and several other dimension reduction techniques. The results indicate that RMPLS is a better dimension reduction method than other variants of PLS as well as other considered methods in terms of the minimized cross-validation error of fit and the mean squared error of fit in the presence of outliers in the response, and is comparable to other variants of PLS in the absence of outliers. Supplementary Materials are available at http://www.worldscinet.com/jbcb/

Entities:  

Keywords:  censored response; dimension reduction; outliers; rank-based PLS

Mesh:

Year:  2009        PMID: 20014472      PMCID: PMC2796584          DOI: 10.1142/s0219720009004412

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  17 in total

1.  Iterative partial least squares with right-censored data analysis: a comparison to other dimension reduction techniques.

Authors:  Jie Huang; David Harrington
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

2.  Partial least squares dimension reduction for microarray gene expression data with a censored response.

Authors:  Danh V Nguyen
Journal:  Math Biosci       Date:  2005-01-22       Impact factor: 2.144

3.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

4.  Kernel Cox regression models for linking gene expression profiles to censored survival data.

Authors:  Hongzhe Li; Yihui Luan
Journal:  Pac Symp Biocomput       Date:  2003

5.  Linking gene expression data with patient survival times using partial least squares.

Authors:  Peter J Park; Lu Tian; Isaac S Kohane
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

6.  Partial least squares proportional hazard regression for application to DNA microarray survival data.

Authors:  Danh V Nguyen; David M Rocke
Journal:  Bioinformatics       Date:  2002-12       Impact factor: 6.937

7.  Dimension reduction of microarray data in the presence of a censored survival response: a simulation study.

Authors:  Tuan S Nguyen; Javier Rojo
Journal:  Stat Appl Genet Mol Biol       Date:  2009-01-21

8.  Gene-expression profiles predict survival of patients with lung adenocarcinoma.

Authors:  David G Beer; Sharon L R Kardia; Chiang-Ching Huang; Thomas J Giordano; Albert M Levin; David E Misek; Lin Lin; Guoan Chen; Tarek G Gharib; Dafydd G Thomas; Michelle L Lizyness; Rork Kuick; Satoru Hayasaka; Jeremy M G Taylor; Mark D Iannettoni; Mark B Orringer; Samir Hanash
Journal:  Nat Med       Date:  2002-07-15       Impact factor: 53.440

9.  Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO.

Authors:  Susmita Datta; Jennifer Le-Rademacher; Somnath Datta
Journal:  Biometrics       Date:  2007-03       Impact factor: 2.571

10.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.