Literature DB >> 17447952

Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO.

Susmita Datta1, Jennifer Le-Rademacher, Somnath Datta.   

Abstract

We consider the problem of predicting survival times of cancer patients from the gene expression profiles of their tumor samples via linear regression modeling of log-transformed failure times. The partial least squares (PLS) and least absolute shrinkage and selection operator (LASSO) methodologies are used for this purpose where we first modify the data to account for censoring. Three approaches of handling right censored data-reweighting, mean imputation, and multiple imputation-are considered. Their performances are examined in a detailed simulation study and compared with that of full data PLS and LASSO had there been no censoring. A major objective of this article is to investigate the performances of PLS and LASSO in the context of microarray data where the number of covariates is very large and there are extremely few samples. We demonstrate that LASSO outperforms PLS in terms of prediction error when the list of covariates includes a moderate to large percentage of useless or noise variables; otherwise, PLS may outperform LASSO. For a moderate sample size (100 with 10,000 covariates), LASSO performed better than a no covariate model (or noise-based prediction). The mean imputation method appears to best track the performance of the full data PLS or LASSO. The mean imputation scheme is used on an existing data set on lung cancer. This reanalysis using the mean imputed PLS and LASSO identifies a number of genes that were known to be related to cancer or tumor activities from previous studies.

Entities:  

Mesh:

Year:  2007        PMID: 17447952     DOI: 10.1111/j.1541-0420.2006.00660.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  31 in total

1.  Lasso regularization for left-censored Gaussian outcome and high-dimensional predictors.

Authors:  Perrine Soret; Marta Avalos; Linda Wittkop; Daniel Commenges; Rodolphe Thiébaut
Journal:  BMC Med Res Methodol       Date:  2018-12-04       Impact factor: 4.615

2.  Buckley-James boosting for survival analysis with high-dimensional biomarker data.

Authors:  Zhu Wang; C Y Wang
Journal:  Stat Appl Genet Mol Biol       Date:  2010-06-08

3.  Bayesian ensemble methods for survival prediction in gene expression data.

Authors:  Vinicius Bonato; Veerabhadran Baladandayuthapani; Bradley M Broom; Erik P Sulman; Kenneth D Aldape; Kim-Anh Do
Journal:  Bioinformatics       Date:  2010-12-08       Impact factor: 6.937

4.  Ranking prognosis markers in cancer genomic studies.

Authors:  Shuangge Ma; Xiao Song
Journal:  Brief Bioinform       Date:  2010-11-18       Impact factor: 11.622

5.  Identification of Breast Cancer Prognosis Markers via Integrative Analysis.

Authors:  Shuangge Ma; Ying Dai; Jian Huang; Yang Xie
Journal:  Comput Stat Data Anal       Date:  2012-09-01       Impact factor: 1.681

6.  Partial least squares Cox regression for genome-wide data.

Authors:  Ståle Nygård; Ornulf Borgan; Ole Christian Lingjaerde; Hege Leite Størvold
Journal:  Lifetime Data Anal       Date:  2008-06       Impact factor: 1.588

7.  Rank-based estimation in the {ell}1-regularized partly linear model for censored outcomes with application to integrated analyses of clinical predictors and gene expression data.

Authors:  Brent A Johnson
Journal:  Biostatistics       Date:  2009-06-24       Impact factor: 5.899

8.  Semiparametric prognosis models in genomic studies.

Authors:  Shuangge Ma; Jian Huang; Mingyu Shi; Yang Li; Ben-Chang Shia
Journal:  Brief Bioinform       Date:  2010-02-01       Impact factor: 11.622

9.  On path restoration for censored outcomes.

Authors:  Brent A Johnson; Qi Long; Matthias Chung
Journal:  Biometrics       Date:  2011-04-02       Impact factor: 2.571

10.  Dimension reduction of microarray gene expression data: the accelerated failure time model.

Authors:  Tuan S Nguyen; Javier Rojo
Journal:  J Bioinform Comput Biol       Date:  2009-12       Impact factor: 1.122

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.