Literature DB >> 15681279

Partial least squares dimension reduction for microarray gene expression data with a censored response.

Danh V Nguyen1.   

Abstract

An important application of DNA microarray technologies involves monitoring the global state of transcriptional program in tumor cells. One goal in cancer microarray studies is to compare the clinical outcome, such as relapse-free or overall survival, for subgroups of patients defined by global gene expression patterns. A method of comparing patient survival, as a function of gene expression, was recently proposed in [Bioinformatics 18 (2002) 1625] by Nguyen and Rocke. Due to the (a) high-dimensionality of microarray gene expression data and (b) censored survival times, a two-stage procedure was proposed to relate survival times to gene expression profiles. The first stage involves dimensionality reduction of the gene expression data by partial least squares (PLS) and the second stage involves prediction of survival probability using proportional hazard regression. In this paper, we provide a systematic assessment of the performance of this two-stage procedure. PLS dimension reduction involves complex non-linear functions of both the predictors and the response data, rendering exact analytical study intractable. Thus, we assess the methodology under a simulation model for gene expression data with a censored response variable. In particular, we compare the performance of PLS dimension reduction relative to dimension reduction via principal components analysis (PCA) and to a modified PLS (MPLS) approach. PLS performed substantially better relative to dimension reduction via PCA when the total predictor variance explained is low to moderate (e.g. 40%-60%). It performed similar to MPLS and slightly better in some cases. Additionally, we examine the effect of censoring on dimension reduction stage. The performance of all methods deteriorates for a high censoring rate, although PLS-PH performed relatively best overall.

Entities:  

Mesh:

Year:  2005        PMID: 15681279     DOI: 10.1016/j.mbs.2004.10.007

Source DB:  PubMed          Journal:  Math Biosci        ISSN: 0025-5564            Impact factor:   2.144


  5 in total

1.  A supervised approach for predicting patient survival with gene expression data.

Authors:  Karthik Devarajan; Yan Zhou; Neeraj Chachra; Nader Ebrahimi
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2010

2.  Whole Genome DNA and RNA Sequencing of Whole Blood Elucidates the Genetic Architecture of Gene Expression Underlying a Wide Range of Diseases.

Authors:  Chunyu Liu; Roby Joehanes; Jiantao Ma; Yuxuan Wang; Xianbang Sun; Amena Keshawarz; Meera Sooda; Tianxiao Huan; Shih-Jen Hwang; Helena Bui; Brandon Tejada; Peter J Munson; Demirkale Cumhur; Nancy L Heard-Costa; Achilleas N Pitsillides; Gina M Peloso; Michael Feolo; Nataliya Sharopova; Ramachandran S Vasan; Daniel Levy
Journal:  Res Sq       Date:  2022-05-31

3.  Dimension reduction of microarray gene expression data: the accelerated failure time model.

Authors:  Tuan S Nguyen; Javier Rojo
Journal:  J Bioinform Comput Biol       Date:  2009-12       Impact factor: 1.122

4.  Dimension reduction of microarray data in the presence of a censored survival response: a simulation study.

Authors:  Tuan S Nguyen; Javier Rojo
Journal:  Stat Appl Genet Mol Biol       Date:  2009-01-21

5.  Whole Genome DNA and RNA Sequencing of Whole Blood Elucidates the Genetic Architecture of Gene Expression Underlying a Wide Range of Diseases.

Authors:  Chunyu Liu; Roby Joehanes; Jiantao Ma; Yuxuan Wang; Xianbang Sun; Amena Keshawarz; Meera Sooda; Tianxiao Huan; Shih-Jen Hwang; Helena Bui; Brandon Tejada; Peter J Munson; Demirkale Cumhur; Nancy L Heard-Costa; Achilleas N Pitsillides; Gina M Peloso; Michael Feolo; Nataliya Sharopova; Ramachandran S Vasan; Daniel Levy
Journal:  medRxiv       Date:  2022-05-03
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.