Literature DB >> 32154833

Unified methods for feature selection in large-scale genomic studies with censored survival outcomes.

Lauren Spirko-Burns1, Karthik Devarajan2.   

Abstract

MOTIVATION: One of the major goals in large-scale genomic studies is to identify genes with a prognostic impact on time-to-event outcomes which provide insight into the disease process. With rapid developments in high-throughput genomic technologies in the past two decades, the scientific community is able to monitor the expression levels of tens of thousands of genes and proteins resulting in enormous datasets where the number of genomic features is far greater than the number of subjects. Methods based on univariate Cox regression are often used to select genomic features related to survival outcome; however, the Cox model assumes proportional hazards (PH), which is unlikely to hold for each feature. When applied to genomic features exhibiting some form of non-proportional hazards (NPH), these methods could lead to an under- or over-estimation of the effects. We propose a broad array of marginal screening techniques that aid in feature ranking and selection by accommodating various forms of NPH. First, we develop an approach based on Kullback-Leibler information divergence and the Yang-Prentice model that includes methods for the PH and proportional odds (PO) models as special cases. Next, we propose R2 measures for the PH and PO models that can be interpreted in terms of explained randomness. Lastly, we propose a generalized pseudo-R2 index that includes PH, PO, crossing hazards and crossing odds models as special cases and can be interpreted as the percentage of separability between subjects experiencing the event and not experiencing the event according to feature measurements.
RESULTS: We evaluate the performance of our measures using extensive simulation studies and publicly available datasets in cancer genomics. We demonstrate that the proposed methods successfully address the issue of NPH in genomic feature selection and outperform existing methods.
AVAILABILITY AND IMPLEMENTATION: R code for the proposed methods is available at github.com/lburns27/Feature-Selection. CONTACT: karthik.devarajan@fccc.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 32154833      PMCID: PMC7267818          DOI: 10.1093/bioinformatics/btaa161

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Assessment and comparison of prognostic classification schemes for survival data.

Authors:  E Graf; C Schmoor; W Sauerbrei; M Schumacher
Journal:  Stat Med       Date:  1999 Sep 15-30       Impact factor: 2.373

2.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

3.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

4.  Consistent estimation of the expected Brier score in general survival models with right-censored event times.

Authors:  Thomas A Gerds; Martin Schumacher
Journal:  Biom J       Date:  2006-12       Impact factor: 2.207

5.  Index for rating diagnostic tests.

Authors:  W J YOUDEN
Journal:  Cancer       Date:  1950-01       Impact factor: 6.860

6.  Analysis of survival data by the proportional odds model.

Authors:  S Bennett
Journal:  Stat Med       Date:  1983 Apr-Jun       Impact factor: 2.373

7.  Gene expression profiling predicts the development of oral cancer.

Authors:  Pierre Saintigny; Li Zhang; You-Hong Fan; Adel K El-Naggar; Vassiliki A Papadimitrakopoulou; Lei Feng; J Jack Lee; Edward S Kim; Waun Ki Hong; Li Mao
Journal:  Cancer Prev Res (Phila)       Date:  2011-02

8.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

9.  Identifying common prognostic factors in genomic cancer studies: a novel index for censored outcomes.

Authors:  Sigrid Rouam; Thierry Moreau; Philippe Broët
Journal:  BMC Bioinformatics       Date:  2010-03-24       Impact factor: 3.169

10.  A pseudo-R2 measure for selecting genomic markers with crossing hazards functions.

Authors:  Sigrid Rouam; Thierry Moreau; Philippe Broët
Journal:  BMC Med Res Methodol       Date:  2011-03-15       Impact factor: 4.615

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.