Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparison of variable selection methods for high-dimensional survival data with competing events.

Literature DB >> 29078093

Comparison of variable selection methods for high-dimensional survival data with competing events.

Julia Gilhodes¹, Christophe Zemmour², Soufiane Ajana¹, Alejandra Martinez³, Jean-Pierre Delord⁴, Eve Leconte⁵, Jean-Marie Boher², Thomas Filleron⁶.

Abstract

BACKGROUND: In the era of personalized medicine, it's primordial to identify gene signatures for each event type in the context of competing risks in order to improve risk stratification and treatment strategy. Until recently, little attention was paid to the performance of high-dimensional selection in deriving molecular signatures in this context. In this paper, we investigate the performance of two selection methods developed in the framework of high-dimensional data and competing risks: Random survival forest and a boosting approach for fitting proportional subdistribution hazards models.
METHODS: Using data from bladder cancer patients (GSE5479) and simulated datasets, stability and prognosis performance of the two methods were evaluated using a resampling strategy. For each sample, the data set was split into 100 training and validation sets. Molecular signatures were developed in the training sets by the two selection methods and then applied on the corresponding validation sets.
RESULTS: Random survival forest and boosting approach have comparable performance for the prediction of survival data, with few selected genes in common. Nevertheless, many different sets of genes are identified by the resampling approach, with a very small frequency of genes occurrence among the signatures. Also, the smaller the training sample size, the lower is the stability of the signatures.
CONCLUSION: Random survival forest and boosting approach give good predictive performance but gene signatures are very unstable. Further works are needed to propose adequate strategies for the analysis of high-dimensional data in the context of competing risks.

Entities: Chemical Disease Species

Keywords: Boosting; Competing risks; High-dimensional data; Random survival forest; Stability; Variable selection

Mesh：

Year: 2017 PMID： 29078093 DOI： 10.1016/j.compbiomed.2017.10.021

Source DB: PubMed Journal: Comput Biol Med ISSN： 0010-4825 Impact factor: 4.589

Keyword Cloud
Cited

5 in total

5. Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data.

Authors: Leili Tapak; Michael R Kosorok; Majid Sadeghifar; Omid Hamidi; Saeid Afshar; Hassan Doosti
Journal: Comput Math Methods Med Date: 2021-09-19 Impact factor: 2.238

5 in total

Comparison of variable selection methods for high-dimensional survival data with competing events.

1. Prognostic gene expression signatures of breast cancer are lacking a sensible biological meaning.

2. Variable selection with Group LASSO approach: Application to Cox regression with frailty model.

3. Variable selection methods for predicting clinical outcomes following allogeneic hematopoietic cell transplantation.

4. Comparison of Variable Selection Methods for Time-to-Event Data in High-Dimensional Settings.

5. Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data.