Literature DB >> 20519286

Structure-based variable selection for survival data.

Vincenzo Lagani1, Ioannis Tsamardinos.   

Abstract

MOTIVATION: Variable selection is a typical approach used for molecular-signature and biomarker discovery; however, its application to survival data is often complicated by censored samples. We propose a new algorithm for variable selection suitable for the analysis of high-dimensional, right-censored data called Survival Max-Min Parents and Children (SMMPC). The algorithm is conceptually simple, scalable, based on the theory of Bayesian networks (BNs) and the Markov blanket and extends the corresponding algorithm (MMPC) for classification tasks. The selected variables have a structural interpretation: if T is the survival time (in general the time-to-event), SMMPC returns the variables adjacent to T in the BN representing the data distribution. The selected variables also have a causal interpretation that we discuss.
RESULTS: We conduct an extensive empirical analysis of prototypical and state-of-the-art variable selection algorithms for survival data that are applicable to high-dimensional biological data. SMMPC selects on average the smallest variable subsets (less than a dozen per dataset), while statistically significantly outperforming all of the methods in the study returning a manageable number of genes that could be inspected by a human expert. AVAILABILITY: Matlab and R code are freely available from http://www.mensxmachina.org

Entities:  

Mesh:

Year:  2010        PMID: 20519286     DOI: 10.1093/bioinformatics/btq261

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Multiple predictively equivalent risk models for handling missing data at time of prediction: With an application in severe hypoglycemia risk prediction for type 2 diabetes.

Authors:  Sisi Ma; Pamela J Schreiner; Elizabeth R Seaquist; Mehmet Ugurbil; Rachel Zmora; Lisa S Chow
Journal:  J Biomed Inform       Date:  2020-01-28       Impact factor: 6.317

2.  T-ReCS: stable selection of dynamically formed groups of features with application to prediction of clinical outcomes.

Authors:  Grace T Huang; Ioannis Tsamardinos; Vineet Raghu; Naftali Kaminski; Panayiotis V Benos
Journal:  Pac Symp Biocomput       Date:  2015

3.  An Artificial Neural Network Stratifies the Risks of Reintervention and Mortality after Endovascular Aneurysm Repair; a Retrospective Observational study.

Authors:  Alan Karthikesalingam; Omneya Attallah; Xianghong Ma; Sandeep Singh Bahia; Luke Thompson; Alberto Vidal-Diez; Edward C Choke; Matt J Bown; Robert D Sayers; Matt M Thompson; Peter J Holt
Journal:  PLoS One       Date:  2015-07-15       Impact factor: 3.240

4.  Biomarker signature identification in "omics" data with multi-class outcome.

Authors:  Vincenzo Lagani; George Kortas; Ioannis Tsamardinos
Journal:  Comput Struct Biotechnol J       Date:  2013-06-08       Impact factor: 7.271

5.  Hidden treasures in "ancient" microarrays: gene-expression portrays biology and potential resistance pathways of major lung cancer subtypes and normal tissue.

Authors:  Konstantinos Kerkentzes; Vincenzo Lagani; Ioannis Tsamardinos; Mogens Vyberg; Oluf Dimitri Røe
Journal:  Front Oncol       Date:  2014-09-29       Impact factor: 6.244

6.  Survival Prediction and Feature Selection in Patients with Breast Cancer Using Support Vector Regression.

Authors:  Shahrbanoo Goli; Hossein Mahjub; Javad Faradmal; Hoda Mashayekhi; Ali-Reza Soltanian
Journal:  Comput Math Methods Med       Date:  2016-11-01       Impact factor: 2.238

7.  Feature selection for high-dimensional temporal data.

Authors:  Michail Tsagris; Vincenzo Lagani; Ioannis Tsamardinos
Journal:  BMC Bioinformatics       Date:  2018-01-23       Impact factor: 3.169

8.  Feature selection with the R package MXM.

Authors:  Michail Tsagris; Ioannis Tsamardinos
Journal:  F1000Res       Date:  2018-09-20
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.