Literature DB >> 22547432

Gene selection using iterative feature elimination random forests for survival outcomes.

Herbert Pang1, Stephen L George, Ken Hui, Tiejun Tong.   

Abstract

Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis.Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes.

Entities:  

Mesh:

Year:  2012        PMID: 22547432      PMCID: PMC3495190          DOI: 10.1109/TCBB.2012.63

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  42 in total

1.  TGF-beta-induced apoptosis is mediated by the adapter protein Daxx that facilitates JNK activation.

Authors:  R Perlman; W P Schiemann; M W Brooks; H F Lodish; R A Weinberg
Journal:  Nat Cell Biol       Date:  2001-08       Impact factor: 28.824

2.  Recursive Mahalanobis separability measure for gene subset selection.

Authors:  K Z Mao; Wenyin Tang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 Jan-Mar       Impact factor: 3.710

3.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

4.  Data-driven approach to predict survival of cancer patients: estimation of microarray genes' prediction significance by Cox proportional hazard regression model.

Authors:  Efthimios Motakis; Anna V Ivshina; Vladimir A Kuznetsov
Journal:  IEEE Eng Med Biol Mag       Date:  2009 Jul-Aug

5.  Sparse kernel methods for high-dimensional survival data.

Authors:  Ludger Evers; Claudia-Martina Messow
Journal:  Bioinformatics       Date:  2008-05-30       Impact factor: 6.937

6.  Laplacian linear discriminant analysis approach to unsupervised feature selection.

Authors:  Satoshi Niijima; Yasushi Okuno
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2009 Oct-Dec       Impact factor: 3.710

7.  Proliferative response of mantle cell lymphoma cells stimulated by CD40 ligation and IL-4.

Authors:  R Castillo; J Mascarenhas; W Telford; A Chadburn; S M Friedman; E J Schattner
Journal:  Leukemia       Date:  2000-02       Impact factor: 11.528

8.  Distinct signatures of B-cell homeostatic and activation-dependent chemokine receptors in the development and progression of extragastric MALT lymphomas.

Authors:  A J A Deutsch; A Aigelsreiter; E Steinbauer; M Frühwirth; H Kerl; C Beham-Schmid; H Schaider; P Neumeister
Journal:  J Pathol       Date:  2008-08       Impact factor: 7.996

9.  Common gene variants in the tumor necrosis factor (TNF) and TNF receptor superfamilies and NF-kB transcription factors and non-Hodgkin lymphoma risk.

Authors:  Sophia S Wang; Mark P Purdue; James R Cerhan; Tongzhang Zheng; Idan Menashe; Bruce K Armstrong; Qing Lan; Patricia Hartge; Anne Kricker; Yawei Zhang; Lindsay M Morton; Claire M Vajdic; Theodore R Holford; Richard K Severson; Andrew Grulich; Brian P Leaderer; Scott Davis; Wendy Cozen; Meredith Yeager; Stephen J Chanock; Nilanjan Chatterjee; Nathaniel Rothman
Journal:  PLoS One       Date:  2009-04-24       Impact factor: 3.240

10.  Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models.

Authors:  Harald Binder; Martin Schumacher
Journal:  BMC Bioinformatics       Date:  2008-01-10       Impact factor: 3.169

View more
  9 in total

1.  Detecting biomarkers from microarray data using distributed correlation based gene selection.

Authors:  Alok Kumar Shukla; Diwakar Tripathi
Journal:  Genes Genomics       Date:  2020-02-10       Impact factor: 1.839

2.  A Selective Review on Random Survival Forests for High Dimensional Data.

Authors:  Hong Wang; Gang Li
Journal:  Quant Biosci       Date:  2017

3.  Statistical aspect of translational and correlative studies in clinical trials.

Authors:  Herbert Pang; Xiaofei Wang
Journal:  Chin Clin Oncol       Date:  2016-02

4.  Stratified pathway analysis to identify gene sets associated with oral contraceptive use and breast cancer.

Authors:  Herbert Pang; Hongyu Zhao
Journal:  Cancer Inform       Date:  2014-12-09

5.  RGIFE: a ranked guided iterative feature elimination heuristic for the identification of biomarkers.

Authors:  Nicola Lazzarini; Jaume Bacardit
Journal:  BMC Bioinformatics       Date:  2017-06-30       Impact factor: 3.169

6.  An Efficient Cancer Classification Model Using Microarray and High-Dimensional Data.

Authors:  Hanaa Fathi; Hussain AlSalman; Abdu Gumaei; Ibrahim I M Manhrawy; Abdelazim G Hussien; Passent El-Kafrawy
Journal:  Comput Intell Neurosci       Date:  2021-12-29

7.  Feature selection through validation and un-censoring of endovascular repair survival data for predicting the risk of re-intervention.

Authors:  Omneya Attallah; Alan Karthikesalingam; Peter J E Holt; Matthew M Thompson; Rob Sayers; Matthew J Bown; Eddie C Choke; Xianghong Ma
Journal:  BMC Med Inform Decis Mak       Date:  2017-08-03       Impact factor: 2.796

8.  Personalized prediction of delayed graft function for recipients of deceased donor kidney transplants with machine learning.

Authors:  Satoru Kawakita; Jennifer L Beaumont; Vadim Jucaud; Matthew J Everly
Journal:  Sci Rep       Date:  2020-10-27       Impact factor: 4.379

9.  Random survival forest model identifies novel biomarkers of event-free survival in high-risk pediatric acute lymphoblastic leukemia.

Authors:  Zachary S Bohannan; Frederick Coffman; Antonina Mitrofanova
Journal:  Comput Struct Biotechnol J       Date:  2022-01-06       Impact factor: 6.155

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.