Literature DB >> 19933158

Pathway analysis using random forests with bivariate node-split for survival outcomes.

Herbert Pang1, Debayan Datta, Hongyu Zhao.   

Abstract

MOTIVATION: There is great interest in pathway-based methods for genomics data analysis in the research community. Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, no study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret. Correlating pathway-based gene expression with survival outcomes may lead to biologically more meaningful prognosis biomarkers. Thus, a comprehensive study on how these methods perform in a pathway-based setting is warranted.
RESULTS: In this article, we describe a pathway-based method using random forests to correlate gene expression data with survival outcomes and introduce a novel bivariate node-splitting random survival forests. The proposed method allows researchers to identify important pathways for predicting patient prognosis and time to disease progression, and discover important genes within those pathways. We compared different implementations of random forests with different split criteria and found that bivariate node-splitting random survival forests with log-rank test is among the best. We also performed simulation studies that showed random forests outperforms several other machine learning algorithms and has comparable results with a newly developed component-wise Cox boosting model. Thus, pathway-based survival analysis using machine learning tools represents a promising approach in dissecting pathways and for generating new biological hypothesis from microarray studies. AVAILABILITY: R package Pwayrfsurvival is available from URL: http://www.duke.edu/~hp44/pwayrfsurvival.htm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19933158      PMCID: PMC2804301          DOI: 10.1093/bioinformatics/btp640

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  58 in total

1.  BRCA1 regulates the G2/M checkpoint by activating Chk1 kinase upon DNA damage.

Authors:  Ronit I Yarden; Sherly Pardo-Reoyo; Magda Sgagias; Kenneth H Cowan; Lawrence C Brody
Journal:  Nat Genet       Date:  2002-02-11       Impact factor: 38.330

Review 2.  Cell-cycle checkpoints and cancer.

Authors:  Michael B Kastan; Jiri Bartek
Journal:  Nature       Date:  2004-11-18       Impact factor: 49.962

3.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

4.  Do poor-prognosis breast tumours express membrane cofactor proteins (CD46)?

Authors:  Zahra Madjd; Lindy G Durrant; Sarah E Pinder; Ian O Ellis; John Ronan; Sarah Lewis; Neil K Rushmere; Ian Spendlove
Journal:  Cancer Immunol Immunother       Date:  2004-09-16       Impact factor: 6.968

5.  Genotoxic stress leads to centrosome amplification in breast cancer cell lines that have an inactive G1/S cell cycle checkpoint.

Authors:  Antonino B D'Assoro; Robert Busby; Kelly Suino; Emmanuella Delva; Gustavo J Almodovar-Mercado; Heidi Johnson; Christopher Folk; Daniel J Farrugia; Vlad Vasile; Franca Stivala; Jeffrey L Salisbury
Journal:  Oncogene       Date:  2004-05-20       Impact factor: 9.867

6.  Higher stromal expression of transforming growth factor-beta type II receptors is associated with poorer prognosis breast tumors.

Authors:  John Barlow; David Yandell; Donald Weaver; Theresa Casey; Karen Plaut
Journal:  Breast Cancer Res Treat       Date:  2003-05       Impact factor: 4.872

7.  The Hedgehog pathway is a possible therapeutic target for patients with estrogen receptor-negative breast cancer.

Authors:  Chizu Kameda; Haruo Tanaka; Akio Yamasaki; Masafumi Nakamura; Kenichiro Koga; Norihiro Sato; Makoto Kubo; Syoji Kuroki; Masao Tanaka; Mitsuo Katano
Journal:  Anticancer Res       Date:  2009-03       Impact factor: 2.480

8.  From genomics to chemical genomics: new developments in KEGG.

Authors:  Minoru Kanehisa; Susumu Goto; Masahiro Hattori; Kiyoko F Aoki-Kinoshita; Masumi Itoh; Shuichi Kawashima; Toshiaki Katayama; Michihiro Araki; Mika Hirakawa
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Low E2F1 transcript levels are a strong determinant of favorable breast cancer outcome.

Authors:  Vincent Vuaroqueaux; Patrick Urban; Martin Labuhn; Mauro Delorenzi; Pratyaksha Wirapati; Christopher C Benz; Renata Flury; Holger Dieterich; Frédérique Spyratos; Urs Eppenberger; Serenella Eppenberger-Castori
Journal:  Breast Cancer Res       Date:  2007       Impact factor: 6.466

10.  BAD: a good therapeutic target?

Authors:  Andrea B Motoyama; Nancy E Hynes
Journal:  Breast Cancer Res       Date:  2002-10-18       Impact factor: 6.466

View more
  20 in total

1.  Pathway-based identification of SNPs predictive of survival.

Authors:  Herbert Pang; Michael Hauser; Stéphane Minvielle
Journal:  Eur J Hum Genet       Date:  2011-02-02       Impact factor: 4.246

2.  Pathway hunting by random survival forests.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

3.  Radiomics analysis using stability selection supervised component analysis for right-censored survival data.

Authors:  Kang K Yan; Xiaofei Wang; Wendy W T Lam; Varut Vardhanabhuti; Anne W M Lee; Herbert H Pang
Journal:  Comput Biol Med       Date:  2020-08-06       Impact factor: 4.589

4.  Development and validation of a quantitative real-time polymerase chain reaction classifier for lung cancer prognosis.

Authors:  Guoan Chen; Sinae Kim; Jeremy M G Taylor; Zhuwen Wang; Oliver Lee; Nithya Ramnath; Rishindra M Reddy; Jules Lin; Andrew C Chang; Mark B Orringer; David G Beer
Journal:  J Thorac Oncol       Date:  2011-09       Impact factor: 15.609

Review 5.  Random forests for genomic data analysis.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Genomics       Date:  2012-04-21       Impact factor: 5.736

6.  Gene selection using iterative feature elimination random forests for survival outcomes.

Authors:  Herbert Pang; Stephen L George; Ken Hui; Tiejun Tong
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 Sep-Oct       Impact factor: 3.710

7.  Analysis of a large data set to identify predictors of blood transfusion in primary total hip and knee arthroplasty.

Authors:  ZeYu Huang; Cheng Huang; JinWei Xie; Jun Ma; GuoRui Cao; Qiang Huang; Bin Shen; Virginia Byers Kraus; FuXing Pei
Journal:  Transfusion       Date:  2018-08-25       Impact factor: 3.157

8.  Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes.

Authors:  Herbert Pang; Sin-Ho Jung
Journal:  Genet Epidemiol       Date:  2013-03-07       Impact factor: 2.135

9.  Statistical aspect of translational and correlative studies in clinical trials.

Authors:  Herbert Pang; Xiaofei Wang
Journal:  Chin Clin Oncol       Date:  2016-02

10.  Random Effects Model for Multiple Pathway Analysis with Applications to Type II Diabetes Microarray Data.

Authors:  Herbert Pang; Inyoung Kim; Hongyu Zhao
Journal:  Stat Biosci       Date:  2014-01-30
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.