Literature DB >> 16809386

Pathway analysis using random forests classification and regression.

Herbert Pang1, Aiping Lin, Matthew Holford, Bradley E Enerson, Bin Lu, Michael P Lawton, Eugenia Floyd, Hongyu Zhao.   

Abstract

MOTIVATION: Although numerous methods have been developed to better capture biological information from microarray data, commonly used single gene-based methods neglect interactions among genes and leave room for other novel approaches. For example, most classification and regression methods for microarray data are based on the whole set of genes and have not made use of pathway information. Pathway-based analysis in microarray studies may lead to more informative and relevant knowledge for biological researchers.
RESULTS: In this paper, we describe a pathway-based classification and regression method using Random Forests to analyze gene expression data. The proposed methods allow researchers to rank important pathways from externally available databases, discover important genes, find pathway-based outlying cases and make full use of a continuous outcome variable in the regression setting. We also compared Random Forests with other machine learning methods using several datasets and found that Random Forests classification error rates were either the lowest or the second-lowest. By combining pathway information and novel statistical methods, this procedure represents a promising computational strategy in dissecting pathways and can provide biological insight into the study of microarray data. AVAILABILITY: Source code written in R is available from http://bioinformatics.med.yale.edu/pathway-analysis/rf.htm.

Mesh:

Year:  2006        PMID: 16809386     DOI: 10.1093/bioinformatics/btl344

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  74 in total

1.  Pathway analysis using random forests with bivariate node-split for survival outcomes.

Authors:  Herbert Pang; Debayan Datta; Hongyu Zhao
Journal:  Bioinformatics       Date:  2009-11-18       Impact factor: 6.937

2.  Comments on Fifty Years of Classification and Regression Trees.

Authors:  Chi Song; Heping Zhang
Journal:  Int Stat Rev       Date:  2014-12-01       Impact factor: 2.217

Review 3.  On the Application of Multivariate Statistical and Data Mining Analyses to Data in Neuroscience.

Authors:  Paul F Smith
Journal:  J Undergrad Neurosci Educ       Date:  2018-06-15

4.  The effects of genetic variants related to insulin metabolism pathways and the interactions with lifestyles on colorectal cancer risk.

Authors:  Su Yon Jung; Zuo-Feng Zhang
Journal:  Menopause       Date:  2019-07       Impact factor: 2.953

Review 5.  Systems analysis of high-throughput data.

Authors:  Rosemary Braun
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

6.  Identifying dysfunctional crosstalk of pathways in various regions of Alzheimer's disease brains.

Authors:  Zhi-Ping Liu; Yong Wang; Xiang-Sun Zhang; Luonan Chen
Journal:  BMC Syst Biol       Date:  2010-09-13

7.  An integrative ChIP-chip and gene expression profiling to model SMAD regulatory modules.

Authors:  Huaxia Qin; Michael W Y Chan; Sandya Liyanarachchi; Curtis Balch; Dustin Potter; Irene J Souriraj; Alfred S L Cheng; Francisco J Agosto-Perez; Elena V Nikonova; Pearlly S Yan; Huey-Jen Lin; Kenneth P Nephew; Joel H Saltz; Louise C Showe; Tim H M Huang; Ramana V Davuluri
Journal:  BMC Syst Biol       Date:  2009-07-17

8.  Genome-wide mRNA expression analysis of hepatic adaptation to high-fat diets reveals switch from an inflammatory to steatotic transcriptional program.

Authors:  Marijana Radonjic; Jorn R de Haan; Marjan J van Erk; Ko Willems van Dijk; Sjoerd A A van den Berg; Philip J de Groot; Michael Müller; Ben van Ommen
Journal:  PLoS One       Date:  2009-08-14       Impact factor: 3.240

9.  A pathway analysis tool for analyzing microarray data of species with low physiological information.

Authors:  M F W Te Pas; S van Hemert; B Hulsegge; A J W Hoekman; M H Pool; J M J Rebel; M A Smits
Journal:  Adv Bioinformatics       Date:  2008-12-24

10.  A markov classification model for metabolic pathways.

Authors:  Timothy Hancock; Hiroshi Mamitsuka
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.