Literature DB >> 24535726

A system-level pathway-phenotype association analysis using synthetic feature random forest.

Qinxin Pan1, Ting Hu, James D Malley, Angeline S Andrew, Margaret R Karagas, Jason H Moore.   

Abstract

As the cost of genome-wide genotyping decreases, the number of genome-wide association studies (GWAS) has increased considerably. However, the transition from GWAS findings to the underlying biology of various phenotypes remains challenging. As a result, due to its system-level interpretability, pathway analysis has become a popular tool for gaining insights on the underlying biology from high-throughput genetic association data. In pathway analyses, gene sets representing particular biological processes are tested for significant associations with a given phenotype. Most existing pathway analysis approaches rely on single-marker statistics and assume that pathways are independent of each other. As biological systems are driven by complex biomolecular interactions, embracing the complex relationships between single-nucleotide polymorphisms (SNPs) and pathways needs to be addressed. To incorporate the complexity of gene-gene interactions and pathway-pathway relationships, we propose a system-level pathway analysis approach, synthetic feature random forest (SF-RF), which is designed to detect pathway-phenotype associations without making assumptions about the relationships among SNPs or pathways. In our approach, the genotypes of SNPs in a particular pathway are aggregated into a synthetic feature representing that pathway via Random Forest (RF). Multiple synthetic features are analyzed using RF simultaneously and the significance of a synthetic feature indicates the significance of the corresponding pathway. We further complement SF-RF with pathway-based Statistical Epistasis Network (SEN) analysis that evaluates interactions among pathways. By investigating the pathway SEN, we hope to gain additional insights into the genetic mechanisms contributing to the pathway-phenotype association. We apply SF-RF to a population-based genetic study of bladder cancer and further investigate the mechanisms that help explain the pathway-phenotype associations using SEN. The bladder cancer associated pathways we found are both consistent with existing biological knowledge and reveal novel and plausible hypotheses for future biological validations.
© 2014 WILEY PERIODICALS, INC.

Entities:  

Keywords:  epistasis; interactions; pathway analysis; statistical epistasis network (SEN); synthetic feature random forest (SF-RF)

Mesh:

Year:  2014        PMID: 24535726      PMCID: PMC4327826          DOI: 10.1002/gepi.21794

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  61 in total

1.  Network motifs: simple building blocks of complex networks.

Authors:  R Milo; S Shen-Orr; S Itzkovitz; N Kashtan; D Chklovskii; U Alon
Journal:  Science       Date:  2002-10-25       Impact factor: 47.728

Review 2.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

3.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

Review 4.  Molecular networks as sensors and drivers of common human diseases.

Authors:  Eric E Schadt
Journal:  Nature       Date:  2009-09-10       Impact factor: 49.962

Review 5.  Protein networks in disease.

Authors:  Trey Ideker; Roded Sharan
Journal:  Genome Res       Date:  2008-04       Impact factor: 9.043

6.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

7.  Effect of long-term hormone therapy on telomere length in postmenopausal women.

Authors:  Duk-Chul Lee; Jee-Aee Im; Jeong-Ho Kim; Hye-Ree Lee; Jae-Yong Shim
Journal:  Yonsei Med J       Date:  2005-08-31       Impact factor: 2.759

Review 8.  Epistasis and its implications for personal genetics.

Authors:  Jason H Moore; Scott M Williams
Journal:  Am J Hum Genet       Date:  2009-09       Impact factor: 11.025

9.  Telomerase activity in human bladder cancer.

Authors:  Y Lin; H Miyamoto; K Fujinami; H Uemura; M Hosaka; Y Iwasaki; Y Kubota
Journal:  Clin Cancer Res       Date:  1996-06       Impact factor: 12.531

10.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

View more
  8 in total

Review 1.  Functional and genomic context in pathway analysis of GWAS data.

Authors:  Michael A Mooney; Joel T Nigg; Shannon K McWeeney; Beth Wilmot
Journal:  Trends Genet       Date:  2014-08-22       Impact factor: 11.639

Review 2.  Gene set analysis: A step-by-step guide.

Authors:  Michael A Mooney; Beth Wilmot
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2015-06-08       Impact factor: 3.568

Review 3.  Towards the identification of the loci of adaptive evolution.

Authors:  Carolina Pardo-Diaz; Camilo Salazar; Chris D Jiggins
Journal:  Methods Ecol Evol       Date:  2015-02-12       Impact factor: 7.781

4.  Synthetic learning machines.

Authors:  Hemant Ishwaran; James D Malley
Journal:  BioData Min       Date:  2014-12-18       Impact factor: 2.522

5.  Ensemble learning for detecting gene-gene interactions in colorectal cancer.

Authors:  Faramarz Dorani; Ting Hu; Michael O Woods; Guangju Zhai
Journal:  PeerJ       Date:  2018-10-29       Impact factor: 2.984

6.  Identification and validation of genetic variants predictive of gait in standardbred horses.

Authors:  Annette M McCoy; Samantha K Beeson; Carl-Johan Rubin; Leif Andersson; Paul Caputo; Sigrid Lykkjen; Alison Moore; Richard J Piercy; James R Mickelson; Molly E McCue
Journal:  PLoS Genet       Date:  2019-05-28       Impact factor: 5.917

7.  The post GWAS era: strategies to identify gene-gene and gene-environment interactions in urinary bladder cancer.

Authors:  Silvia Selinski
Journal:  EXCLI J       Date:  2014-11-03       Impact factor: 4.068

8.  Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study.

Authors:  Stephan Seifert; Sven Gundlach; Olaf Junge; Silke Szymczak
Journal:  Bioinformatics       Date:  2020-08-01       Impact factor: 6.937

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.