Literature DB >> 28989562

A Bayesian approach to identify genes and gene-level SNP aggregates in a genetic analysis of cancer data.

Francesco C Stingo1, Michael D Swartz2, Marina Vannucci3.   

Abstract

Complex diseases, such as cancer, arise from complex etiologies consisting of multiple single-nucleotide polymorphisms (SNPs), each contributing a small amount to the overall risk of disease. Thus, many researchers have gone beyond single-SNPs analysis methods, focusing instead on groups of SNPs, for example by analysing haplotypes. More recently, pathway-based methods have been proposed that use prior biological knowledge on gene function to achieve a more powerful analysis of genome-wide association studies (GWAS) data. In this paper we propose a novel Bayesian modeling framework to identify molecular biomarkers for disease prediction. Our method combines pathway-based approaches with multiple SNP analyses of a specified region of interest. The model's development is motivated by SNP data from a lung cancer study. In our approach we define gene-level scores based on SNP allele frequencies and use a linear modeling setting to study the scores association to the observed phenotype. The basic idea behind the definition of gene-level scores is to weigh the SNPs within the gene according to their rarity, based on genotype frequencies expected under the Hardy-Weinberg equilibrium law. This results in scores giving more importance to the unusually low frequencies, i.e. to SNPs that might indicate peculiar genetic differences between subjects belonging to different groups. An additional feature of our approach is that we incorporate information on SNP-to-SNP associations into the model. In particular, we use network priors that model the linkage disequilibrium between SNPs. For posterior inference, we design a stochastic search method that identifies significant biomarkers (genes and SNPs) for disease prediction. We assess performances on simulated data and compare results to existing approaches. We then show the ability of the proposed methodology to detect relevant genes and associated SNPs in a lung cancer dataset.

Entities:  

Keywords:  Bayesian variable selection; Hardy-Weinberg equilibrium law; Linear models; Linkage disequilibrium; Markov random field; SNP data

Year:  2015        PMID: 28989562      PMCID: PMC5630184          DOI: 10.4310/SII.2015.v8.n2.a2

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  45 in total

1.  The mystery of missing heritability: Genetic interactions create phantom heritability.

Authors:  Or Zuk; Eliana Hechter; Shamil R Sunyaev; Eric S Lander
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-05       Impact factor: 11.205

2.  A pathway analysis method for genome-wide association studies.

Authors:  Babak Shahbaba; Catherine M Shachaf; Zhaoxia Yu
Journal:  Stat Med       Date:  2012-02-03       Impact factor: 2.373

3.  Incorporating model uncertainty in detecting rare variants: the Bayesian risk index.

Authors:  Melanie A Quintana; Jonine L Berstein; Duncan C Thomas; David V Conti
Journal:  Genet Epidemiol       Date:  2011-08-26       Impact factor: 2.135

4.  A unified framework for detecting genetic association with multiple SNPs in a candidate gene or region: contrasting genotype scores and LD patterns between cases and controls.

Authors:  Wei Pan
Journal:  Hum Hered       Date:  2009-10-02       Impact factor: 0.444

5.  Molecular identification of hydroxylysine kinase and of ammoniophospholyases acting on 5-phosphohydroxy-L-lysine and phosphoethanolamine.

Authors:  Maria Veiga-da-Cunha; Farah Hadi; Thomas Balligand; Vincent Stroobant; Emile Van Schaftingen
Journal:  J Biol Chem       Date:  2012-01-12       Impact factor: 5.157

6.  Pathway-based analysis for genome-wide association studies using supervised principal components.

Authors:  Xi Chen; Lily Wang; Bo Hu; Mingsheng Guo; John Barnard; Xiaofeng Zhu
Journal:  Genet Epidemiol       Date:  2010-11       Impact factor: 2.135

7.  Forward-time simulations of human populations with complex diseases.

Authors:  Bo Peng; Christopher I Amos; Marek Kimmel
Journal:  PLoS Genet       Date:  2007-02-15       Impact factor: 5.917

8.  Investigating multiple candidate genes and nutrients in the folate metabolism pathway to detect genetic and nutritional risk factors for lung cancer.

Authors:  Michael D Swartz; Christine B Peterson; Philip J Lupo; Xifeng Wu; Michele R Forman; Margaret R Spitz; Ladia M Hernandez; Marina Vannucci; Sanjay Shete
Journal:  PLoS One       Date:  2013-01-23       Impact factor: 3.240

9.  A novel similarity-measure for the analysis of genetic data in complex phenotypes.

Authors:  Vincenzo Lagani; Alberto Montesanto; Fausta Di Cianni; Victor Moreno; Stefano Landi; Domenico Conforti; Giuseppina Rose; Giuseppe Passarino
Journal:  BMC Bioinformatics       Date:  2009-06-16       Impact factor: 3.169

10.  Identifying biomarkers from mass spectrometry data with ordinal outcome.

Authors:  Deukwoo Kwon; Mahlet G Tadesse; Naijun Sha; Ruth M Pfeiffer; Marina Vannucci
Journal:  Cancer Inform       Date:  2007-02-05
View more
  1 in total

1.  A Bayesian Approach for Estimating Dynamic Functional Network Connectivity in fMRI Data.

Authors:  Ryan Warnick; Michele Guindani; Erik Erhardt; Elena Allen; Vince Calhoun; Marina Vannucci
Journal:  J Am Stat Assoc       Date:  2018-05-16       Impact factor: 5.033

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.