Literature DB >> 29399241

BAYESIAN LARGE-SCALE MULTIPLE REGRESSION WITH SUMMARY STATISTICS FROM GENOME-WIDE ASSOCIATION STUDIES.

Xiang Zhu1, Matthew Stephens1.   

Abstract

Bayesian methods for large-scale multiple regression provide attractive approaches to the analysis of genome-wide association studies (GWAS). For example, they can estimate heritability of complex traits, allowing for both polygenic and sparse models; and by incorporating external genomic data into the priors, they can increase power and yield new biological insights. However, these methods require access to individual genotypes and phenotypes, which are often not easily available. Here we provide a framework for performing these analyses without individual-level data. Specifically, we introduce a "Regression with Summary Statistics" (RSS) likelihood, which relates the multiple regression coefficients to univariate regression results that are often easily available. The RSS likelihood requires estimates of correlations among covariates (SNPs), which also can be obtained from public databases. We perform Bayesian multiple regression analysis by combining the RSS likelihood with previously proposed prior distributions, sampling posteriors by Markov chain Monte Carlo. In a wide range of simulations RSS performs similarly to analyses using the individual data, both for estimating heritability and detecting associations. We apply RSS to a GWAS of human height that contains 253,288 individuals typed at 1.06 million SNPs, for which analyses of individual-level data are practically impossible. Estimates of heritability (52%) are consistent with, but more precise, than previous results using subsets of these data. We also identify many previously unreported loci that show evidence for association with height in our analyses. Software is available at https://github.com/stephenslab/rss.

Entities:  

Keywords:  Bayesian regression; Markov chain Monte Carlo; Summary statistics; association study; explained variation; genome wide; heritability; multiple-SNP analysis; variable selection

Year:  2017        PMID: 29399241      PMCID: PMC5796536          DOI: 10.1214/17-aoas1046

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  57 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

2.  An efficient Monte Carlo approach to assessing statistical significance in genomic studies.

Authors:  D Y Lin
Journal:  Bioinformatics       Date:  2004-09-28       Impact factor: 6.937

Review 3.  Analysing biological pathways in genome-wide association studies.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nat Rev Genet       Date:  2010-12       Impact factor: 53.242

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics.

Authors:  Wenan Chen; Beth R Larrabee; Inna G Ovsyannikova; Richard B Kennedy; Iana H Haralambieva; Gregory A Poland; Daniel J Schaid
Journal:  Genetics       Date:  2015-05-06       Impact factor: 4.562

6.  Identifying causal variants at loci with multiple signals of association.

Authors:  Farhad Hormozdiari; Emrah Kostem; Eun Yong Kang; Bogdan Pasaniuc; Eleazar Eskin
Journal:  Genetics       Date:  2014-08-07       Impact factor: 4.562

Review 7.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

8.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

9.  DIST: direct imputation of summary statistics for unmeasured SNPs.

Authors:  Donghyung Lee; T Bernard Bigdeli; Brien P Riley; Ayman H Fanous; Silviu-Alin Bacanu
Journal:  Bioinformatics       Date:  2013-08-28       Impact factor: 6.937

10.  JAM: A Scalable Bayesian Framework for Joint Analysis of Marginal SNP Effects.

Authors:  Paul J Newcombe; David V Conti; Sylvia Richardson
Journal:  Genet Epidemiol       Date:  2016-04       Impact factor: 2.135

View more
  48 in total

Review 1.  Genetic Risk Scores.

Authors:  Robert P Igo; Tyler G Kinzy; Jessica N Cooke Bailey
Journal:  Curr Protoc Hum Genet       Date:  2019-12

2.  ACAT: A Fast and Powerful p Value Combination Method for Rare-Variant Analysis in Sequencing Studies.

Authors:  Yaowu Liu; Sixing Chen; Zilin Li; Alanna C Morrison; Eric Boerwinkle; Xihong Lin
Journal:  Am J Hum Genet       Date:  2019-03-07       Impact factor: 11.025

3.  iFunMed: Integrative functional mediation analysis of GWAS and eQTL studies.

Authors:  Constanza Rojo; Qi Zhang; Sündüz Keleş
Journal:  Genet Epidemiol       Date:  2019-07-22       Impact factor: 2.135

4.  Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics.

Authors:  Omer Weissbrod; Jonathan Flint; Saharon Rosset
Journal:  Am J Hum Genet       Date:  2018-07-05       Impact factor: 11.025

5.  Incorporating Functional Annotations for Fine-Mapping Causal Variants in a Bayesian Framework Using Summary Statistics.

Authors:  Wenan Chen; Shannon K McDonnell; Stephen N Thibodeau; Lori S Tillmans; Daniel J Schaid
Journal:  Genetics       Date:  2016-09-21       Impact factor: 4.562

6.  A powerful fine-mapping method for transcriptome-wide association studies.

Authors:  Chong Wu; Wei Pan
Journal:  Hum Genet       Date:  2019-12-16       Impact factor: 4.132

7.  Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.

Authors:  Maria Masotti; Bin Guo; Baolin Wu
Journal:  Biometrics       Date:  2019-08-02       Impact factor: 2.571

8.  Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics.

Authors:  Sung Chun; Maxim Imakaev; Daniel Hui; Nikolaos A Patsopoulos; Benjamin M Neale; Sekar Kathiresan; Nathan O Stitziel; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2020-05-28       Impact factor: 11.025

9.  The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities.

Authors:  Lauren J Beesley; Maxwell Salvatore; Lars G Fritsche; Anita Pandit; Arvind Rao; Chad Brummett; Cristen J Willer; Lynda D Lisabeth; Bhramar Mukherjee
Journal:  Stat Med       Date:  2019-12-20       Impact factor: 2.373

10.  IGREX for quantifying the impact of genetically regulated expression on phenotypes.

Authors:  Mingxuan Cai; Lin S Chen; Jin Liu; Can Yang
Journal:  NAR Genom Bioinform       Date:  2020-02-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.