Literature DB >> 31735295

A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank.

Wenjian Bi1, Zhangchen Zhao1, Rounak Dey2, Lars G Fritsche1, Bhramar Mukherjee3, Seunggeun Lee4.   

Abstract

The etiology of most complex diseases involves genetic variants, environmental factors, and gene-environment interaction (G × E) effects. Compared with marginal genetic association studies, G × E analysis requires more samples and detailed measure of environmental exposures, and this limits the possible discoveries. Large-scale population-based biobanks with detailed phenotypic and environmental information, such as UK-Biobank, can be ideal resources for identifying G × E effects. However, due to the large computation cost and the presence of case-control imbalance, existing methods often fail. Here we propose a scalable and accurate method, SPAGE (SaddlePoint Approximation implementation of G × E analysis), that is applicable for genome-wide scale phenome-wide G × E studies. SPAGE fits a genotype-independent logistic model only once across the genome-wide analysis in order to reduce computation cost, and SPAGE uses a saddlepoint approximation (SPA) to calibrate the test statistics for analysis of phenotypes with unbalanced case-control ratios. Simulation studies show that SPAGE is 33-79 times faster than the Wald test and 72-439 times faster than the Firth's test, and SPAGE can control type I error rates at the genome-wide significance level even when case-control ratios are extremely unbalanced. Through the analysis of UK-Biobank data of 344,341 white British European-ancestry samples, we show that SPAGE can efficiently analyze large samples while controlling for unbalanced case-control ratios.
Copyright © 2019. Published by Elsevier Inc.

Entities:  

Keywords:  UK Biobank; biobank data analysis; gene-environment interaction; saddlepoint approximation; unbalanced case-controlratios

Mesh:

Year:  2019        PMID: 31735295      PMCID: PMC6904814          DOI: 10.1016/j.ajhg.2019.10.008

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  44 in total

1.  Design considerations for genomic association studies: importance of gene-environment interactions.

Authors:  Loïc Le Marchand; Lynne R Wilkens
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2008-02       Impact factor: 4.254

Review 2.  Electronic health records: the next wave of complex disease genetics.

Authors:  Brooke N Wolford; Cristen J Willer; Ida Surakka
Journal:  Hum Mol Genet       Date:  2018-05-01       Impact factor: 6.150

3.  Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies.

Authors:  Minsun Song; William Wheeler; Neil E Caporaso; Maria Teresa Landi; Nilanjan Chatterjee
Journal:  Genet Epidemiol       Date:  2017-11-26       Impact factor: 2.135

4.  Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies.

Authors:  W W Piegorsch; C R Weinberg; J A Taylor
Journal:  Stat Med       Date:  1994-01-30       Impact factor: 2.373

5.  An exposure-weighted score test for genetic associations integrating environmental risk factors.

Authors:  Summer S Han; Philip S Rosenberg; Arpita Ghosh; Maria Teresa Landi; Neil E Caporaso; Nilanjan Chatterjee
Journal:  Biometrics       Date:  2015-07-01       Impact factor: 2.571

6.  Powerful cocktail methods for detecting genome-wide gene-environment interaction.

Authors:  Li Hsu; Shuo Jiao; James Y Dai; Carolyn Hutter; Ulrike Peters; Charles Kooperberg
Journal:  Genet Epidemiol       Date:  2012-04       Impact factor: 2.135

7.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations.

Authors:  Joshua C Denny; Marylyn D Ritchie; Melissa A Basford; Jill M Pulley; Lisa Bastarache; Kristin Brown-Gentry; Deede Wang; Dan R Masys; Dan M Roden; Dana C Crawford
Journal:  Bioinformatics       Date:  2010-03-24       Impact factor: 6.937

8.  Update on the State of the Science for Analytical Methods for Gene-Environment Interactions.

Authors:  W James Gauderman; Bhramar Mukherjee; Hugues Aschard; Li Hsu; Juan Pablo Lewinger; Chirag J Patel; John S Witte; Christopher Amos; Caroline G Tai; David Conti; Dara G Torgerson; Seunggeun Lee; Nilanjan Chatterjee
Journal:  Am J Epidemiol       Date:  2017-10-01       Impact factor: 5.363

9.  Efficient Bayesian mixed-model analysis increases association power in large cohorts.

Authors:  Po-Ru Loh; George Tucker; Brendan K Bulik-Sullivan; Bjarni J Vilhjálmsson; Hilary K Finucane; Rany M Salem; Daniel I Chasman; Paul M Ridker; Benjamin M Neale; Bonnie Berger; Nick Patterson; Alkes L Price
Journal:  Nat Genet       Date:  2015-02-02       Impact factor: 38.330

10.  Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases.

Authors:  Kimberly McAllister; Leah E Mechanic; Christopher Amos; Hugues Aschard; Ian A Blair; Nilanjan Chatterjee; David Conti; W James Gauderman; Li Hsu; Carolyn M Hutter; Marta M Jankowska; Jacqueline Kerr; Peter Kraft; Stephen B Montgomery; Bhramar Mukherjee; George J Papanicolaou; Chirag J Patel; Marylyn D Ritchie; Beate R Ritz; Duncan C Thomas; Peng Wei; John S Witte
Journal:  Am J Epidemiol       Date:  2017-10-01       Impact factor: 5.363

View more
  3 in total

1.  Efficient gene-environment interaction tests for large biobank-scale sequencing studies.

Authors:  Xinyu Wang; Elise Lim; Ching-Ti Liu; Yun Ju Sung; Dabeeru C Rao; Alanna C Morrison; Eric Boerwinkle; Alisa K Manning; Han Chen
Journal:  Genet Epidemiol       Date:  2020-08-30       Impact factor: 2.135

2.  Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes.

Authors:  Wenjian Bi; Wei Zhou; Rounak Dey; Bhramar Mukherjee; Joshua N Sampson; Seunggeun Lee
Journal:  Am J Hum Genet       Date:  2021-04-08       Impact factor: 11.043

3.  GEM: scalable and flexible gene-environment interaction analysis in millions of samples.

Authors:  Kenneth E Westerman; Duy T Pham; Liang Hong; Ye Chen; Magdalena Sevilla-González; Yun Ju Sung; Yan V Sun; Alanna C Morrison; Han Chen; Alisa K Manning
Journal:  Bioinformatics       Date:  2021-10-25       Impact factor: 6.931

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.