Literature DB >> 25583502

Integrative analysis of sequencing and array genotype data for discovering disease associations with rare mutations.

Yi-Juan Hu1, Yun Li2, Paul L Auer3, Dan-Yu Lin4.   

Abstract

In the large cohorts that have been used for genome-wide association studies (GWAS), it is prohibitively expensive to sequence all cohort members. A cost-effective strategy is to sequence subjects with extreme values of quantitative traits or those with specific diseases. By imputing the sequencing data from the GWAS data for the cohort members who are not selected for sequencing, one can dramatically increase the number of subjects with information on rare variants. However, ignoring the uncertainties of imputed rare variants in downstream association analysis will inflate the type I error when sequenced subjects are not a random subset of the GWAS subjects. In this article, we provide a valid and efficient approach to combining observed and imputed data on rare variants. We consider commonly used gene-level association tests, all of which are constructed from the score statistic for assessing the effects of individual variants on the trait of interest. We show that the score statistic based on the observed genotypes for sequenced subjects and the imputed genotypes for nonsequenced subjects is unbiased. We derive a robust variance estimator that reflects the true variability of the score statistic regardless of the sampling scheme and imputation quality, such that the corresponding association tests always have correct type I error. We demonstrate through extensive simulation studies that the proposed tests are substantially more powerful than the use of accurately imputed variants only and the use of sequencing data alone. We provide an application to the Women's Health Initiative. The relevant software is freely available.

Entities:  

Keywords:  data integration; gene-level association tests; genotype imputation; linkage disequilibrium; whole-exome sequencing

Mesh:

Year:  2015        PMID: 25583502      PMCID: PMC4313847          DOI: 10.1073/pnas.1406143112

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  29 in total

1.  Quantitative trait analysis in sequencing studies under trait-dependent sampling.

Authors:  Dan-Yu Lin; Donglin Zeng; Zheng-Zheng Tang
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-11       Impact factor: 11.205

2.  A general framework for detecting disease associations with rare variants in sequencing studies.

Authors:  Dan-Yu Lin; Zheng-Zheng Tang
Journal:  Am J Hum Genet       Date:  2011-09-01       Impact factor: 11.025

3.  Imputation of exome sequence variants into population- based samples and blood-cell-trait-associated loci in African Americans: NHLBI GO Exome Sequencing Project.

Authors:  Paul L Auer; Jill M Johnsen; Andrew D Johnson; Benjamin A Logsdon; Leslie A Lange; Michael A Nalls; Guosheng Zhang; Nora Franceschini; Keolu Fox; Ethan M Lange; Stephen S Rich; Christopher J O'Donnell; Rebecca D Jackson; Robert B Wallace; Zhao Chen; Timothy A Graubert; James G Wilson; Hua Tang; Guillaume Lettre; Alex P Reiner; Santhi K Ganesh; Yun Li
Journal:  Am J Hum Genet       Date:  2012-10-25       Impact factor: 11.025

4.  The use of imputed values in the meta-analysis of genome-wide association studies.

Authors:  Shuo Jiao; Li Hsu; Carolyn M Hutter; Ulrike Peters
Journal:  Genet Epidemiol       Date:  2011-07-18       Impact factor: 2.135

5.  Exome array analysis identifies new loci and low-frequency variants influencing insulin processing and secretion.

Authors:  Jeroen R Huyghe; Anne U Jackson; Marie P Fogarty; Martin L Buchkovich; Alena Stančáková; Heather M Stringham; Xueling Sim; Lingyao Yang; Christian Fuchsberger; Henna Cederberg; Peter S Chines; Tanya M Teslovich; Jane M Romm; Hua Ling; Ivy McMullen; Roxann Ingersoll; Elizabeth W Pugh; Kimberly F Doheny; Benjamin M Neale; Mark J Daly; Johanna Kuusisto; Laura J Scott; Hyun Min Kang; Francis S Collins; Gonçalo R Abecasis; Richard M Watanabe; Michael Boehnke; Markku Laakso; Karen L Mohlke
Journal:  Nat Genet       Date:  2012-12-23       Impact factor: 38.330

6.  Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative.

Authors:  Eric Yi Liu; Steven Buyske; Aaron K Aragaki; Ulrike Peters; Eric Boerwinkle; Chris Carlson; Cara Carty; Dana C Crawford; Jeff Haessler; Lucia A Hindorff; Loic Le Marchand; Teri A Manolio; Tara Matise; Wei Wang; Charles Kooperberg; Kari E North; Yun Li
Journal:  Genet Epidemiol       Date:  2012-02       Impact factor: 2.135

7.  Fast and accurate genotype imputation in genome-wide association studies through pre-phasing.

Authors:  Bryan Howie; Christian Fuchsberger; Matthew Stephens; Jonathan Marchini; Gonçalo R Abecasis
Journal:  Nat Genet       Date:  2012-07-22       Impact factor: 38.330

8.  Meta-analysis of gene-level associations for rare variants based on single-variant statistics.

Authors:  Yi-Juan Hu; Sonja I Berndt; Stefan Gustafsson; Andrea Ganna; Joel Hirschhorn; Kari E North; Erik Ingelsson; Dan-Yu Lin
Journal:  Am J Hum Genet       Date:  2013-07-25       Impact factor: 11.025

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits.

Authors:  Benjamin F Voight; Hyun Min Kang; Jun Ding; Cameron D Palmer; Carlo Sidore; Peter S Chines; Noël P Burtt; Christian Fuchsberger; Yanming Li; Jeanette Erdmann; Timothy M Frayling; Iris M Heid; Anne U Jackson; Toby Johnson; Tuomas O Kilpeläinen; Cecilia M Lindgren; Andrew P Morris; Inga Prokopenko; Joshua C Randall; Richa Saxena; Nicole Soranzo; Elizabeth K Speliotes; Tanya M Teslovich; Eleanor Wheeler; Jared Maguire; Melissa Parkin; Simon Potter; N William Rayner; Neil Robertson; Kathleen Stirrups; Wendy Winckler; Serena Sanna; Antonella Mulas; Ramaiah Nagaraja; Francesco Cucca; Inês Barroso; Panos Deloukas; Ruth J F Loos; Sekar Kathiresan; Patricia B Munroe; Christopher Newton-Cheh; Arne Pfeufer; Nilesh J Samani; Heribert Schunkert; Joel N Hirschhorn; David Altshuler; Mark I McCarthy; Gonçalo R Abecasis; Michael Boehnke
Journal:  PLoS Genet       Date:  2012-08-02       Impact factor: 5.917

View more
  11 in total

1.  Analysis in case-control sequencing association studies with different sequencing depths.

Authors:  Sixing Chen; Xihong Lin
Journal:  Biostatistics       Date:  2020-07-01       Impact factor: 5.899

2.  Protocols, Methods, and Tools for Genome-Wide Association Studies (GWAS) of Dental Traits.

Authors:  Cary S Agler; Dmitry Shungin; Andrea G Ferreira Zandoná; Paige Schmadeke; Patricia V Basta; Jason Luo; John Cantrell; Thomas D Pahel; Beau D Meyer; John R Shaffer; Arne S Schaefer; Kari E North; Kimon Divaris
Journal:  Methods Mol Biol       Date:  2019

3.  Robust Score Tests With Missing Data in Genomics Studies.

Authors:  Kin Yau Wong; Donglin Zeng; D Y Lin
Journal:  J Am Stat Assoc       Date:  2019-02-26       Impact factor: 5.033

4.  Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs.

Authors:  Zheng-Zheng Tang; Dan-Yu Lin
Journal:  Am J Hum Genet       Date:  2015-06-18       Impact factor: 11.025

5.  Elucidation of the complex metabolic profile of cerebrospinal fluid using an untargeted biochemical profiling assay.

Authors:  Adam D Kennedy; Kirk L Pappan; Taraka R Donti; Anne M Evans; Jacob E Wulff; Luke A D Miller; V Reid Sutton; Qin Sun; Marcus J Miller; Sarah H Elsea
Journal:  Mol Genet Metab       Date:  2017-04-09       Impact factor: 4.797

Review 6.  The Genomics of Neonatal Abstinence Syndrome.

Authors:  F Sessions Cole; Daniel J Wegner; Jonathan M Davis
Journal:  Front Pediatr       Date:  2017-08-22       Impact factor: 3.418

Review 7.  Genomics of periodontal disease and tooth morbidity.

Authors:  Thiago Morelli; Cary S Agler; Kimon Divaris
Journal:  Periodontol 2000       Date:  2020-02       Impact factor: 7.589

8.  An integrative approach to investigate the respective roles of single-nucleotide variants and copy-number variants in Attention-Deficit/Hyperactivity Disorder.

Authors:  Leandro de Araújo Lima; Ana Cecília Feio-dos-Santos; Sintia Iole Belangero; Ary Gadelha; Rodrigo Affonseca Bressan; Giovanni Abrahão Salum; Pedro Mario Pan; Tais Silveira Moriyama; Ana Soledade Graeff-Martins; Ana Carina Tamanaha; Pedro Alvarenga; Fernanda Valle Krieger; Bacy Fleitlich-Bilyk; Andrea Parolin Jackowski; Elisa Brietzke; João Ricardo Sato; Guilherme Vanoni Polanczyk; Jair de Jesus Mari; Gisele Gus Manfro; Maria Conceição do Rosário; Eurípedes Constantino Miguel; Renato David Puga; Ana Carolina Tahira; Viviane Neri Souza; Thais Chile; Gisele Rodrigues Gouveia; Sérgio Nery Simões; Xiao Chang; Renata Pellegrino; Lifeng Tian; Joseph T Glessner; Ronaldo Fumio Hashimoto; Luis Augusto Rohde; Patrick M A Sleiman; Hakon Hakonarson; Helena Brentani
Journal:  Sci Rep       Date:  2016-03-07       Impact factor: 4.379

9.  Estimation of kinship coefficient in structured and admixed populations using sparse sequencing data.

Authors:  Jinzhuang Dou; Baoluo Sun; Xueling Sim; Jason D Hughes; Dermot F Reilly; E Shyong Tai; Jianjun Liu; Chaolong Wang
Journal:  PLoS Genet       Date:  2017-09-29       Impact factor: 5.917

10.  Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.

Authors:  Corbin Quick; Pramod Anugu; Solomon Musani; Scott T Weiss; Esteban G Burchard; Marquitta J White; Kevin L Keys; Francesco Cucca; Carlo Sidore; Michael Boehnke; Christian Fuchsberger
Journal:  Genet Epidemiol       Date:  2020-06-09       Impact factor: 2.135

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.