Literature DB >> 25979475

Likelihood-based complex trait association testing for arbitrary depth sequencing data.

Song Yan1, Shuai Yuan2, Zheng Xu1, Baqun Zhang2, Bo Zhang2, Guolian Kang2, Andrea Byrnes2, Yun Li1.   

Abstract

UNLABELLED: In next generation sequencing (NGS)-based genetic studies, researchers typically perform genotype calling first and then apply standard genotype-based methods for association testing. However, such a two-step approach ignores genotype calling uncertainty in the association testing step and may incur power loss and/or inflated type-I error. In the recent literature, a few robust and efficient likelihood based methods including both likelihood ratio test (LRT) and score test have been proposed to carry out association testing without intermediate genotype calling. These methods take genotype calling uncertainty into account by directly incorporating genotype likelihood function (GLF) of NGS data into association analysis. However, existing LRT methods are computationally demanding or do not allow covariate adjustment; while existing score tests are not applicable to markers with low minor allele frequency (MAF). We provide an LRT allowing flexible covariate adjustment, develop a statistically more powerful score test and propose a combination strategy (UNC combo) to leverage the advantages of both tests. We have carried out extensive simulations to evaluate the performance of our proposed LRT and score test. Simulations and real data analysis demonstrate the advantages of our proposed combination strategy: it offers a satisfactory trade-off in terms of computational efficiency, applicability (accommodating both common variants and variants with low MAF) and statistical power, particularly for the analysis of quantitative trait where the power gain can be up to ∼60% when the causal variant is of low frequency (MAF < 0.01).
AVAILABILITY AND IMPLEMENTATION: UNC combo and the associated R files, including documentation, examples, are available at http://www.unc.edu/∼yunmli/UNCcombo/ CONTACT: yunli@med.unc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2015        PMID: 25979475      PMCID: PMC4668777          DOI: 10.1093/bioinformatics/btv307

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  35 in total

1.  Next generation analytic tools for large scale genetic epidemiology studies of complex diseases.

Authors:  Leah E Mechanic; Huann-Sheng Chen; Christopher I Amos; Nilanjan Chatterjee; Nancy J Cox; Rao L Divi; Ruzong Fan; Emily L Harris; Kevin Jacobs; Peter Kraft; Suzanne M Leal; Kimberly McAllister; Jason H Moore; Dina N Paltoo; Michael A Province; Erin M Ramos; Marylyn D Ritchie; Kathryn Roeder; Daniel J Schaid; Matthew Stephens; Duncan C Thomas; Clarice R Weinberg; John S Witte; Shunpu Zhang; Sebastian Zöllner; Eric J Feuer; Elizabeth M Gillanders
Journal:  Genet Epidemiol       Date:  2011-12-06       Impact factor: 2.135

2.  Resequencing candidate genes implicates rare variants in asthma susceptibility.

Authors:  Dara G Torgerson; Daniel Capurso; Rasika A Mathias; Penelope E Graves; Ryan D Hernandez; Terri H Beaty; Eugene R Bleecker; Benjamin A Raby; Deborah A Meyers; Kathleen C Barnes; Scott T Weiss; Fernando D Martinez; Dan L Nicolae; Carole Ober
Journal:  Am J Hum Genet       Date:  2012-02-10       Impact factor: 11.025

3.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

4.  Low-coverage sequencing: implications for design of complex trait association studies.

Authors:  Yun Li; Carlo Sidore; Hyun Min Kang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

5.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

6.  AbCD: arbitrary coverage design for sequencing-based genetic studies.

Authors:  Jian Kang; Kuan-Chieh Huang; Zheng Xu; Yunfei Wang; Gonçalo R Abecasis; Yun Li
Journal:  Bioinformatics       Date:  2013-01-28       Impact factor: 6.937

Review 7.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

8.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

Review 9.  Sequencing studies in human genetics: design and interpretation.

Authors:  David B Goldstein; Andrew Allen; Jonathan Keebler; Elliott H Margulies; Steven Petrou; Slavé Petrovski; Shamil Sunyaev
Journal:  Nat Rev Genet       Date:  2013-06-11       Impact factor: 53.242

10.  Discovery of rare variants via sequencing: implications for the design of complex trait association studies.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  PLoS Genet       Date:  2009-05-15       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.