Literature DB >> 22972696

Analysis and optimal design for association studies using next-generation sequencing with case-control pools.

Wei E Liang1, Duncan C Thomas, David V Conti.   

Abstract

With its potential to discover a much greater amount of genetic variation, next-generation sequencing is fast becoming an emergent tool for genetic association studies. However, the cost of sequencing all individuals in a large-scale population study is still high in comparison to most alternative genotyping options. While the ability to identify individual-level data is lost (without bar-coding), sequencing pooled samples can substantially lower costs without compromising the power to detect significant associations. We propose a hierarchical Bayesian model that estimates the association of each variant using pools of cases and controls, accounting for the variation in read depth across pools and sequencing error. To investigate the performance of our method across a range of number of pools, number of individuals within each pool, and average coverage, we undertook extensive simulations varying effect sizes, minor allele frequencies, and sequencing error rates. In general, the number of pools and pool size have dramatic effects on power while the total depth of coverage per pool has only a moderate impact. This information can guide the selection of a study design that maximizes power subject to cost, sample size, or other laboratory constraints. We provide an R package (hiPOD: hierarchical Pooled Optimal Design) to find the optimal design, allowing the user to specify a cost function, cost, and sample size limitations, and distributions of effect size, minor allele frequency, and sequencing error rate.
© 2012 Wiley Periodicals, Inc.

Entities:  

Keywords:  genetic association studies; rare variants; sequencing

Mesh:

Year:  2012        PMID: 22972696      PMCID: PMC4139478          DOI: 10.1002/gepi.21681

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  21 in total

1.  The next generation of molecular markers from massively parallel sequencing of pooled DNA samples.

Authors:  Andreas Futschik; Christian Schlötterer
Journal:  Genetics       Date:  2010-05-10       Impact factor: 4.562

2.  Pooled association tests for rare variants in exon-resequencing studies.

Authors:  Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2010-05-13       Impact factor: 11.025

3.  Incorporating model uncertainty in detecting rare variants: the Bayesian risk index.

Authors:  Melanie A Quintana; Jonine L Berstein; Duncan C Thomas; David V Conti
Journal:  Genet Epidemiol       Date:  2011-08-26       Impact factor: 2.135

4.  Overlapping pools for high-throughput targeted resequencing.

Authors:  Snehit Prabhu; Itsik Pe'er
Journal:  Genome Res       Date:  2009-05-15       Impact factor: 9.043

5.  Optimal DNA pooling-based two-stage designs in case-control association studies.

Authors:  Yihong Zhao; Shuang Wang
Journal:  Hum Hered       Date:  2008-10-17       Impact factor: 0.444

Review 6.  Human genetic variation and its contribution to complex traits.

Authors:  Kelly A Frazer; Sarah S Murray; Nicholas J Schork; Eric J Topol
Journal:  Nat Rev Genet       Date:  2009-04       Impact factor: 53.242

7.  Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples.

Authors:  Andrew M Smith; Lawrence E Heisler; Robert P St Onge; Eveline Farias-Hesson; Iain M Wallace; John Bodeau; Adam N Harris; Kathleen M Perry; Guri Giaever; Nader Pourmand; Corey Nislow
Journal:  Nucleic Acids Res       Date:  2010-05-11       Impact factor: 16.971

8.  On optimal pooling designs to identify rare variants through massive resequencing.

Authors:  Joon Sang Lee; Murim Choi; Xiting Yan; Richard P Lifton; Hongyu Zhao
Journal:  Genet Epidemiol       Date:  2011-01-19       Impact factor: 2.135

9.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

10.  An evaluation of statistical approaches to rare variant analysis in genetic association studies.

Authors:  Andrew P Morris; Eleftheria Zeggini
Journal:  Genet Epidemiol       Date:  2010-02       Impact factor: 2.135

View more
  5 in total

1.  On the design and analysis of next-generation sequencing genotyping for a cohort with haplotype-informative reads.

Authors:  Degui Zhi; Nianjun Liu; Kui Zhang
Journal:  Methods       Date:  2015-01-30       Impact factor: 3.608

2.  A stepwise likelihood ratio test procedure for rare variant selection in case-control studies.

Authors:  Anthony Y C Kuk; David J Nott; Yaning Yang
Journal:  J Hum Genet       Date:  2014-01-23       Impact factor: 3.172

3.  Two-stage family-based designs for sequencing studies.

Authors:  Zhao Yang; Duncan C Thomas
Journal:  BMC Proc       Date:  2014-06-17

4.  An EM algorithm based on an internal list for estimating haplotype distributions of rare variants from pooled genotype data.

Authors:  Anthony Y C Kuk; Xiang Li; Jinfeng Xu
Journal:  BMC Genet       Date:  2013-09-13       Impact factor: 2.797

Review 5.  Two-phase and family-based designs for next-generation sequencing studies.

Authors:  Duncan C Thomas; Zhao Yang; Fan Yang
Journal:  Front Genet       Date:  2013-12-13       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.