Literature DB >> 20812911

On the optimal design of genetic variant discovery studies.

Iuliana Ionita-Laza1, Nan M Laird.   

Abstract

The recent emergence of massively parallel sequencing technologies has enabled an increasing number of human genome re-sequencing studies, notable among them being the 1000 Genomes Project. The main aim of these studies is to identify the yet unknown genetic variants in a genomic region, mostly low frequency variants (frequency less than 5%). We propose here a set of statistical tools that address how to optimally design such studies in order to increase the number of genetic variants we expect to discover. Within this framework, the tradeoff between lower coverage for more individuals and higher coverage for fewer individuals can be naturally solved. The methods here are also useful for estimating the number of genetic variants missed in a discovery study performed at low coverage. We show applications to simulated data based on coalescent models and to sequence data from the ENCODE project. In particular, we show the extent to which combining data from multiple populations in a discovery study may increase the number of genetic variants identified relative to studies on single populations.

Entities:  

Mesh:

Year:  2010        PMID: 20812911      PMCID: PMC2942028          DOI: 10.2202/1544-6115.1581

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  10 in total

1.  The next generation of molecular markers from massively parallel sequencing of pooled DNA samples.

Authors:  Andreas Futschik; Christian Schlötterer
Journal:  Genetics       Date:  2010-05-10       Impact factor: 4.562

2.  The genetical structure of populations.

Authors:  S WRIGHT
Journal:  Ann Eugen       Date:  1951-03

3.  GENOME: a rapid coalescent-based whole genome simulator.

Authors:  Liming Liang; Sebastian Zöllner; Gonçalo R Abecasis
Journal:  Bioinformatics       Date:  2007-04-25       Impact factor: 6.937

4.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

5.  Estimating the number of unseen variants in the human genome.

Authors:  Iuliana Ionita-Laza; Christoph Lange; Nan M Laird
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-10       Impact factor: 11.205

6.  Next-generation DNA sequencing.

Authors:  Jay Shendure; Hanlee Ji
Journal:  Nat Biotechnol       Date:  2008-10       Impact factor: 54.908

Review 7.  Massively parallel sequencing: the next big thing in genetic medicine.

Authors:  Tracy Tucker; Marco Marra; Jan M Friedman
Journal:  Am J Hum Genet       Date:  2009-08       Impact factor: 11.025

Review 8.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

Review 9.  Finding the missing heritability of complex diseases.

Authors:  Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

10.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

  10 in total
  7 in total

1.  BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing.

Authors:  Song Yan; Yun Li
Journal:  Bioinformatics       Date:  2013-12-12       Impact factor: 6.937

2.  EM vs MM: A Case Study.

Authors:  Hua Zhou; Yiwen Zhang
Journal:  Comput Stat Data Anal       Date:  2012-12       Impact factor: 1.681

3.  Two-stage design of sequencing studies for testing association with rare variants.

Authors:  Fan Yang; Duncan C Thomas
Journal:  Hum Hered       Date:  2011-07-02       Impact factor: 0.444

4.  Predicting discovery rates of genomic features.

Authors:  Simon Gravel
Journal:  Genetics       Date:  2014-03-17       Impact factor: 4.562

5.  Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data.

Authors:  Yun Li; Wei Chen; Eric Yi Liu; Yi-Hui Zhou
Journal:  Stat Biosci       Date:  2013-05

6.  Demographic history and rare allele sharing among human populations.

Authors:  Simon Gravel; Brenna M Henn; Ryan N Gutenkunst; Amit R Indap; Gabor T Marth; Andrew G Clark; Fuli Yu; Richard A Gibbs; Carlos D Bustamante
Journal:  Proc Natl Acad Sci U S A       Date:  2011-07-05       Impact factor: 11.205

7.  Quantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects.

Authors:  James Zou; Gregory Valiant; Paul Valiant; Konrad Karczewski; Siu On Chan; Kaitlin Samocha; Monkol Lek; Shamil Sunyaev; Mark Daly; Daniel G MacArthur
Journal:  Nat Commun       Date:  2016-10-31       Impact factor: 14.919

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.