Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 On the optimal design of genetic variant discovery studies.

Literature DB >> 20812911

On the optimal design of genetic variant discovery studies.

Abstract

The recent emergence of massively parallel sequencing technologies has enabled an increasing number of human genome re-sequencing studies, notable among them being the 1000 Genomes Project. The main aim of these studies is to identify the yet unknown genetic variants in a genomic region, mostly low frequency variants (frequency less than 5%). We propose here a set of statistical tools that address how to optimally design such studies in order to increase the number of genetic variants we expect to discover. Within this framework, the tradeoff between lower coverage for more individuals and higher coverage for fewer individuals can be naturally solved. The methods here are also useful for estimating the number of genetic variants missed in a discovery study performed at low coverage. We show applications to simulated data based on coalescent models and to sequence data from the ENCODE project. In particular, we show the extent to which combining data from multiple populations in a discovery study may increase the number of genetic variants identified relative to studies on single populations.

Entities: Species

Mesh：

Year: 2010 PMID： 20812911 PMCID： PMC2942028 DOI： 10.2202/1544-6115.1581

Source DB: PubMed Journal: Stat Appl Genet Mol Biol ISSN： 1544-6115

10 in total

1. The next generation of molecular markers from massively parallel sequencing of pooled DNA samples.

Authors: Andreas Futschik; Christian Schlötterer
Journal: Genetics Date: 2010-05-10 Impact factor: 4.562

2. The genetical structure of populations.

Authors: S WRIGHT
Journal: Ann Eugen Date: 1951-03

3. GENOME: a rapid coalescent-based whole genome simulator.

Authors: Liming Liang; Sebastian Zöllner; Gonçalo R Abecasis
Journal: Bioinformatics Date: 2007-04-25 Impact factor: 6.937

4. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors: Bingshan Li; Suzanne M Leal
Journal: Am J Hum Genet Date: 2008-08-07 Impact factor: 11.025

5. Estimating the number of unseen variants in the human genome.

Authors: Iuliana Ionita-Laza; Christoph Lange; Nan M Laird
Journal: Proc Natl Acad Sci U S A Date: 2009-03-10 Impact factor: 11.205

6. Next-generation DNA sequencing.

Authors: Jay Shendure; Hanlee Ji
Journal: Nat Biotechnol Date: 2008-10 Impact factor: 54.908

Review 7. Massively parallel sequencing: the next big thing in genetic medicine.

Authors: Tracy Tucker; Marco Marra; Jan M Friedman
Journal: Am J Hum Genet Date: 2009-08 Impact factor: 11.025

Review 8. Sequencing technologies - the next generation.

Authors: Michael L Metzker
Journal: Nat Rev Genet Date: 2009-12-08 Impact factor: 53.242

Review 9. Finding the missing heritability of complex diseases.

Authors: Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal: Nature Date: 2009-10-08 Impact factor: 49.962

10. A groupwise association test for rare mutations using a weighted sum statistic.

Authors: Bo Eskerod Madsen; Sharon R Browning
Journal: PLoS Genet Date: 2009-02-13 Impact factor: 5.917

10 in total

7 in total

1. BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing.

Authors: Song Yan; Yun Li
Journal: Bioinformatics Date: 2013-12-12 Impact factor: 6.937

2. EM vs MM: A Case Study.

Authors: Hua Zhou; Yiwen Zhang
Journal: Comput Stat Data Anal Date: 2012-12 Impact factor: 1.681

3. Two-stage design of sequencing studies for testing association with rare variants.

Authors: Fan Yang; Duncan C Thomas
Journal: Hum Hered Date: 2011-07-02 Impact factor: 0.444

4. Predicting discovery rates of genomic features.

Authors: Simon Gravel
Journal: Genetics Date: 2014-03-17 Impact factor: 4.562

5. Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data.

Authors: Yun Li; Wei Chen; Eric Yi Liu; Yi-Hui Zhou
Journal: Stat Biosci Date: 2013-05

6. Demographic history and rare allele sharing among human populations.

Authors: Simon Gravel; Brenna M Henn; Ryan N Gutenkunst; Amit R Indap; Gabor T Marth; Andrew G Clark; Fuli Yu; Richard A Gibbs; Carlos D Bustamante
Journal: Proc Natl Acad Sci U S A Date: 2011-07-05 Impact factor: 11.205

7. Quantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects.

Authors: James Zou; Gregory Valiant; Paul Valiant; Konrad Karczewski; Siu On Chan; Kaitlin Samocha; Monkol Lek; Shamil Sunyaev; Mark Daly; Daniel G MacArthur
Journal: Nat Commun Date: 2016-10-31 Impact factor: 14.919

7 in total