Literature DB >> 23730833

Estimation of population allele frequencies from next-generation sequencing data: pool-versus individual-based genotyping.

Mathieu Gautier1, Julien Foucaud, Karim Gharbi, Timothée Cézard, Maxime Galan, Anne Loiseau, Marian Thomson, Pierre Pudlo, Carole Kerdelhué, Arnaud Estoup.   

Abstract

Molecular markers produced by next-generation sequencing (NGS) technologies are revolutionizing genetic research. However, the costs of analysing large numbers of individual genomes remain prohibitive for most population genetics studies. Here, we present results based on mathematical derivations showing that, under many realistic experimental designs, NGS of DNA pools from diploid individuals allows to estimate the allele frequencies at single nucleotide polymorphisms (SNPs) with at least the same accuracy as individual-based analyses, for considerably lower library construction and sequencing efforts. These findings remain true when taking into account the possibility of substantially unequal contributions of each individual to the final pool of sequence reads. We propose the intuitive notion of effective pool size to account for unequal pooling and derive a Bayesian hierarchical model to estimate this parameter directly from the data. We provide a user-friendly application assessing the accuracy of allele frequency estimation from both pool- and individual-based NGS population data under various sampling, sequencing depth and experimental error designs. We illustrate our findings with theoretical examples and real data sets corresponding to SNP loci obtained using restriction site-associated DNA (RAD) sequencing in pool- and individual-based experiments carried out on the same population of the pine processionary moth (Thaumetopoea pityocampa). NGS of DNA pools might not be optimal for all types of studies but provides a cost-effective approach for estimating allele frequencies for very large numbers of SNPs. It thus allows comparison of genome-wide patterns of genetic variation for large numbers of individuals in multiple populations.
© 2013 John Wiley & Sons Ltd.

Entities:  

Keywords:  coverage; estimation of allele frequencies; molecular markers; next-generation sequencing; pooled samples; population genetics

Mesh:

Year:  2013        PMID: 23730833     DOI: 10.1111/mec.12360

Source DB:  PubMed          Journal:  Mol Ecol        ISSN: 0962-1083            Impact factor:   6.185


  76 in total

1.  Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates.

Authors:  Mathieu Gautier
Journal:  Genetics       Date:  2015-10-19       Impact factor: 4.562

2.  Introduction to Population Genomics Methods.

Authors:  Thibault Leroy; Quentin Rougemont
Journal:  Methods Mol Biol       Date:  2021

3.  Measuring Genetic Differentiation from Pool-seq Data.

Authors:  Valentin Hivert; Raphaël Leblois; Eric J Petit; Mathieu Gautier; Renaud Vitalis
Journal:  Genetics       Date:  2018-07-30       Impact factor: 4.562

4.  Phylogeographic model selection leads to insight into the evolutionary history of four-eyed frogs.

Authors:  Maria Tereza C Thomé; Bryan C Carstens
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-19       Impact factor: 11.205

5.  Habitat type and dispersal mode underlie the capacity for plant migration across an intermittent seaway.

Authors:  J R P Worth; B R Holland; N J Beeton; B Schönfeld; M Rossetto; R E Vaillancourt; G J Jordan
Journal:  Ann Bot       Date:  2017-10-17       Impact factor: 4.357

6.  Experimental drought reduces genetic diversity in the grassland foundation species Bouteloua eriopoda.

Authors:  Kenneth D Whitney; Joann Mudge; Donald O Natvig; Anitha Sundararajan; William T Pockman; Jennifer Bell; Scott L Collins; Jennifer A Rudgers
Journal:  Oecologia       Date:  2019-03-08       Impact factor: 3.225

Review 7.  Harnessing the power of RADseq for ecological and evolutionary genomics.

Authors:  Kimberly R Andrews; Jeffrey M Good; Michael R Miller; Gordon Luikart; Paul A Hohenlohe
Journal:  Nat Rev Genet       Date:  2016-01-05       Impact factor: 53.242

8.  The Genomic Architecture of Flowering Time Varies Across Space and Time in Mimulus guttatus.

Authors:  Patrick J Monnahan; John K Kelly
Journal:  Genetics       Date:  2017-04-28       Impact factor: 4.562

9.  SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis.

Authors:  Paula Arielle M R Valdisser; Georgios J Pappas; Ivandilson P P de Menezes; Bárbara S F Müller; Wendell J Pereira; Marcelo G Narciso; Claudio Brondani; Thiago L P O Souza; Tereza C O Borba; Rosana P Vianello
Journal:  Mol Genet Genomics       Date:  2016-03-01       Impact factor: 3.291

Review 10.  The impact of whole-genome sequencing on the reconstruction of human population history.

Authors:  Krishna R Veeramah; Michael F Hammer
Journal:  Nat Rev Genet       Date:  2014-02-04       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.