Literature DB >> 28239248

Efficient computation of the joint sample frequency spectra for multiple populations.

John A Kamm1, Jonathan Terhorst1, Yun S Song2.   

Abstract

A wide range of studies in population genetics have employed the sample frequency spectrum (SFS), a summary statistic which describes the distribution of mutant alleles at a polymorphic site in a sample of DNA sequences and provides a highly efficient dimensional reduction of large-scale population genomic variation data. Recently, there has been much interest in analyzing the joint SFS data from multiple populations to infer parameters of complex demographic histories, including variable population sizes, population split times, migration rates, admixture proportions, and so on. SFS-based inference methods require accurate computation of the expected SFS under a given demographic model. Although much methodological progress has been made, existing methods suffer from numerical instability and high computational complexity when multiple populations are involved and the sample size is large. In this paper, we present new analytic formulas and algorithms that enable accurate, efficient computation of the expected joint SFS for thousands of individuals sampled from hundreds of populations related by a complex demographic model with arbitrary population size histories (including piecewise-exponential growth). Our results are implemented in a new software package called momi (MOran Models for Inference). Through an empirical study we demonstrate our improvements to numerical stability and computational complexity.

Entities:  

Keywords:  coalescent; demographic inference; population genetics; sum-product algorithm

Year:  2017        PMID: 28239248      PMCID: PMC5319604          DOI: 10.1080/10618600.2016.1159212

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   2.302


  26 in total

1.  Estimation of population parameters and recombination rates from single nucleotide polymorphisms.

Authors:  R Nielsen
Journal:  Genetics       Date:  2000-02       Impact factor: 4.562

2.  Inferring species trees directly from biallelic genetic markers: bypassing gene trees in a full coalescent analysis.

Authors:  David Bryant; Remco Bouckaert; Joseph Felsenstein; Noah A Rosenberg; Arindam RoyChoudhury
Journal:  Mol Biol Evol       Date:  2012-03-14       Impact factor: 16.240

3.  Estimating ancestral population parameters.

Authors:  J Wakeley; J Hey
Journal:  Genetics       Date:  1997-03       Impact factor: 4.562

4.  APPROXIMATE SAMPLING FORMULAS FOR GENERAL FINITE-ALLELES MODELS OF MUTATION.

Authors:  Anand Bhaskar; John A Kamm; Yun S Song
Journal:  Adv Appl Probab       Date:  2012-06       Impact factor: 0.690

5.  The effect of recurrent mutation on the frequency spectrum of a segregating site and the age of an allele.

Authors:  Paul A Jenkins; Yun S Song
Journal:  Theor Popul Biol       Date:  2011-04-28       Impact factor: 1.570

6.  Deep resequencing reveals excess rare recent variants consistent with explosive population growth.

Authors:  Alex Coventry; Lara M Bull-Otterson; Xiaoming Liu; Andrew G Clark; Taylor J Maxwell; Jacy Crosby; James E Hixson; Thomas J Rea; Donna M Muzny; Lora R Lewis; David A Wheeler; Aniko Sabo; Christine Lusk; Kenneth G Weiss; Humeira Akbar; Andrew Cree; Alicia C Hawes; Irene Newsham; Robin T Varghese; Donna Villasana; Shannon Gross; Vandita Joshi; Jireh Santibanez; Margaret Morgan; Kyle Chang; Walker Hale Iv; Alan R Templeton; Eric Boerwinkle; Richard Gibbs; Charles F Sing
Journal:  Nat Commun       Date:  2010-11-30       Impact factor: 14.919

7.  scrm: efficiently simulating long sequences using the approximated coalescent with recombination.

Authors:  Paul R Staab; Sha Zhu; Dirk Metzler; Gerton Lunter
Journal:  Bioinformatics       Date:  2015-01-08       Impact factor: 6.937

8.  Linking great apes genome evolution across time scales using polymorphism-aware phylogenetic models.

Authors:  Nicola De Maio; Christian Schlötterer; Carolin Kosiol
Journal:  Mol Biol Evol       Date:  2013-08-01       Impact factor: 16.240

9.  Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data.

Authors:  Ryan N Gutenkunst; Ryan D Hernandez; Scott H Williamson; Carlos D Bustamante
Journal:  PLoS Genet       Date:  2009-10-23       Impact factor: 5.917

10.  Demographic inference using spectral methods on SNP data, with an analysis of the human out-of-Africa expansion.

Authors:  Sergio Lukic; Jody Hey
Journal:  Genetics       Date:  2012-08-03       Impact factor: 4.562

View more
  22 in total

1.  Computationally Efficient Composite Likelihood Statistics for Demographic Inference.

Authors:  Alec J Coffman; Ping Hsun Hsieh; Simon Gravel; Ryan N Gutenkunst
Journal:  Mol Biol Evol       Date:  2015-11-05       Impact factor: 16.240

2.  Inferring Demographic History Using Two-Locus Statistics.

Authors:  Aaron P Ragsdale; Ryan N Gutenkunst
Journal:  Genetics       Date:  2017-04-16       Impact factor: 4.562

3.  Inferring the Joint Demographic History of Multiple Populations: Beyond the Diffusion Approximation.

Authors:  Julien Jouganous; Will Long; Aaron P Ragsdale; Simon Gravel
Journal:  Genetics       Date:  2017-05-11       Impact factor: 4.562

4.  Geometry of the Sample Frequency Spectrum and the Perils of Demographic Inference.

Authors:  Zvi Rosen; Anand Bhaskar; Sebastien Roch; Yun S Song
Journal:  Genetics       Date:  2018-07-31       Impact factor: 4.562

5.  Computing the joint distribution of the total tree length across loci in populations with variable size.

Authors:  Alexey Miroshnikov; Matthias Steinrücken
Journal:  Theor Popul Biol       Date:  2017-09-21       Impact factor: 1.570

6.  Inference of complex population histories using whole-genome sequences from multiple populations.

Authors:  Matthias Steinrücken; Jack Kamm; Jeffrey P Spence; Yun S Song
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-06       Impact factor: 11.205

7.  Terminal Pleistocene Alaskan genome reveals first founding population of Native Americans.

Authors:  J Víctor Moreno-Mayar; Ben A Potter; Lasse Vinner; Matthias Steinrücken; Simon Rasmussen; Jonathan Terhorst; John A Kamm; Anders Albrechtsen; Anna-Sapfo Malaspinas; Martin Sikora; Joshua D Reuther; Joel D Irish; Ripan S Malhi; Ludovic Orlando; Yun S Song; Rasmus Nielsen; David J Meltzer; Eske Willerslev
Journal:  Nature       Date:  2018-01-03       Impact factor: 49.962

8.  Robust and scalable inference of population history from hundreds of unphased whole genomes.

Authors:  Jonathan Terhorst; John A Kamm; Yun S Song
Journal:  Nat Genet       Date:  2016-12-26       Impact factor: 38.330

Review 9.  Explosive genetic evidence for explosive human population growth.

Authors:  Feng Gao; Alon Keinan
Journal:  Curr Opin Genet Dev       Date:  2016-10-04       Impact factor: 5.578

10.  Efficiently inferring the demographic history of many populations with allele count data.

Authors:  Jack Kamm; Jonathan Terhorst; Richard Durbin; Yun S Song
Journal:  J Am Stat Assoc       Date:  2019-07-22       Impact factor: 5.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.