Literature DB >> 21900266

A general method for calculating likelihoods under the coalescent process.

K Lohse1, R J Harrison, N H Barton.   

Abstract

Analysis of genomic data requires an efficient way to calculate likelihoods across very large numbers of loci. We describe a general method for finding the distribution of genealogies: we allow migration between demes, splitting of demes [as in the isolation-with-migration (IM) model], and recombination between linked loci. These processes are described by a set of linear recursions for the generating function of branch lengths. Under the infinite-sites model, the probability of any configuration of mutations can be found by differentiating this generating function. Such calculations are feasible for small numbers of sampled genomes: as an example, we show how the generating function can be derived explicitly for three genes under the two-deme IM model. This derivation is done automatically, using Mathematica. Given data from a large number of unlinked and nonrecombining blocks of sequence, these results can be used to find maximum-likelihood estimates of model parameters by tabulating the probabilities of all relevant mutational configurations and then multiplying across loci. The feasibility of the method is demonstrated by applying it to simulated data and to a data set previously analyzed by Wang and Hey (2010) consisting of 26,141 loci sampled from Drosophila simulans and D. melanogaster. Our results suggest that such likelihood calculations are scalable to genomic data as long as the numbers of sampled individuals and mutations per sequence block are small.

Entities:  

Mesh:

Year:  2011        PMID: 21900266      PMCID: PMC3213358          DOI: 10.1534/genetics.111.129569

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  16 in total

1.  Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis.

Authors:  Jody Hey; Rasmus Nielsen
Journal:  Genetics       Date:  2004-06       Impact factor: 4.562

2.  Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates.

Authors:  J Felsenstein
Journal:  Genet Res       Date:  1992-04       Impact factor: 1.588

3.  Learning about modes of speciation by computational approaches.

Authors:  Céline Becquet; Molly Przeworski
Journal:  Evolution       Date:  2009-02-18       Impact factor: 3.694

4.  A Markov Chain Model of Coalescence with Recombination

Authors: 
Journal:  Theor Popul Biol       Date:  1997-08       Impact factor: 1.570

5.  Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives.

Authors:  Carlos A Machado; Richard M Kliman; Jeffrey A Markert; Jody Hey
Journal:  Mol Biol Evol       Date:  2002-04       Impact factor: 16.240

6.  The island model of population differentiation: a general solution.

Authors:  B D Latter
Journal:  Genetics       Date:  1973-01       Impact factor: 4.562

7.  Sampling theory for neutral alleles in a varying environment.

Authors:  R C Griffiths; S Tavaré
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1994-06-29       Impact factor: 6.237

8.  Estimating divergence parameters with small samples from a large number of loci.

Authors:  Yong Wang; Jody Hey
Journal:  Genetics       Date:  2009-11-16       Impact factor: 4.562

9.  Likelihood and Bayes estimation of ancestral population sizes in hominoids using data from multiple loci.

Authors:  Ziheng Yang
Journal:  Genetics       Date:  2002-12       Impact factor: 4.562

10.  Divergence time and population size in the lineage leading to modern humans.

Authors:  N Takahata; Y Satta; J Klein
Journal:  Theor Popul Biol       Date:  1995-10       Impact factor: 1.570

View more
  39 in total

Review 1.  Population genetic studies in the genomic sequencing era.

Authors:  Hua Chen
Journal:  Dongwuxue Yanjiu       Date:  2015-07-18

Review 2.  Understanding the origin of species with genome-scale data: modelling gene flow.

Authors:  Vitor Sousa; Jody Hey
Journal:  Nat Rev Genet       Date:  2013-05-09       Impact factor: 53.242

3.  Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution.

Authors:  Tomotaka Matsumoto; Hiroshi Akashi; Ziheng Yang
Journal:  Genetics       Date:  2015-05-06       Impact factor: 4.562

4.  The Effects of Demography and Genetics on the Neutral Distribution of Quantitative Traits.

Authors:  Evan M Koch
Journal:  Genetics       Date:  2019-02-19       Impact factor: 4.562

Review 5.  Challenges in Species Tree Estimation Under the Multispecies Coalescent Model.

Authors:  Bo Xu; Ziheng Yang
Journal:  Genetics       Date:  2016-12       Impact factor: 4.562

6.  The Timing and Direction of Introgression Under the Multispecies Network Coalescent.

Authors:  Mark S Hibbins; Matthew W Hahn
Journal:  Genetics       Date:  2019-01-22       Impact factor: 4.562

7.  Efficient computation in the IM model.

Authors:  Lars Nørvang Andersen; Thomas Mailund; Asger Hobolth
Journal:  J Math Biol       Date:  2013-04-16       Impact factor: 2.259

8.  A renewal theory approach to IBD sharing.

Authors:  Shai Carmi; Peter R Wilton; John Wakeley; Itsik Pe'er
Journal:  Theor Popul Biol       Date:  2014-08-18       Impact factor: 1.570

9.  The structure of allelic diversity in the presence of purifying selection.

Authors:  Michael M Desai; Lauren E Nicolaisen; Aleksandra M Walczak; Joshua B Plotkin
Journal:  Theor Popul Biol       Date:  2011-12-16       Impact factor: 1.570

10.  Neandertal admixture in Eurasia confirmed by maximum-likelihood analysis of three genomes.

Authors:  Konrad Lohse; Laurent A F Frantz
Journal:  Genetics       Date:  2014-02-13       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.