Literature DB >> 24634516

APPROXIMATE SAMPLING FORMULAS FOR GENERAL FINITE-ALLELES MODELS OF MUTATION.

Anand Bhaskar1, John A Kamm1, Yun S Song1.   

Abstract

Many applications in genetic analyses utilize sampling distributions, which describe the probability of observing a sample of DNA sequences randomly drawn from a population. In the one-locus case with special models of mutation such as the infinite-alleles model or the finite-alleles parent-independent mutation model, closed-form sampling distributions under the coalescent have been known for many decades. However, no exact formula is currently known for more general models of mutation that are of biological interest. In this paper, models with finitely-many alleles are considered, and an urn construction related to the coalescent is used to derive approximate closed-form sampling formulas for an arbitrary irreducible recurrent mutation model or for a reversible recurrent mutation model, depending on whether the number of distinct observed allele types is at most three or four, respectively. It is demonstrated empirically that the formulas derived here are highly accurate when the per-base mutation rate is low, which holds for many biological organisms.

Entities:  

Keywords:  Sampling probability; coalescent theory; martingale; urn models

Year:  2012        PMID: 24634516      PMCID: PMC3953561          DOI: 10.1239/aap/1339878718

Source DB:  PubMed          Journal:  Adv Appl Probab        ISSN: 0001-8678            Impact factor:   0.690


  12 in total

1.  Estimate of the mutation rate per nucleotide in humans.

Authors:  M W Nachman; S L Crowell
Journal:  Genetics       Date:  2000-09       Impact factor: 4.562

2.  AN ASYMPTOTIC SAMPLING FORMULA FOR THE COALESCENT WITH RECOMBINATION.

Authors:  Paul A Jenkins; Yun S Song
Journal:  Ann Appl Probab       Date:  2010-06       Impact factor: 1.872

3.  Ewens' sampling formula and related formulae: combinatorial proofs, extensions to variable population size and applications to ages of alleles.

Authors:  Robert C Griffiths; Sabin Lessard
Journal:  Theor Popul Biol       Date:  2005-11       Impact factor: 1.570

4.  The frequency spectrum of a mutation, and its age, in a general diffusion model.

Authors:  R C Griffiths
Journal:  Theor Popul Biol       Date:  2003-09       Impact factor: 1.570

5.  Closed-form two-locus sampling distributions: accuracy and universality.

Authors:  Paul A Jenkins; Yun S Song
Journal:  Genetics       Date:  2009-09-07       Impact factor: 4.562

6.  CLOSED-FORM ASYMPTOTIC SAMPLING DISTRIBUTIONS UNDER THE COALESCENT WITH RECOMBINATION FOR AN ARBITRARY NUMBER OF LOCI.

Authors:  Anand Bhaskar; Yun S Song
Journal:  Adv Appl Probab       Date:  2012-06       Impact factor: 0.690

7.  The sampling theory of selectively neutral alleles.

Authors:  W J Ewens
Journal:  Theor Popul Biol       Date:  1972-03       Impact factor: 1.570

8.  Sampling theory for neutral alleles in a varying environment.

Authors:  R C Griffiths; S Tavaré
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1994-06-29       Impact factor: 6.237

9.  Estimating the pattern of nucleotide substitution.

Authors:  Z Yang
Journal:  J Mol Evol       Date:  1994-07       Impact factor: 2.395

10.  The effect of recurrent mutation on the frequency spectrum of a segregating site and the age of an allele.

Authors:  Paul A Jenkins; Yun S Song
Journal:  Theor Popul Biol       Date:  2011-04-28       Impact factor: 1.570

View more
  7 in total

1.  General triallelic frequency spectrum under demographic models with variable population size.

Authors:  Paul A Jenkins; Jonas W Mueller; Yun S Song
Journal:  Genetics       Date:  2013-11-08       Impact factor: 4.562

2.  The stationary distribution of a sample from the Wright-Fisher diffusion model with general small mutation rates.

Authors:  Conrad J Burden; Robert C Griffiths
Journal:  J Math Biol       Date:  2018-11-13       Impact factor: 2.259

3.  Efficient computation of the joint sample frequency spectra for multiple populations.

Authors:  John A Kamm; Jonathan Terhorst; Yun S Song
Journal:  J Comput Graph Stat       Date:  2017-02-16       Impact factor: 2.302

4.  TRACTABLE DIFFUSION AND COALESCENT PROCESSES FOR WEAKLY CORRELATED LOCI.

Authors:  Paul A Jenkins; Paul Fearnhead; Yun S Song
Journal:  Electron J Probab       Date:  2016-06-04       Impact factor: 1.151

5.  Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans.

Authors:  Arbel Harpak; Anand Bhaskar; Jonathan K Pritchard
Journal:  PLoS Genet       Date:  2016-12-15       Impact factor: 5.917

6.  Genome-wide fine-scale recombination rate variation in Drosophila melanogaster.

Authors:  Andrew H Chan; Paul A Jenkins; Yun S Song
Journal:  PLoS Genet       Date:  2012-12-20       Impact factor: 5.917

7.  The effect of single recombination events on coalescent tree height and shape.

Authors:  Luca Ferretti; Filippo Disanto; Thomas Wiehe
Journal:  PLoS One       Date:  2013-04-08       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.