Literature DB >> 18976228

Importance sampling for the infinite sites model.

Asger Hobolth1, Marcy K Uyenoyama, Carsten Wiuf.   

Abstract

Importance sampling or Markov Chain Monte Carlo sampling is required for state-of-the-art statistical analysis of population genetics data. The applicability of these sampling-based inference techniques depends crucially on the proposal distribution. In this paper, we discuss importance sampling for the infinite sites model. The infinite sites assumption is attractive because it constraints the number of possible genealogies, thereby allowing for the analysis of larger data sets. We recall the Griffiths-Tavaré and Stephens-Donnelly proposals and emphasize the relation between the latter proposal and exact sampling from the infinite alleles model. We also introduce a new proposal that takes knowledge of the ancestral state into account. The new proposal is derived from a new result on exact sampling from a single site. The methods are illustrated on simulated data sets and the data considered in Griffiths and Tavaré (1994).

Entities:  

Mesh:

Year:  2008        PMID: 18976228      PMCID: PMC2832804          DOI: 10.2202/1544-6115.1400

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  12 in total

1.  Generating samples under a Wright-Fisher neutral model of genetic variation.

Authors:  Richard R Hudson
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

2.  Ewens' sampling formula and related formulae: combinatorial proofs, extensions to variable population size and applications to ages of alleles.

Authors:  Robert C Griffiths; Sabin Lessard
Journal:  Theor Popul Biol       Date:  2005-11       Impact factor: 1.570

3.  Partition structures, Polya urns, the Ewens sampling formula, and the ages of alleles.

Authors:  P Donnelly
Journal:  Theor Popul Biol       Date:  1986-10       Impact factor: 1.570

4.  The sampling theory of neutral alleles and an urn model in population genetics.

Authors:  F M Hoppe
Journal:  J Math Biol       Date:  1987       Impact factor: 2.259

5.  The sampling theory of selectively neutral alleles.

Authors:  W J Ewens
Journal:  Theor Popul Biol       Date:  1972-03       Impact factor: 1.570

6.  Addendum to a paper of W. Ewens.

Authors:  S Karlin; J McGregor
Journal:  Theor Popul Biol       Date:  1972-03       Impact factor: 1.570

7.  Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.

Authors:  M K Kuhner; J Yamato; J Felsenstein
Journal:  Genetics       Date:  1995-08       Impact factor: 4.562

8.  Statistical properties of segregating sites.

Authors:  Y X Fu
Journal:  Theor Popul Biol       Date:  1995-10       Impact factor: 1.570

9.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences.

Authors:  R R Hudson; N L Kaplan
Journal:  Genetics       Date:  1985-09       Impact factor: 4.562

10.  Counting all possible ancestral configurations of sample sequences in population genetics.

Authors:  Yun S Song; Rune Lyngsø; Jotun Hein
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2006 Jul-Sep       Impact factor: 3.710

View more
  10 in total

1.  Topologies of the conditional ancestral trees and full-likelihood-based inference in the general coalescent tree framework.

Authors:  Ori Sargsyan
Journal:  Genetics       Date:  2010-05-17       Impact factor: 4.562

2.  An analytical framework in the general coalescent tree setting for analyzing polymorphisms created by two mutations.

Authors:  Ori Sargsyan
Journal:  J Math Biol       Date:  2014-04-24       Impact factor: 2.259

3.  Two-Locus Likelihoods Under Variable Population Size and Fine-Scale Recombination Rate Estimation.

Authors:  John A Kamm; Jeffrey P Spence; Jeffrey Chan; Yun S Song
Journal:  Genetics       Date:  2016-05-10       Impact factor: 4.562

4.  Importance sampling for Lambda-coalescents in the infinitely many sites model.

Authors:  Matthias Birkner; Jochen Blath; Matthias Steinrücken
Journal:  Theor Popul Biol       Date:  2011-02-04       Impact factor: 1.570

5.  Stopping-time resampling and population genetic inference under coalescent models.

Authors:  Paul A Jenkins
Journal:  Stat Appl Genet Mol Biol       Date:  2012-01-06

6.  Site frequency spectra from genomic SNP surveys.

Authors:  Ganeshkumar Ganapathy; Marcy K Uyenoyama
Journal:  Theor Popul Biol       Date:  2009-04-14       Impact factor: 1.570

7.  Allele frequency spectra in structured populations: Novel-allele probabilities under the labelled coalescent.

Authors:  Marcy K Uyenoyama; Naoki Takebayashi; Seiji Kumagai
Journal:  Theor Popul Biol       Date:  2020-03-03       Impact factor: 1.570

8.  Coalescent: an open-source and scalable framework for exact calculations in coalescent theory.

Authors:  Susanta Tewari; John L Spouge
Journal:  BMC Bioinformatics       Date:  2012-10-03       Impact factor: 3.169

9.  Coalescent: an open-science framework for importance sampling in coalescent theory.

Authors:  Susanta Tewari; John L Spouge
Journal:  PeerJ       Date:  2015-08-18       Impact factor: 2.984

10.  Bayesian Estimation of Population Size Changes by Sampling Tajima's Trees.

Authors:  Julia A Palacios; Amandine Véber; Lorenzo Cappello; Zhangyuan Wang; John Wakeley; Sohini Ramachandran
Journal:  Genetics       Date:  2019-09-11       Impact factor: 4.562

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.