Literature DB >> 22697240

Efficient simulation and likelihood methods for non-neutral multi-allele models.

Paul Joyce1, Alan Genz, Erkan Ozge Buzbas.   

Abstract

Throughout the 1980s, Simon Tavaré made numerous significant contributions to population genetics theory. As genetic data, in particular DNA sequence, became more readily available, a need to connect population-genetic models to data became the central issue. The seminal work of Griffiths and Tavaré (1994a , 1994b , 1994c) was among the first to develop a likelihood method to estimate the population-genetic parameters using full DNA sequences. Now, we are in the genomics era where methods need to scale-up to handle massive data sets, and Tavaré has led the way to new approaches. However, performing statistical inference under non-neutral models has proved elusive. In tribute to Simon Tavaré, we present an article in spirit of his work that provides a computationally tractable method for simulating and analyzing data under a class of non-neutral population-genetic models. Computational methods for approximating likelihood functions and generating samples under a class of allele-frequency based non-neutral parent-independent mutation models were proposed by Donnelly, Nordborg, and Joyce (DNJ) (Donnelly et al., 2001). DNJ (2001) simulated samples of allele frequencies from non-neutral models using neutral models as auxiliary distribution in a rejection algorithm. However, patterns of allele frequencies produced by neutral models are dissimilar to patterns of allele frequencies produced by non-neutral models, making the rejection method inefficient. For example, in some cases the methods in DNJ (2001) require 10(9) rejections before a sample from the non-neutral model is accepted. Our method simulates samples directly from the distribution of non-neutral models, making simulation methods a practical tool to study the behavior of the likelihood and to perform inference on the strength of selection.

Mesh:

Year:  2012        PMID: 22697240      PMCID: PMC3375650          DOI: 10.1089/cmb.2012.0033

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  13 in total

1.  Population growth of human Y chromosomes: a study of Y chromosome microsatellites.

Authors:  J K Pritchard; M T Seielstad; A Perez-Lezaun; M W Feldman
Journal:  Mol Biol Evol       Date:  1999-12       Impact factor: 16.240

2.  Two-locus sampling distributions and their application.

Authors:  R R Hudson
Journal:  Genetics       Date:  2001-12       Impact factor: 4.562

3.  Perfect simulation from population genetic models with selection.

Authors:  P Fearnhead
Journal:  Theor Popul Biol       Date:  2001-06       Impact factor: 1.570

4.  Likelihoods and simulation methods for a class of nonneutral population genetics models.

Authors:  P Donnelly; M Nordborg; P Joyce
Journal:  Genetics       Date:  2001-10       Impact factor: 4.562

5.  Ancestral processes for non-neutral models of complex diseases.

Authors:  Paul Fearnhead
Journal:  Theor Popul Biol       Date:  2003-03       Impact factor: 1.570

6.  Inference on the strength of balancing selection for epistatically interacting loci.

Authors:  Erkan Ozge Buzbas; Paul Joyce; Noah A Rosenberg
Journal:  Theor Popul Biol       Date:  2011-01-26       Impact factor: 1.570

7.  Approximate Bayesian computation in population genetics.

Authors:  Mark A Beaumont; Wenyang Zhang; David J Balding
Journal:  Genetics       Date:  2002-12       Impact factor: 4.562

8.  The effects of human population structure on large genetic association studies.

Authors:  Jonathan Marchini; Lon R Cardon; Michael S Phillips; Peter Donnelly
Journal:  Nat Genet       Date:  2004-03-28       Impact factor: 38.330

9.  Heterosis or neutrality?

Authors:  G A Watterson
Journal:  Genetics       Date:  1977-04       Impact factor: 4.562

10.  Sampling theory for neutral alleles in a varying environment.

Authors:  R C Griffiths; S Tavaré
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1994-06-29       Impact factor: 6.237

View more
  3 in total

1.  AABC: approximate approximate Bayesian computation for inference in population-genetic models.

Authors:  Erkan O Buzbas; Noah A Rosenberg
Journal:  Theor Popul Biol       Date:  2014-09-26       Impact factor: 1.570

2.  Inference from the stationary distribution of allele frequencies in a family of Wright-Fisher models with two levels of genetic variability.

Authors:  Jake M Ferguson; Erkan Ozge Buzbas
Journal:  Theor Popul Biol       Date:  2018-03-21       Impact factor: 1.570

3.  Generalization of the Ewens sampling formula to arbitrary fitness landscapes.

Authors:  Pavel Khromov; Constantin D Malliaris; Alexandre V Morozov
Journal:  PLoS One       Date:  2018-01-11       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.