Literature DB >> 30734077

A general framework for moment-based analysis of genetic data.

Maria Simonsen Speed1,2, David Joseph Balding3, Asger Hobolth4.   

Abstract

In population genetics, the Dirichlet (also called the Balding-Nichols) model has for 20 years been considered the key model to approximate the distribution of allele fractions within populations in a multi-allelic setting. It has often been noted that the Dirichlet assumption is approximate because positive correlations among alleles cannot be accommodated under the Dirichlet model. However, the validity of the Dirichlet distribution has never been systematically investigated in a general framework. This paper attempts to address this problem by providing a general overview of how allele fraction data under the most common multi-allelic mutational structures should be modeled. The Dirichlet and alternative models are investigated by simulating allele fractions from a diffusion approximation of the multi-allelic Wright-Fisher process with mutation, and applying a moment-based analysis method. The study shows that the optimal modeling strategy for the distribution of allele fractions depends on the specific mutation process. The Dirichlet model is only an exceptionally good approximation for the pure drift, Jukes-Cantor and parent-independent mutation processes with small mutation rates. Alternative models are required and proposed for the other mutation processes, such as a Beta-Dirichlet model for the infinite alleles mutation process, and a Hierarchical Beta model for the Kimura, Hasegawa-Kishino-Yano and Tamura-Nei processes. Finally, a novel Hierarchical Beta approximation is developed, a Pyramidal Hierarchical Beta model, for the generalized time-reversible and single-step mutation processes.

Keywords:  Allele fraction; Beta–Dirichlet; Diffusion; Dirichlet; Distribution of allele fractions; Evolutionary history; Hierarchical Beta; Moments; Multi-allelic Wright–Fisher; Mutation processes; Pyramid

Mesh:

Year:  2019        PMID: 30734077     DOI: 10.1007/s00285-018-01325-0

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.259


  18 in total

1.  Reconstructing population histories from single nucleotide polymorphism data.

Authors:  Jukka Sirén; Pekka Marttinen; Jukka Corander
Journal:  Mol Biol Evol       Date:  2010-09-06       Impact factor: 16.240

2.  Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation.

Authors:  Paula Tataru; Thomas Bataillon; Asger Hobolth
Journal:  Genetics       Date:  2015-08-26       Impact factor: 4.562

3.  Inferring population histories using genome-wide allele frequency data.

Authors:  Mathieu Gautier; Renaud Vitalis
Journal:  Mol Biol Evol       Date:  2012-11-15       Impact factor: 16.240

4.  Stepwise mutation model and distribution of allelic frequencies in a finite population.

Authors:  M Kimura; T Ohta
Journal:  Proc Natl Acad Sci U S A       Date:  1978-06       Impact factor: 11.205

5.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity.

Authors:  D J Balding; R A Nichols
Journal:  Genetica       Date:  1995       Impact factor: 1.082

Review 6.  Statistical Inference in the Wright-Fisher Model Using Allele Frequency Data.

Authors:  Paula Tataru; Maria Simonsen; Thomas Bataillon; Asger Hobolth
Journal:  Syst Biol       Date:  2017-01-01       Impact factor: 15.683

7.  Human triallelic sites: evidence for a new mutational mechanism?

Authors:  Alan Hodgkinson; Adam Eyre-Walker
Journal:  Genetics       Date:  2009-11-02       Impact factor: 4.562

8.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

9.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  PoMo: An Allele Frequency-Based Approach for Species Tree Estimation.

Authors:  Nicola De Maio; Dominik Schrempf; Carolin Kosiol
Journal:  Syst Biol       Date:  2015-07-23       Impact factor: 9.160

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.