| Literature DB >> 32769100 |
Zhangyi He1, Xiaoyang Dai2, Mark Beaumont2, Feng Yu3.
Abstract
Temporally spaced genetic data allow for more accurate inference of population genetic parameters and hypothesis testing on the recent action of natural selection. In this work, we develop a novel likelihood-based method for jointly estimating selection coefficient and allele age from time series data of allele frequencies. Our approach is based on a hidden Markov model where the underlying process is a Wright-Fisher diffusion conditioned to survive until the time of the most recent sample. This formulation circumvents the assumption required in existing methods that the allele is created by mutation at a certain low frequency. We calculate the likelihood by numerically solving the resulting Kolmogorov backward equation backward in time while reweighting the solution with the emission probabilities of the observation at each sampling time point. This procedure reduces the two-dimensional numerical search for the maximum of the likelihood surface, for both the selection coefficient and the allele age, to a one-dimensional search over the selection coefficient only. We illustrate through extensive simulations that our method can produce accurate estimates of the selection coefficient and the allele age under both constant and nonconstant demographic histories. We apply our approach to reanalyze ancient DNA data associated with horse base coat colors. We find that ignoring demographic histories or grouping raw samples can significantly bias the inference results.Entities:
Keywords: allele age; conditioned Wright-Fisher diffusion; hidden Markov model; maximum likelihood estimation; natural selection
Mesh:
Substances:
Year: 2020 PMID: 32769100 PMCID: PMC7536852 DOI: 10.1534/genetics.120.303400
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562