| Literature DB >> 27098912 |
Peter D Keightley1, José L Campos2, Tom R Booker2, Brian Charlesworth2.
Abstract
Many approaches for inferring adaptive molecular evolution analyze the unfolded site frequency spectrum (SFS), a vector of counts of sites with different numbers of copies of derived alleles in a sample of alleles from a population. Accurate inference of the high-copy-number elements of the SFS is difficult, however, because of misassignment of alleles as derived vs. ancestral. This is a known problem with parsimony using outgroup species. Here we show that the problem is particularly serious if there is variation in the substitution rate among sites brought about by variation in selective constraint levels. We present a new method for inferring the SFS using one or two outgroups that attempts to overcome the problem of misassignment. We show that two outgroups are required for accurate estimation of the SFS if there is substantial variation in selective constraints, which is expected to be the case for nonsynonymous sites in protein-coding genes. We apply the method to estimate unfolded SFSs for synonymous and nonsynonymous sites in a population of Drosophila melanogaster from phase 2 of the Drosophila Population Genomics Project. We use the unfolded spectra to estimate the frequency and strength of advantageous and deleterious mutations and estimate that ∼50% of amino acid substitutions are positively selected but that <0.5% of new amino acid mutations are beneficial, with a scaled selection strength of Nes ≈ 12.Entities:
Keywords: Drosophila; adaptation; distribution of fitness effects; site frequency spectrum (SFS)
Mesh:
Substances:
Year: 2016 PMID: 27098912 PMCID: PMC4896206 DOI: 10.1534/genetics.116.188102
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562