| Literature DB >> 25829516 |
Thomas Bataillon1, Jinjie Duan1, Christina Hvilsom2, Xin Jin3, Yingrui Li4, Laurits Skov5, Sylvain Glemin6, Kasper Munch5, Tao Jiang4, Yu Qian5, Asger Hobolth5, Jun Wang7, Thomas Mailund5, Hans R Siegismund8, Mikkel H Schierup1.
Abstract
We study genome-wide nucleotide diversity in three subspecies of extant chimpanzees using exome capture. After strict filtering, Single Nucleotide Polymorphisms and indels were called and genotyped for greater than 50% of exons at a mean coverage of 35× per individual. Central chimpanzees (Pan troglodytes troglodytes) are the most polymorphic (nucleotide diversity, θw = 0.0023 per site) followed by Eastern (P. t. schweinfurthii) chimpanzees (θw = 0.0016) and Western (P. t. verus) chimpanzees (θw = 0.0008). A demographic scenario of divergence without gene flow fits the patterns of autosomal synonymous nucleotide diversity well except for a signal of recent gene flow from Western into Eastern chimpanzees. The striking contrast in X-linked versus autosomal polymorphism and divergence previously reported in Central chimpanzees is also found in Eastern and Western chimpanzees. We show that the direction of selection statistic exhibits a strong nonmonotonic relationship with the strength of purifying selection S, making it inappropriate for estimating S. We instead use counts in synonymous versus nonsynonymous frequency classes to infer the distribution of S coefficients acting on nonsynonymous mutations in each subspecies. The strength of purifying selection we infer is congruent with the differences in effective sizes of each subspecies: Central chimpanzees are undergoing the strongest purifying selection followed by Eastern and Western chimpanzees. Coding indels show stronger selection against indels changing the reading frame than observed in human populations.Entities:
Keywords: effective size; fitness effect; mutation; selection
Mesh:
Year: 2015 PMID: 25829516 PMCID: PMC4419804 DOI: 10.1093/gbe/evv058
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FTheoretical expectation for DoS as a function of the strength of purifying selection against a mutation, S. Expectations are derived by numerical (exact) calculation based on the diffusion approximation of the Wright–Fisher model to calculate sojourn times and the total number of synonymous nonsynonymous polymorphic and divergent sites (see supplementary Mathematica notebook, Supplementary Material online, for details). Each S value on the x axis corresponds to a mean strength of selection that is also the mean of the assumed Γ distribution for S when computing the expected value of DoS. Different curves correspond each to a different shape parameter assumed for modeling the Γ distribution of S: Blue (shape of α = 0.05), magenta (α = 0.25), red (α = 0.5). In gray, we use for illustration the shape parameter inferred from the full SFS data in the Central subspecies (α = 0.12).
F(a) Venn diagrams for the autosomes and the X chromosomes are shown (left and right, respectively). The red numbers are nonsynonymous alleles and the green are synonymous alleles. The numbers in the brackets for nonsynonymous and synonymous alleles are the number of singletons for all three subpopulation that are present in the respective species. The blue are fixed nonsynonymous alleles and the black are fixed synonymous alleles. Notice that these fixed alleles are not part of the Venn diagram; they are just a measure of how many alleles that are fixed in each subspecies. The allele can be both polymorphic and fixed in the other species. The numbers in the brackets for the fixed synonymous and nonsynonymous alleles are fixed alleles that are different from the reference allele (here the human allele), which only occurs in the respective subspecies, meaning that the two other species both have the reference allele in this position with no variation. (b) Venn diagrams for the autosomes and the X chromosomes are shown (left and right, respectively). The red numbers show the number of deletions and the green number shows the number of insertions. The numbers in the brackets are how many deletions or insertions that are a multiple of 3.
FPrincipal component analysis. (A) Synonymous SNP frequencies. (B) Indels.
FOverview of the demographic model inferred from autosomal synonymous SNPs. Estimates reported here are assuming a constant population sizes within each subspecies and a pure divergence model without gene flow. Population size and divergence time estimates are based on ABC and coalescent simulations assuming a generation time of 20 years and a mutation rate of 0.6 × 10−9 per year per site.
FDerived SFS of polymorphism segregating in each subspecies. (A) SFS for autosomes. (B) SFS for X-linked exons.
FDistribution of indel polymorphism and indel size. (A) SFS of indel in autosomes. (B) Distribution of indel variants size.
FEmpirical distribution of NI and DoS on autosomal and X-linked regions. Divergence is computed using exclusively the chimpanzee branch from human–chimpanzee ancestor. (A) DoS in each subspecies. (B) NI per subspecies (on a log10 scale).
FDistribution of fitness effect of nonsynonymous and indel mutations in each subspecies. (A) Distribution of S inferred from the synonymous and nonsynonymous SFS autosomal data in each subspecies. Distributions are discretized by reporting the proportion of the mutation in four classes of purifying selection. Error bars denote SE around estimated proportions. (B) Distribution of S inferred from the synonymous and nonsynonymous SFS on X-linked data in each subspecies. (C) Distribution of S inferred from the multiple of three and nonmultiple of three SFS of autosomal indel data in each subspecies.
Summary of Synonymous and Nonsynonymous Variation in Exon Regions
| Watterson’s Theta (θw) | Nucleotide Diversity (π) | |||||
|---|---|---|---|---|---|---|
| East | Central | West | East | Central | West | |
| Autosomal synonymous | 0.0016 | 0.0023 | 0.0008 | 0.0015 | 0.0017 | 0.0008 |
| Autosomal nonsynonymous | 0.0004 | 0.0005 | 0.0002 | 0.0003 | 0.0003 | 0.0002 |
| Ratio | 0.23 | 0.22 | 0.25 | 0.21 | 0.20 | 0.24 |
| X synonymous | 0.0009 | 0.0011 | 0.0004 | 0.0007 | 0.0008 | 0.0004 |
| X nonsynonymous | 0.0002 | 0.0002 | 0.0001 | 0.0001 | 0.0002 | 0.0001 |
| Ratio | 0.20 | 0.20 | 0.22 | 0.18 | 0.18 | 0.22 |
| X/Autosomal (syn) | 0.53 | 0.49 | 0.50 | 0.47 | 0.49 | 0.48 |
| X/Autosomal (nonsyn) | 0.46 | 0.45 | 0.45 | 0.41 | 0.44 | 0.44 |