| Literature DB >> 27678122 |
Susanne P Pfeifer1,2,3, Jeffrey D Jensen4,2,3.
Abstract
Levels of nucleotide diversity vary greatly across the genomes of most species owing to multiple factors. These include variation in the underlying mutation rates, as well as the effects of both direct and linked selection. Fundamental to interpreting the relative importance of these forces is the common observation of a strong positive correlation between nucleotide diversity and recombination rate. While indeed observed in humans, the interpretation of this pattern has been difficult in the absence of high-quality polymorphism data and recombination maps in closely related species. Here, we characterize genetic features driving nucleotide diversity in Western chimpanzees using a recently generated whole genome polymorphism data set. Our results suggest that recombination rate is the primary predictor of nucleotide variation with a strongly positive correlation. In addition, telomeric distance, regional GC-content, and regional CpG-island content are strongly negatively correlated with variation. These results are compared with humans, with both similarities and differences interpreted in the light of the estimated effective population sizes of the two species as well as their strongly differing recent demographic histories.Entities:
Keywords: chimpanzee; nucleotide diversity; selection
Mesh:
Year: 2016 PMID: 27678122 PMCID: PMC5174744 DOI: 10.1093/gbe/evw240
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Nucleotide diversity levels π in Western chimpanzees across chromosomes (estimated in 1 Mb windows with at least 80% accessibility after filtering). π = 0 (dark blue): no data available.
. 2.—Distribution of estimates for nucleotide diversity π (blue) and Watterson's estimate of θ (green) by chromosomes. Calculated using 1 Mb windows with at least 80% accessibility after filtering.
. 3.—Relative importance of each significant regressor in the model for the nucleotide diversity estimate π in (a) Western chimpanzees (R2= 45.85%) and (b) humans (R2= 55.69%). The LMG metric (Lindemann et al. 1980) was used to calculate the relative importance of each predictor by partitioning R2 by averaging over orders (computed using Gnu R's “rela.impo” package). Metrics were normalized to sum to 100%. Significance levels: *** 0; ** 0.001; * 0.01.
Pairwise Correlation Between Different Predictors Used in the Regression Model for Western Chimpanzees (Significant Correlations with P < 0.01 are Highlighted in Bold)
| Recombination rate | GC content | Distance to centromere | Distance to telomere | Size | Exon content | CpG-Island content | Simple repeat content | |
|---|---|---|---|---|---|---|---|---|
| π | ||||||||
| Recombination rate | ||||||||
| GC content | 0.046 | |||||||
| Distance to centromere | 0.002 | |||||||
| Distance to telomere | ||||||||
| Size | ||||||||
| Exon content | ||||||||
| CpG | 0.037 |