| Literature DB >> 26721855 |
Jing Wang1, Nathaniel R Street2, Douglas G Scofield3, Pär K Ingvarsson4.
Abstract
A central aim of evolutionary genomics is to identify the relative roles that various evolutionary forces have played in generating and shaping genetic variation within and among species. Here we use whole-genome resequencing data to characterize and compare genome-wide patterns of nucleotide polymorphism, site frequency spectrum, and population-scaled recombination rates in three species of Populus: Populus tremula, P. tremuloides, and P. trichocarpa. We find that P. tremuloides has the highest level of genome-wide variation, skewed allele frequencies, and population-scaled recombination rates, whereas P. trichocarpa harbors the lowest. Our findings highlight multiple lines of evidence suggesting that natural selection, due to both purifying and positive selection, has widely shaped patterns of nucleotide polymorphism at linked neutral sites in all three species. Differences in effective population sizes and rates of recombination largely explain the disparate magnitudes and signatures of linked selection that we observe among species. The present work provides the first phylogenetic comparative study on a genome-wide scale in forest trees. This information will also improve our ability to understand how various evolutionary forces have interacted to influence genome evolution among related species.Entities:
Keywords: Populus; natural selection; nucleotide polymorphism; recombination; whole-genome resequencing
Mesh:
Substances:
Year: 2015 PMID: 26721855 PMCID: PMC4788117 DOI: 10.1534/genetics.115.183152
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Figure 1Genome-wide patterns of polymorphism among three Populus species. Nucleotide diversity (Θπ) was calculated over 100-kbp nonoverlapping windows in P. tremula (orange line), P. tremuloides (blue line), and P. trichocarpa (green line) along the 19 chromosomes.
Figure 2Distribution and correlations of (A) polymorphism (Θπ), (B) Tajima’s D, and (C) population-scaled recombination rate (ρ) between pairwise comparisons of P. tremula, P. tremuloides, and P. trichocarpa over 100-kbp nonoverlapping windows. The red-to-yellow-to-blue gradient indicates decreased density of observed events at a given location in the graph. Spearman’s rank correlation coefficient (rho) and the P-value are shown in each subplot. (***P < 2.2 × 10−16, **P < 0.001). The dashed gray line in each subplot indicates a simple linear regression line with intercept being zero and slope being one.
Figure 3Estimates of purifying and positive selection at 0-fold nonsynonymous sites in three Populus species. (A) The distribution of fitness effects of new amino acid mutations, (B) the proportion of adaptive substitution (α), and (C) the rate of adaptive nonsynonymous-to-synonymous substitutions (ω) in P. tremula (orange bar), P. tremuloides (blue bar), and P. trichocarpa (green bar). Error bars represent 95% bootstrap confidence intervals.
Summary of the correlation coefficients (Spearman’s rank correlation coefficient) between levels of neutral polymorphism (Θ), divergence (d), and recombination rate (ρ) in genic and intergenic regions among all three Populus species
| Data set | Species | Pairwise | Partial | Pairwise | Partial | ||
|---|---|---|---|---|---|---|---|
| 100 kbp | 0.339 | 0.309 | 0.043 | 0.062 | 0.142 | −0.077 | |
| 0.310 | 0.284 | 0.061 | −0.037 | 0.100 | −0.029 | ||
| 0.011 | −0.024 | 0.053 | −0.080 | −0.002 | −0.015 | ||
| 1 Mb | 0.647 | 0.573 | −0.070 | 0.201 | 0.348 | −0.209 | |
| 0.400 | 0.363 | −0.033 | 0.032 | 0.320 | −0.127 | ||
| 0.227 | 0.151 | −0.027 | −0.072 | 0.165 | −0.120 | ||
Partial correlation controls for GC content, gene density, divergence of fourfold synonymous sites between aspen and P. trichocarpa, and coverage (the number of fourfold synonymous bases covered by sequencing data).
Partial correlation controls for GC content, gene density, divergence of intergenic sites between aspen and P. trichocarpa, and coverage (the number of intergenic bases covered by sequencing data).
P < 0.05.
P < 0.001.
P < 2.2 × 10−16
Figure 4Correlations of estimates between neutral genetic diversity (Θfourfold) (left), neutral genetic divergence (dfourfold) (right), and population-scaled recombination rates (ρ) over 1-Mb nonoverlapping windows. Linear regression lines are colored according to species: (A) P. tremula (orange line), (B) P. tremuloides (blue line), and (C) P. trichocarpa (green line).
Summary of the correlation coefficients (Spearman’s rank correlation coefficient) between recombination rate (ρ) and the ratio of nonsynonymous to synonymous polymorphism (θ0-fold/θfourfold) and divergence (d0-fold/dfourfold)
| Data set | Species | Pairwise | Partial | Pairwise | Partial |
|---|---|---|---|---|---|
| 100 kbp | −0.057 | −0.075 | −0.012 | −0.005 | |
| −0.118 | −0.122 | −0.003 | −0.002 | ||
| −0.004 | −0.002 | −0.026 | −0.020 | ||
| 1 Mb | −0.063 | −0.045 | −0.007 | 0.017 | |
| −0.142 | −0.092 | 0.014 | 0.020 | ||
| 0.035 | −0.002 | 0.030 | 0.036 | ||
Partial correlation controls for GC content, gene density, and the number of fourfold synonymous and 0-fold nonsynonymous bases covered by sequencing data.
P < 0.05.
P < 0.001.
Figure 5Correlations of estimates between (A) population-scaled recombination rates (ρ), (B) genic genetic diversity (Θfourfold), (C) intergenic genetic diversity (ΘIntergenic), and gene density over 1-Mb nonoverlapping windows in P. tremula (left), P. tremuloides (middle), and P. trichocarpa (right). Gray points represent the statistics computed over 1-Mb nonoverlapping windows. Colored lines denote the lowess curves fit to the two analyzed variables in each species.
Summary of the correlation coefficients (Spearman’s rank correlation coefficient) between gene density and population recombination rate (ρ), neutral polymorphism in genic (Θfourfold), and intergenic regions (Θintergenic) over 1-Mb nonoverlapping windows in three Populus species
| Gene density | Gene density | Gene density | |||||
|---|---|---|---|---|---|---|---|
| Species | Correlation type | Low | High | Low | High | Low | High |
| Pairwise | 0.674 | −0.112 | 0.601 | −0.180 | 0.431 | −0.605 | |
| Partial | 0.516 | 0.263 | 0.191 | 0.110 | 0.263 | −0.438 | |
| Pairwise | 0.527 | 0.006 | 0.576 | −0.077 | 0.419 | −0.600 | |
| Partial | 0.315 | 0.048 | 0.407 | 0.280 | 0.363 | −0.444 | |
| Pairwise | 0.609 | 0.168 | 0.417 | −0.033 | 0.529 | −0.513 | |
| Partial | 0.477 | 0.193 | 0.242 | 0.263 | 0.432 | −0.273 | |
Partial correlation controls for GC content and the number of bases covered by the data.
Partial correlation controls for GC content, population recombination rate, divergence of fourfold synonymous sites between aspen and P. trichocarpa, and coverage (the number of fourfold synonymous bases covered by sequencing data).
Partial correlation controls for GC content, population recombination rate, divergence of intergenic sites between aspen and P. trichocarpa, and coverage (the number of intergenic bases covered by sequencing data).
P < 0.05.
P < 0.001.
P < 2.2 × 10−16
Summary of the correlation coefficients (Spearman’s rank correlation coefficient) between levels of synonymous diversity (Θfourfold) and nonsynonymous divergence (d0-fold) at different physical scales in three Populus species
| Data set | Species | Pairwise | Partial |
|---|---|---|---|
| 100 kbp | −0.029 | −0.032 | |
| −0.021 | −0.025 | ||
| −0.053 | −0.051 | ||
| 1 Mb | −0.049 | 0.043 | |
| −0.069 | −0.008 | ||
| −0.086 | −0.006 | ||
| Single genes | −0.087 | −0.185 | |
| −0.087 | −0.192 | ||
| −0.148 | −0.218 | ||
Partial means partial correlation controls for GC content, gene density, population recombination rate, divergence of fourfold synonymous sites between aspen and P. trichocarpa, the number of fourfold synonymous bases and 0-fold nonsynonymous bases covered by sequencing data.
Partial means correlation between d0-fold and θfourfold/dfourfold.
P < 0.05.
P < 2.2×10−16