| Literature DB >> 28333233 |
Amanda R De La Torre1,2, Zhen Li3,4, Yves Van de Peer3,4,5, Pär K Ingvarsson2,6.
Abstract
The majority of variation in rates of molecular evolution among seed plants remains both unexplored and unexplained. Although some attention has been given to flowering plants, reports of molecular evolutionary rates for their sister plant clade (gymnosperms) are scarce, and to our knowledge differences in molecular evolution among seed plant clades have never been tested in a phylogenetic framework. Angiosperms and gymnosperms differ in a number of features, of which contrasting reproductive biology, life spans, and population sizes are the most prominent. The highly conserved morphology of gymnosperms evidenced by similarity of extant species to fossil records and the high levels of macrosynteny at the genomic level have led scientists to believe that gymnosperms are slow-evolving plants, although some studies have offered contradictory results. Here, we used 31,968 nucleotide sites obtained from orthologous genes across a wide taxonomic sampling that includes representatives of most conifers, cycads, ginkgo, and many angiosperms with a sequenced genome. Our results suggest that angiosperms and gymnosperms differ considerably in their rates of molecular evolution per unit time, with gymnosperm rates being, on average, seven times lower than angiosperm species. Longer generation times and larger genome sizes are some of the factors explaining the slow rates of molecular evolution found in gymnosperms. In contrast to their slow rates of molecular evolution, gymnosperms possess higher substitution rate ratios than angiosperm taxa. Finally, our study suggests stronger and more efficient purifying and diversifying selection in gymnosperm than in angiosperm species, probably in relation to larger effective population sizes.Entities:
Keywords: angiosperms; gymnosperms; life-history traits; mutation; selection; substitution rates
Mesh:
Year: 2017 PMID: 28333233 PMCID: PMC5435085 DOI: 10.1093/molbev/msx069
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FPhylogenetic tree obtained from a concatenated alignment of 42 single-copy genes and 66 species distributed between two main seed plant lineages (angiosperms in green, gymnosperms in blue, and outgroup in black) inferred by RAxML. Stem branches for angiosperms and gymnosperms were used for model testing using the branch models implemented in codeml. Results of the model testing can be found in table 2.
Absolute Rates of Silent-Site Divergence (μ) for Each Taxonomic Family or Subfamily Based on 42 Single-Copy Nuclear Genes in 31 Gymnosperms and 34 Angiosperms Species.
| Plant Clade | Taxa | Subtaxa | dS | ||
|---|---|---|---|---|---|
| Gymnosperms | Pinaceae | 0.102 | 7.28067E–10 | ||
| Pinus | 0.070 | 84500000 | 4.16479E–10 | ||
| Cedrus_Abies | 0.128 | 129700000 | 4.9387E–10 | ||
| Larix_Pseudotsuga | 0.148 | 61600000 | 1.19939E–09 | ||
| Picea | 0.061 | 38300000 | 8.02526E–10 | ||
| Cupressaceae | 0.189 | 159200000 | 5.92216E–10 | ||
| Taxaceae | 0.225 | 153000000 | 7.34573E–10 | ||
| Podocarpaceae | 0.170 | 146100000 | 5.81248E–10 | ||
| Cycadales | 0.333 | 248100000 | 6.70325E–10 | ||
| Gnetophytes | 0.073 | 25000000 | 1.45547E–09 | ||
| Angiosperms | Brassicaceae | 0.352 | 27000000 | 6.51648E–09 | |
| Malvaceae | 0.373 | 40900000 | 4.56E–09 | ||
| Euphorbiaceae | 0.463 | 51000000 | 4.54382E–09 | ||
| Fabaceae | 0.502 | 53100000 | 4.72809E–09 | ||
| Cucurbitaceae | 0.239 | 20000000 | 5.9795E–09 | ||
| Rosaceae | 0.421 | 52300000 | 4.02113E–09 | ||
| Solanaceae | 0.084 | 7300000 | 5.75068E–09 | ||
| Poaceae | 0.372 | 6.0277E–09 | |||
| Poaceae I | 0.457 | 39700000 | 5.76174E–09 | ||
| Poaceae II | 0.287 | 22800000 | 6.29366E–09 | ||
Note.—Divergence times (T) are based on fossil calibration data and published studies (see supplementary table S2, Supplementary Material Online).
Parameter Estimates under Models of Variable Omega (ω) among Branches from the Gymnosperm and Angiosperm Lineages Based on a Concatenated Alignment of 31,737 Nucleotide Sites in 61 Species.
| Hypothesis | Model | No. of Parameters | Background | Foreground | Parameter Estimates | ln | AIC |
|---|---|---|---|---|---|---|---|
| H0 | M0 | 1 | –621777.74 | 1243557.48 | |||
| H1 | M2 | 2 | –621644.6499 | 1243293.3 | |||
| H2 | M2 | 2 | –619928.0209 | 1239860.042 | |||
| H3 | M2 | 3 | –619863.362 | 1239732.724 | |||
| H4 | M1 | 120 | –617598.8386 | 1235437.677 |
Note.—We tested the following hypotheses: H0: Homogeneous selective pressure in both clades (ω0 = ωG = ωA); H1: Selective Pressure in the Angiosperm clade (ω0 = ωG, ωA); H2: selective pressure in the gymnosperm clade (ω0 = ωA, ωG); H3: long-term shift in selective pressure resulting in both clades having different omegas (ω0, ωG, ωA); and H4: all terminal branches in both clades have different omegas. For hypotheses H1 to H3, ωG and ωA identify estimated omegas on stem branches of the gymnosperm and angiosperm lineages, according to phylogenetic tree in figure 1; ω0 represents the estimated omega in unselected branches (background). Parameter estimates of model M1 (hypothesis H4) are detailed in supplementary table S7, Supplementary Material Online.
Results of the Likelihood Ratio Tests for Evidence of Positive Selection in Each of the Taxonomic Families Studied.
| Taxa | M1a–M2a (2ΔL) | M7–M8 (2ΔL) | ||
|---|---|---|---|---|
| Brassicaceae | 0 | NS | 2.577926 | 0.2766 |
| Malvaceae | 0 | NS | 33.063102 | 0.00000007 |
| Fabaceae | 0 | NS | 22.744606 | 0.00001151 |
| Rosaceae | 0 | NS | 6.429714 | 0.04016 |
| Poaceae | 0 | NS | 15.428744 | 0.0004463 |
| Pinus | 511.4601 | 0 | 549.377428 | 0 |
| Picea | 116.785146 | 0 | 116.646654 | 0 |
| Cupressaceae | 0 | NS | 15.11157 | 0.000523 |
| Cycadales | 0 | NS | 6.535688 | 0.03808 |
Note.—Comparisons between site models M1a and M2a, and M7 and M8 for each taxonomic family were evaluated.
F(A) Boxplots showing differences in number of synonymous substitutions (dS), nonsynonymous substitutions (dN), absolute rate of silent-site divergence (μ), and substitution rate ratio (ω), among life forms defined as angiosperms herbs (green), angiosperms shrub/trees (light green), and gymnosperms (blue). Results of the statistical tests of comparisons among groups can be found in supplementary table S6, Supplementary Material Online. (B) Proportion of sites under negative selection, when all sites are evaluated (top left), when S < –15 (top right), and when S > –15 (bottom left); and proportion of sites under positive selection (bottom right). Green boxes represent angiosperm species, and blue boxes, gymnosperms. (C) Distribution of the selection coefficient (f(S)) of new mutations. It was calculated for all species in each taxonomic family studied, assuming a gamma distribution of ω among sites. Parameter estimates α and β were obtained from running the site model 5 in codeml (PAML) using a concatenated alignment of 29,000–31,000 sites per taxonomic family. Selection coefficients were then obtained replacing α and β in equation 1. Green dotted vertical lines are used to show the distribution of f(S) when S ≈ 0 and ω = 1 (neutrality). Taxonomic families containing less than 5 species were not analyzed due to difficulties in constructing the phylogenetic trees in RaxML.
Correlation among Number of Synonymous (dS) and Nonsynonymous Substitutions (dN), Nonsynonymous/synonymous Rate Ratio (ω), and the Absolute Rate of Silent-Site Divergence (μ), with Life-History Traits Using the Phylogenetically Independent Contrast Method (PIC).
| Species Correlations | Taxonomic Family Correlations | |||||
|---|---|---|---|---|---|---|
| Parameters | Generation Time (years) | Genome Size (1C) | Generation Time (years) | Genome Size (1C) | Species Richness | |
| dS | –0.172 | –0.135 | –0.153 | –0.311 | 0.703 | |
| 0.223 | 0.341 | 0.558 | 0.225 | 0.002 | ||
| df | 50 | 50 | 15 | 15 | 15 | |
| dN | 0.005 | 0.041 | 0.189 | –0.026 | 0.485 | |
| 0.973 | 0.771 | 0.467 | 0.922 | 0.048 | ||
| df | 50 | 50 | 15 | 15 | 15 | |
| Μ | –0.183 | –0.037 | –0.781 | –0.670 | 0.057 | |
| 0.194 | 0.797 | 0.000 | 0.003 | 0.829 | ||
| df | 50 | 50 | 15 | 15 | 15 | |
| Ω | 0.212 | 0.567 | 0.482 | 0.527 | –0.224 | |
| 0.132 | 0.000 | 0.050 | 0.030 | 0.387 | ||
| df | 50 | 50 | 15 | 15 | 15 | |
Note.—Substitution rates and life-history traits were estimated by species pairs (species correlations table); and averaged within taxonomic family (taxonomic family correlations table). We reject a null hypothesis of correlation equal to zero when P < 0.05 (shaded areas).