| Literature DB >> 19390686 |
Bojian Zhong1, Takahiro Yonezawa, Yang Zhong, Masami Hasegawa.
Abstract
BACKGROUND: It has been suggested that the chloroplast genomes of the grass family, Poaceae, have undergone an elevated evolutionary rate compared to most other angiosperms, yet the details of this phenomenon have remained obscure. To know how the rate change occurred during evolution, estimation of the time-scale with reliable calibrations is needed. The recent finding of 65 Ma grass phytoliths in Cretaceous dinosaur coprolites places the diversification of the grasses to the Cretaceous period, and provides a reliable calibration in studying the tempo and mode of grass chloroplast evolution. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2009 PMID: 19390686 PMCID: PMC2669172 DOI: 10.1371/journal.pone.0005297
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The phylogenetic tree of chloroplast genomes for the 31 species.
The tree topology in Fig. 3 of ref.[6] was used, and the branch lengths were estimated by the ML with the codon-substitution model [21], [22] (CODEML in PAML [19]). The root was arbitrarily placed between Gymnosperm and Angiosperm. Non-synonymous (d N) and synonymous (d S) distances of Poales from Musa were estimated by CODEML.
Figure 3Posterior estimates of divergence times of a whole Angiosperm tree.
Estimations were done by using MCMCTREE [19] with the IR model [15]. The >65 Ma constraint to the Zea/Oryza separation was applied.
Figure 2Posterior estimates of divergence times and rate change during evolution.
Estimations were done by using MCMCTREE in PAML [19] with the IR model [15] for the rate change along lineages. Shape and scale parameters, α and β, in the gamma prior for parameter σ2 were 1.0 and 10.0, respectively. Only Poales+Musa part of the whole tree is shown, and a numbering of a node follows that of the whole tree in Fig. 3. The upper lines of the colored area trace the estimated rates along the lineage from the root to Oryza (the lineage indicated by colored lines in the tree) where 95% highest posterior density (HPD) is shown by a vertical line segment with two short horizontal line segments at boundaries. (A) >65 Ma constraint and (B) no constraint to the Zea/Oryza separation (for other calibrations, see Materials and Methods).
Posterior estimates of divergence times with the >65 Ma constraint to the Zea/Oryza separation.
| Node | Independent-rates (IR) model | Correlated-rates (CR) model | ||
| Time (Ma) | Rate | Time (Ma) | Rate | |
| 1 ( | 62.3 (44.4, 84.4) | 0.033 (0.009, 0.090) | 58.4 (45.5, 76.2) | 0.030 (0.014, 0.058) |
| 2 ( |
|
|
|
|
| 3 ( | 114.1 (91.4, 144.5) | 0.069 (0.029, 0.157) | 150.7 (123.2, 179.5) | 0.062 (0.033, 0.104) |
| 4 ( | 134.3 (111.9, 165.8) | 0.040 (0.012, 0.106) | 171.0 (145.1, 197.7) | 0.048 (0.027, 0.079) |
| 5 ( | 141.0 (117.0, 173.4) | 0.046 (0.015, 0.116) | 175.6 (149.3, 202.1) | 0.065 (0.038, 0.104) |
| 6 ( | 152.2 (125.7, 185.4) | 0.041 (0.012, 0.109) | 182.0 (156.0, 208.0) | 0.047 (0.028, 0.076) |
| 7 ( | 159.6 (131.8, 193.6) | 0.083 (0.035, 0.185) | 187.0 (160.7, 212.9) | 0.070 (0.043, 0.107) |
| 8 ( | 187.8 (153.1, 224.4) | 0.057 (0.023, 0.136) | 216.3 (188.8, 241.5) | 0.053 (0.032, 0.085) |
| 9 (Eudicots) | 212.5 (175.0, 247.5) | 0.035 (0.009, 0.096) | 239.1 (211.5, 262.3) | 0.055 (0.033, 0.086) |
| 10( | 216.2 (178.0, 251.4) | 0.033 (0.008, 0.091) | 240.9 (213.4, 263.8) | 0.050 (0.029, 0.079) |
| 11( | 218.6 (180.1, 254.0) | 0.061 (0.024, 0.145) | 242.1 (214.6, 264.9) | 0.055 (0.034, 0.086) |
| 12( | 240.8 (199.7, 274.7) | 0.053 (0.020, 0.125) | 263.0 (235.3, 283.9) | 0.056 (0.032, 0.091) |
| 13( | 262.4 (219.8, 293.3) | 0.043 (0.014, 0.108) | 280.8 (253.3, 299.5) | 0.057 (0.032, 0.092) |
| 14( | 276.9 (233.3, 305.1) | 0.054 (0.011, 0.174) | 289.8 (262.2, 307.8) | 0.059 (0.035, 0.096) |
| 15(Gymnosperm) | 297.2 (280.5, 310.3) | – | 299.5 (281.4, 310.6) | – |
| 16 | 179.4 (142.1, 218.8) | 0.057 (0.024, 0.135) | 196.7 (166.5, 230.0) | 0.039 (0.025, 0.071) |
| Terminal branch to | 0 | 0.038 (0.027, 0.052) | 0 | 0.040 (0.030, 0.051) |
MCMCTREE in PAML [19] was used with the GTR+Γ5 model. Shape and scale parameters, α and β, in the gamma prior for parameter σ2 were 1.0 and 10.0, respectively. 95% highest posterior density (HPD) is shown in parentheses. Rate (×10−8 substitutions/ nucleotide/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.
Posterior estimates of divergence times without constraint to the Zea/Oryza separation.
| Node | Independent-rates (IR) model | Correlated-rates (CR) model | ||
| Time (Ma) | Rate | Time (Ma) | Rate | |
| 1 ( | 47.2 (28.4, 71.5) | 0.040 (0.012, 0.105) | 32.4 (21.6, 49.6) | 0.059 (0.028, 0.103) |
| 2 ( | 55.0 (35.2, 80.4) | 0.212 (0.124, 0.377) | 36.9 (24.6, 56.7) | 0.119 (0.087, 0.159) |
| 3 ( | 104.2 (78.7, 135.5) | 0.063 (0.026, 0.144) | 119.8 (90.9, 156.5) | 0.059 (0.034, 0.096) |
| 4 ( | 126.8 (105.7, 158.6) | 0.040 (0.012, 0.105) | 140.8 (113.0, 176.2) | 0.043 (0.025, 0.069) |
| 5 ( | 133.6 (111.1, 166.1) | 0.044 (0.014, 0.112) | 145.8 (117.4, 181.1) | 0.057 (0.033, 0.091) |
| 6 ( | 145.3 (120.2, 179.3) | 0.041 (0.012, 0.108) | 153.2 (124.5, 187.7) | 0.043 (0.026, 0.066) |
| 7 ( | 152.6 (125.8, 187.8) | 0.080 (0.034, 0.179) | 158.7 (129.4, 193.2) | 0.059 (0.035, 0.093) |
| 8 ( | 181.7 (147.1, 220.3) | 0.054 (0.022, 0.128) | 193.7 (159.4, 226.9) | 0.046 (0.027, 0.073) |
| 9 (Eudicots) | 207.7 (169.2, 245.1) | 0.036 (0.010, 0.098) | 220.4 (184.1, 251.4) | 0.047 (0.029, 0.072) |
| 10 ( | 211.3 (172.3, 249.1) | 0.034 (0.009, 0.094) | 222.6 (186.2, 253.2) | 0.044 (0.026, 0.070) |
| 11 ( | 213.6 (174.3, 251.5) | 0.060 (0.023, 0.141) | 223.9 (187.4, 254.5) | 0.047 (0.029, 0.072) |
| 12 ( | 236.2 (193.9, 272.9) | 0.053 (0.020, 0.128) | 248.6 (210.3, 277.3) | 0.048 (0.027, 0.079) |
| 13 ( | 258.2 (213.6, 291.8) | 0.042 (0.014, 0.107) | 269.2 (229.4, 296.1) | 0.048 (0.027, 0.078) |
| 14 ( | 272.8 (227.0, 304.2) | 0.056 (0.012, 0.177) | 279.7 (239.6, 305.8) | 0.053 (0.031, 0.086) |
| 15 (Gymnosperm) | 296.6 (280.3, 310.2) | – | 297.2 (280.4, 310.3) | – |
| 16 | 176.9 (138.5, 217.4) | 0.061 (0.026, 0.139) | 178.5 (141.1, 215.1 | 0.040 (0.024, 0.068) |
| Terminal branch to | 0 | 0.051 (0.032, 0.080) | 0 | 0.074 (0.046, 0.106) |
MCMCTREE in PAML [19] was used with the GTR+Γ5 model. Shape and scale parameters, α and β, in the gamma prior for parameter σ2 were 1.0 and 10.0, respectively. 95% HPD is shown in parentheses. Rate (×10−8 substitutions/ nucleotide/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.
Posterior estimates of divergence times excluding Poaceae.
| Node | Independent-rates (IR) model | Correlated-rates (CR) model | ||
| Time (Ma) | Rate | Time (Ma) | Rate | |
| 4 ( | 115.5 (96.3, 136.7) | 0.033 (0.011, 0.083) | 110.6 (91.6, 128.6) | 0.031 (0.016, 0.052) |
| 5 ( | 125.1 (103.9, 149.2) | 0.033 (0.012, 0.081) | 119.2 (99.0, 138.7) | 0.034 (0.022, 0.053) |
| 6 ( | 139.5 (115.5, 167.3) | 0.034 (0.010, 0.086) | 130.8 (109.9, 150.3) | 0.034 (0.021, 0.052) |
| 7 ( | 147.7 (122.0, 177.2) | 0.060 (0.029, 0.128) | 137.3 (115.4, 158.1) | 0.037 (0.026, 0.056) |
| 8 ( | 184.6 (151.4, 219.8) | 0.041 (0.019, 0.090) | 190.8 (162.0, 218.1) | 0.037 (0.022, 0.060) |
| 9 (Eudicots) | 216.5 (181.5, 248.9) | 0.033 (0.010, 0.085) | 223.2 (194.0, 246.9) | 0.041 (0.027, 0.060) |
| 10 ( | 220.1 (184.7, 252.6) | 0.031 (0.009, 0.081) | 225.5 (196.2, 248.9) | 0.041 (0.026, 0.062) |
| 11 ( | 222.2 (186.6, 254.9) | 0.057 (0.024, 0.128) | 226.7 (197.4, 250.2) | 0.041 (0.028, 0.060) |
| 12 ( | 244.7 (206.9, 275.8) | 0.050 (0.020, 0.114) | 253.8 (222.5, 276.8) | 0.045 (0.028, 0.070) |
| 13 ( | 266.3 (227.3, 294.5) | 0.040 (0.014, 0.096) | 274. 8 (241.9, 297.0) | 0.044 (0.028, 0.068) |
| 14 ( | 280.2 (240.4, 305.9) | 0.047 (0.011, 0.139) | 285.2 (251.9, 307.0) | 0.049 (0.031, 0.076) |
| 15 (Gymnosperms) | 298.3 (280.8, 310.4) | – | 300.3 (282.0, 310.7) | – |
| 16 | 190.4 (157.5, 223.6) | 0.067 (0.031, 0.141) | 188.6 (161.2, 213.2) | 0.045 (0.030, 0.066) |
MCMCTREE in PAML [19]] was used with the GTR+Γ5 model. Shape and scale parameters, α and β, in the gamma prior for parameter σ2 were 1.0 and 10.0, respectively. 95% HPD is shown in parentheses. Rate (×10−8 substitutions/ nucleotide/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.
Estimation of non-synonymous/synonymous rate ratio (ω) under different models by using CODEML in PAML [19].
| Model | ω0 | ω1 | ω2 | Ln L | LRT with 1ω-model | AIC |
| 1ω | 0.1518 | – | – | −115,741.5 | – | 231,485.1 |
| Simple 2ω | 0.1265 | 0.1617 | – | −115,729.5 | 9.63×10−7 | 231,463.0 |
| 3ω | 0.1255 | 0.2189 | 0.1246 | −115,669.4 | 4.95×10−32 | 231,344.8 |
| Reverted 2ω | 0.1250 | 0.2189 | – | −115,669.4 | 3.29×10−33 |
|
1ω-model: (Musa#ω0, Typha#ω0, (crown Poaceae #ω0) #ω0).
Simple 2ω-model: (Musa#ω0, Typha#ω0, (crown Poaceae #ω1) #ω1).
3ω-model: (Musa#ω0, Typha#ω0, (crown Poaceae #ω2) #ω1).
Reverted 2ω-model: (Musa#ω0, Typha#ω0, (crown Poaceae #ω0) #ω1).
“Crown Poaceae” includes all Poaceae branches in our tree except for the common ancestral branch (stem Poaceae). The codon-substitution model with the F61 codon frequency was used. Minimum AIC (Reverted 2ω-model) is shown in bold italic.
LRT of 1ω-model vs reverted 2ω-model for individual genes.
| Gene | Ancestral Poaceae | Other branches |
|
|
|
|
|
| 0.3469 | 0.1013 | −8546.12 | −8574.82 | 57.41 | 3.53E-14 |
|
| 0.7406 | 0.0538 | −1126.87 | −1144.90 | 36.06 | 1.91E-09 |
|
| 0.5943 | 0.1374 | −1767.99 | −1778.73 | 21.47 | 3.95E-06 |
|
| 0.9642 | 0.1309 | −1132.80 | −1143.33 | 21.05 | 4.48E-06 |
|
| 0.5767 | 0.1273 | −1958.45 | −1968.07 | 19.24 | 1.15E-05 |
|
| 1.4337 | 0.3047 | −2214.75 | −2224.29 | 19.09 | 1.25E-05 |
|
| 0.6319 | 0.1139 | −1067.47 | −1075.53 | 16.11 | 5.97E-05 |
|
| 0.2385 | 0.1294 | −5702.89 | −5708.31 | 10.85 | 0.000989 |
|
| 0.2296 | 0.1102 | −4201.36 | −4206.20 | 9.68 | 0.001860 |
| PS13 | 0.0540 | 0.1663 | −3542.69 | −3547.23 | 9.08 | 0.002578 |
|
| 0.3623 | 0.1288 | −1886.14 | −1890.56 | 8.83 | 0.002960 |
|
| 0.3364 | 0.2179 | −10553.23 | −10557.46 | 8.45 | 0.003642 |
|
| 0.0001 | 0.0780 | −603.22 | −607.35 | 8.26 | 0.004047 |
|
| 0.0344 | 0.0937 | −3805.48 | −3809.48 | 8.00 | 0.004669 |
|
| 0.1405 | 0.0592 | −3144.67 | −3148.58 | 7.81 | 0.005202 |
|
| 1.2673 | 0.2151 | −865.87 | −869.76 | 7.78 | 0.005297 |
|
| 0.6567 | 0.1220 | −697.24 | −700.69 | 6.91 | 0.008589 |
|
| 0.6328 | 0.1836 | −727.58 | −730.62 | 6.08 | 0.013646 |
|
| 0.0268 | 0.1060 | −1131.04 | −1134.08 | 6.07 | 0.013739 |
|
| 0.2858 | 0.0995 | −972.76 | −975.00 | 4.47 | 0.034457 |
and refer to the estimates of non-synonymous/synonymous rate ratios based on the reverted 2ω-model. Only 20 genes rejecting the 1ω-model with P<0.05 are listed. In this analysis, 11 monocot species (6 Poacea species, Typha, Musa, Elaeis, Phalaenopsis, and Yucca) were used. 75 genes were analyzed, but genes with related functions were concatenated if the lengths are shorter than 180 nucleotides, and therefore the number of tests was 61. PS13 refers to the concatenated sequences of petG+petL+petN+psaI+psaJ+psbF+psbI+psbJ+psbK+psbL+psbM+psbN+psbΤ.
refers to a gene which remains significant after the Bonferroni correction.
refers to a gene with ω1<ω0.
Branch-site test of positive selection.
| gene | LRT | Positively selected sites |
|
| 0.0484 | 2T→K, 17S→C, 41A→N, |
|
| 0.0021 | 55N→R, 76Y→K, 161W→F, 190I→F, 204I→C |
|
| 0.0081 | 26R→V, 48V→T, 86F→T, 112I→P, 134E→R, 182T→D |
|
| 0.0352 | 90R→F, 338G→K, 1026G→N |
|
| 0.0082 |
|
The numberings of amino acids are those of Zea mays [45]. Positively selected sites were inferred at P b = 95% with those reaching 99% shown in bold italic. The analyses were carried out for the 16 genes selected in Table 5, and only genes with positively selected sites and with P<0.05 (LRT) are listed.