| Literature DB >> 27168719 |
Rim Chriki-Adeeb1, Ali Chriki1.
Abstract
Accurate estimation of divergence times of soil bacteria that form nitrogen-fixing associations with most leguminous plants is challenging because of a limited fossil record and complexities associated with molecular clocks and phylogenetic diversity of root nodule bacteria, collectively called rhizobia. To overcome the lack of fossil record in bacteria, divergence times of host legumes were used to calibrate molecular clocks and perform phylogenetic analyses in rhizobia. The 16S rRNA gene and intergenic spacer region remain among the favored molecular markers to reconstruct the timescale of rhizobia. We evaluate the performance of the random local clock model and the classical uncorrelated lognormal relaxed clock model, in combination with four tree models (coalescent constant size, birth-death, birth-death incomplete sampling, and Yule processes) on rhizobial divergence time estimates. Bayes factor tests based on the marginal likelihoods estimated from the stepping-stone sampling analyses strongly favored the random local clock model in combination with Yule process. Our results on the divergence time estimation from 16S rRNA gene and intergenic spacer region sequences are compatible with age estimates based on the conserved core genes but significantly older than those obtained from symbiotic genes, such as nodIJ genes. This difference may be due to the accelerated evolutionary rates of symbiotic genes compared to those of other genomic regions not directly implicated in nodulation processes.Entities:
Keywords: 16S rRNA gene; Bayes factors; ITS; nodIJ genes; random local clock; rhizobia; uncorrelated lognormal clock
Year: 2016 PMID: 27168719 PMCID: PMC4856229 DOI: 10.4137/EBO.S39070
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
List of bacterial strains isolated from root nodules of three species of the genus Sulla (Fabaceae).
| STRAIN | GENBANK ID (16S/ITS) | HOST PLANT | CLOSEST SPECIES | GENBANK ID |
|---|---|---|---|---|
| Hc01 | JN944178/JN944179 | KP762553 | ||
| Hc04 | JN944189/JN944182 | AM237359 | ||
| Hsc01 | JN944190/JN944183 | Y10170 | ||
| Hsc02 | JN944191/JN944184 | NR_113608 | ||
| Hsc03 | JN966893/JN966897 | NR_113608 | ||
| Hcar01 | JN944192/JN944185 | KF876889 | ||
| Hcar02 | JN944193/JQ081302 | HM038120 |
Note:
For 16S.
List of reference strains used in this study.
| GENUS | SPECIES | STRAIN | ACCESSION NUMBER (16S/ITS) |
|---|---|---|---|
| LMG 2299 | EF016361/KC756967 | ||
| 8201 | FJ870554/FJ870554 | ||
| ATCC 14482 | EF141340/EF141341 | ||
| CCBAU 33202 | DQ835306/FJ392873 | ||
| A4 | AB247607/AB247607 | ||
| OS-49.b | AM237359/AF345270 | ||
| CCBAU 05104 | HM070174/EU418348 | ||
| LMG 6214 | X67226/AF345265 | ||
| H 152 | U86344/AF345268 | ||
| CCBAU 85040 | EU256415/EU288740 | ||
| I66 | U71078/AF321872 | ||
| CCBAU 25295 | EU399715/EU418365 | ||
| CCBAU 25010 | NR_043548/EU120729 | ||
| CCBAU 25217 | EF070130/EU120730 | ||
| CCBAU 85013 | EU256408/EU288729 | ||
| CCBAU 05122 | EU399697/EU418349 | ||
| USDA 194 | AB433352/EU152398 | ||
| CCBAU 05183 | EU399710/EU418357 | ||
| CCBAU 83306 | EF035058/EF050771 | ||
| LMG 6125 | X67229/AF345260 | ||
| USDA 110 | AF363150/AB100749 | ||
| C8-1780 | AB513452/AB513476 |
Note:
Strain LMG 18254 for ITS.
Figure 1ML-inferred tree for rhizobia based on the concatenated dataset of 16S rRNA gene and ITS region sequences. Bootstrap probabilities based on 500 replicates are indicated at each node. The scale bar represents the number of nucleotide substitutions per site.
Ages in million years ago for rhizobial group divergences estimated with BEAST under different combinations of clock/tree models (Brady: Bradyrhizobium; Meso: Mesorhizobium; Sino: Sinorhizobium; Agro: Agrobacterium; Neorhizo: Neorhizobium).
| CLOCK MODEL | TREE MODEL | ROOT | BRADY | MESO | SINO | AGRO | NEORHIZO |
|---|---|---|---|---|---|---|---|
| RLC | Birth-Death | 639.72 | 424.73 | 372.00 | 195.17 | 142.42 | 53.28 |
| Yule | 603.46 | 384.62 | 343.96 | 200.87 | 149.07 | 54.50 | |
| Birth-Death | 892.97 | 497.03 | 429.90 | 198.83 | 144.95 | 52.64 | |
| Constant Size | 1166.42 | 618.06 | 505.17 | 221.86 | 160.87 | 56.54 | |
| Average | 825.64 | 481.11 | 412.76 | 204.18 | 149.33 | 54.24 | |
| UCLN | Birth-Death | 522.11 | 264.81 | 214.58 | 140.98 | 104.19 | 76.94 |
| Birth-Death | 529.85 | 269.87 | 217.60 | 141.95 | 104.68 | 76.47 | |
| Constant Size | 774.76 | 332.17 | 258.43 | 160.58 | 117.29 | 86.02 | |
| Yule | 462.85 | 255.70 | 211.20 | 143.06 | 105.46 | 78.21 | |
| Average | 572.39 | 280.64 | 225.45 | 146.64 | 107.91 | 79.41 |
Note:
Birth–death (incomplete sampling).
Abbreviations: RLC, random local clock; UCLN, uncorrelated lognormal.
Mean substitution rates estimated by BEAST under RLC (random local clock) and UCLN (uncorrelated lognormal) clock combined with four tree models.
| CLOCK MODEL | TREE MODEL | MEAN RATE | CONFIDENCE INTERVAL |
|---|---|---|---|
| RLC | Birth-Death | 0.0018 | (0.0008, 0.0028) |
| Yule | 0.0018 | (0.0009, 0.0028) | |
| Birth-Death | 0.0014 | (0.0007, 0.0021) | |
| Constant Size | 0.0012 | (0.0005, 0.0019) | |
| Average | 0.0015 | ||
| UCLN | Birth-Death | 0.0023 | (0.0009, 0.0041) |
| Birth-Death | 0.0023 | (0.0010, 0.0038) | |
| Constant Size | 0.0018 | (0.0008, 0.0029) | |
| Yule | 0.0025 | (0.0009, 0.0044) | |
| Average | 0.0022 |
Notes:
(Mean) substitution rate unit: substitutions/site/million years (My).
Birth–death (incomplete sampling).
(Log)marginal likelihood (ml) estimations performed with BEAST using the path sampling (PS) and stepping-stone sampling (SS) methods under different combinations of clock/tree models.
| CLOCK MODELa | TREE MODEL | LOG ML (PS) | LN ML (SS) |
|---|---|---|---|
| UCLN | Constant size | −22988.18 | −22988.35 |
| Birth-death | −22989.07 | −22989.28 | |
| Birth-death | −22349.77 | −22349.70 | |
| Yule | −22992.10 | −22992.18 | |
| RLC | Constant size | −21089.77 | −21089.87 |
| Birth-death | −20265.45 | −20265.47 | |
| Birth-death | −19934.40 | −19934.44 | |
| Yule | −19574.18 | −19574.14 |
Note:
Birth–death (incomplete sampling).
Abbreviations: UCLN, uncorrelated lognormal; RLC, random local clock.
Model comparisons using Bayes factors calculated from marginal likelihoods (mls) in BEAST.
| MODELa | UCLN (CS) | UCL (BD) | UCLN (BDis) | UCLN (Y) | RLC (CS) | RLC (BD) | RLC (BDis) | RLC (Y) |
|---|---|---|---|---|---|---|---|---|
| UCLN (CS) | – | 0.89 | −638.41 | 3.92 | −1898.41 | −2722.73 | −3053.78 | −3414.0 |
| UCLN (BD) | −0.89 | – | −639.30 | 3.03 | −1899.30 | −2723.62 | −3054.67 | −3414.89 |
| UCLN (BDis) | 638.41 | 639.30 | – | 642.33 | −1260.0 | −2084.32 | −2415.37 | −2775.59 |
| UCLN (Y) | −3.92 | −3.03 | −642.33 | – | −1902.33 | −2726.65 | −3057.70 | −3417.92 |
| RLC (CS) | 1898.41 | 1899.30 | 1260.0 | 1902.33 | – | −824.32 | −1155.37 | −1515.59 |
| RLC (BD) | 2722.73 | 2723.62 | 2084.32 | 2726.65 | 824.32 | – | −331.05 | −691.27 |
| RLC (BDis) | 3053.78 | 3054.67 | 2415.37 | 3057.70 | 1155.37 | 331.05 | – | −360.22 |
| RLC (Y) | 3414.0 | 3414.89 | 2775.59 | 3417.92 | 1515.59 | 691.27 | 360.22 | – |
Notes: (Log)Bayes factors (BF) were calculated from (log)mls estimated from stepping-stone sampling (SS) only because those estimated from path sampling (PS) were similar (Table 5). Asterisks after the (log10)BF indicate their interpretation according to Raftery50:
BF value between 0 and 1 is not worth more than a bare mention;
BF value between 3 and 5 is considered to give strong evidence favoring the model with the higher (log)ml; and
BF values higher than 5 are considered to give very strong evidence in favor of the model with the higher log(ml) (see Table 5 for ml values).
Abbreviations: RLC, random local clock; UCLN, uncorrelated lognormal; CS, constant size; BD, birth–death; BDis, birth–death (incomplete sampling); Y, Yule.
Figure 2BEAST divergence time estimates from combined 16S rRNA gene and ITS region sequences under RLC model and Yule tree process. Divergence time of R. sullae (29.3 ± 3.0 Mya) was used to calibrate the clock. The scale axis represents age estimates in Mya.
Dates (Mya) and 95% confidence intervals (CIs) for rhizobial groups estimated using Bayesian (BEAST) and ML (RelTime) methods.
| NODE | BEAST | 95% CI | RELTIME | 95% CI |
|---|---|---|---|---|
| Root | 603.46 | (280.92, 998.22) | 874.78 | (153.59, 1596.0) |
| 384.62 | (174.00, 661.31) | 347.71 | (61.02, 634.40) | |
| 343.96 | (159.78, 598.13) | 285.19 | (49.98, 520.40) | |
| 200.87 | (109.04, 319.75) | 140.10 | (24.41, 255.78) | |
| 149.07 | (78.11, 236.36) | 100.23 | (17.12, 183.34) | |
| 54.50 | (35.25, 79.53) | 71.38 | (12.26, 130.50) |
Note:
The RelTime and BEAST estimates from the combined 16S rRNA gene and ITS region sequence analyses were converted into absolute times by using one calibration: R. sullae (29.3 ± 3.0 Mya).
Divergence times estimated in rhizobia under random local clock (RLC) model in combination with Yule process from different molecular markers and calibration points.
| NODE | MOLECULAR MARKER | ||
|---|---|---|---|
| 16S-ITS | 16S | ||
| 384.62 | 441.92 | 75.32 | |
| 343.96 | 283.69 | 63.23 | |
| 200.87 | 198.36 | 42.67 | |
| Mean rate | 0.0018 | 0.0002 | 0.0032 |
Notes: Calibration points: split of R. sullae at 29.3 ± 3.0 Mya for concatenated 16S–ITS marker and divergence of R. leguminosarum at 26.5 ± 2.6 Mya for 16S rRNA and nodIJ genes.
Concatenated 16S rRNA gene and ITS region sequences (this study).
16S rRNA and concatenated nodI and nodJ gene sequences from taxa used by Aoki et al.29
Substitutions per site per million years.