| Literature DB >> 29653534 |
Frédéric Delsuc1, Hervé Philippe2,3, Georgia Tsagkogeorga4,5, Paul Simion4, Marie-Ka Tilak4, Xavier Turon6, Susanna López-Legentil7, Jacques Piette8, Patrick Lemaire8, Emmanuel J P Douzery4.
Abstract
BACKGROUND: Tunicates are the closest relatives of vertebrates and are widely used as models to study the evolutionary developmental biology of chordates. Their phylogeny, however, remains poorly understood, and to date, only the 18S rRNA nuclear gene and mitogenomes have been used to delineate the major groups of tunicates. To resolve their evolutionary relationships and provide a first estimate of their divergence times, we used a transcriptomic approach to build a phylogenomic dataset including all major tunicate lineages, consisting of 258 evolutionarily conserved orthologous genes from representative species.Entities:
Keywords: Evo-devo; Molecular dating; Phylogenomics; Thaliacea; Transcriptomes; Tunicata
Mesh:
Substances:
Year: 2018 PMID: 29653534 PMCID: PMC5899321 DOI: 10.1186/s12915-018-0499-2
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Phylogenetic relationships of 63 chordates highlighting the major tunicate groups inferred from 66,593 amino acid sites of 258 proteins. The Bayesian consensus phylogram has been inferred by PhyloBayes_MPI under the CAT-GTR + Γ4 mixture model. Values at nodes indicate Bayesian posterior probabilities (PPCAT-GTR) obtained under CAT-GTR + Γ4, and jackknife support (JS) percentages, respectively. Circles at nodes pinpoint branches with maximal support from both methods. Species with newly obtained data are indicated in bold. The branch leading to the fast-evolving Oikopleura dioica has been halved for graphical purposes
Fig. 2Evolutionary rate variation across sampled species. The bar plots represent average rate estimates (in number of substitutions per site per million years) obtained for the 63 terminal taxa regrouped by taxonomy. The rates were calculated using a rate-autocorrelated log-normal (LN) relaxed molecular clock model under the CAT-GTR + Γ4 mixture model with a birth-death prior on the diversification process and 13 soft calibration constraints. Data points are plotted as open circles with n = 10, 1, 18, 34 sample points in each taxonomic categories. Centre lines show the medians, crosses represent sample means, and box limits indicate the 25th and 75th percentiles with whiskers extending 1.5 times the interquartile range from the 25th and 75th percentiles. The width of the boxes is proportional to the square root of the sample size. This figure was made with BoxPlotR [81]
Molecular estimates of divergence dates (in Mya)
| Nodes | LN CAT-GTR + Γ4 | UGAM CAT-GTR + Γ4 | ||
|---|---|---|---|---|
| Mean ± SD | 95% Cred. int. | Mean ± SD | 95% Cred. int. | |
| #63 Deuterostomia | 599 ± 11 | [621–579] | 671 ± 108 | [985–576] |
| #64 Xenambulacraria | 588 ± 16 | [616–555] | 600 ± 89 | [849–467] |
| #65 Ambulacraria | 551 ± 16 | [578–516] | 517 ± 72 | [677–403] |
| #66 Hemichordata | 404 ± 34 | [458–326] | 206 ± 101 | [427–63] |
| #67 Echinodermata | 431 ± 21 | [469–388] | 403 ± 47 | [507–323] |
| #68 | 406 ± 21 | [442–363] | 284 ± 83 | [433–121] |
| #69 | 408 ± 20 | [443–368] | 360 ± 42 | [450–287] |
| #70 | 158 ± 22 | [210–112] | 117 ± 51 | [249–42] |
| #71 Echinoideaa | 260 ± 18 | [303–229] | 266 ± 28 | [342–222] |
| #72 | 89 ± 15 | [121–61] | 85 ± 49 | [195–20] |
| #73 Chordataa | 578 ± 6 | [586–563] | 575 ± 7 | [586–558] |
| #74 Olfactoresa | 547 ± 6 | [557–532] | 545 ± 11 | [564–523] |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| #92 Vertebrata | 490 ± 7 | [504–476] | 481 ± 13 | [510–460] |
| #93 Cyclostomata | 434 ± 8 | [449–418] | 277 ± 94 | [430–101] |
| #94 Gnathostomataa | 443 ± 4 | [452–435] | 437 ± 9 | [459–424] |
| #95 Chondrichthyes | 363 ± 11 | [380–338] | 192 ± 96 | [394–62] |
| #96 | 249 ± 22 | [277–192] | 88 ± 59 | [261–23] |
| #97 Osteichthyesa | 418 ± 2 | [422–416] | 419 ± 2 | [422–416] |
| #98 Clupeocephalaa | 159 ± 4 | [165–150] | 157 ± 5 | [165–150] |
| #99 | 391 ± 3 | [397–386] | 393 ± 15 | [415–360] |
| #100 | 377 ± 3 | [383–371] | 374 ± 16 | [405–346] |
| #101 Tetrapodaa | 349 ± 2 | [351–345] | 341 ± 6 | [350–330] |
| #102 Amphibia | 326 ± 3 | [332–320] | 246 ± 30 | [299–200] |
| #103 | 180 ± 26 | [232–132] | 71 ± 51 | [190–14] |
| #104 Batrachiaa | 232 ± 21 | [268–190] | 123 ± 48 | [225–47] |
| #105 | 118 ± 29 | [174–68] | 42 ± 32 | [132–9] |
| #106 | 182 ± 25 | [224–136] | 71 ± 37 | [160–21] |
| #107 Amniotaa | 312 ± 1 | [315–310] | 319 ± 5 | [329–312] |
| #108 Mammaliaa | 186 ± 5 | [192–172] | 176 ± 8 | [191–163] |
| #109 Theriaa | 146 ± 9 | [163–127] | 143 ± 14 | [170–123] |
| #110 | 69 ± 13 | [96–47] | 47 ± 35 | [128–8] |
| #111 | 62 ± 11 | [86–43] | 55 ± 31 | [127–15] |
| #112 | 52 ± 10 | [] | 35 ± 24 | [100–9] |
| #113 Diapsidaa | 271 ± 6 | [282–259] | 278 ± 14 | [300–256] |
| #114 Lepidosauria | 243 ± 10 | [261–224] | 166 ± 69 | [283–57] |
| #115 | 168 ± 14 | [189–138] | 88 ± 47 | [203–28] |
| #116 | 139 ± 14 | [162–107] | 55 ± 35 | [152–16] |
| #117 | 252 ± 9 | [269–233] | 154 ± 69 | [280–55] |
| #118 Testudines | 180 ± 19 | [220–146] | 59 ± 41 | [173–16] |
| #119 | 163 ± 19 | [204–128] | 38 ± 28 | [120–11] |
| #120 | 96 ± 18 | [136–62] | 15 ± 14 | [] |
| #121 Archosauria | 218 ± 16 | [249–186] | 102 ± 51 | [238–37] |
| #122 | 81 ± 29 | [150–38] | 23 ± 22 | [85–4] |
| #123 Aves | 111 ± 27 | [170–67] | 45 ± 28 | [120–14] |
| #124 Crocodylia | 86 ± 24 | [142–46] | 23 ± 18 | [] |
aCalibration constraints
The reported values represent mean divergence dates and associated standard deviations and 95% credibility intervals obtained from a Bayesian relaxed molecular clock under the LN and UGAM models coupled with a CAT-GTR + Γ4 mixture model. Values in bold refer to tunicates
Fig. 3A molecular timescale for tunicates within chordates. The Bayesian chronogram has been obtained using a rate-autocorrelated log-normal (LN) relaxed molecular clock model using PhyloBayes under the CAT-GTR + Γ4 mixture model, with a birth-death prior on the diversification process and 13 soft calibration constraints. Node bars indicate the uncertainty around mean age estimates based on 95% credibility intervals. Plain node bars indicate nodes used as a priori calibration constraints. Numbers at nodes refer to Table 1
Parallel divergences between model tunicates and vertebrates
| Nodes | Tunicates | Nodes | Vertebrates | ||
|---|---|---|---|---|---|
| Mean date ± SD (Mya) | Sequence similarity (aa) | Mean date ± SD (Mya) | Sequence similarity (aa) | ||
| 447 ± 20 | 64.3% | 443 ± 4 | 88.7% | ||
| 389 ± 32 | 79.5% | 391 ± 3 | 90.9% | ||
| 350 ± 36 | 80.3% | 349 ± 2 | 91.3% | ||
| 285 ± 37 | 85.7% | 271 ± 6 | 93.6% | ||
| 277 ± 35 | 88.5% | 271 ± 6 | 93.6% | ||
| 238 ± 44 | 80.5% | 243 ± 10 | 93.5% | ||
| 219 ± 35 | 86.3% | 218 ± 16 | 95.3% | ||
| 176 ± 32 | 88.1% | 180 ± 19 | 98.0% | ||
| 122 ± 33 | 92.5% | 140 ± 14 | 95.2% | ||
Mya: million years ago, aa: amino acids
The reported values indicate mean divergence dates and associated standard deviations obtained from a Bayesian relaxed molecular clock under the CAT-GTR + Γ4 model and the percentage of amino acid sequence identity for each couple