| Literature DB >> 30668767 |
Jürgen F H Strassert1, Mahwash Jamy1, Alexander P Mylnikov2, Denis V Tikhonenkov2, Fabien Burki1,3.
Abstract
The resolution of the broad-scale tree of eukaryotes is constantly improving, but the evolutionary origin of several major groups remains unknown. Resolving the phylogenetic position of these "orphan" groups is important, especially those that originated early in evolution, because they represent missing evolutionary links between established groups. Telonemia is one such orphan taxon for which little is known. The group is composed of molecularly diverse biflagellated protists, often prevalent although not abundant in aquatic environments. Telonemia has been hypothesized to represent a deeply diverging eukaryotic phylum but no consensus exists as to where it is placed in the tree. Here, we established cultures and report the phylogenomic analyses of three new transcriptome data sets for divergent telonemid lineages. All our phylogenetic reconstructions, based on 248 genes and using site-heterogeneous mixture models, robustly resolve the evolutionary origin of Telonemia as sister to the Sar supergroup. This grouping remains well supported when as few as 60% of the genes are randomly subsampled, thus is not sensitive to the sets of genes used but requires a minimal alignment length to recover enough phylogenetic signal. Telonemia occupies a crucial position in the tree to examine the origin of Sar, one of the most lineage-rich eukaryote supergroups. We propose the moniker "TSAR" to accommodate this new mega-assemblage in the phylogeny of eukaryotes.Entities:
Keywords: TSAR; Telonemia; eukaryotes; phylogenomics; protists; tree of life
Mesh:
Year: 2019 PMID: 30668767 PMCID: PMC6844682 DOI: 10.1093/molbev/msz012
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.ML tree showing the phylogenetic position of Telonemia within eukaryotes. The tree corresponds to the best ML tree inferred from a concatenated alignment of 248 protein-coding genes using the LG+C60+G+F-PMSF model. Support at nodes is derived from 100 BP replicates and Bayesian PPs. Black circles represent maximum support in both analyses. A number appended to a black circle indicates full support in only one of the two analyses (i.e., BP/PP). Nodes with dashes (–) indicate an alternative branching in the Phylobayes tree (see supplementary figs. S2 and S3, Supplementary Material online). The ambiguous position of Haptista with this data set is highlighted with an exclamation mark (see text for more information).
. 2.Gene subsampling and fast-evolving sites removal analyses. (A) UFBoot approximation for selected nodes of interest using the full supermatrix (100%) and subsets of randomly sampled genes (20–80%; see Materials and Methods). The variability of support values is shown by Box-and-Whisker plots. (B) UFBoot approximation UFBoot for the same nodes as in (A) using the full supermatrix (0 sites removed) and subsets from which fast-evolving sites were removed in 5,000 increments (see Materials and Methods). The gray box indicates the reduced data set which recovered a different position for Haptista. Arch, Archaeplastida; Crypt, Cryptista; Glau, Glaucophyta; Hapt, Haptista; Telo, Telonemia; Viri, Viridiplantae.
. 3.Simplified Phylobayes consensus tree inferred from a reduced concatenated alignment (25,000 fast-evolving sites removed) of the 248 protein-coding genes under the CAT+F81+Γ4 model (for a full version showing all taxa, see supplementary fig. S4, Supplementary Material online). Node support is given by Bayesian PPs and UFBoot approximation inferred from a ML analysis (LG+C60+G+F model) using the same subalignment (i.e., 25,000 sites removed; see text and fig. 2). Nodes with black circles are fully supported in both analyses. A number appended to a black circle indicates full support in only one of the two analyses (i.e., PP/UFBoot). Note the complete congruence between the Bayesian and ML analyses (all nodes are recovered by both methods), including the position of Haptista.