| Literature DB >> 22319147 |
Sen Zhao1, Fabien Burki, Jon Bråte, Patrick J Keeling, Dag Klaveness, Kamran Shalchian-Tabrizi.
Abstract
The current consensus for the eukaryote tree of life consists of several large assemblages (supergroups) that are hypothesized to describe the existing diversity. Phylogenomic analyses have shed light on the evolutionary relationships within and between supergroups as well as placed newly sequenced enigmatic species close to known lineages. Yet, a few eukaryote species remain of unknown origin and could represent key evolutionary forms for inferring ancient genomic and cellular characteristics of eukaryotes. Here, we investigate the evolutionary origin of the poorly studied protist Collodictyon (subphylum Diphyllatia) by sequencing a cDNA library as well as the 18S and 28S ribosomal DNA (rDNA) genes. Phylogenomic trees inferred from 124 genes placed Collodictyon close to the bifurcation of the "unikont" and "bikont" groups, either alone or as sister to the potentially contentious excavate Malawimonas. Phylogenies based on rDNA genes confirmed that Collodictyon is closely related to another genus, Diphylleia, and revealed a very low diversity in environmental DNA samples. The early and distinct origin of Collodictyon suggests that it constitutes a new lineage in the global eukaryote phylogeny. Collodictyon shares cellular characteristics with Excavata and Amoebozoa, such as ventral feeding groove supported by microtubular structures and the ability to form thin and broad pseudopods. These may therefore be ancient morphological features among eukaryotes. Overall, this shows that Collodictyon is a key lineage to understand early eukaryote evolution.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22319147 PMCID: PMC3351787 DOI: 10.1093/molbev/mss001
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
F18S rDNA phylogeny of the Diphyllatia species Collodictyon triciliatum (highlighted by black box) and Diphylleia rotans. The topology was reconstructed by MrBayes v3.1.2 under the GTR + GAMMA + I + covarion model. Posterior probabilities (PP) and ML bootstrap supports (BP, inferred by RAxML v7.1.2 under GTR + GAMMA + I model) are shown at the nodes. Thick lines indicate PP > 0.90 and BP > 80%. Dashes “-” indicate PP < 0.5 or BP < 50%. A few long branches are shortened by 50% (/) or 75% (//).
F18S + 28S rDNA phylogeny of Collodictyon triciliatum (highlighted by black box) reconstructed with MrBayes v3.1.2 under the GTR + GAMMA+I + covarion model. Numbers at nodes are PP and ML bootstrap values (BP, inferred by RAxML v7.2.6 under the GTR + GAMMA + I model). Thick lines show PP > 0.9 and BP > 80%. Nodes marked with symbol “-” indicate BP < 50% or PP < 0.5. Some branches are shortened by half in order to save space (marked with “/”).
Maximum likelihood bootstrap values (ML) and bayesian posterior probabilities (Bayes) of the Eukaryote Supergroups in the Phylogenomic Trees.
| 79 Taxa | 74 Taxa | 77 Taxa | 72 Taxa | ||||||||
| All Sites | 20% Removed | All Sites | All Sites | 20% Removed | All Sites | ||||||
| Node | Groups | ML | Bayes | ML | Bayes | ML | ML | Bayes | ML | Bayes | ML |
| A | Opisthokonta | 100 | 1.00 | 100 | 1.00 | 100 | 100 | 1.00 | 100 | 1.00 | 100 |
| B | Unikonts | 79 | 0.99 | 99 | 1.00 | 87 | 57 | 0.99 | 96 | 1.00 | 58 |
| C | Amoebozoa | 86 | 1.00 | 100 | 1.00 | 100 | 84 | 1.00 | 100 | 1.00 | 100 |
| D | 86 | 0.79 | 98 | 0.63 | 94 | NA | NA | NA | NA | NA | |
| E | Excavata | 100 | 1.00 | 100 | 1.00 | 100 | 100 | 1.00 | 100 | 1.00 | 100 |
| F | Bikonts | 98 | 1.00 | 98 | 1.00 | 95 | 98 | 1.00 | 100 | 1.00 | 93 |
| G | Archaeplastida | - | 0.98 | 63 | 0.84 | - | - | 0.99 | 71 | 0.95 | - |
| H | Archaeplastida + CCTH + SAR | - | 1.00 | 81 | 1.00 | - | - | 1.00 | 88 | 1.00 | - |
| I | CCTH | - | * | 54 | * | - | 50 | * | 60 | * | - |
| J | SAR | 98 | 1.00 | 100 | 1.00 | 96 | 99 | 1.00 | 100 | 1.00 | 96 |
NOTE.—“-” indicate bootstrap values < 50% or PP < 0.5; “*” indicate that CCTH (Cryptophyta, Centrohelida, Telonemia, and Haptophyta) is not monophyletic.
Five taxa (Leishmania, Trypanosoma, Sawyeria, Entamoeba, and Breviata) were removed.
Two Malawimonas taxa were removed.
Two Malawimonas taxa and five taxa (Leishmania, Trypanosoma, Sawyeria, Entamoeba, and Breviata) were removed.
Removal of the 20% fastest evolving sites from the alignment.
The capital letters correspond to supergroups marked in figure 3.
Breviata is sister to Opisthokonta (fig 3).
FPhylogenomic position of Collodictyon inferred from 124 genes under the CAT mixture model in PhyloBayes v3.2. Branches that received 1.00 PP are marked by filled circles. The branch length of Entamoeba is shortened by 50% to save space. (A) Tree topology constructed with 79 taxa from the saved 18,000 trees after discarding the first 6,000 cycles as burn-in (maxdiff = 0.137). Missing data for each taxon is shown as a color barplot (left bar: missing number of genes; right bar: missing percentage of characters). Bars marked by “*” indicate the missing percentage of characters is over 60% of the full-length alignment. (B) Tree topology constructed with 77 taxa (i.e., two Malawimonas excluded) from the saved 16,000 trees after discarding first 8,000 cycles as burn-in (maxdiff = 0.083). CCTH is the abbreviation of Cryptophyta, Centrohelida, Telonemia, and Haptophyta. Additional statistical support values for the main nodes in the tree marked by capital letters in boxes are listed in table 1.
FChanges in bootstrap support for key nodes in the inferred trees as fast-evolving sites were removed. Site rates were estimated from an alignment without two Malawimonas and Collodictyon species (76 taxa). Sites were then removed in 5% increments from alignments consisting of (A) 79 taxa (including Collodictyon and Malawimonas) and (B) 77 taxa (including Collodictyon). ML Bootstrap values (BP) for Collodictyon + Malawimonas, unikonts, bikonts, and Opisthokonta (used as a reference) were calculated under the PROTCATLGF model in RAxML v7.2.6. BP values shaded by gray rectangles are listed in table 1 and supplementary figure S6 (Supplementary Material online).
FBayesian phylogeny of Collodictyon constructed from 124 genes after removal of the fastest evolving sites. The consensus topology was calculated under the CAT model from 18,000 saved trees after discarding the first 6,000 cycles as burn-in. Branches showing 1.00 PP are marked by filled circles. The branch length of Entamoeba is shortened by 50% to save space. (A) Tree topology inferred from the trimmed alignment with the 20% fastest evolving sites removed (marked by gray rectangles in fig. 4). Chains were considered to have converged (maxdiff = 0.104). (B) Tree topology inferred from the trimmed alignment (i.e., two Malawimonas excluded) with the 20% fastest evolving sites removed (marked by gray rectangles in fig. 4). Chains were considered to have converged (maxdiff = 0.065). Numbers at the nodes in (B) indicate PP/bootstrap values calculated from from 100 pseudoreplicates with Phylobayes under CAT mixture model. Dashes “-” indicate bootstrap supports < 50%. CCTH is the abbreviation of Cryptophyta, Centrohelida, Telonemia, and Haptophyta. Additional statistical support values for the supergroups are shown in table 1.
AU Test of Tree Topologies.
| Rank | Tree Topology Based on a Sample of 79 Taxa | All | 20% |
| 1 | (((Opst,Amoe),(Mala, | ||
| 2 | (((Opst,Amoe), | 0.045 | |
| 3 | (((Amoe,(Mala, | 0.006 | |
| 4 | (((Opst,(Mala, | 0.009 | 0.005 |
| 5 | (((Opst,Mala),( | 0.007 | 0.010 |
| 6 | (((Opst, | 0.006 | 0.006 |
| 7 | ((((Opst,Amoe), | 0.005 | 0.014 |
| 8 | (((Opst,( | 0.003 | 2 × 10−04 |
| 9 | ((((Opst, | 0.002 | 0.001 |
| 10 | (((Opst,Amoe),Mala),((Exca, | 3 × 10−04 | 7 × 10−05 |
| 11 | (((Opst,Amoe),Mala),(Exca,((Plan, | 2 × 10−04 | 8 × 10−05 |
| 12 | (((Opst,Amoe),Mala),(Exca,(Plan,(SAR,((Cryp,Hapt),( | 1 × 10−04 | 0.001 |
| 13 | (((Opst,Amoe),Mala),(Exca,(Plan,(SAR,(((Cryp, | 8 × 10−05 | 8 × 10−06 |
| 14 | (((Opst,Amoe),Mala),(Exca,(Plan,(SAR,((Cryp,( | 2 × 10−05 | 6 × 10−07 |
| 15 | ((((Opst,Amoe), | 1 × 10−05 | 2 × 10−07 |
| 16 | ((Opst,Amoe),((Exca,(Mala, | 6 × 10−15 | 5 × 10−14 |
| 17 | ((Opst,Amoe),((Exca,Mala), | 4 × 10−12 | 3 × 10−11 |
| Rank | Tree Topology Based on a Sample of 77 Taxa | All | 20% |
| 1 | (((Opst,Amoe), | ||
| 2 | (((Opst,Amoe), | ||
| 3 | (((Opst, | 0.046 | |
| 4 | (((Opst, | 0.045 | |
| 5 | ((Opst,(Amoe, | 0.040 | |
| 6 | ((Opst,(Amoe, | 0.037 | |
| 7 | ((Opst,Amoe),((Exca, | 0.001 | 0.001 |
| 8 | ((Opst,Amoe),((Exca, | 0.003 | 2 × 10−04 |
| 9 | ((Opst,Amoe),(Exca,((Plan, | 2 × 10−06 | 4 × 10−07 |
| 10 | ((Opst,Amoe),(Exca,((SAR, | 7 × 10−06 | 3 × 10−08 |
| 11 | ((Opst,Amoe),(Exca,(Plan,(SAR,(((Cryp, | 6 × 10−07 | 1 × 10−04 |
| 12 | ((Opst,Amoe),(Exca,(SAR,(Plan,(((Cryp, | 2 × 10−05 | 6 × 10−05 |
| 13 | ((Opst,Amoe),(Exca,(Plan,(SAR,((Cryp,( | 6 × 10−09 | 7 × 10−39 |
| 14 | ((Opst,Amoe),(Exca,(SAR,(Plan,((Cryp,( | 8 × 10−05 | 2 × 10−47 |
| 15 | ((Opst,Amoe),(Exca,(Plan,(SAR,((Cryp,Hapt),( | 1 × 10−74 | 5 × 10−51 |
| 16 | ((Opst,Amoe),(Exca,(SAR,(Plan,((Cryp,Hapt),( | 1 × 10−69 | 8 × 10−54 |
| 17 | ((Opst,Amoe),(Exca,(Plan,((SAR, | 2 × 10−63 | 7 × 10−40 |
| 18 | ((Opst,Amoe),(Exca,(SAR,((Plan, | 2 × 10−51 | 2 × 10−43 |
The abbreviation of major groups: Opst, Opisthokonta; Amoe, Amoebozoa; Exca, Excavata; Plan, Archaeplastida; SAR, Stramenopila + Alveolata + Rhizaria; Cryp, Guillardia + Plagioselmis; Hapt, Haptophyta; TelRap, Telonemia + Raphidiophrys; Mala, Malawimonas; and Coll, Collodictyon.
P values in which the topologies cannot be rejected at the 5% level of significance were underlined.
P values were calculated from the original alignment (i.e., no sites removed).
P values were calculated from the trimmed alignment with removal of the 20% fastest evolving sites (marked by gray rectangles in fig. 4).
P values were calculated from the trimmed alignment (i.e., two Malawimonas excluded) with removal of the 20% fastest evolving sites (marked by gray rectangles in fig. 4).