| Literature DB >> 25945739 |
Anja Spang1, Jimmy H Saw1, Steffen L Jørgensen2, Katarzyna Zaremba-Niedzwiedzka1, Joran Martijn1, Anders E Lind1, Roel van Eijk1, Christa Schleper2,3, Lionel Guy1,4, Thijs J G Ettema1.
Abstract
The origin of the eukaryotic cell remains one of the most contentious puzzles in modern biology. Recent studies have provided support for the emergence of the eukaryotic host cell from within the archaeal domain of life, but the identity and nature of the putative archaeal ancestor remain a subject of debate. Here we describe the discovery of 'Lokiarchaeota', a novel candidate archaeal phylum, which forms a monophyletic group with eukaryotes in phylogenomic analyses, and whose genomes encode an expanded repertoire of eukaryotic signature proteins that are suggestive of sophisticated membrane remodelling capabilities. Our results provide strong support for hypotheses in which the eukaryotic host evolved from a bona fide archaeon, and demonstrate that many components that underpin eukaryote-specific features were already present in that ancestor. This provided the host with a rich genomic 'starter-kit' to support the increase in the cellular and genomic complexity that is characteristic of eukaryotes.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25945739 PMCID: PMC4444528 DOI: 10.1038/nature14447
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Figure 1Identification of a novel archaeal lineage
a, Bathymetric map of the sampling site (GC14; red circle) at the Arctic Mid-Ocean Spreading Ridge, located 15 km from Loki’s Castle active vent site. b, 16S rRNA amplicon-based assessment of microbial diversity in GC14. Bars on the left represent the fraction of the respective prokaryotic taxa and bars on the right depict archaeal diversity. Numbers refer to operational taxonomic units for each group. MHVG, Marine Hydrothermal Vent Group; DHVEG-6, Deep-sea Hydrothermal Vent Euryarchaeota Group 6; MBG-A and -B, Marine Benthic Group A and B. c, Maximum-likelihood phylogeny of the archaeal 16S rRNA reads (see b), revealing that DSAG sequences cluster deeply in the TACK superphylum. Numbers between brackets indicate relative abundance (%) of each group relative to total and archaeal reads, respectively. MCG, Miscellaneous Crenarchaeota Group; MHVG, Marine Hydrothermal Vent Group. d, Maximum-likelihood phylogeny of 16S rRNA gene sequences indicating that the DSAG operational taxonomic unit (red font) belongs to the DSAG γ cluster. Bootstrap support values above 50 are shown. c, d, Scale indicates the number of substitutions per site.
Figure 2Metagenomic reconstruction and phylogenetic analysis of Lokiarchaeum
a, Schematic overview of the metagenomics approach. BI, Bayesian inference; ML, maximum likelihood. b, Bayesian phylogeny of concatenated alignments comprising 36 conserved phylogenetic marker proteins using sophisticated models of protein evolution (Methods), showing eukaryotes branching within Lokiarchaeota. Numbers above and below branches refer to Bayesian posterior probability and maximum-likelihood bootstrap support values, respectively. Posterior probability values above 0.7 and bootstrap support values above 70 are shown. Scale indicates the number of substitutions per site. c, Phylogenetic breakdown of the Lokiarchaeum proteome, in comparison with proteomes of Korarchaeota, Aigarchaeota (Caldiarchaeum) and Miscellaneous Crenarchaeota Group (MCG) archaea. Category ‘Other’ contains proteins assigned to the root of cellular organisms, to viruses and to unclassified proteins.
Overview of Lokiarchaeal ESPs
| Suggested function | Product | Locus tag | IPR-domains | Comment |
|---|---|---|---|---|
| Putative ESCRT-III proteins | Vps2/24/46-like protein |
| IPR005024 Snf7 | More distant homologs also present in several other members of the TACK superphylum. |
| Vps20/32/60-like protein |
| IPR005024 Snf7 | ||
| Putative ESCRT-II proteins | EAP30 domain protein (Vps22/36-like) |
| IPR007286 EAP30 | Previously not found in Archaea. |
| Vps25-like protein |
| IPR014041 ESCRT-II complex, Vps25 subunit, N-terminal Winged helix; IPR008570 ESCRT-II complex, Vps25 subunit; IPR011991 Winged helix-turnhelix DNA-binding domain | ||
| Putative ESCRT-I protein | Hypothetical protein with Vps28-like domain | Lokiarch_10170 | IPR007143 Vacuolar protein sortingassociated, Vps28 | Vps28 is part of ESCRT-I, potential interacting protein Lokiarch_16740 (see |
| Putative ESCRT-associated protein | Vps4 ATPase |
| IPR003959 ATPase, AAA-type, core; IPR027417 P-loop containing nucleoside triphosphate hydrolase; IPR003593 AAA+ ATPase domain; IPR007330 MIT-domain | Also present in other members of the Archaea. |
| Putative vesicular trafficking machinery associated proteins | Hypothetical proteins vacuolar fusion domain MON1 | Lokiarch_21780 | IPR004353 Vacuolar fusion protein MON1 | Previously not found in other prokaryotic organisms (see |
| Hypothetical proteins with longin-like domains | Lokiarch_01890 | IPR011012 Longin-like domain; IPR010908 Longin domain | ||
| BAR/IMD domain-like superfamily protein |
| IPR004148 BAR domain; IPR009602 FAM92 protein | Includes various protein families that bind membranes and detect membrane curvature. | |
| Cell division/cytoskeleton related proteins | Actin and related proteins |
| IPR004000, Actin-related protein; IPR020902 Actin/actin-like conserved site | Some Cren- Kor- and Aigarchaeota encode crenactins [ |
| Hypothetical proteins with gelsolin-like domains |
| IPR007122 Villin/Gelsolin; IPR029006 ADF-H/Gelsolin-like domain; IPR007123 Gelsolin-like domain | Previously not found in Archaea. Serve as candidates for potential actin-binding proteins. | |
| Small GTP-binding domain proteins with Ran-/Ras- /Rab-/Rho- and Arf-domain signatures |
| IPR001806 Small GTPase superfamily; IPR003579 Small GTPase superfamily, Rab type; IPR027417 P-loop containing nucleoside triphosphate hydrolase; IPR020849 Small GTPase superfamily, Ras type; IPR002041 Ran GTPase; IPR003578 Small GTPase superfamily, Rho type; IPR005225 Small GTP-binding protein domain; IPR024156 Small GTPase superfamily, ARF type | Extreme proliferation of small GTP-binding proteins in Lokiarchaeum (92 proteins in composite genome, see | |
| Ubiquitin modifier system related proteins | Ubiquitin-like proteins |
| IPR029071 Ubiquitin-related domain; IPR000626 Ubiquitin-like | Ubiquitin modifier system was previously identified in Aigarchaeota[ |
| Putative E1-like ubiquitin activating protein | Lokiarch_15900 | IPR023280 Ubiquitin-like 1 activating enzyme, catalytic cysteine domain; IPR019572 Ubiquitin-activating enzyme (see SOM for more details) | ||
| Putative E2-like ubiquitin conjugating protein |
| IPR016135 Ubiquitin-conjugating enzyme/RWD-like; IPR000608 Ubiquitin-conjugating enzyme, E2 | ||
| Hypothetical proteins with JAB1/MPN/MOV34 metalloenzyme domain |
| IPR000555 JAB1/MPN/MOV34 metalloenzyme domain | ||
| Eukaryotic ribosomal protein | Putative homolog of eukaryotic ribosomal protein L22e |
| - | Previously not found in Archaea. Best blast hit: gb|EPR78232.1| 60S ribosomal protein L22 [ |
| Oligosaccharyl transferase complex proteins | Ribophorin 1 superfamily protein |
| IPR007676 Ribophorin I | Previously not found in Archaea |
| Putative oligosaccharyl transferase complex, subunit OST3/OST6 |
| IPR021149 Oligosaccharyl transferase complex, subunit OST3/OST6 | Previously not found in Archaea. | |
| Putative oligosaccharyl transferase STT3 subunit |
| IPR003674 Oligosaccharyl transferase, STT3 subunit | Homologs also present in some other Archaea. |
Locus tags that are highlighted in bold indicate a significant top blast hit of the respective protein of Lokiarchaeum to a eukaryotic sequence (see Supplementary Table 6 for further details).
Phylogenetic analyses have been performed.
Alignments shown in Supplementary figures.
Protein domain assignments for these proteins listed in Supplementary Table 10.
While most small GTPases encoded by Lokiarchaeum have highest similarity to eukaryotic homologues, approximately 10% are most similar to Archaea and/or Bacteria (see Supplementary Table 6 for more details).
Figure 3Identification and phylogeny of small GTPases and actin orthologues
a, Maximum-likelihood phylogeny of 378 aligned amino acid residues of actin homologues identified in Lokiarchaeum and in the LCGC14AMP metagenome, including eukaryotic actins, ARP1–3 homologues and crenactins[25]. Consecutive numbers in brackets refer to the number of sequences in a respective clade from LCGC14AMP and Lokiarchaeum, respectively. b, Relative amount of small GTPases (assigned to IPR006689 and IPR001806) in the Lokiarchaeum genome in comparison with other eukaryotic, archaeal and bacterial species. Numbers refer to total amount of small GTPases per predicted proteome. c, Maximum-likelihood phylogeny of 150 aligned amino acid residues of small Ras- and Arf-type GTPases (IPR006689 and IPR001806) in all domains of life. Numbers in brackets refer to the number of sequences in the respective clades. a, c, Sequence clusters comprising Lokiarchaeum and/or LCGC14AMP sequences (red), eukaryotes (blue) and Bacteria/Archaea (grey) have been collapsed. Bootstrap values above 50 are shown. Scale indicates the number of substitutions per site.
Figure 4Identification of ESCRT components in the Lokiarchaeum genome
a, Schematic overview of ESCRT gene clusters identified in Lokiarchaeum and Loki2/3. Intensity of shading between homologous sequences is correlated with BLAST bit score. b, Maximum-likelihood phylogeny of 207 aligned amino acid residues of ESCRT-III homologues identified in Lokiarchaeum, LCGC14AMP and other archaeal lineages. Eukaryotic homologues include the two distantly related families Vps2/24/46 and Vps20/32/60. Bootstrap support values above 50 are shown. c, Maximum-likelihood phylogeny of 388 aligned amino acid residues of AAA-type Vps4 ATPases including representatives for each of the four major eukaryotic sub-groups (membrane scaffold protein (MSP), katanin, spastin/fidgetin and Vps4) as well as homologues identified in the Lokiarchaeum genome, in LCGC14AMP and in sequenced archaeal genomes. Bootstrap support values below 45 are not shown. b, c, Scale indicates the number of substitutions per site. Numbers in brackets refer to the number of sequences in the respective clades.
Figure 5The complex archaeal ancestry of eukaryotes
Schematic overview of the distribution of ESPs in major archaeal lineages across the tree of life. Each ESP is depicted as a coloured circle and losses are indicated with a cross. Patchy distribution and absence of a particular ESP in archaeal phyla is indicated by half-shaded and white circles, respectively. aWhile eukaryotes and Lokiarchaeota contain bona fide actins, other archaea encode the more distantly related Crenactins. bOnly few members of the Thaumarchaeota contain distantly related homologs of tubulins (ar-tubulins). cThaum-, Aig- and some Crenarchaeota contain distant homologues of ESCRT-III (SNF7 domain proteins).