| Literature DB >> 34140936 |
Cindy J Castelle1, Raphaël Méheust1,2,3, Alexander L Jaffe4, Kiley Seitz5, Xianzhe Gong5,6, Brett J Baker5, Jillian F Banfield1,2,7.
Abstract
DPANN are small-celled archaea that are generally predicted to be symbionts, and in some cases are known episymbionts of other archaea. As the monophyly of the DPANN remains uncertain, we hypothesized that proteome content could reveal relationships among DPANN lineages, constrain genetic overlap with bacteria, and illustrate how organisms with hybrid bacterial and archaeal protein sets might function. We tested this hypothesis using protein family content that was defined in part using 3,197 genomes including 569 newly reconstructed genomes. Protein family content clearly separates the final set of 390 DPANN genomes from other archaea, paralleling the separation of Candidate Phyla Radiation (CPR) bacteria from all other bacteria. This separation is partly driven by hypothetical proteins, some of which may be symbiosis-related. Pacearchaeota with the most limited predicted metabolic capacities have Form II/III and III-like Rubisco, suggesting metabolisms based on scavenged nucleotides. Intriguingly, the Pacearchaeota and Woesearchaeota with the smallest genomes also tend to encode large extracellular murein-like lytic transglycosylase domain proteins that may bind and degrade components of bacterial cell walls, indicating that some might be episymbionts of bacteria. The pathway for biosynthesis of bacterial isoprenoids is widespread in Woesearchaeota genomes and is encoded in proximity to genes involved in bacterial fatty acids synthesis. Surprisingly, in some DPANN genomes we identified a pathway for synthesis of queuosine, an unusual nucleotide in tRNAs of bacteria. Other bacterial systems are predicted to be involved in protein refolding. For example, many DPANN have the complete bacterial DnaK-DnaJ-GrpE system and many Woesearchaeota and Pacearchaeota possess bacterial group I chaperones. Thus, many DPANN appear to have mechanisms to ensure efficient protein folding of both archaeal and laterally acquired bacterial proteins.Entities:
Keywords: DPANN; archaea; bacterial genes; phylogeny; protein family
Year: 2021 PMID: 34140936 PMCID: PMC8204110 DOI: 10.3389/fmicb.2021.660052
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Taxonomic assessment and distribution of the newly DPANN genomes with metabolic similarities and distribution among DPANN lineages. Maximum-likelihood phylogeny of the DPANN lineages using Altiarchaeota as an outgroup and based on a 14-ribosomal-protein concatenated alignment (3,677 amino acids, LG + R10 model). The presence/absence of a subset of targeted metabolic traits is indicated as concentric rings. The scale bar indicates the mean number of substitutions per site. A fully annotated tree with all included lineages, more metabolic features as well as the bootstrap values are available in Supplementary Figure 1.
FIGURE 2Distribution of 17 modules (1143 Protein families) in DPANN genomes. The distribution of 1143 protein families (columns) in representative genomes (rows) from Archaea. Data are clustered based on the presence (black) and absence (white) profiles (Jaccard distance, complete linkage). Pathway or function for each protein family is indicated by a colored circle.
List of the eleven modules that are assigned to a single DPANN lineage and the module 98 that is taxonomically assigned to both the Woesearchaeota and the Pacearchaeota within the genomes of DPANN archaea.
FIGURE 3Bacterial protein families found in DPANN genomes. In the top left plot, “multiple” refers to protein families that are found in more than one metabolic pathway. Colored dots indicate those instances in DPANN. For example, translation genes in the DPANN (pink dots) are genes involved in queosine biosynthesis (see key).
FIGURE 4Bacterial queuosine pathway in DPANN genomes and some Euryarchaeota (Methanosarcinales). (A) Phylogenetic affiliation and distribution of the identified QueA, QueH and TGT in archaea selected for this study. (B) The three last steps of the queuosine biosynthesis pathway in Bacteria. (C) Selected examples of genomic context and gene synteny around the genes QueA, QueH (DUF208) and TGT. QueA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; QueH (DUF208), epoxyqueuosine reductase; TGT, queuine tRNA-ribosyltransferase. A full gene synteny and genomic context of all the genes involved in the queuosine pathway in this study is available in Supplementary Table 5.
FIGURE 5Phylogenetic tree of DNA polymerase III, a bacterial polymerase not typically found in Archaea. The tree includes sequences from phages and some bacteria (light cream shading). Asterisks indicate groups of sequences found in DPANN genomes. The scale bar indicates the mean number of substitutions per site.