| Literature DB >> 25583072 |
Kira S Makarova1, Michael Y Galperin2, Eugene V Koonin2.
Abstract
A systematic comparative genomic analysis of all archaeal membrane proteins that have been projected to the last archaeal common ancestor gene set led to the identification of several novel components of predicted secretion, membrane remodeling, and protein glycosylation systems. Among other findings, most crenarchaea have been shown to encode highly diverged orthologs of the membrane insertase YidC, which is nearly universal in bacteria, eukaryotes, and euryarchaea. We also identified a vast family of archaeal proteins, including the C-terminal domain of N-glycosylation protein AglD, as membrane flippases homologous to the flippase domain of bacterial multipeptide resistance factor MprF, a bifunctional lysylphosphatidylglycerol synthase and flippase. Additionally, several proteins were predicted to function as membrane transporters. The results of this work, combined with our previous analyses, reveal an unexpected diversity of putative archaeal membrane-associated functional systems that remain to be functionally characterized. A more general conclusion from this work is that the currently available collection of archaeal (and bacterial) genomes could be sufficient to identify (almost) all widespread functional modules and develop experimentally testable predictions of their functions. Published by Elsevier B.V.Entities:
Keywords: Archaeal genomes; Gene neighborhoods; Membrane proteins; arCOGs
Mesh:
Substances:
Year: 2015 PMID: 25583072 PMCID: PMC5898192 DOI: 10.1016/j.biochi.2015.01.004
Source DB: PubMed Journal: Biochimie ISSN: 0300-9084 Impact factor: 4.079
Uncharacterized ancestral arCOGs that include predicted membrane proteins.
| arCOG | Related | Number | Presence/absence pattern in major lineages | Prediction/comment | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Desulfu- | Sulfo- | Thermo- | Thaumar- | Korar- | Archaeog- | Halo- | Methano- | Methan- | Methano- | Thermo- | Thermo- | Nanoar- | ||||
| 02673 | 07287 | 3,4 | p | 1 | – | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | Sec system component, YidC ortholog (see |
| 01994 | 01995 | 3 or 6 | p | 1 | p | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | p | 1 | Putative membrane remodeling system component (see |
| 02177 | 02178 | 7 | – | – | – | – | – | 1 | 1 | p | 1 | 1 | 1 | – | 1 | Putative membrane remodeling system component (see |
| 01314 | 2 | p | p | p | – | 1 | 1 | p | p | p | p | 1 | – | – | Putative secretion system component (see | |
| 02884 | 02886 | 2 or 4 | p | p | p | – | – | p | p | p | p | p | P | p | – | Putative secretion system component (see |
| 00899 | 00901 | 7–9 | p | – | p | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | p | – | Flippase involved in the protein glycosylation pathways; many archaea have multiple paralogs |
| 02245 | 6–14 | – | – | 1 | – | 1 | 1 | 1 | 1 | 1 | 1 | 1 | p | – | Cytidylyltransferase family enzyme involved in lipid metabolism | |
| 04002 | 08353 | 2,3 | – | – | p | p | 1 | p | – | – | – | p | P | p | – | Cytochrome |
| 02078 | 2 | – | – | – | – | 1 | – | – | 1 | 1 | 1 | P | – | – | Possibly involved in electron transfer as a component of redox complexes; | |
| 03427 | 03426 | 4,5 | p | – | – | – | – | – | p | – | 1 | – | P | p | – | Transporter; expansion in Thermococci |
| 02008 | 4 | 1 | 1 | 1 | – | 1 | – | p | p | – | – | 1 | 1 | – | Transporter component | |
| 04354 | 12 | p | – | p | – | 1 | 1 | – | – | – | – | P | – | – | Transporter | |
| 04469 | 11 | – | – | – | – | – | p | 1 | 1 | 1 | 1 | 1 | p | 1 | Transporter | |
| 02159 | 7 | p | 1 | p | – | – | 1 | – | – | – | 1 | p | p | – | No prediction; expansion in Methanomicrobia | |
| 03206 | 03207 | 8–10 | – | – | – | 1 | – | – | p | 1 | – | 1 | – | – | Lipopolysaccharide biosynthesis | |
Complete phyletic patterns are provided in the Supplementary Table 2.
The patterns are abbreviated as follows:–, absent in the vast majority of the respective genomes; p, partially present, and 1, present in the vast majority of genomes.
Fig. 1Comparative genomic analysis of the YidC family in crenarchaea. A. Gene neighborhoods of predicted yidC genes in archaea. For each arCOG gene neighborhoods for representative organisms are shown. Genes are shown by block arrows with the length roughly proportional to the size of the corresponding gene. Homologous genes are indicated by the same color. The annotated arCOGs are indicated above the respective arrows. The arCOG05556 gene is represented by a white arrow to show that the evidence for it being the YidC subunit is weak. Abbreviations: RP S5 – ribosomal protein S5, RP L30 – ribosomal protein L30, RP L15 – ribosomal protein L15, SecY – preprotein translocase subunit SecY, RP L34 – ribosomal protein L34E, AdkA – archaeal adenylate kinase, PolB – DNA polymerase elongation subunit (family B). B. Phyletic pattern of predicted YidC subfamilies in different archaeal lineages. Phyletic patterns for the indicated arCOG families (filled circles show presence and empty circles show absence of the respective COG members) are superimposed over the phylogenetic tree of crenarchaea. The tree topology is based on the phylogeny of concatenated ribosomal proteins [81].
Fig. 2Neighborhood analysis and a model of putative novel secretion system associated with arCOG01314. A. Gene neighborhoods of arCOG01314 for several representative organisms. Designations are as in Fig. 1. The details for each family are described in Table 2. Specific arCOGs numbers present in each neighborhood are provided in the Supplementary Table 3. B. A hypothetical model of the putative novel secretion system. The cartoon is based on the features of each family described in Table 2. The green shape represents a membrane-anchored protein of arCOG01314. Oval yellow, orange and pale orange shapes represent three distinct families of proteins with Ig-like region. Purple shapes represent extracellular proteins containing vWa and Ig-like domains. Blue and sky blue shapes represent two distinct families of membrane proteins. The stoichiometry and positions of the subunits are not known and should be the subject of further experimental investigation.
Putative novel secretion system associated with arCOG01314.
| arCOG and its homologs | Domain architecture and predicted activity |
|---|---|
| arCOG01314 | Membrane-anchored protein with extracellular flavodoxin-like domain |
| arCOG00435 | AAA + ATPase of MoxR/GvpN family, possibly involved in the complex assembly and regulation of its state |
| arCOG02742 | Likely secreted protein with an N-terminal Ig like domain fused to the C-terminal von Willebrand factor type A (vWA) domain |
| arCOG02487 | Predicted S-layer protein with a Ig-like domain |
| arCOG02884 | Membrane-associated predicted S-layer protein with an extracellular Ig-like domain; 2 or 4 TMs; belongs to COG4743, pfam07760 |
| arCOG03442 | Membrane-associated predicted S-layer protein with an extracellular Ig-like domain; 2 or 4 TMs; belongs to COG4743, pfam07760 |
| arCOG03875 | Membrane protein with 2–3 TMs |
| arCOG07126 | Membrane protein with 5 TMs |
| arCOG11232 | Membrane protein with 5 TMs |
Fig. 3Genomic neighborhood analysis of arCOG01994 and arCOG01996, putative components of membrane remodeling systems. A. Gene neighborhood for arCOG01994. B. Gene neighborhood for arCOG01996. Designations are as in Fig. 1. Respective arCOG numbers are indicated underneath the arrows.