| Literature DB >> 29263101 |
Michael Y Galperin1, Kira S Makarova2, Yuri I Wolf2, Eugene V Koonin2.
Abstract
The two-component signal transduction (TCS) machinery is a key mechanism of sensing environmental changes in the prokaryotic world. TCS systems have been characterized thoroughly in bacteria but to a much lesser extent in archaea. Here, we provide an updated census of more than 2,000 histidine kinases and response regulators encoded in 218 complete archaeal genomes, as well as unfinished genomes available from metagenomic data. We describe the domain architectures of the archaeal TCS components, including several novel output domains, and discuss the evolution of the archaeal TCS machinery. The distribution of TCS systems in archaea is strongly biased, with high levels of abundance in haloarchaea and thaumarchaea but none detected in the sequenced genomes from the phyla Crenarchaeota, Nanoarchaeota, and Korarchaeota The archaeal sensor histidine kinases are generally similar to their well-studied bacterial counterparts but are often located in the cytoplasm and carry multiple PAS and/or GAF domains. In contrast, archaeal response regulators differ dramatically from the bacterial ones. Most archaeal genomes do not encode any of the major classes of bacterial response regulators, such as the DNA-binding transcriptional regulators of the OmpR/PhoB, NarL/FixJ, NtrC, AgrA/LytR, and ActR/PrrA families and the response regulators with GGDEF and/or EAL output domains. Instead, archaea encode multiple copies of response regulators containing either the stand-alone receiver (REC) domain or combinations of REC with PAS and/or GAF domains. Therefore, the prevailing mechanism of archaeal TCS signaling appears to be via a variety of protein-protein interactions, rather than direct transcriptional regulation.IMPORTANCE Although the Archaea represent a separate domain of life, their signaling systems have been assumed to be closely similar to the bacterial ones. A study of the domain architectures of the archaeal two-component signal transduction (TCS) machinery revealed an overall similarity of archaeal and bacterial sensory modules but substantial differences in the signal output modules. The prevailing mechanism of archaeal TCS signaling appears to involve various protein-protein interactions rather than direct transcription regulation. The complete list of histidine kinases and response regulators encoded in the analyzed archaeal genomes is available online at http://www.ncbi.nlm.nih.gov/Complete_Genomes/TCSarchaea.html.Entities:
Keywords: Archaea; arCOGs; archaeal genomes; gene neighborhoods; genome analysis; genomics; halobacterium; histidine kinase; membrane proteins; metagenomics; methanogens; protein-protein interactions; signal transduction; two-component regulatory systems
Mesh:
Substances:
Year: 2018 PMID: 29263101 PMCID: PMC5847659 DOI: 10.1128/JB.00681-17
Source DB: PubMed Journal: J Bacteriol ISSN: 0021-9193 Impact factor: 3.490
Distribution of TCSs among major archaeal taxa
| Archaeal superphylum, phylum, or class | No. of complete genomes | Total no. of proteins | No. (%) of | |||
|---|---|---|---|---|---|---|
| Histidine kinases | Response regulators | |||||
| All | Containing: | |||||
| REC only | HTH | |||||
| DPANN group | 4 | |||||
| “ | 1 | 952 | 1 | 1 | — | 1 |
| “ | 1 | 1,183 | — | — | — | — |
| | 2 | 1,122 | — | — | — | — |
| 149 | ||||||
| | 7 | 15,162 | 37 | 42 | 32 (76) | |
| | 35 | 123,786 | 745 | 695 | 163 (23) | 93 |
| | 17 | 33,023 | 140 | 150 | 35 (23) | — |
| | 12 | 19,966 | 19 | 23 | 15 (65) | — |
| | 39 | 108,246 | 998 | 697 | 300 (43) | 20 |
| | 1 | 1,687 | — | — | — | — |
| | 25 | 51,618 | 17 | 31 | 17 (55) | — |
| | 11 | 18,221 | 3 | 4 | — | 1 |
| Unclassified euryarchaea | 3 | 4,814 | 4 | 8 | 6 (75) | — |
| TACK group | 61 | |||||
| | 16 | 36,321 | 175 | 224 | 172 (77) | 1 |
| | 44 | 88,281 | — | — | — | — |
| | 1 | 1,602 | — | — | — | — |
| Unclassified archaea | 4 | 5,604 | — | — | — | — |
| Total no. | 218 | 511,588 | 2,139 | 1,875 | 740 | 116 |
Among the proteins that combine the HisKA, HATPase, and REC domains, the 53 that contain REC domains at their C termini were counted as histidine kinases, whereas those (445 in total) that contain REC domains on their N termini were counted as response regulators. —, no proteins found.
FIG 1Census of the archaeal two-component signal transduction systems. (A) Total numbers of sensor histidine kinases (HKs) and response regulators (RRs) encoded in 218 archaeal genomes. (B) Ratios of histidine kinases and response regulators in various archaea. Squares represent HKs, and circles represent RRs; symbols representing data for halobacteria are in orange and red, those for methanogens are in brown, and those for thaumarchaea are in blue (9). The small blue dots indicate RR/HK ratios for individual bacterial genomes.
FIG 2Principal classes of archaeal response regulators. The detailed data are available in Tables S1 and S2 and online at http://www.ncbi.nlm.nih.gov/Complete_Genomes/TCSarchaea.html.
Previously described REC-associated domains in archaeal response regulators
| Domain | Pfam ID | No. of proteins containing domain | Accession no. of representative example in: | Typical domain architecture(s) | Phylogenetic distribution | |
|---|---|---|---|---|---|---|
| GenBank | UniProt | |||||
| CheB | PF01339 | >450 | CheB, REC-CheB | All archaeal phyla | ||
| PAS | PF13426 | >750 | REC-PAS, PEC-PAS-PAS, REC-PAS-GAF | All archaeal phyla | ||
| GAF | PF13492 | >450 | REC-GAF, REC-PAS-GAF | All archaeal phyla | ||
| HisKA | PF00512 | 18 | REC-(PAS) | Most archaeal phyla | ||
| HATPase | PF02518 | REC-(PA) | Most archaeal phyla | |||
| BAT | PF15915 | >250 | REC-PAS-GAF-BAT-HTH_10 | |||
| HTH_10 | PF04967 | >250 | REC-PAS-GAF-BAT-HTH_10 | |||
| HalX | PF08663 | >550 | HalX, REC-HalX, HxlR-REC-HalX | |||
| Glyco_transf-2_3 (BcsA) | PF13641 | 13 | REC-BcsA, REC-REC-BcsA | |||
| iKaiC | PF06745 | 9 | iKaiC-REC | |||
| DUF835 (iKaiC) | PF05763 | 36 | REC-DUF835, REC-PAS-PAS-DUF835 | |||
| MCPsignal | PF00015 | 3 | REC-PAS-PAS-HAMP-MCPsignal | |||
| MEDS (iKaiC) | PF14417 | 2 | MEDS, MEDS-REC, REC-MEDS | “ | ||
| TPR-like | PF13414 | 6 | REC-PAS-TPR, TPR-TPR-TPR-REC | |||
Abbreviated domain names that may represent a group of related Pfam (4) domains, e.g., PAS represents domains from PAS to PAS_11. The Pfam and GAF entries listed are for the versions that are most often found in archaeal RRs.
In archaea, see the respective Pfam entries for the complete listings.
iKaiC, inactivated KaiC-like ATPase domain (described in detail in reference 41).
Novel REC-associated domains in archaeal response regulators
| Domain | Pfam ID | Length (aa) | No. of proteins containing domain | Accession no. of representative example in: | Domain architecture(s) | Phylogenetic distribution | |
|---|---|---|---|---|---|---|---|
| GenBank | UniProt | ||||||
| AcidOD1 | NA | 70 | 2 | REC-AcidOD1 | |||
| HalOD1 | PF18545 | 80 | >2,000 | REC-HalOD1, HalOD1-PAS, HalOD1-iKaiC | |||
| HalOD2 | PF18547 | 130 | 14 | REC-HalOD2 | |||
| MetOD1 | PF18546 | 140 | >100 | MetOD1, REC-MetOD1, REC-PAS-MetOD1 | |||
| MetOD2 | PF18548 | 80 | >70 | REC-MetOD2, MetOD3-REC-MetOD2 | |||
| MetOD3 | NA | 180 | 4 | MetOD3-REC-MetOD2 | |||
| MetOD4 | NA | 80 | 3 | MetOD4, REC-MetOD4 | |||
| MetOD5 | NA | 300 | 3 | MetOD5, MetOD5-REC | |||
| NitrOD1 | PF18549 | 70 | 12 | NitrOD1, REC-NitrOD1 | |||
| NitrOD2 | PF18550 | 90 | 13 | NitrOD2, NitrOD2-REC | |||
| NitrOD3 | NA | 120 | 3 | NitrOD3-REC | |||
| NitrOD4 | NA | 75 | 4 | NitrOD4, NitrOD4-REC | |||
| NitrOD5 | PF11537 | 100 | 16 | NitrOD5, REC-NitrOD5 | |||
| TackOD1 | PF18551 | 200 | 12 | TackOD1, REC-TackOD1, REC-wHTH-TackOD1 | TACK group | ||
Tentative domain names, constructed by combining an abbreviated taxon name with “OD” (output domain). These names are expected to be replaced as soon as these domains are experimentally characterized. Details of the analyses of remote sequence similarities of these domains are presented in Table S5 in the supplemental material.
NA, not available. The numbered domains are expected to be included in Pfam release 33 [2018]).
The number of archaeal proteins containing the domain (with any domain architecture) in the NCBI protein database as of 1 July 2017.
FIG 3Sequence conservation and domain architectures of HalOD1s. (A) Sequence logo generated by the WebLogo program (77) from an alignment obtained by PSI-BLAST run using as query the sequence of halovirus HRTV-8 protein 1 (GenBank accession number AGM10749; UniProt ID R4T552). The first position of the logo corresponds to Arg21 of HRTV8-1 and to Glu161 of Haloferax volcanii response regulator HVO_2306 (GenBank accession number ADE02288; UniProt ID D4GWD4). Secondary structure prediction (cylinders indicate α-helices, and arrows indicate β-strands) was produced by JPred (84). (B) Domain architectures of selected HalOD1-containing proteins, listed under their locus names and GenBank accession numbers. iKaiC, divergent and possibly inactivated ATPase domain of the KaiC superfamily. The domain architectures for each sector are shown only for comparison and are not scaled to size.