| Literature DB >> 26339559 |
Baldiri Salvado1, Ester Vilaprinyo2, Albert Sorribas1, Rui Alves1.
Abstract
Two Component Systems and Phosphorelays (TCS/PR) are environmental signal transduction cascades in prokaryotes and, less frequently, in eukaryotes. The internal domain organization of proteins and the topology of TCS/PR cascades play an important role in shaping the responses of the circuits. It is thus important to maintain updated censuses of TCS/PR proteins in order to identify the various topologies used by nature and enable a systematic study of the dynamics associated with those topologies. To create such a census, we analyzed the proteomes of 7,609 organisms from all domains of life with fully sequenced and annotated genomes. To begin, we survey each proteome searching for proteins containing domains that are associated with internal signal transmission within TCS/PR: Histidine Kinase (HK), Response Regulator (RR) and Histidine Phosphotranfer (HPt) domains, and analyze how these domains are arranged in the individual proteins. Then, we find all types of operon organization and calculate how much more likely are proteins that contain TCS/PR domains to be coded by neighboring genes than one would expect from the genome background of each organism. Finally, we analyze if the fusion of domains into single TCS/PR proteins is more frequently observed than one might expect from the background of each proteome. We find 50 alternative ways in which the HK, HPt, and RR domains are observed to organize into single proteins. In prokaryotes, TCS/PR coding genes tend to be clustered in operons. 90% of all proteins identified in this study contain just one of the three domains, while 8% of the remaining proteins combine one copy of an HK, a RR, and/or an HPt domain. In eukaryotes, 25% of all TCS/PR proteins have more than one domain. These results might have implications for how signals are internally transmitted within TCS/PR cascades. These implications could explain the selection of the various designs in alternative circumstances.Entities:
Keywords: Biological organization principles; Domain organization; Mathematical modeling; Signal transduction
Year: 2015 PMID: 26339559 PMCID: PMC4558063 DOI: 10.7717/peerj.1183
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Two component systems.
(A) Prototypical two component system with one phosphotransfer step between HK and RR. (B) 3-step phosphorelay, with four protein domains involved in the signal transduction process and 3 phosphotransfer steps.
Figure 2Four different patterns of covalent linkage between the protein domains involved in phosphorelays.
(A) A four protein phosphorelay. (B) A phosphorelay with and hybrid kinase at the beginning of the cascade. (C) A two protein phosphorelay where the first two phosphotransfer steps between domains contained in a single protein. (D) A one protein phosphorelay, where all phosphotransfer steps take place between domains of a single protein.
Percentage of species in each phylum with TCS/PR proteins.
| Domain | Phylum | Abbreviaton | No. of species surveyed | % of species with HK and RR domains | % of species with HPt domains |
|---|---|---|---|---|---|
| Bacteria | Actinobacteria | At | 635 | 99.37 | 14.49 |
| Bacteria | Aquificae | Aq | 13 | 100.00 | 76.92 |
| Bacteria | Armatimonadetes | Ar | 1 | 100.00 | 100.00 |
| Bacteria | Bacteroidetes | Ba | 235 | 89.79 | 49.79 |
| Bacteria | Chlorobi | Cb | 14 | 100.00 | 71.43 |
| Bacteria | Caldiserica | Cd | 1 | 100.00 | 0.00 |
| Bacteria | Chlamydiae | Cm | 108 | 98.15 | 1.85 |
| Bacteria | Lentisphaerae | L | 1 | 100.00 | 100.00 |
| Bacteria | Verrucomicrobia | V | 10 | 100.00 | 80.00 |
| Bacteria | Chloroflexi | Cf | 23 | 100.00 | 65.21 |
| Bacteria | Chrysiogenetes | Cr | 1 | 100.00 | 100.00 |
| Bacteria | Cyanobacteria | Cy | 118 | 100.00 | 75.42 |
| Bacteria | Deferribacteres | Df | 4 | 100.00 | 100.00 |
| Bacteria | Deinococcus-Thermus | Dt | 20 | 100.00 | 35.00 |
| Bacteria | Dictyoglomi | Dc | 2 | 100.00 | 0.00 |
| Bacteria | Elusimicrobia | El | 1 | 100.00 | 0.00 |
| Bacteria | Acidobacteria | Ac | 9 | 100.00 | 100.00 |
| Bacteria | Fibrobacteres | Fb | 1 | 100.00 | 100.00 |
| Bacteria | Firmicutes | Fi | 2,066 | 99.42 | 37.80 |
| Bacteria | Fusobacteria | Fu | 38 | 94.74 | 28.95 |
| Bacteria | Gemmatimonadetes | Ge | 1 | 100.00 | 100.00 |
| Bacteria | Nitrospinae | Ni | 1 | 100.00 | 100.00 |
| Bacteria | Nitrospirae | Nt | 4 | 100.00 | 100.00 |
| Bacteria | Planctomycetes | Pl | 20 | 100.00 | 100.00 |
| Bacteria | Alphaproteobacteria | A | 451 | 96.67 | 58.31 |
| Bacteria | Betaproteobacteria | B | 366 | 98.36 | 59.56 |
| Bacteria | Deltaproteobacteria | D | 82 | 98.78 | 98.78 |
| Bacteria | Epsilonproteobacteria | E | 410 | 100.00 | 98.54 |
| Bacteria | Gammaproteobacteria | G | 2,246 | 98.31 | 95.46 |
| Bacteria | Zetaproteobacteria | Z | 1 | 100.00 | 100.00 |
| Bacteria | Spirochaetes | S | 274 | 100.00 | 99.64 |
| Bacteria | Synergistetes | Sy | 11 | 90.91 | 63.64 |
| Bacteria | Tenericutes | T | 111 | 15.32 | 7.21 |
| Bacteria | Thermodesulfobacteria | Th | 2 | 100.00 | 100.00 |
| Bacteria | Thermotogae | Tt | 17 | 100.00 | 100.00 |
| Archaea | Crenarchaeota | C | 51 | 3.92 | 3.92 |
| Archaea | Euryarchaeota | Eu | 179 | 86.03 | 64.25 |
| Archaea | Korarchaeota | K | 1 | 0.00 | 0.00 |
| Archaea | Thaumarchaeota | Ta | 11 | 81.82 | 63.64 |
| Archaea | Nanoarchaeota | N | 1 | 0.00 | 0.00 |
| Archaea | Nanohaloarchaeota | Nh | 1 | 0.00 | 0.00 |
| Eukarya | Alveolates | Av | 5 | 0.00 | 20.00 |
| Eukarya | Amoeboflagellates | Am | 1 | 100.00 | 100.00 |
| Eukarya | Euglenozoa | Eg | 5 | 40.00 | 0.00 |
| Eukarya | Microsporidians | Mi | 2 | 50.00 | 0.00 |
| Eukarya | Ascomycetes | As | 31 | 54.84 | 96.77 |
| Eukarya | Basidiomycetes | Bs | 2 | 100.00 | 100.00 |
| Eukarya | Eudicots | Ed | 2 | 100.00 | 100.00 |
| Eukarya | Monocots | M | 1 | 0.00 | 100.00 |
| Eukarya | Nematodes | – | 1 | 0.00 | 0.00 |
| Eukarya | Arthropods | – | 7 | 0.00 | 0.00 |
| Eukarya | Chordates | – | 10 | 0.00 | 0.00 |
Figure 3Percentage of TCS/PR proteins in the proteome per phylum.
The colored box represents the range of percentage values comprised between the 25% and the 75% quantiles, and the edges of the vertical bar denote the upper and lower percentage values for each phylum. Phylum abbreviations are given in Table 1. Phyla with only one species surveyed are not represented in the figure. Their percentage of TCS/PR proteins per phylum are: Aq (1.30), Ge (3.15), Fb (0.81), L (0.68), Cr (3.73), El (0.78), Ar (0.93), Z (2.47), O (4.95), Ni (1.97), K (0), N (0), Nh (0), Am (0.17) and M (0.04). We have found only 2 TCS/PR proteins in Av (5 sp): 1 HPt in T. annulata and 1 HK in T. parva.
Figure 4Percentage of TCS/PR proteins in the proteome versus total number of proteins in the proteome.
R2 is 0.21 for prokaryotes and 0.49 for eukaryotes. This means that proteome size explains 21% of the variation in the percentage of TCS/PR in prokaryotes and 49% in eukaryotes.
Types of TCS/PR proteins found in the 7,609 surveyed species.
The protein identifier describes the type (HK, HPt, or RR) and number of TCS/PR domains fused in each protein.
| Protein type | Total number of proteins found | Percentage of proteomes with this type of protein | Number of species with this type of protein | Average number of proteins/organism |
|---|---|---|---|---|
| RR | 219,436 | 97,07 | 7,386 | 29.71 |
| HK | 151,849 | 95,98 | 7,303 | 20.79 |
| HKRR | 18,383 | 48,57 | 3,696 | 4.97 |
| HKRRHPt | 9,097 | 40,85 | 3,108 | 2.93 |
| HKHPt | 5,506 | 41,99 | 3,195 | 1.72 |
| HPt | 3,534 | 28,05 | 2,134 | 1.66 |
| RR1RR2 | 2,034 | 17,60 | 1,339 | 1.52 |
| HKRR1RR2 | 2,017 | 13,59 | 1,034 | 1.95 |
| HKRR1HPtRR2 | 982 | 8.12 | 618 | 1.59 |
| HK1RR1RR2RR3 | 580 | 6.58 | 501 | 1.16 |
| HK1HK2 | 450 | 4.07 | 310 | 1.45 |
| HK1RRHK2 | 392 | 3.30 | 251 | 1.56 |
| RRHPt | 312 | 3.47 | 264 | 1.18 |
| HKRRHPt1HPt2HPt3 | 141 | 1.85 | 141 | 1.00 |
| RR1RR2HPt | 130 | 1.45 | 110 | 1.18 |
| HKRRHPt1HPt2HPt3HPt4 | 108 | 1.42 | 108 | 1.00 |
| HK1RR1HK2RR2 | 90 | 0.79 | 60 | 1.50 |
| RR1RR2RR3HPt | 72 | 0.51 | 39 | 1.85 |
| HKRRHPt1HPt2HPt3HPt4HPt5 | 61 | 0.80 | 61 | 1.00 |
| HKRRHPt1HPt2 | 58 | 0.72 | 55 | 1.05 |
| HK1HK2RRHPt | 39 | 0.50 | 38 | 1.03 |
| HK1HK2HPt | 39 | 0.50 | 38 | 1.03 |
| HKHPt1HPt2 | 36 | 0.46 | 35 | 1.03 |
| RR1RR2RR3 | 34 | 0.32 | 24 | 1.42 |
| HKRR1RR2RR3HPt | 33 | 0.37 | 28 | 1.18 |
| HPt1HPt2 | 21 | 0.20 | 15 | 1.40 |
| HKHPt1HPt2HPt3 | 16 | 0.20 | 15 | 1.07 |
| HK1HK2RR1RR2RR3 | 9 | 0.12 | 9 | 1.00 |
| HK1HK2HK3 | 9 | 0.04 | 3 | 3.00 |
| HKRR1RR2RR3RR4RR5HPt | 7 | 0.09 | 7 | 1.00 |
| HKRRHPt1HPt2HPt3HPt4HPt5HPt6HPt7 | 7 | 0.09 | 7 | 1.00 |
| HKRR1RR2RR3RR4 | 6 | 0.08 | 6 | 1.00 |
| HK1HK2HK3HK4RR1RR2 | 6 | 0.08 | 6 | 1.00 |
| HK1HK2RRHPt1HPt2 | 5 | 0.07 | 5 | 1.00 |
| HKRR1RR2RR3RR4HPt | 5 | 0.07 | 5 | 1.00 |
| RR1RR2RR3RR4 | 2 | 0.03 | 2 | 1.00 |
| HK1HK2RR1RR2HPt1HPt2 | 2 | 0.03 | 2 | 1.00 |
| HK1HK2RR1RR2RR3RR4 | 2 | 0.03 | 2 | 1.00 |
| HK1HK2HK3HK4 | 2 | 0.03 | 2 | 1.00 |
| HK1HK2HPt1HPt2 | 2 | 0.03 | 2 | 1.00 |
| HKRR1RR2HPt1HPt2 | 2 | 0.03 | 2 | 1.00 |
| HK1HK2HK3RR | 1 | 0.01 | 1 | 1.00 |
| HPt1HPt2HPt3 | 1 | 0.01 | 1 | 1.00 |
| HK1HK2RRHPt1HPt2HPt3 | 1 | 0.01 | 1 | 1.00 |
| HKRR1RR2RR3HPt1HPt2HPt3 | 1 | 0.01 | 1 | 1.00 |
| HPt1HPt2HPt3HPt4 | 1 | 0.01 | 1 | 1.00 |
| HKRR1RR2HPt1HPt2HPt3 | 1 | 0.01 | 1 | 1.00 |
| HK1HK2RR1RR2HPt | 1 | 0.01 | 1 | 1.00 |
| HK1HK2RR1RR2RR3RR4RR5RR6HPt | 1 | 0.01 | 1 | 1.00 |
| HKRRHPt1HPt2HPt3HPt4HPt5HPt6 | 1 | 0.01 | 1 | 1.00 |
Total number of HKRRHPt and HKRRHK proteins found in prokaryotic phyla.
Phyla in bold are from the bacterial domain. Italicized phyla are from the archaeal domain.
| Phylum | Number of HKRRHPt/HK1RRHK2 proteins found | Number of HKRRHPt/HK1RRHK2 genes with a neighboring RR gene | % of HKRRHPt/HK1RRHK2 genes with a neighboring RR gene |
|---|---|---|---|
| Actinobacteria | 12/4 | 9/1 | 75.00/25.00 |
| Aquificae | 0/0 | 0/0 | −/− |
| Armatimonadetes | 0/0 | 0/0 | −/− |
| Bacteroidetes | 107/9 | 62/4 | 57.94/44.44 |
| Chlorobi | 4/0 | 0/0 | 0.00/− |
| Caldiserica | 0/0 | 0/0 | −/− |
| Chlamydiae | 2/0 | 1/0 | 50.00/− |
| Lentisphaerae | 1/0 | 0/0 | 0.00/− |
| Verrucomicrobia | 12/2 | 9/1 | 75.00/50.00 |
| Chloroflexi | 16/0 | 8/0 | 50.00/− |
| Chrysiogenetes | 1/0 | 0/0 | 0.00/− |
| Cyanobacteria | 193/28 | 41/9 | 21.24/32.14 |
| Deferribacteres | 9/0 | 7/0 | 77.78/− |
| Deinococcus-Thermus | 0/4 | 0/1 | −/25.00 |
| Dictyoglomi | 0/0 | 0/0 | −/− |
| Elusimicrobia | 0/0 | 0/0 | −/− |
| Acidobacteria | 1/5 | 1/2 | 100.00/40.00 |
| Fibrobacteres | 0/0 | 0/0 | −/− |
| Firmicutes | 65/97 | 44/69 | 67.69/71.13 |
| Fusobacteria | 2/0 | 2/0 | 100.00/− |
| Gemmatimonadetes | 3/0 | 3/0 | 100.00/− |
| Nitrospinae | 0/0 | 0/0 | −/− |
| Nitrospirae | 4/0 | 3/0 | 75.00/− |
| Planctomycetes | 40/0 | 18/0 | 45.00/− |
| Alphaproteobacteria | 337/10 | 233/5 | 69.14/50.00 |
| Betaproteobacteria | 364/9 | 274/4 | 75.27/44.44 |
| Deltaproteobacteria | 208/29 | 131/1 | 62.98/3.45 |
| Epsilonproteobacteria | 399/0 | 389/0 | 97.49/− |
| Gammaproteobacteria | 7239/28 | 3336/15 | 46.08/53.57 |
| Zetaproteobacteria | 2/0 | 1/0 | 50.00/− |
| Spirochaetes | 53/147 | 16/3 | 30.19/2.04 |
| Synergistetes | 6/0 | 6/0 | 100.00/− |
| Tenericutes | 0/0 | 0/0 | −/− |
| Thermodesulfobacteria | 2/0 | 1/0 | 50.00/− |
| Thermotogae | 6/0 | 5/0 | 83.33/− |
|
| 0/0 | 0/0 | −/− |
|
| 9/1 | 3/0 | 33.33/0.00 |
|
| 0/0 | 0/0 | −/− |
|
|
|
|
|
Percentage of RR and HK domains in hybrid proteins as a function of the total number of HK and RR proteins in the genome.
Phyla in bold are from the bacterial domain. Italicized phyla are from the archaeal domain. Other phyla are from the eukaryotic domain.
| Phylum | RR | SK |
|---|---|---|
| Gammaproteobacteria | 6.97 + 0.2 | 14 + 0.31 |
| Betaproteobacteria | 1.90 + 0.22 | 4.6 + 0.37 |
| Epsilonproteobacteria | 15.3 − 0.06 | 0.17 + 0.36 |
| Deltaproteobacteria | 17.3 + 0.1 | 31.9 + 0.06 |
| Alphaproteobacteria | 4.1 + 0.29 | 3.8 + 0.4 |
| Firmicutes | −0.6 + 0.1 | 1.7 + 0.09 |
| Tenericutes | – | – |
| Actinobacteria | −4.5 + 0.32 | −0.38 + 0.2 |
| Chlamydiae | – | – |
| Spirochaetes | 5 + 0.47 | 27.2 + 0.07 |
| Acidobacteria | −5.7 + 0.26 | −16 + 0.53 |
| Bacteroidetes | 30.7 + 0.05 | 32 + 0.09 |
| Fusobacteria | −5.8 + | −7.1 + 1.5 |
| Verrumicrobia | 6.7 + 0.3 | 6.6 + 0.4 |
| Planctomycetes | 32.8 − 0.1 | 49.8 − 0.21 |
| Synergistetes | – | – |
| Cyanobacteria | 2.2 + 0.3 | 5.8 + 0.4 |
| Green sulfur bacteria | 31.6 + 0.7 | 33.5 + 0.5 |
| Green non-sulfur bacteria | 5.2 + 0.2 | 8.9 + 0.2 |
| Deinococcus-Thermus | −1.2 + 0.2 | −1.6 + 0.2 |
| Euryarchaeota | 6.9 + 0.6 | 15.4 + 0.1 |
| Crenarchaeota | – | – |
| Nanoarchaeota | – | – |
| Korarchaeota | – | – |
| Oomycetes | – | – |
| Diatoms | – | – |
| Parabasilids | – | – |
| Diplomonads | – | – |
| Euglenozoa | – | – |
| Alveolates | – | 9.5 + 0.4 |
| Amoeboflagellates | – | – |
| Choanoflagellates | – | – |
| Microsporideans | – | – |
| Basidiomycetes | 25.7 + 3.7 | 88 + 1.1 |
| Ascomycetes | 37.4 + 2.5 | 92.4 − 0.07 |
| Red algae | – | – |
| Green algae | 29.2 + | 114.7 − 9 |
| Mosses | – | – |
| Monocots | 12.3 + 0.2 | 32.9 + 1.9 |
| Eudicots | 17.4 + 0.1 | 71 − 0.5 |
Notes.
p-value < 10−3
p-value < 10−8
Non-significant (p-value > 0.1)