| Literature DB >> 27608806 |
Rita Pancsa1, Peter Tompa2,3.
Abstract
UNLABELLED: Intrinsically disordered regions (IDRs) of proteins fulfill important regulatory roles in most organisms. However, the proteins of certain endosymbiont and intracellular pathogenic bacteria with extremely reduced genomes contain disproportionately small amounts of IDRs, consisting almost entirely of folded domains. As their genomes co-evolving with their hosts have been reduced in unrelated lineages, the proteomes of these bacteria represent independently evolved minimal protein sets. We systematically analyzed structural disorder in a representative set of such minimal organisms to see which types of functionally relevant longer IDRs are invariably retained in them. We found that a few characteristic functions are consistently linked with conformational disorder: ribosomal proteins, key components of the protein production machinery, a central coordinator of DNA metabolism and certain housekeeping chaperones seem to strictly rely on structural disorder even in genome-reduced organisms. We propose that these functions correspond to the most essential and probably also the most ancient ones fulfilled by structural disorder in cellular organisms. REVIEWERS: This article was reviewed by Michael Gromiha, Zoltan Gaspari and Sandor Pongor.Entities:
Keywords: Chaperone function; Disorder prediction; Endosymbiont; Essential function; Genome reduction; Genome-reduced bacterium; Intrinsically disordered; Minimal genome; Structural disorder
Mesh:
Substances:
Year: 2016 PMID: 27608806 PMCID: PMC5016991 DOI: 10.1186/s13062-016-0149-y
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Non-ribosomal proteins that retained LDRs in at least three minimal organisms of at least two different bacterial clades
| Taxon ID | Clade | Species | rpoB | rpoC | rpoD | infB | prfA | groL | dnaK | ssb | ftsH |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1053648 | BPB | Cand. Tremblaya princeps PCVAL | ✓ | ✓ | ND | Ø | Ø | ✓ | ✓ | Ø | Ø |
| 1343077 | BPB | Cand. Nasuia deltocephalinicola str. NAS-ALF | ND | ND | ND | Ø | ND | ND | ✓ | ✓ | Ø |
| 573234 | APB | Hodgkinia cicadicola (strain Dsem) | ND | ND | ND | ND | ND | ND | ✓ | Ø | Ø |
| 1266371 | BPB | Cand. Tremblaya phenacola PAVE | ✓ | ND | ND | ND | ✓ | ✓ | ✓ | ND | Ø |
| 667013 | GPB | Cand. Carsonella ruddii DC | ND | ND | ND | ND | ND | ND | ND | Ø | Ø |
| 871271 | BPB | Zinderia insecticola (strain CARI) | ND | ND | ND | ND | ND | ND | ND | ✓ | Ø |
| 1415657 | BAC | endosymbiont of Llaveia axin axin | ND | ND | ND | ✓ | ND | ND | ✓ | Ø | ✓ |
| 1206109 | GPB | Cand. Portiera aleyrodidarum BT-B-HRs | ND | ND | ND | ✓ | ND | ✓ | ✓ | Ø | ND |
| 482235 | FIR | Phytoplasma mali (strain AT) | ND | ND | ND | ND | ND | ND | ✓ | ND | ✓ |
| 347256 | TEN | Mycoplasma hominis (strain ATCC 23114) | ✓ | ✓ | ✓ | ND | ND | Ø | ✓ | ✓ | ✓ |
| 515618 | GPB | Riesia pediculicola (strain USDA) | ND | ND | ✓ | ND | ND | ✓ | ✓ | ✓ | ND |
| 107806 | GPB | Buchnera aphidicola subsp. A. pisum (strain APS) | ND | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ND |
| 1318617 | TEN | Cand. Mycoplasma girerdii | ND | ND | ND | ND | ✓ | Ø | ✓ | ✓ | ✓ |
|
|
|
|
|
| ✔ | ✔ | ✔ |
| ✔ | ✔ | ✔ |
|
|
|
|
|
|
| ✔ |
|
| ✔ | ✔ | ✔ |
|
|
|
| ✔ | ✔ |
| ✔ | ✔ | ✔ | ✔ | ✔ |
|
Taxon ID: the taxon identifier of the organisms with the number of their proteins annotated by UniProt in brackets. Clade: an abbreviation of the corresponding phylogenetic clade (APB Alphaproteobacteria, BPB Betaproteobacteria, GPB Gammaproteobacteria, BAC Bacteroidetes, FIR Firmicutes, TEN Tenericutes). Species and strain information is followed by columns showing information on the 9 proteins with retained LDRs. Different marks mean that i) the given protein was not annotated in the given organism (missing or does not show recognizable sequence homology (Ø)); ii) the protein is not disordered (ND), meaning that the given protein was annotated, but it does not contain an LDR in the given species, and iii) the protein has at least one LDR (✓). Data for non-minimal reference bacteria of three different clades are indicated in the last three lines in bold. A similar table with all corresponding UniProt identifiers indicated in the appropriate cells is available as Additional file 1: Table S2
Fig. 1Conserved disorder in chaperones and hub proteins of minimal organisms. Domain maps, structures, sequence and disorder conservation are depicted for the three proteins (GroEL (a) DnaK (b) and SSB (c)) with conserved long disordered regions. On the grey domain maps, the residue boundaries of conserved disordered segments (in red) and known domains (in darker grey) are provided. The red regions of the domain maps complemented by a few residue positions around are also highlighted as Clustal Omega 1.2.2. multiple sequence alignments below the domain maps. The sequences of the minimal and reference organisms are identified by their Taxonomy/UniProt identifiers, and are depicted in the same order as in Table 1. In the alignments the background of the residues are colored according to the corresponding IUPred predictions; residues with a score >0.5 in darker red, while residues with a score between 0.5 and 0.4 in lighter red. The structures of the corresponding E. coli proteins (PDB: 2NWC for GroEL, 2KHO for DnaK and 1QVC for SSB) are also depicted in light grey with the conserved disordered segments marked by red or added as red dashed lines. In the heptameric GroEL and tetrameric SSB structures one chain is depicted by darker grey than the others