| Literature DB >> 26252577 |
Esmeralda Vicedo1, Avner Schlessinger2, Burkhard Rost3.
Abstract
Many prokaryotic organisms have adapted to incredibly extreme habitats. The genomes of such extremophiles differ from their non-extremophile relatives. For example, some proteins in thermophiles sustain high temperatures by being more compact than homologs in non-extremophiles. Conversely, some proteins have increased volumes to compensate for freezing effects in psychrophiles that survive in the cold. Here, we revealed that some differences in organisms surviving in extreme habitats correlate with a simple single feature, namely the fraction of proteins predicted to have long disordered regions. We predicted disorder with different methods for 46 completely sequenced organisms from diverse habitats and found a correlation between protein disorder and the extremity of the environment. More specifically, the overall percentage of proteins with long disordered regions tended to be more similar between organisms of similar habitats than between organisms of similar taxonomy. For example, predictions tended to detect substantially more proteins with long disordered regions in prokaryotic halophiles (survive high salt) than in their taxonomic neighbors. Another peculiar environment is that of high radiation survived, e.g. by Deinococcus radiodurans. The relatively high fraction of disorder predicted in this extremophile might provide a shield against mutations. Although our analysis fails to establish causation, the observed correlation between such a simplistic, coarse-grained, microscopic molecular feature (disorder content) and a macroscopic variable (habitat) remains stunning.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26252577 PMCID: PMC4529154 DOI: 10.1371/journal.pone.0133990
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distribution of disorder content in different organisms.
Fractions of proteins with long regions of disorder (here ≥30 consecutive residues) were predicted by three prediction methods (MD, NORSnet and IUPred). (A) The raw values are standardized using the Z-scores (Eq 1; mean and standard deviation σ from a 1613 prokaryotes calculated for each method; positive: higher than the mean; negative: below the mean; integers +/- N imply N*σ above/below the mean). The top panel shows the extremophiles; the lower panel shows the closest phylogenetic relative for each extremophile in the top panel (for relatives discussed in the text and left out for clarity from the figure, for all studied organisms S3 Fig). The archaeal halophiles Haloarcula marismortui ATCC 43049 and Halobacterium sp. NRC-1 were predicted with the highest content of proteins with long disorder. Conversely, the archaeal thermophile Aeropyrum pernix K1 was one of the organisms predicted with the lowest disorder. The taxonomic neighbors section compares the disorder predicted for the closest relatives of the extremophiles. (B-D) Mapping of disorder protein content predictions for all organisms for each prediction method (B: MD [42], C: NORSnet [6], and D: IUPred Clearly, all three methods put the thermophiles on the left (less disorder), while the halophiles appear on the right (high disorder). The blue curves are Gaussian fits based on the mean and σ of our data.
Protein disorder overlap between related proteins in opposite extremophiles.
| HVAL |
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
| -20 | 75.5 ±0.2 | 9.5 ± 0.1 | 66.9 ± 0.1 | 5.53 ± 0.06 |
| -10 | 56.4 ±0.2 | 6.8 ± 0.1 | 55.7 ± 0.2 | 5.04 ± 0.08 |
| 0 | 24.0 ±0.1 | 4.9 ± 0.2 | 30.9 ± 0.1 | 2.7 ± 0.1 |
| 10 | 5.5 ±0.1 | 2.6 ± 0.2 | 9.7 ± 0.1 | 0.51 ± 0.06 |
| 20 | 0.6 ±0.02 | 0.07 ± 0.02 | 1.28 ± 0.03 | 0 |
| 30 | 0.04±0.01 | 0 | 0.20 ± 0.01 | 0 |
a HVAL measures sequence similarity as the distance from the HSSP-curve [55, 89]; e.g. HVAL = 0 implies 20% pairwise sequence identity (PIDE) for >250 aligned residues [57] (or 20+N% PIDE at HVAL = N).
b related gives the percentage of proteins in one organism (CP: Colwellia psychrerythraea 34H or PH: Pyrococcus horikoshii) that have corresponding homologs in the other (PH or CP) at the given HVAL (totals: CP = 4423 and PH = 1573). For instance, 24% of all 4423 CT proteins have a match in one of the 1573 PH proteins at HVAL≥0, while almost 31% of the PH proteins have a homolog in CP at this level of sequence relation. One standard error is marked as ‘±stderr’.
c related+disordered gives the percentage of proteins in one organism (CP or PH) that are related and have at least one disordered region (>30 residues, prediction by MD; other methods and thresholds in SOM) in the other (PH or CP) at the given HVAL . Overall MD predicts 12% of all Colwellia psychrerythraea 34H and 8% of all Pyrococcus horikoshii OT3 proteins to have at least one long disordered region (Table 1; cold = high disorder, heat = low). These numbers imply that the proteins shared between the two extremophiles from opposite ends of the temperature spectrum are depleted in disorder with respect to the entire proteome. For instance only 4.9% are related and disordered from the CP perspective at HVAL≥0 as opposed to 12% for all proteins. The more similar the homologs the more the related proteins were selected to not contain disorder. One standard error is marked as ‘±stderr’.
Fig 2Protein disorder content differs for habitat, not for phyla.
We represent the protein disorder content for the organisms in similar habitats (left panel) and those in the same phyla (right panel). The y-axes give the percentage of proteins with at least one region of ≥30 consecutive residues predicted as disordered by MD (A), NORSnet (B) and IUPred (C). The x-axis on the left side marks the different environmental groups (S2 Table); on the right side marks the studied phylogenetic groups (S14 Table). The groups which are significant for a paired Wilcoxon Test are marked with * (P<0.05) or ** (P<0.005).
Fig 3Protein disorder linked to habitat more than to phylogeny.
The fractions of proteins with long disordered regions are predicted by two disorder predictor methods (MD in green bars and IUPred in red bars). Eukaryotes are predicted with substantially more disorder than prokaryotes. Within the kingdoms predictions vary greatly: organisms in similar habitats tend to resemble each other in terms of disorder more than they resemble their closest phylogenetic relatives. (A) Hyperthermophilic archaea (dark red) are more ordered than their phylogenetic neighbors; halophilic archaea are more disordered (green). (B) Halophilic bacteria also appear more disordered than their relatives. (C) The bacterial thermophile (red) also has less disorder than its relatives. Other extreme organisms included: psychrophile (blue), psychrotolerant (light blue), radiation resistant (purple) and alkalophile (pink). We could also find organisms with relative high/low disorder content explainable separately.