| Literature DB >> 17430559 |
K Mizuguchi1, M Sele, M V Cubellis.
Abstract
BACKGROUND: Thermophilic organisms are able to live at high temperatures ranging from 50 to > 100 degrees C. Their proteins must be sufficiently stable to function under these extreme conditions; however, the basis for thermostability remains elusive. Subtle differences between thermophilic and mesophilic molecules can be found when sequences or structures from homologous proteins are compared, but often these differences are family-specific and few general rules have been derived. The availability of complete genome sequences has now made it feasible to perform a large-scale comparison between mesophilic and thermophilic proteins, the latter of which primarily come from archaeal genomes although a few complete genomes of thermophilic eubacteria are also available.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17430559 PMCID: PMC1885844 DOI: 10.1186/1471-2105-8-S1-S15
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
G/C content and Optimal Growth Temperature of the organisms analysed in this paper.
| Aeropyrum pernix K1 | 56 | 90–95 |
| Methanocaldococcus jannaschii DSM 2661 | 31 | 85 |
| Methanothermobacter thermautotrophicus str. Delta_H | 49 | 65–70 |
| Archaeoglobus fulgidus DSM 4304 | 48 | 83 |
| Thermoplasma acidophilum DSM 1728 | 45 | 59 |
| Thermoplasma volcanium GSS1 | 39 | 60 |
| Sulfolobus solfataricus P2 | 35 | 85 |
| Pyrococcus furiosus DSM 3638 | 40 | 100 |
| Methanopyrus kandleri AV19 | 61 | 98 |
| Picrophilus torridus DSM 9790 | 35 | 60 |
| Thermotoga maritima MSB8 | 46 | 80 |
| Aquifex aeolicus VF5 96 | 43 | 96 |
| Thermoanaerobacter tengcongensis MB4 | 37 | 75 |
| Thermosynechococcus elongatus BP-1 | 53 | 55 |
| Thermus thermophilus HB27 | 69 | 68 |
| Geobacillus kaustophilus HTA426 | 52 | 55 |
| Thermobifida fusca YX | 67 | 50–55 |
Data were obtained from NCBI genome database [32] with the exception of G/C content and OGT of Geobacillus kaustophilus, which were taken from the DSMZ database of organisms (Braunschweg, Germany) [37].
Figure 1Amino acid composition in percent. Bars in blue are for mesophilic, in green for thermophilic archaeal, and in yellow for thermophilic eubacterial proteins. Dots indicate values that significantly differ (P < 0.01) between thermophilic and mesophilic proteins.
Environment specific amino acid compositions in percent.
| HAH | 2.4 | 2.3 | EAH | 2.9 | 2.4 | CAH | 2.4 | 2.2 | 2.8 | ||
| HAE | 11.3 | EAE | 7.4 | 9.3 | CAE | 6.6 | |||||
| HAK | 9.3 | EAK | 8.1 | 7.2 | CAK | 7.4 | |||||
| HAR | 7.4 | 8.5 | EAR | 6.8 | 7.8 | CAR | 5.5 | 6.5 | |||
| HAD | 7.1 | 7.0 | EAD | 4.7 | 5.7 | CAD | 8.7 | 9.5 | |||
| HAS | 5.5 | 5.4 | EAS | 6.9 | 5.1 | CAS | 8.0 | ||||
| HAN | 4.5 | 4.2 | EAN | 3.8 | 3.5 | CAN | 6.4 | 5.9 | |||
| HAQ | 5.8 | EAQ | 4.3 | CAQ | 3.9 | ||||||
| HAC | 0.7 | EAC | 1.4 | 0.4 | 0.5 | CAC | 1.2 | ||||
| HAT | 4.4 | EAT | 9.4 | 7.0 | CAT | 7.1 | |||||
| HAP | 3.2 | 3.0 | 3.6 | EAP | 3.0 | 3.3 | 3.9 | CAP | 8.2 | 7.6 | |
| HAA | 9.9 | 10.6 | EAA | 4.6 | 4.0 | 5.4 | CAA | 6.6 | 6.5 | ||
| HAG | 3.5 | 3.6 | 3.7 | EAG | 3.4 | 3.6 | 3.9 | CAG | 6.4 | 6.4 | 6.8 |
| HAI | 3.7 | 3.8 | EAI | 5.6 | 6.2 | CAI | 3.1 | 3.3 | |||
| HAV | 4.4 | 4.7 | 4.8 | EAV | 8.9 | 10.0 | CAV | 4.5 | 4.7 | 4.7 | |
| HAL | 7.5 | 7.5 | 8.3 | EAL | 6.1 | 5.7 | 7.7 | CAL | 5.3 | 5.6 | |
| HAM | 1.9 | 1.8 | EAM | 1.6 | 1.8 | 1.4 | CAM | 1.5 | 1.7 | ||
| HAF | 2.8 | 2.7 | 2.7 | EAF | 4.1 | 3.7 | 3.7 | CAF | 2.9 | 3.0 | 2.8 |
| HAY | 3.3 | 3.1 | EAY | 5.5 | 5.8 | 5.0 | CAY | 3.3 | 3.2 | ||
| HAW | 1.3 | 1.0 | 1.1 | EAW | 1.7 | 1.0 | 1.2 | CAW | 1.1 | 0.9 | 0.9 |
| HaH | 1.8 | 1.5 | EaH | 1.6 | 1.2 | 1.2 | CaH | 2.8 | 2.3 | 2.7 | |
| HaE | 2.1 | 2.7 | EaE | 1.6 | 1.7 | 1.6 | CaE | 2.5 | 3.0 | 2.8 | |
| HaK | 1.5 | 2.1 | 1.7 | EaK | 1.1 | 1.3 | CaK | 1.6 | 1.9 | ||
| HaR | 2.2 | 2.3 | 2.5 | EaR | 1.7 | 1.6 | 1.7 | CaR | 2.2 | 2.5 | 2.6 |
| HaD | 2.0 | 2.2 | 1.9 | EaD | 2.0 | 2.1 | 2.1 | CaD | 4.3 | 3.8 | 4.1 |
| HaS | 3.9 | 4.6 | 3.3 | EaS | 3.8 | 3.5 | CaS | 6.5 | 6.4 | ||
| HaN | 1.8 | 1.8 | 1.6 | EaN | 1.8 | 1.7 | 1.6 | CaN | 3.7 | 3.4 | 2.8 |
| HaQ | 1.8 | 1.3 | 1.6 | EaQ | 1.5 | CaQ | 1.8 | 1.4 | 1.5 | ||
| HaC | 2.8 | EaC | 3.1 | 1.0 | 1.2 | CaC | 4.1 | ||||
| HaT | 4.3 | 4.5 | 4.3 | EaT | 4.7 | 4.4 | 4.5 | CaT | 6.2 | 6.4 | 6.3 |
| HaP | 1.7 | 1.9 | 2.1 | EaP | 1.6 | 2.0 | 2.0 | CaP | 7.1 | 7.9 | 7.7 |
| HaA | 14.6 | 16.8 | 17.1 | EaA | 8.0 | 8.8 | 9.3 | CaA | 8.9 | 9.4 | 10.7 |
| HaG | 4.8 | 5.2 | 5.6 | EaG | 4.8 | 5.5 | 4.8 | CaG | 7.0 | 7.9 | 7.8 |
| HaI | 9.8 | 9.3 | EaI | 13.4 | 14.7 | CaI | 7.1 | 7.9 | |||
| HaV | 10.3 | 10.4 | 11.2 | EaV | 17.7 | CaV | 8.8 | 9.5 | |||
| HaL | 18.2 | 19.0 | EaL | 14.0 | 11.5 | 14.8 | CaL | 11.2 | 10.2 | 11.7 | |
| HaM | 3.9 | 3.7 | EaM | 2.7 | 2.8 | 2.5 | CaM | 2.8 | 2.6 | 2.6 | |
| HaF | 6.4 | 5.0 | 5.2 | EaF | 7.8 | CaF | 6.0 | 5.2 | 5.3 | ||
| HaY | 3.9 | 3.8 | EaY | 5.0 | 4.1 | 3.7 | CaY | 3.8 | 4.0 | 3.4 | |
| HaW | 2.2 | EaW | 2.0 | CaW | 1.8 | ||||||
Mes stands for amino acid composition of mesophilic proteins, t_arc for amino acid composition of thermophilic archaeal proteins and t_eu for amino acid composition of thermophilic eubacterial proteins. HA stands for exposed alpha helices, Ha for non exposed alpha helices, EA for exposed beta strands, Ea for non exposed beta strands, CA for exposed coil and Ca for non exposed coil. The third letter is the standard code for amino acids. Values in bold significantly differ (P < 0.01) between thermophilic and mesophilic proteins.
Likelihoods of environment specific amino acid substitutions (in percent) that are large and significantly different between mesophiles-mesophiles and mesophiles-thermophilic archaeal homologues.
Mes stands for mesophilic proteins, t_arc for thermophilic archaeal proteins, HA for exposed alpha helices, Ha for non exposed alpha helices, EA for exposed beta strands, Ea for non exposed beta strands, CA for exposed coil and Ca for non exposed coil. Data are shown only if P < 0.01 in the two-tailed t-test and if the difference between mes and t_arc are, in absolute value, larger than 2. Environment specific amino acid substitutions with higher likelihood values in mesophiles-thermophilic than in archaeal homologues are in italics, those with higher likelihood values in mesophiles-mesophiles homologues are in bold. Data are sorted by increasing differences between mes and t_arc.
Likelihoods of environment-specific amino acid substitutions (in percent) that are large and significantly different between mesophiles-mesophiles and mesophiles-thermophilic eubacterial homologues.
| S→A | 5,6 | 7,9 | |||||||||||||||
Mes stands for mesophilic proteins, t_eu for thermophilic eubacterial proteins, HA for exposed alpha helices, Ha for non exposed alpha helices, EA for exposed beta strands, Ea for non exposed beta strands, CA for exposed coil and Ca for non exposed coil. Data are shown only if P < 0.01 in the two-tailed t-test and if the difference between mes and t_eu are, in absolute value, larger than 2. Environment specific amino acid substitutions with higher likelihood values in mesophiles-thermophilic than in eubacterial homologues are in italics, those with higher likelihood values in mesophiles-mesophiles homologues are in bold. Data are sorted by increasing differences between mes and t_eu.