Literature DB >> 19133128

Comparative proteome analysis of psychrophilic versus mesophilic bacterial species: Insights into the molecular basis of cold adaptation of proteins.

Raghu Prasad Rao Metpally1, Boojala Vijay B Reddy.   

Abstract

BACKGROUND: Cold adapted or psychrophilic organisms grow at low temperatures, where most of other organisms cannot grow. This adaptation requires a vast array of sequence, structural and physiological adjustments. To understand the molecular basis of cold adaptation of proteins, we analyzed proteomes of psychrophilic and mesophilic bacterial species and compared the differences in amino acid composition and substitution patterns to investigate their likely association with growth temperatures.
RESULTS: In psychrophilic bacteria, serine, aspartic acid, threonine and alanine are overrepresented in the coil regions of secondary structures, whilst glutamic acid and leucine are underrepresented in the helical regions. Compared to mesophiles, psychrophiles comprise a significantly higher proportion of amino acids that contribute to higher protein flexibility in the coil regions of proteins, such as those with tiny/small or neutral side chains. Amino acids with aliphatic, basic, aromatic and hydrophilic side chains are underrepresented in the helical regions of proteins of psychrophiles. The patterns of amino acid substitutions between the orthologous proteins of psychrophiles versus mesophiles are significantly different for several amino acids when compared to their substitutions in orthologous proteins of within the mesophiles or psychrophiles.
CONCLUSION: Current results provide quantitative substitution preferences (or avoidance) of amino acids that lead to the adaptation of proteins to cold temperatures. These finding would help future efforts in selecting mutations for rational design of proteins with enhanced psychrophilic properties.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19133128      PMCID: PMC2653534          DOI: 10.1186/1471-2164-10-11

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Microorganisms that live under forbidding conditions are called extremophiles, whose discovery points out the unique adaptability of primitive life-forms. These microorganisms are grouped according to their optimal growth conditions in which they exist such as acidophiles (exhibiting optimum growth in acidic pH conditions), alkaliphiles (thriving in alkaline pH conditions), barophiles (surviving under great pressures), endoliths (living in deep inside rocks), halophiles (thriving in high salt concentrations), psychrophiles (optimal temperature below 20°C), and the thermophiles (optimal temperature between 45–80°C), hyperthermophiles (optimal temperature above 80°C) [1]. The largest coverage of known extremophile conditions of the earth's biosphere is below 10°C. For example, three fourths of earth is covered by oceans, which maintain an average temperature of one to three degrees centigrade. Furthermore, the vast land areas of the Arctic and Antarctic are permanently frozen throughout the year [1]. Other few examples of cryo habitats include cold deserts, high alpine soils, sea ice, cold caves, marine sediments, permafrost soils, glacier, snow etc. The majority of known psychrophiles belong to varieties of archaea and bacteria, and a few species of yeast, fungi and algae [2]. The ability to thrive at life-endangering effects of low temperatures, close to freezing point of water, requires a vast array of adaptations from all their cellular components, including their membranes, energy-generating systems, protein synthesis machinery, biodegradative enzymes and the components responsible for nutrient uptake etc., to maintain metabolism, sustain growth and reproduction compatible with life in these low temperature conditions [3,4]. Having evolved with special mechanisms, the psychrophiles successfully colonized these niches [2,5]. Psychrophilic proteins display sequences and structures comparable with those of their meso and (hyper) thermophilic homolog's, especially enzymes with their ability to work efficiently as catalysts at low temperatures [6]. The thermolability of these proteins at moderate temperatures warrant tremendous industrial applications in biotechnology, bioremediation, food, textiles, detergents bio-catalysis under low-water conditions and detergents etc [5-9]. Due to above facts, historically starting from mid-1970's, much attention was paid mainly to sequence and structural attributes contributing to adaptation of proteins (mainly enzymes) to high temperature conditions. Many investigators have compared sequence and structure-based parameters among thermophilic and mesophilic proteins [10]. With the advent of pioneering efforts in late 1990's in solving three dimensional structures of cryophilic enzymes such as alpha-amylase [11]; alkaline protease [12]; triose phosphate isomerase [13]; malate dehydrogenase [14] from Antarctic microorganisms, and due to handful of available structures in the protein data bank (PDB), groups have focused to address the structural basis of proteins in cold adaptation [4,15-20]. The steady increase in sequencing of proteomes of extremophiles has opened many new avenues in understanding adaptations to extreme conditions [16,21-25]. A comprehensive comparison of global amino acid preferences and substitution patterns as deduced from proteomes of different organisms is now possible [26-28]. Using homologous sequences, clustering along with various statistical methods; we conducted an extensive analysis of proteomes of psychrophilic, mesophilic, thermophilic and hyperthermophilic microorganisms to examine a possible correlation of amino acid substitution patterns with adaptation to their respective optimal growth conditions. In this manuscript we discuss the results from comparative analysis of fully sequenced proteomes of six members from each of psychrophilic and mesophilic organisms.

Results

On average we analyzed 2,816 proteins with 875,219 amino acids per proteome of mesophiles and 3665 proteins with 1,169,678 amino acids per proteome of psychrophiles. The amino acid (AA) frequencies given in Table 1 show that some of the AA differed significantly in psychrophile proteomes when compared to mesophile proteomes. When compared to psychrophiles, the mesophile proteomes show larger standard deviation for residues indicating that the six proteomes of mesophiles we used are considerably more divergent than the proteomes of psychrophiles. The frequencies of individual amino acids as well as property groups were further analyzed with student t-test.
Table 1

The composition of individual amino acids and property groups in protein sequences of psychrophilic and mesophilic proteomes.

Amino AcidsPsychrophilesaMesophilesb
P1P2P3P4P5P6AvgSDM1M2M3M4M5M6AvgSDt-test
Ala (A)8.18.59.28.48.912.39.21.68.36.89.58.49.16.78.11.21.373
Cys (C)1.01.40.91.11.00.71.00.21.01.11.21.01.10.61.00.20.146
Asp (D)5.65.16.05.45.85.35.50.35.04.85.15.05.05.95.10.41.801
Glu (E)5.96.35.55.95.85.65.80.36.56.95.86.16.26.66.30.4-2.547
Phe (F)4.44.33.74.44.33.44.10.44.45.43.94.04.14.34.40.6-1.006
Gly (G)6.47.56.66.56.88.47.10.86.75.87.47.36.76.36.70.60.922
His (H)2.22.02.32.12.31.92.10.22.12.12.31.92.41.82.10.20.206
Ile (I)7.27.16.97.46.25.06.60.97.17.26.06.36.07.96.80.8-0.298
Lys (K)6.15.75.26.15.13.35.21.06.38.94.44.34.98.16.22.0-1.019
Leu (L)10.310.410.110.710.110.410.30.210.511.210.711.410.89.610.70.6-1.370
Met (M)2.42.62.82.42.52.02.50.32.42.32.82.02.72.62.50.3-0.137
Asn (N)5.13.84.64.84.52.94.30.84.95.93.94.13.95.64.70.9-0.883
Pro (P)3.53.94.03.63.95.04.00.53.73.34.45.14.03.24.00.70.140
Gln (Q)4.53.74.74.44.83.74.30.54.63.74.45.55.23.64.50.8-0.602
Arg (R)3.85.04.44.04.46.14.60.94.53.55.55.15.03.84.60.80.115
Ser (S)7.26.66.66.76.96.86.80.25.86.85.85.96.36.16.10.43.684
Thr (T)5.65.35.85.45.55.65.50.25.24.45.45.55.25.55.20.41.779
Val (V)6.66.76.66.46.98.06.90.66.75.67.16.67.06.96.60.50.724
Trp (W)1.11.01.21.11.21.51.20.21.10.71.51.61.31.01.20.3-0.023
Tyr (Y)3.13.03.13.13.12.12.90.43.13.72.82.93.04.03.30.5-1.310
Amino Acid Property Groups
Tiny28.329.429.128.129.033.829.62.127.024.929.228.028.425.127.11.82.235
Small49.148.850.348.450.255.150.32.447.344.449.848.948.446.647.61.92.166
aliphatic24.124.223.524.523.223.423.80.524.324.023.824.323.924.424.10.3-1.157
aromatic10.810.310.210.710.98.910.30.810.711.910.510.410.811.110.90.6-1.559
non polar54.256.555.055.255.058.955.81.755.153.157.356.655.952.955.11.80.615
Polar45.843.545.044.845.041.144.21.744.946.942.743.444.147.144.81.8-0.601
charged23.424.123.323.423.422.223.30.624.326.123.122.423.526.224.31.6-1.414
Basic12.012.711.812.211.811.312.00.512.814.512.211.312.313.712.81.2-1.663
Acidic11.511.411.511.211.610.911.30.311.511.610.911.111.212.511.50.5-0.541
Neutral25.925.226.025.226.326.425.80.524.422.825.326.125.823.324.61.32.057
hydrophilic30.829.630.330.630.426.929.81.531.833.629.230.230.233.731.41.9-1.696
hydrophobic44.345.044.445.144.245.444.70.544.744.045.544.245.243.544.50.80.527

The average (Avg) values among each set of proteomes along with their standard deviations (SD) are also given. Significant compositional differences as indicated by t-test parameter are shown in bold.

: Colwellia psychrerythraea 34H (Abbreviation: Cpsy-P1), Desulfotalea psychrophila LSv54 (Dpsy-P2), Psychrobacter cryohalolentis K5 (Pcry-P3), Psychromonas ingrahamii 37 (Ping-P4)Pseudoalteromonas atlantica T6c (Patl-P5) and Renibacterium salmoninarum ATCC 33209 (Rsal-P6).

: Haemophilus influenzae (Hinf-M1), Synechocystis PCC680 (Spcc-M2), Helicobacter pylori 26695 (Hpyl-M3), Escherichia coli K12 (Ecol-M4), Vibrio cholerae (Vcho-M5), Lactobacillus salivarius UCC11 (Lsal-M6).

The composition of individual amino acids and property groups in protein sequences of psychrophilic and mesophilic proteomes. The average (Avg) values among each set of proteomes along with their standard deviations (SD) are also given. Significant compositional differences as indicated by t-test parameter are shown in bold. : Colwellia psychrerythraea 34H (Abbreviation: Cpsy-P1), Desulfotalea psychrophila LSv54 (Dpsy-P2), Psychrobacter cryohalolentis K5 (Pcry-P3), Psychromonas ingrahamii 37 (Ping-P4)Pseudoalteromonas atlantica T6c (Patl-P5) and Renibacterium salmoninarum ATCC 33209 (Rsal-P6). : Haemophilus influenzae (Hinf-M1), Synechocystis PCC680 (Spcc-M2), Helicobacter pylori 26695 (Hpyl-M3), Escherichia coli K12 (Ecol-M4), Vibrio cholerae (Vcho-M5), Lactobacillus salivarius UCC11 (Lsal-M6).

Amino acid composition preferences

The t-test results demonstrate significant preferences in frequencies of amino acid occurrences and property groups in psychrophilic proteomes as compared to mesophilic proteomes or vice versa (Table 1). The compositional trend of AA is somewhat similar in both types of genomes. However, as indicated by t-values from Table 1, there are a few AA residues such as A, D, S and T, significantly preferred in psychrophiles as compared to mesophiles. On the other hand, AA residues E and L are significantly less favored in psychrophile proteomes. When comparing frequencies of occurrences of property groups of AAs, we observe that tiny/small and neutral amino acid groups are significantly preferred in psychrophiles where as charged, basic, aromatic and hydrophilic groups are significantly less favored as shown by their corresponding t-values in Table 1. When we compared the AA compositions of the sequences in alignments of respective orthologous proteins alone (the data not given), we observed similar trends.

Secondary Structural Elements

The composition of AA of psychrophilic and mesophilic proteomes in three major secondary structural elements, α-helices, β-sheets and coils, is given in Table 2. Collectively taken, the psychrophilic proteomes contain significantly less number of residues (~2%) in the α-helices and significantly more number of residues (~2%) in the coil regions. The majority of amino acids exhibit similar compositions in either of the two genome sequences. However, amino acids E, F L, N and Y show significantly low frequencies in α-helices of psychrophilic proteomes and amino acids A, D, G, S, T, and V are significantly high in the coil region of psychrophilic proteomes. The amino acid, E is significantly low in the coil region of psychrophilic proteomes. Except in an increase in Alanine residues, β-sheets of psychrophile proteomes did not show any significant changes as compared to mesophiles.
Table 2

Distribution of amino acids and property group parameters in the predicted secondary structural elements of psychrophilic and mesophilic proteomes.

α-Helixβ-SheetCoil
Amino AcidsPsychroMesoPsychroMesoPsychroMeso
AvgSDAvgSDt-valAvgSDAvgSDt-valAvgSDAvgSDt-val
Ala (A)5.230.954.790.090.9061.150.071.050.121.7452.830.612.270.561.963
Cys (C)0.410.110.420.00-0.0970.220.050.230.04-0.0260.370.080.350.020.524
Asp (D)1.900.131.880.000.1760.520.090.500.020.4663.090.232.750.342.660
Glu (E)3.040.223.420.02-2.4800.700.090.740.04-1.0862.090.102.18-0.08-1.487
Phe (F)2.030.242.240.01-1.3780.970.171.020.15-0.5261.070.071.10-0.03-0.544
Gly (G)1.810.191.870.01-0.5930.660.080.680.05-0.4944.580.714.110.471.392
His (H)0.830.080.830.000.0470.360.040.350.030.3660.920.070.910.010.295
Ile (I)3.260.493.450.03-0.7221.970.291.990.20-0.1341.380.161.320.060.734
Lys (K)2.480.543.000.17-1.1230.650.160.770.21-1.0432.110.362.41-0.30-0.876
Leu (L)6.030.236.420.01-2.6192.010.142.040.19-0.2942.290.132.240.060.612
Met (M)1.220.161.260.01-0.3630.380.050.380.05-0.2450.860.050.840.020.633
Asn (N)1.360.271.590.02-1.3830.410.120.450.07-0.7252.510.462.67-0.16-0.572
Pro (P)0.980.071.010.01-0.4090.260.030.280.04-0.9502.760.512.650.110.353
Gln (Q)2.270.212.450.03-0.9560.530.100.540.08-0.3591.510.181.53-0.02-0.166
Arg (R)2.310.442.320.03-0.0410.700.060.710.12-0.1751.610.371.530.070.383
Ser (S)2.650.172.550.010.7980.900.140.810.101.1903.260.222.760.504.981
Thr (T)2.070.112.060.010.1331.150.091.070.101.3352.320.202.060.252.418
Val (V)2.980.252.970.010.0382.340.202.280.150.5981.560.191.380.181.912
Trp (W)0.620.090.660.01-0.4310.250.040.240.070.1860.320.060.300.020.598
Tyr (Y)1.320.191.510.01-1.5540.750.150.830.11-1.0660.860.090.92-0.06-0.979
All AA44.811.5646.730.18-2.50716.881.5816.980.34-0.15338.311.9036.292.022.379
Amino Acid Property Groups
Tiny12.180.9011.700.140.9134.080.363.840.241.33613.371.5411.560.112.537
Small19.390.9519.160.180.4157.620.587.370.310.94623.292.0321.020.182.435
Aliphatic12.270.5612.850.01-2.2796.330.316.320.150.0905.230.224.940.012.443
Non-polar25.901.0726.620.14-1.24110.970.8011.030.28-0.17118.891.8017.490.241.594
Aromatic4.810.395.240.01-2.2352.320.362.440.17-0.7193.180.173.240.00-0.651
Polar18.910.9520.110.21-1.9875.910.785.950.14-0.12919.420.8318.800.091.340
Charged10.560.5711.460.17-1.9232.930.353.060.14-0.8959.820.169.780.050.192
Basic5.610.366.150.06-1.8631.710.171.820.11-1.3534.640.074.850.04-1.041
Acidic4.940.225.310.03-1.8921.220.181.240.03-0.3355.190.214.930.012.287
Neutral9.640.369.770.05-0.4693.590.433.460.160.67412.590.9811.380.092.430
H-philic13.360.6914.670.18-2.5523.510.523.720.13-0.94012.910.6713.060.11-0.356
H-phobic23.110.8523.730.09-1.35210.040.7110.060.22-0.06711.560.6610.720.022.715

Significant compositional differences as indicated by t-test parameter are shown in bold.

Distribution of amino acids and property group parameters in the predicted secondary structural elements of psychrophilic and mesophilic proteomes. Significant compositional differences as indicated by t-test parameter are shown in bold. When evaluated for the frequencies of occurrence of property groups of amino acids, the majority of them show significantly low frequencies in helices of psychrophilic proteomes. Except in tiny group of AAs, the β-sheet regions of psychrophile proteomes did not show any significant changes (Table 2). The tiny, small, hydrophobic, neutral, acidic, aliphatic, and non-polar amino acid groups showed significantly high frequencies in the coil/loop regions of psychrophilic proteomes.

Comparative Proteome Analysis

Towards identification of residue substitutions, likely to have undergone in psychrophilic proteins as the species adapted to cold temperatures, a comparative proteome analysis was performed on the basis of amino acid substitutions occurred between the orthologous protein sequences of psychrophile and mesophile proteomes. The orthologous sequence pairs for a protein sequence with significant length coverage only were considered for the analysis. Coverage of hits with respect to each proteome is shown in Table 3. On average 16.3% psychrophile proteins have orthologous proteins in mesophiles we used. This was cross-checked using mesophile proteomes as query sequences and the psychrophile proteomes as subject sequences, where we found 13.9% orthologs (Table 3). We used sequence alignments of these sequence pairs to compute substitutions of amino acids between mesophilic and psychrophilic proteomes and vice versa and the obtained values were averaged. On average, ten to twenty five percent of sequences from individual proteomes exhibited best hit homologues from members of other thermal groups (Table 3). This percent depended on the size of the proteome under consideration. The higher the number of proteins in a query proteome the higher percentage of hits from the subject proteomes searched. This may be because some of the paralogous sequences selecting the same protein as its ortholog (this redundancy was removed in our final data). We also considered homologous proteins among the psychrophiles and mesophiles, and calculated the substitutions within them to use as background substitution frequencies [see Eqn. (ii) and (iii)]. We observed, on average 16.7% and 17.1% orthologous proteins within the psychrophilic and mesophilic proteomes, respectively (Table 3).
Table 3

Percentage of orthologs (best hits) in Psychrophiles and Mesophiles with respect to total number of sequences (size) in each of organisms considered.

PsychrophilesMesophilesSize
GenomeCpsyDpsyPcryPingPatlRsalAVGHinfSpccHpylEcolVchoLsalAVG
Cpsy100.012.517.815.514.79.514.021.110.415.312.214.113.014.43545
Dpsy10.410017.614.511.49.512.722.812.120.112.313.315.516.03507
Pcry13.215.510017.915.312.614.928.813.722.315.016.317.418.94910
Ping13.815.321.010015.211.415.328.012.619.615.317.817.418.53234
Patl14.513.420.116.910010.615.123.711.716.713.915.914.816.12511
Rsal9.010.915.412.510.010011.620.311.214.611.311.316.814.34281
AVG12.213.518.415.513.310.716.724.111.918.113.314.815.816.33665
Hinf14.217.624.720.916.413.617.910014.427.619.520.321.220.61657
Spcc8.312.015.411.79.79.711.118.410016.69.910.214.013.73569
Hpyl9.614.418.113.710.710.012.725.411.710011.712.516.415.51576
Ecol11.614.219.516.813.411.714.529.711.818.710016.617.718.94243
Vcho12.814.419.417.814.610.514.927.111.618.914.910016.017.73835
Lsal8.912.816.013.010.112.212.223.711.419.411.811.910015.62017
AVG10.914.218.815.612.511.313.924.912.220.113.514.317.117.02816
Percentage of orthologs (best hits) in Psychrophiles and Mesophiles with respect to total number of sequences (size) in each of organisms considered.

Amino Acid Substitution Patterns

Log odd scores (LOS) of AA substitutions were calculated using the frequency of occurrence of substitutions among orthologous proteins of psychrophiles and mesophiles by normalizing with frequency of occurrence of substitutions within the proteomes of the same temperature sensitive group. In LOS calculation the substitutions influenced by factors other than temperature are nullified and values represent true substitution due to cold adaptation of species. In Table 4 we show LOSMeso values computed using equation (ii) as described in methods. These values clearly show that psychrophilic proteins avoid containing the amino acids E, F, K, N and Y. On the other hand they prefer containing the residues A, D, G, S, and T as compared to mesophile proteins. The individual values in Table 4 show that to what extant certain substitutions are favored or avoided depending on corresponding LOSMeso score +ve or -ve, respectively. For example the W in mesophiles mutating to S in the psychrophiles is highly favored with LOS scores of 10.9. On the other hand G in mesophiles mutating to K in psychrophiles is avoided with LOS score of -12.4. The LOSPsychro [Eqn. (iii)] scores calculated using substitution frequencies among psychrophiles as normalizing factor have shown similar results (data not shown).
Table 4

Log odd scores (LOS) of amino acid substitutions calculated using the equation (ii).

Psychrophiles
ACDEFGHIKLMNPQRSTVWY
MesophilesA2.0-2.53.4-5.1-4.61.4-0.6-1.3-11.2-2.5-2.6-6.4-1.6-2.9-0.31.30.8-0.40.8-7.7
C1.81.21.2-3.2-3.41.76.1-2.0-9.4-2.1-4.2-4.3-1.59.32.31.3-0.8-0.21.5-6.9
D8.22.91.1-5.1-5.24.1-3.02.0-9.5-1.1-0.6-6.0-2.9-2.72.03.82.04.11.7-6.5
E7.51.13.2-2.4-6.74.90.21.5-8.9-1.9-1.6-4.6-3.3-1.40.85.44.43.5-4.9-7.2
F4.7-0.25.5-4.5-0.13.23.40.0-8.7-0.60.4-2.70.8-0.83.05.02.41.40.0-3.2
G3.10.42.2-4.6-3.90.8-8.8-1.7-12.4-3.5-2.0-9.3-0.9-4.0-1.91.60.92.51.7-8.3
H7.50.62.9-4.6-6.24.00.4-1.7-8.2-1.0-2.0-4.0-2.0-1.60.75.81.40.90.1-7.4
I4.8-0.96.1-6.5-2.73.53.3-1.1-7.9-1.60.2-2.60.00.71.04.82.81.2-4.4-5.7
K6.62.74.8-4.8-5.34.4-3.42.4-3.70.0-1.3-3.7-2.6-3.1-0.27.05.23.5-3.1-8.8
L5.41.13.4-5.8-3.12.64.3-0.3-8.5-0.20.8-3.9-0.20.71.54.92.41.4-3.3-4.7
M3.7-3.23.9-6.2-4.23.71.3-2.5-8.7-2.20.6-2.5-0.11.31.03.21.30.7-1.7-6.2
N6.62.94.0-3.6-3.73.6-3.00.5-7.50.1-1.5-1.9-2.4-2.91.45.42.63.2-2.3-4.6
P6.70.44.6-5.4-6.74.40.0-1.6-10.5-3.01.7-5.70.20.9-0.14.13.11.0-1.6-11.0
Q8.25.05.0-4.2-5.46.0-0.81.3-7.2-0.3-0.1-4.00.3-0.51.16.05.24.20.1-7.0
R6.82.95.0-5.4-5.25.4-2.4-0.7-6.8-2.2-2.8-2.50.4-1.80.56.15.02.1-2.5-7.8
S3.1-0.72.4-4.8-2.83.1-0.60.8-8.0-1.2-1.1-4.1-0.5-2.60.11.50.62.40.6-5.4
T3.90.02.8-4.1-3.53.8-0.7-1.5-9.3-1.0-0.7-3.30.4-0.61.62.60.71.3-3.2-7.6
V2.60.13.6-5.4-3.74.41.1-1.8-9.3-2.1-0.1-2.9-0.40.52.13.61.80.3-0.6-8.1
W9.65.45.0-4.9-2.83.32.9-1.1-11.10.51.1-4.20.44.31.810.91.02.43.3-4.1
Y5.80.64.2-2.8-0.34.13.90.3-7.2-0.70.5-3.4-0.9-0.13.47.63.02.3-0.4-1.2
All3.80.32.5-3.8-1.91.8-0.2-0.9-6.9-0.90.0-3.7-0.4-1.30.63.21.81.00.7-3.8
Log odd scores (LOS) of amino acid substitutions calculated using the equation (ii). Student t-test was further applied to evaluate level of significance of LOS substitution scores and these data were shown in Additional files (Additional file 1 &2). Table Additional file 1(a) shows t-values for the LOSMeso scores and Additional file 1(b) shows t-values for the LOSPsychro scores. The LOSMeso scores (Table 4) with their corresponding t-values (Additional file 1) greater than 1.37 or less than -1.37 could be considered as significantly preferred or avoided, respectively, at 90% confidence level (shaded in color, Table 4). It can be seen from the table that there are about 45% substitutions that are shown to be significant. Further, the LOSMeso for AA property group substitutions are given in Table 5 and their corresponding t-values are given in Additional file 2. It is clearly seen from Table 5 that there is high preference for tiny, small and neutral AAs whereas charged (including both basic and acidic) aromatic and hydrophilic AAs are avoided significantly in the psychrophiles as compared to mesophiles.
Table 5

Log odd scores (LOS) of amino acid substitutions calculated using the Eqn (ii).

Psychrophiles
TSAlArNpPCBANHlHb
MesophilesTiny (T)1.471.12-0.68-3.860.79-0.94-3.31-5.40-1.320.87-3.670.74
Small (S)1.891.12-0.59-3.930.58-0.86-2.03-4.96-0.481.11-2.410.44
Aliphatic (Al)3.341.60-0.40-3.19-0.29-0.10-2.60-2.66-2.502.69-2.52-0.33
Aromatic(Ar)4.672.750.05-0.40-0.020.12-0.41-0.32-0.731.94-2.05-0.11
Nonpolar (Np)2.011.31-0.49-1.490.07-1.01-2.91-3.93-1.611.36-3.31-0.08
Polar (P)3.502.030.59-2.661.65-0.84-1.66-2.55-0.761.44-1.911.51
Charged (C)5.522.490.68-2.401.69-1.13-1.54-2.24-0.831.69-1.871.69
Basic (B)5.753.440.47-2.071.15-1.16-1.72-1.76-1.501.56-2.070.89
Acidic (A)5.291.941.00-3.592.39-1.10-1.36-4.84-0.701.86-1.682.84
Neutral (N)1.781.390.16-2.121.02-0.69-2.26-3.11-1.100.95-2.680.98
H-philic (Hi)5.532.410.97-3.902.14-1.07-1.64-2.66-0.721.76-1.782.17
H-phobic (Hb)2.381.55-0.48-1.32-0.08-0.57-2.57-2.98-1.981.97-2.90-0.14
All2.561.60-0.34-1.890.42-0.91-1.83-2.79-0.871.34-2.160.25

The +ve values indicates that the corresponding substitution of mesophilic amino acid property groups to the respective psychrophilic groups is higher than such substitutions within the mesophilic homologous proteins. The values shown to be significant at 90% confidence level by student t-test are shown in bold.

Log odd scores (LOS) of amino acid substitutions calculated using the Eqn (ii). The +ve values indicates that the corresponding substitution of mesophilic amino acid property groups to the respective psychrophilic groups is higher than such substitutions within the mesophilic homologous proteins. The values shown to be significant at 90% confidence level by student t-test are shown in bold.

Discussion

Our objective in this study was to analyze systematically the compositional variation and substitution preferences of amino acids in proteomes of psychrophiles compared to the proteomes of mesophiles to investigate general proteome wide characteristics for cold adaptation. We considered total compositional differences in proteomes as well as compositional differences in their orthologous proteins alone. We performed analysis at different levels, through simple amino acid compositions, student t-test and finally by substitution patterns in their orthologous proteins. Some of the methods we used were previously applied [26,29-31] but not to the complete proteome analysis. In psychrophiles individual residue compositions show that there is a significant preference for A, D, S and T content and significant avoidance of E and L content and moderate preference for G and avoidance for F and K content (Table 1). All these residue preferences and avoidance directly show a strong correlation with respect to avoidance for helical content in psychrophiles, as S, D and G are helix breakers [32] and T is a helix indifferent. Likewise, the presence of E tends to favor formation of helical structures and L tends to stabilize helical structures [32] that are highly avoided in psychrophiles. Amino acid D is observed to be unstable at high temperatures and therefore its frequency observed to decrease as optimal growth temperature of organisms increase [33]. Reverse trends are observed for E to counter the trend in favor of making ion pair interactions to form salt-bridges at higher temperatures [34,35]. Helix destabilizing beta-branched residues (I, T and V) are preferred in beta sheets and loop regions of psychrophilic proteins [36,37]. The substitution pattern in the orthologous proteins of two temperature groups show several interesting features that are not readily seen in the simple AA compositions. On other hand, they strongly support observed differences in compositions apart from giving additional insights, such as what specific substitutions were more favored or avoided as shown by LOS values.

Nonpolar Amino Acids

Our results in this study confirm that overall composition of nonpolar AA group did not show much difference in proteins belonging to psychrophiles and mesophiles, but there is a decrease in nonpolar AA group frequency in helices and a significant increase in loop regions among proteins of psychrophiles. This is in accordance with earlier findings that there are more nonpolar amino acids on the exposed surface area of the majority of psychrophilic proteins [35,38] as more loops are observed on surface regions. Among nonpolar residues, I, L and V belong to aliphatic group of residues, which are significantly reduced to favor protein flexibility. It has been widely accepted that the aliphatic amino acids would contribute to the hydrophobic interaction for maintaining conformational stability and rigidity in core region of the proteins [36,39,40]. Observed low average hydropathy and low aliphatic residues in psychrophiles are mainly contributed by significantly low in L composition [35].

Tiny and Small Amino acids

Tiny and small amino acids are those with short side chains and are unable to participate in long range interactions among secondary structural elements and are usually confined to form local interactions. Overall their compositions are significantly increased in beta sheets and loops of psychrophilic proteins over mesophilic counter parts. This is also clear from substitutions observed in orthologous proteins. The amino acid G is devoid of side chain, is more flexible with greater rotational freedom, is capable of making cavities in the core parts of the proteins structures [39] and was shown to be in less frequency in thermophiles [41]. Whereas P with pyrolidine ring structure has restricted conformations and was shown to occur in higher frequency in thermophiles [40]. Our LOS values confirm that the amino acid G is preferred and the P is avoided in psychrophiles as compared to mesophiles.

Charged amino acids

Charged residues are polar and hydrophilic. They contribute to ion pair electrostatic interactions that are important binding force for maintaining conformational stability in surface of the proteins [37,39,42]. The more charged residues were found in thermophilic proteins than in mesophilic proteins [36,40]. The charged AA (especially basic and hydrophilic) residues are significantly avoided in psychrophiles (Table 5). Present analysis also supports the notion that these residues are significantly avoided in psychrophiles as observed from compositions as well as LOS scores. The most striking feature of psychrophilic proteins is an increase in amino acid D and decrease in amino acid E over mesophiles. The charged residues in mesophiles are mainly replaced with small and tiny residues in psychrophiles.

Aromatic Amino acids

Psychrophilic proteins and their secondary structural elements show significant decrease in aromatic amino acids. Especially F and Y that are capable of binding to cationic amino acid side chains of K and R in forming cation-π interactions and play important role in stabilizing three dimensional structure of proteins [43-45]. Our studies show that, a significant decrease in aromatic residues occurs by substitution with tiny/small and neutral amino acids in psychrophiles (Table 5). Finally, our LOS values, combining with PAM or BLOSUM mutual substitution scores, assist in selecting suitable mutations in designing mesophilic proteins to optimally function in cold temperatures or vice versa. For example substitution of E in mesophilic proteins with D may not change chemical characteristics significantly but may result in optimizing protein function in cold temperatures. In similar lines we observe that amino acid K is highly avoided and R is preferred in psychrophiles although both being basic amino acids.

Conclusion

We analyzed compositions of individual amino acid residues, amino acid groups and their distribution pattern in secondary structures and then computed and quantified their substitution patterns and directional preferences between mesophiles and psychrophiles. Significant differences in composition of amino acid residues were observed between the mesophilic and psychrophilic proteins as summarized below: (i) we observed an increase in frequency of individual amino acids like A, D, S and T that avoid helices and a decrease in E and L amino acids; (ii) There is an increase in small/tiny and neutral group residues which contribute to protein flexibility and a decrease in charged amino acids, particularly basic as well as hydrophilic residues that contribute to ionic interactions; (iii) there is a decrease in aromatic amino acids residues that contribute to the cation-π interactions; (iv) there is a decrease in aliphatic residues which provide good covering and masking to produce hydrophobic pockets that are involved in stabilizing protein structure; (v) there is a reduction in amino acid preferences for helices and an increase of coil forming residues; (vi) we also observed a significant level of substitutions of aliphatic and charged amino acids by tiny/small or neutral amino acids; (vii) the results from this analysis, especially significant t-values of LOS substitution pairs, can be used as a knowledge base in rational design of mutations for engineering of mesophilic proteins to function optimally in cold temperatures or vice versa.

Methods

Proteome sequences from six members each of available completely sequenced species of psychrophiles and mesophiles (listed in the foot note of Table 1) are collected. They were selected randomly from independent genuses of mesophiles and 6 of 9 available completely sequenced genomes of psychrophiles to control plausible variations from phylogenetic non-independence (PNI) as related species may share the similar traits due to shared ancestry. These proteome sequences were downloaded from the NCBI proteome project server in the fasta format. The growth temperatures of these species were obtained from NCBI [46] and/or PGTdb (Prokaryotic Growth Temperature database) [47]. We computed the frequencies of amino acid residues in the protein sequences of psychrophilic and mesophilic proteomes. We also grouped the amino acids into 12 property groups [48] as follows: Acidic amino acids group include D and E; aliphatic: I, L and V; aromatic: H, F, W and Y; basic: R, H, and K; charged: R, D, E, H and K; hydrophilic: D, E, K, N, Q and R; hydrophobic: A, C, F, I, L, M, V, W and Y neutral: G, Q, H, S and T; non-polar: A, C, G, I, L, M, F, P, V, W and Y; polar: R, N, D, E, Q, H, K, S and T; small: A, C, D, G, N, P, S, T, V and tiny: A, C, G, S and T. The sum of frequencies of amino acids that fall in each property group are calculated for psychrophiles and mesophiles and are also compared (Table 1). Some of the amino acids are included in more than one property group.

Student t-test

To compare the means of two groups of data, t-test is essentially a good tool for the signal-to-noise metaphor used in research analysis. IIn present analysis, we compared the mean frequencies of single amino acids, 12 different property groups of amino acids and three secondary structural elements from all protein sequences of psychrophilic and mesophilic proteomes considered in this analysis. The t-values are calculated as follows: Where Varand Varare the variance of residues or property groups; (F) and (F) are mean frequencies of psychrophilic and mesophilic proteomes respectively. The nand nare the total number of psychrophilic and mesophilic proteomes investigated in this study, respectively. Based on student's t-distribution table of significance, critical values for such t-test at various probabilities are as follows (see Table 6):
Table 6

t-test critical values

Probability10%5%2.5%2%1%0.5%
Critical Value1.3721.8122.2282.3592.7643.169
t-test critical values If t-value is positive and greater than critical value at 10% probability (1.372) then the mean frequency (F) of psychrophilic proteomes is significantly greater than that of the mesophilic proteomes (F) at 90% or higher confidence level. If the frequency of residue or property group t-value is negative and less than -1.372 then the mean frequency of psychrophilic proteomes (F) is significantly less than that of mesophilic proteomes (F) at 90% or higher confidence level [18,40,49].

Secondary Structure Prediction

We predicted secondary structural elements in protein sequences using GTOP [50] and/or PSIPRED [51]. We used these predictions to compute frequencies of different amino acids and property groups of residues in three major secondary structural regions, helix (H), strand (E) and coil (C). PSIPRED is a highly reliable secondary structure prediction method with ~83% reported prediction accuracy. We have also tested its prediction accuracy on some of the known psychrophilic and mesophilic proteins to see if there is any significant difference in its prediction for psychrophilic proteins. A total of about 25 proteins, approximately 5000 residues, from each we observed that PSIPRED prediction accuracy was 78.42% and 80.89% for psychrophilic and mesophilic proteins, respectively.

Comparative Analysis of Amino Acid Substitutions

All protein sequences from each mesophilic species in dataset were searched against each proteome of psychrophylic species and vice versa, using BLASTP [52] with 10-3 expectation value cutoff and considerable length coverage. We picked up pair-wise alignments obtained from BLAST results of each protein sequence in a query proteome that showed best hit homolog (ortholog) in the subject proteome. The pair-wise alignments (exclusion of gapped regions) were parsed to calculate amino acid substitution counts between the two proteins from respective proteomes. The substitution counts were normalized to total amino acids present in their respective proteomes pairs individually and finally to all the pairs. The resultant frequency of substitutions was further used to calculate two types of likelihood log odd scores (LOS): Where F(X→ Y) represent normalized frequency of amino acid X in mesophile substituted by an amino acid Y in psychrophile. The LOS values are calculated by using background substitution frequencies among the mesophilic and/or psychrophilic proteomes in the denominator. The LOS scores, therefore, indicated the pattern of substitutions that are predominantly due to their thermal adaptation and therefore minimize the effect of substitutions due to any speciation events in the evolution process.

Authors' contributions

MRPR and BVBR conceived of the study and participated in its design and coordination. MRPR carried out the analysis and authored the first draft of this manuscript. BVBR was involved in useful discussions and provided comments and revisions to the final version of this text. Both authors read and approved the final manuscript.

Additional file 1

T-test values for amino acid substitution preferences between proteins of Mesophilic and Psychrophilic proteomes. The t-values calculated using LOS substitution scores of mutation frequencies normalized with substitution frequencies within the (a) mesophiles (m2m) and (b) psychrophiles (p2p). Click here for file

Additional file 2

T-test values for amino acid property groups' substitution preferences between proteins of Mesophilic and Psychrophilic proteomes. The t-values calculated using LOS substitution scores of mutation frequencies normalized with substitution frequencies within the mesophiles Click here for file
  49 in total

1.  Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins.

Authors:  M M Gromiha; M Oobatake; A Sarai
Journal:  Biophys Chem       Date:  1999-11-15       Impact factor: 2.352

2.  Elucidation of determinants of protein stability through genome sequence analysis.

Authors:  S Chakravarty; R Varadarajan
Journal:  FEBS Lett       Date:  2000-03-17       Impact factor: 4.124

3.  Factors enhancing protein thermostability.

Authors:  S Kumar; C J Tsai; R Nussinov
Journal:  Protein Eng       Date:  2000-03

4.  The NCBI. Publicly available tools and resources on the Web.

Authors:  J P Jenuth
Journal:  Methods Mol Biol       Date:  2000

5.  Adaptation of model proteins from cold to hot environments involves continuous and small adjustments of average parameters related to amino acid composition.

Authors:  Emmanuele De Vendittis; Immacolata Castellano; Roberta Cotugno; Maria Rosaria Ruocco; Gennaro Raimo; Mariorosario Masullo
Journal:  J Theor Biol       Date:  2007-09-14       Impact factor: 2.691

6.  Structural adaptation to low temperatures--analysis of the subunit interface of oligomeric psychrophilic enzymes.

Authors:  Daniele Tronelli; Elisa Maugini; Francesco Bossa; Stefano Pascarella
Journal:  FEBS J       Date:  2007-08-14       Impact factor: 5.542

Review 7.  Differences in amino acids composition and coupling patterns between mesophilic and thermophilic proteins.

Authors:  X-X Zhou; Y-B Wang; Y-J Pan; W-F Li
Journal:  Amino Acids       Date:  2007-08-21       Impact factor: 3.520

Review 8.  The impact of extremophiles on structural genomics (and vice versa).

Authors:  Francis E Jenney; Michael W W Adams
Journal:  Extremophiles       Date:  2007-06-13       Impact factor: 2.395

9.  Elucidating the protein cold-adaptation: Investigation of the parameters enhancing protein psychrophilicity.

Authors:  Mina Jahandideh; Seyyed Mohsen Hosseini Barkooie; Samad Jahandideh; Parviz Abdolmaleki; Mohammad Mehdi Movahedi; Somayyeh Hoseini; Ebrahim Barzegari Asadabadi; Fatemeh Javani Jouni; Zahra Karami; Nader Hodjati Firoozabadi
Journal:  J Theor Biol       Date:  2008-07-31       Impact factor: 2.691

10.  Relationship between insertion/deletion (indel) frequency of proteins and essentiality.

Authors:  Simon K Chan; Michael Hsing; Fereydoun Hormozdiari; Artem Cherkasov
Journal:  BMC Bioinformatics       Date:  2007-06-28       Impact factor: 3.169

View more
  42 in total

1.  Thirty-degree shift in optimum temperature of a thermophilic lipase by a single-point mutation: effect of serine to threonine mutation on structural flexibility.

Authors:  Monika Sharma; Rakesh Kumar; Ranvir Singh; Jagdeep Kaur
Journal:  Mol Cell Biochem       Date:  2017-02-11       Impact factor: 3.396

2.  Compositional and structural features related to thermal stability in the archaea SRP19 and SRP54 signal recognition particle proteins.

Authors:  Francisco Miralles
Journal:  J Mol Evol       Date:  2011-04-20       Impact factor: 2.395

3.  Structural prediction of a novel laminarinase from the psychrophilic Glaciozyma antarctica PI12 and its temperature adaptation analysis.

Authors:  Sepideh Parvizpour; Jafar Razmara; Ashraf Fadhil Jomah; Mohd Shahir Shamsir; Rosli Mohd Illias
Journal:  J Mol Model       Date:  2015-02-27       Impact factor: 1.810

Review 4.  Multifactorial level of extremostability of proteins: can they be exploited for protein engineering?

Authors:  Debamitra Chakravorty; Mohd Faheem Khan; Sanjukta Patra
Journal:  Extremophiles       Date:  2017-03-10       Impact factor: 2.395

5.  Engineering deamidation-susceptible asparagines leads to improved stability to thermal cycling in a lipase.

Authors:  K Bhanuramanand; Shoeb Ahmad; N M Rao
Journal:  Protein Sci       Date:  2014-08-05       Impact factor: 6.725

Review 6.  Some like it cold: understanding the survival strategies of psychrophiles.

Authors:  Pieter De Maayer; Dominique Anderson; Craig Cary; Don A Cowan
Journal:  EMBO Rep       Date:  2014-03-26       Impact factor: 8.807

7.  Structural and functional analysis of a novel psychrophilic β-mannanase from Glaciozyma antarctica PI12.

Authors:  Sepideh Parvizpour; Jafar Razmara; Aizi Nor Mazila Ramli; Rosli Md Illias; Mohd Shahir Shamsir
Journal:  J Comput Aided Mol Des       Date:  2014-05-22       Impact factor: 3.686

8.  Bacterial growth at -15 °C; molecular insights from the permafrost bacterium Planococcus halocryophilus Or1.

Authors:  Nadia C S Mykytczuk; Simon J Foote; Chris R Omelon; Gordon Southam; Charles W Greer; Lyle G Whyte
Journal:  ISME J       Date:  2013-02-07       Impact factor: 10.302

Review 9.  A role for A-to-I RNA editing in temperature adaptation.

Authors:  Sandra C Garrett; Joshua J C Rosenthal
Journal:  Physiology (Bethesda)       Date:  2012-12

10.  Genomic analysis of cold-active Colwelliaphage 9A and psychrophilic phage-host interactions.

Authors:  Jesse R Colangelo-Lillis; Jody W Deming
Journal:  Extremophiles       Date:  2012-12-07       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.