| Literature DB >> 20660111 |
Abstract
Many proteins require the assistance of molecular chaperones in order to fold efficiently. Chaperones are known to mask the effects of mutations that induce misfolding because they can compensate for the deficiency in spontaneous folding. One of the best studied chaperones is the eubacterial GroEL/GroES system. In Escherichia coli, three classes of proteins have been distinguished based on their degree of dependency on GroEL for folding: 1) those that do not require GroEL, 2) those that require GroEL in a temperature-dependent manner, and 3) those that obligately require GroEL for proper folding. The buffering effects of GroEL have so far been observed in experimental regimens, but their effect on genomes during evolution has not been examined. Using 446 sequenced proteobacterial genomes, we have compared the frequency of amino acid replacements among orthologs of 236 proteins corresponding to the three categories of GroEL dependency determined for E. coli. Evolutionary rates are significantly correlated with GroEL dependency upon folding with GroEL dependency class accounting for up to 84% of the variation in amino acid substitution rates. Greater GroEL dependency entails increased evolutionary rates with GroEL obligatory proteins (Class III) evolving on average up to 15% faster than GroEL partially dependent proteins (Class II) and 35% faster than GroEL-independent proteins (Class I). Moreover, GroEL dependency class correlations are strictly conserved throughout all proteobacteria surveyed, as is a significant correlation between folding class and codon bias. The results suggest that during evolution, GroEL-dependent folding increases evolutionary rate by buffering the deleterious effects of misfolding-related mutations.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20660111 PMCID: PMC3296371 DOI: 10.1093/gbe/evq044
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FEvolutionary rates of proteins in the three GroEL dependency classes within 445 Proteobacteria compared with their Escherichia coli strain O157H7 EDL933 ortholog. Each dot in the figure represents the mean distance of all proteins in the same class within the same species from their ortholog in E. coli O157H7 EDL933.
Statistical Tests for Homogeneity of Medians among the GroEL Dependency Classes
| Variable | Taxonomic Group | Homogeneity of Medians ( | Post hoc Comparisons |
| Genus: Escherichia | 7.5 × 10−15* | I < II, III and II = III | |
| Order: Enterobacteriales | <2.2 × 10−16* | I < II < III | |
| Class: Gammaproteobacteria | |||
| Phylum: Proteobacteria | |||
| Protein distance | Genus: Escherichia | 1.1 × 10−16* | I < II, III and II = III |
| Order: Enterobacteriales | <2.2 × 10−16* | I < II < III | |
| Class: Gammaproteobacteria | |||
| Phylum: Proteobacteria | |||
| CAI | Genus: Escherichia | <2.2 × 10−16* | I > II, III and II = III |
| Order: Enterobacteriales | |||
| Class: Gammaproteobacteria | I > II > III | ||
| Phylum: Proteobacteria | I > II, III and II = III |
Using Friedman test.
α = 0.05, using Tukey’s test.
Roman numbers denote the classes. The notation I < II means that the values of the tested variable are significantly smaller in Class I proteins than in Class II proteins.
*P value << 0.01.
Explained Variability and Mean Ratios of Class-Specific Values for All Tested Samples
| Genus: Escherichia | Order: Enterobacteriales | Class: Gammaproteobacteria | Phylum: Proteobacteria | |
| Explained variability | 0.36 | 0.4 | 0.87 | 0.8 |
| Class III/II | 0.92 | 1.06 | 1.14 | 1.1 |
| Class III/I | 1.1 | 1.4 | 1.31 | 1.18 |
| Protein distance | ||||
| Explained variability | 0.6 | 0.3 | 0.84 | 0.76 |
| Class III/II | 0.87 | 1.06 | 1.15 | 1.1 |
| Class III/I | 1.17 | 1.36 | 1.35 | 1.2 |
| CAI | ||||
| Explained variability | 0.96 | 0.57 | 0.48 | 0.53 |
| Class III/II | 0.99 | 1 | 0.99 | 1 |
| Class III/I | 0.95 | 0.98 | 0.97 | 0.97 |
Explained variability was calculated by partial η = with Friedman test.
Escherichia coli K12 MG1655 and E. coli O157H7 comparisons resulted in zero distance for Class I proteins and were omitted from the calculation.
FDistribution of protein expression levels (Lu et al. 2007) (top) and number of protein-protein interactions (Hu et al. 2009) (bottom) in the three GroEL dependency classes.
Statistical Tests for Differences in Evolutionary Rates among the Three GroEL Dependency Classes with a Covariate
| Response Variable ( | Covariate ( | Taxonomic Group | Pooled Regression | Homogeneity of Slopes among Groups | Homogeneity of Intercepts among Groups |
| Protein expression level | Genus: Escherichia | 0.026* | 0.074 | 0.0049* | |
| Order: Enterobacteriales | 6.5 × 10−6** | 0.52 | <2.2 × 10−16** | ||
| Class: Gammaproteobacteria | <2.2 × 10−16** | <2.2 × 10−16** | n.a. | ||
| Phylum: Proteobacteria | <2.2 × 10−16** | <2.2 × 10−16** | n.a. | ||
| Protein distance | Protein expression level | Genus: Escherichia | 0.0044* | 0.15 | 6.5 × 10−4 |
| Order: Enterobacteriales | 1.6 × 10−4** | 0.49 | <2.2 × 10−16 | ||
| Class: Gammaproteobacteria | <2.2 × 10−16** | 1.1 × 10−16** | n.a. | ||
| Phylum: Proteobacteria | <2.2 × 10−16** | <2.2 × 10−16** | n.a. | ||
| CAI | Genus: Escherichia | 1.3 × 10−9** | 5.5 × 10−4** | n.a. | |
| Order: Enterobacteriales | <2.2 × 10−16** | <2.2 × 10−16** | n.a. | ||
| Class: Gammaproteobacteria | <2.2 × 10−16** | 6.1 × 10−6** | n.a. | ||
| Phylum: Proteobacteria | <2.2 × 10−16** | 0.74 | <2.2 × 10−16** | ||
| Protein distance | CAI | Genus: Escherichia | 7.7 × 10−13** | <2.2 × 10−16** | n.a. |
| Order: Enterobacteriales | <2.2 × 10−16** | 5.1 × 10−9** | n.a. | ||
| Class: Gammaproteobacteria | <2.2 × 10−16** | 1.9 × 10−13** | n.a. | ||
| Phylum: Proteobacteria | <2.2 × 10−16** | 0.42 | <2.2 × 10−16** |
NOTE.—Results of the ANCOVA test and its underlying assumptions (Sokal and Rohlf 1995) are presented. To adjust for overall differences among species, the response variable was divided by the genomic average.
Using F-test for linear relation between the response and covariate y = ax + b testing the null hypothesis H0: a = 0.
Using F-test for equality of slopes among the groups. Each group is fitted with a linear regression yclass = aclassxclass + bclass followed by testing the null hypothesis H0: aclass I = aclass II = aclass III.
Using F-test for equality of intercepts among the groups. This is equivalent to a test for equality of means with the null hypothesis H0: μclass I = μclass II = μclass III.
*P value < 0.05.
**P value << 0.01.
FCAI of proteins in the three GroEL dependency classes.