| Literature DB >> 27189986 |
A Samer Kadibalban1, David Bogumil2, Giddy Landan1, Tal Dagan3.
Abstract
Many proteins depend on an interaction with molecular chaperones in order to fold into a functional tertiary structure. Previous studies showed that protein interaction with the GroEL/GroES chaperonine and Hsp90 chaperone can buffer the impact of slightly deleterious mutations in the protein sequence. This capacity of GroEL/GroES to prevent protein misfolding has been shown to accelerate the evolution of its client proteins. Whether other bacterial chaperones have a similar effect on their client proteins is currently unknown. Here, we study the impact of DnaK (Hsp70) chaperone on the evolution of its client proteins. Evolutionary parameters were derived from comparison of the Escherichia coli proteome to 1,808,565 orthologous proteins in 1,149 proteobacterial genomes. Our analysis reveals a significant positive correlation between protein binding frequency with DnaK and evolutionary rate. Proteins with high binding affinity to DnaK evolve on average 4.3-fold faster than proteins in the lowest binding affinity class at the genus resolution. Differences in evolutionary rates of DnaK interactor classes are still significant after adjusting for possible effects caused by protein expression level. Furthermore, we observe an additive effect of DnaK and GroEL chaperones on the evolutionary rates of their common interactors. Finally, we found pronounced similarities in the physicochemical profiles that characterize proteins belonging to DnaK and GroEL interactomes. Our results thus implicate DnaK-mediated folding as a major component in shaping protein evolutionary dynamics in bacteria and supply further evidence for the long-term manifestation of chaperone-mediated folding on genome evolution.Entities:
Keywords: GroEL; chaperones; genome evolution; microbial evolution
Mesh:
Substances:
Year: 2016 PMID: 27189986 PMCID: PMC4898814 DOI: 10.1093/gbe/evw102
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Comparison of Evolutionary Rates Among DnaK-Dependency Classes
| Taxonomic depth | |||
|---|---|---|---|
| Genus: | <2.2 × 10−16 | III > II = I | |
| Order: Enterobacteriales | <2.2 × 10−16 | III > II > I | |
| Class: Gammaproteobacteria | <2.2 × 10−16 | III > II > I | |
| Phylum: Proteobacteria | <2.2 × 10−16 | III > II > I | |
| Protein distance | Genus: | <2.2 × 10−16 | III > II = I |
| Order: Enterobacteriales | <2.2 × 10−16 | III > II > I | |
| Class: Gammaproteobacteria | <2.2 × 10−16 | III > II > I | |
| Phylum: Proteobacteria | <2.2 × 10−16 | III > II > I |
aUsing Friedman Mack–Skilling test (Mack and Skillings 1980).
bOrder of the DnaK-dependency classes (IDnaK, IIDnaK, IIIDnaK) sorted by the mean of the relevant measure (α = 0.05, using Tukey’s post hoc test).
FNonsynonymous substitution rates and amino acid replacement rates of DnaK-dependency classes in proteobacteria. Each data point represents the mean of distances of all class members in one genome from their orthologs in E. coli K12. Comparisons are shown in four taxonomic depths: genus (Escherichia), order (Enterobacteriales), class (Gammaproteobacteria), and phylum (Proteobacteria). The taxonomic depth samples are mutually exclusive.
Mean Ratio of Class Rates for DnaK (left) and GroEL (right) Dependency Classes
| Genus | Order | Class | Phylum | |
|---|---|---|---|---|
| dN | ||||
| Class III/I | 4.3; 2.8 | 2.4; 1.5 | 1.4; 1.3 | 1.2; 1.2 |
| Class II/I | 2.4; 1.4 | 1.6; 1.3 | 1.3; 1.1 | 1.1; 1.1 |
| Protein distance | ||||
| Class III/I | 3.9; 2.6 | 2.4; 1.5 | 1.5; 1.2 | 1.2; 1.1 |
| Class II/I | 2.2; 1.2 | 1.6; 1.3 | 1.3; 1.1 | 1.1; 1 |
Note.—The taxonomic depth samples are mutually exclusive.
FExpression level and CAI of the DnaK-dependency classes in E. coli. Expression data are available for only 248 of the substrate proteins, necessitating the use of CAI as a proxy for expression. All classes are significantly different from each other for both expression level and CAI (α = 0.05, using Kruskal–Wallis and Tukey’s post hoc tests).
Analysis of Covariance
| Response variable | Covariate | Taxonomic depth | |||
|---|---|---|---|---|---|
| Protein expression level | Genus | 0.349 | n.a. | n.a. | |
| Order | 0.001 | 0.15 | 3.5 × 10−6
| ||
| Class | 0.009 | 0.6 | 1.01 × 10−7
| ||
| Phylum | 0.23 | n.a. | n.a. | ||
| Protein distance | Protein expression level | Genus | 0.23 | n.a. | n.a. |
| Order | 0.009 | 0.602 | 1.01 × 10−7 | ||
| Class | 0.049 | 0.54 | 3.22 × 10−6 | ||
| Phylum | 1.29 × 10−4 | 0.08 | 3.63 × 10−7 | ||
| CAI | Genus | 0.76 | n.a. | n.a. | |
| Order | 9.73 × 10−11 | 0.62 | 3.1 × 10−30
| ||
| Class | 1.14 × 10−27 | 0.25 | 4.37 × 10−16
| ||
| Phylum | 3.26 × 10−13
| 0.56 | 1.42 × 10−15
| ||
| Protein distance | CAI | Genus | 0.37 | n.a. | n.a. |
| Order | 9.73 × 10−11
| 0.63 | 3.1 × 10−30
| ||
| Class | 1.14 × 10−27
| 0.26 | 4.37 × 10−16
| ||
| Phylum | 3.26 × 10−19
| 0.56 | 1.42 × 10−15
|
Note.—The ANCOVA test and its underlying assumptions (Zar 1999). The analysis in each taxonomic depth was performed using a single representative genome while maximizing the sample size for the test. The best linear models were obtained when both responses and covariates were transformed logarithmically. Note that the pooled regression hypothesis was rejected in all combinations at the genus depth. This may be due to the small range of rates measured within the Escherichia genomes.
aRepresentative genomes: Genus: E. coli BW2952; Order: Klebsiella oxytoca E718; Class: Aeromonas hydrophila ATCC 7966; Phylum: Burkholderia pseudomallei K96243.
bRepresentative genomes: Genus: E. coli SE15; Order: Klebsiella pneumoniae NTUH_K2044; Class: A. hyrophila ATCC_7966; Phylum: B. pseudomallei K96243.
cRepresentative genomes: Genus: E. coli K12_W3110; Order: Shigella nonnei 53G; Class: A. hyrophila ML09_119; Phylum: Burkholderia sp. 383.
dP value of an F-test with the null hypothesis H0: the response and covariate variables are linearly correlated.
eP value of an F-test for equality of slopes among the classes.
fP value of an F-test for equality of intercepts among the classes.
*P value < 0.05.
**P value <0.01.
FEvolutionary rates of combined DnaK and GroEL dependency classes. Each data point represents the mean of distances of all class members in one genome from their orthologs in E. coli K12. Comparisons are shown in four taxonomic depths: genus (Escherichia), order (Enterobacteriales), class (Gammaproteobacteria), and phylum (Proteobacteria). The taxonomic depth samples are mutually exclusive.
Comparison of Evolutionary Rates Among the Three Combined Classes
| Taxonomic depth | |||
|---|---|---|---|
| dN | Genus | 4.9 × 10−8 | Dg > Gd |
| Order | <2.2 × 10−16 | DG > dG > Dg | |
| Class | <2.2 × 10−16 | DG > Dg > dG | |
| Phylum | <2.2 × 10−16 | DG > Dg > dG | |
| Protein distance | Genus | 1.2 × 10−5 | DG > dG > Dg |
| Order | <2.2 × 10−16 | DG > dG | |
| Class | <2.2 × 10−16 | DG > Dg > dG | |
| Phylum | <2.2 × 10−16 | DG > Dg > dG |
aUsing Friedman Mack–Skilling test (Mack and Skillings 1980).
bα = 0.05, using Tukey’s post hoc test of mean evolutionary rates of the class within each species.
FEnrichment of physiochemical properties in chaperone dependency classes. Enrichment ratios are calculated as the mean value of Class III substrates divided by the mean value of Class I substrates. Ratios for DnaK are in red and for GroEL in blue. Kruskal–Wallis test significant P values are marked with *P < 0.05 and **P < 0.01.