| Literature DB >> 22417914 |
David Bogumil1, Giddy Landan, Judith Ilhan, Tal Dagan.
Abstract
It has long been known that many proteins require folding via molecular chaperones for their function. Although it has become apparent that folding imposes constraints on protein sequence evolution, the effects exerted by different chaperone classes are so far unknown. We have analyzed data of protein interaction with the chaperones in Saccharomyces cerevisiae using network methods. The results reveal a distinct community structure within the network that was hitherto undetectable with standard statistical tools. Sixty-four yeast chaperones comprise ten distinct modules that are defined by interaction specificity for their 2,691 interacting proteins. The classes of interacting proteins that are in turn defined by their dedicated chaperone modules are distinguished by various physiochemical protein properties and are characterized by significantly different protein expression levels, codon usage, and amino acid substitution rates. Correlations between substitution rate, codon bias, and gene expression level that have long been known for yeast are apparent at the level of the chaperone-defined modules. This indicates that correlated expression, conservation, and codon bias levels for yeast genes are attributable to previously unrecognized effects of protein folding. Proteome-wide categories of chaperone-substrate specificity uncover novel hubs of functional constraint in protein evolution that are conserved across 20 fungal genomes.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22417914 PMCID: PMC3381671 DOI: 10.1093/gbe/evs025
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FThe network of CSIs. A graphic representation of the network with chaperones on the x axis (i = 1 … 69) and substrates on the y axis (j = 1 … 3,595). Cells in the matrix represent a protein–protein interaction between chaperone i and substrate j. The cells are colored by the module color if both substrate and chaperone are included in the module, and in gray otherwise. Cells of noninteracting proteins are colored in black. Hsp70 group includes the five ungrouped chaperones: Ssb1, Ssa1, Sse1, Ssa2, and Ssb2.
Comparison of Substrate Properties among the Modules
| Variable | As Is | Random | Correlation with Expression Level in the Network | |
| Expression | Expression level | 2.22 × 10−16** | 0.62 | — |
| CAI | 2.38 × 10−06** | 0.37 | 0.54** | |
| Optimal codons | 1.18 × 10−05** | 0.76 | 0.53** | |
| Secondary structure | Alpha helix | 0.0067** | 0.08 | 0.02 |
| Coiled coils | 0.0256** | 0.4 | 0.21** | |
| Beta sheets | 0.0833 | 0.53 | 0.21** | |
| Physiochemical properties | Protein length | 4.13 × 10−09** | 0.94 | −0.17** |
| Hydrophobic amino acids | 0.2177 | 0.23 | 0.18** | |
| Negative amino acids | 0.0008** | 0.56 | 0.08** | |
| Positive amino acids | 0.5682 | 0.72 | −0.06** | |
| Polar amino acids | 0.0081** | 0.83 | −0.31** | |
| Aromaticity index | 0.0017** | 0.43 | −0.04** | |
| Gravy | 0.171 | 0.58 | 0.14** | |
| Amino acid frequencies | Alanine | 6.60 × 10−07** | 0.89 | 0.36** |
| Arginine | 0.3581 | 0.8 | −0.09** | |
| Asparagine | 0.0384* | 0.58 | −0.27** | |
| Aspartate | 4.71 × 10−05** | 0.08 | 0.03 | |
| Cysteine | 0.5354 | 0.23 | −0.09** | |
| Glutamine | 0.0064** | 0.87 | −0.08** | |
| Glutamate | 0.2669 | 0.97 | 0.09** | |
| Glycine | 0.0172** | 0.24 | 0.25** | |
| Histidine | 0.4528 | 0.07 | −0.10** | |
| Isoleucine | 0.0027** | 0.11 | −0.06** | |
| Leucine | 0.0031** | 0.47 | −0.08** | |
| Lysine | 0.4807 | 0.75 | 0.03** | |
| Methionine | 0.3369 | 0.61 | −0.08** | |
| Phenyl-alanine | 0.0012** | 0.48 | −0.04** | |
| Proline | 0.0074** | 0.43 | −0.07** | |
| Serine | 0.0417* | 0.07 | −0.29** | |
| Threonine | 0.4651 | 0.72 | −0.05** | |
| Tryptophan | 0.0612 | 0.48 | 0.03 | |
| Tyrosine | 0.0586 | 0.31 | 0.02 | |
| Valine | 0.0185** | 0.27 | 0.27** | |
| Evolutionary rate | Substitution rate | 2.15 × 10−06** | 0.36 | −0.42** |
| % Identical amino acids | 1.35 × 10−07** | 0.81 | 0.47** | |
| Substitutions per site | 2.58 × 10−07** | 0.75 | −0.46** |
Using Kruskal–Wallis test for equality of median ranks with the null hypothesis, H0: μmodule1 = μmodule2 = … = μmodule10.
Using Spearman rank correlation coefficient.
P value < 0.05.
P value < 0.05 using false discovery rate test for multiple comparisons.
FComparison of expression level (a), codon adaptation index (b), and relative amino acid substitution rates (c) among the modules. A matrix representation of post hoc multiple comparison results (α = 0.05, using Tukey test). Cell a in the matrix is colored red if the corresponding variable module i > module j, blue if module i < module j, and white if no significant difference between the modules was found.
FEvolutionary distances of yeast substrates in the ten modules compared with their positional ortholog in 20 fungal species. The x axis shows the variation of amino acid substitution rates within different fungal genomes in comparison with yeast. The y axis shows the rate variation among proteins in the different modules within the same genome. Module colors correspond to the ranking by substrate expression levels with highly expressed modules in red shades and lowly expressed modules in blue shades. Hsp70 group includes five ungrouped chaperones: Ssb1, Ssa1, Sse1, Ssa2, and Ssb2. Arabic numerals correspond to fungal species: 1) Saccharomyces paradoxus, 2) Saccharomyces mikatae, 3) Saccharomyces bayanus, 4) Saccharomyces castellii, 5) Kluyveromyces Lactis, 6) Ashbya gossypii, 7) Kluyveromyces waltii, 8) Lachancea kluyveri, 9) Candida glabrata, 10) Candida guilliermondii, 11) Candida albicans, 12) Candida tropicalis, 13) Lodderomyces elongosporus, 14) Yarrowia lipolytica, 15) Aspergillus nidulans, 16) Neurospora crassa, 17) Schizosaccharomyces pompe, 18) Schizosaccharomyces japonicus, 19) Debaryomyces hansenii, and 20) Candida parapsilosis.