| Literature DB >> 29438383 |
Dominik Schwarz1, Orit Adato2, Amnon Horovitz3, Ron Unger2.
Abstract
The GroE chaperonin system, which comprises GroEL and GroES, assists protein folding in vivo and in vitro. It is conserved in all prokaryotes except in most, but not all, members of the class of mollicutes. In Escherichia coli, about 60 proteins were found to be obligatory clients of the GroE system. Here, we describe the properties of the homologs of these GroE clients in mollicutes and the evolution of chaperonins in this class of bacteria. Comparing the properties of these homologs in mollicutes with and without chaperonins enabled us to search for features correlated with the presence of GroE. Interestingly, no sequence-based features of proteins such as average length, amino acid composition and predicted folding/disorder propensity were found to be affected by the absence of GroE. Other properties such as genome size and number of proteins were also found to not differ between mollicute species with and without GroE. Our data suggest that two clades of mollicutes re-acquired the GroE system, thereby supporting the view that gaining the system occurred polyphyletically and not monophyletically, as previously debated. Our data also suggest that there might have been three isolated cases of lateral gene transfer from specific bacterial sources. Taken together, our data indicate that loss of GroE does not involve crossing a high evolutionary barrier and can be compensated for by a small number of changes within the few dozen client proteins.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29438383 PMCID: PMC5810989 DOI: 10.1371/journal.pone.0192619
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A scheme of the different groups of proteins used in this study.
The ellipse shows the groups on which this work focuses but some properties are compared across all groups.
Comparing properties of GroE+ and GroE- homologs.
| Homologs in GroE+ mollicutes | Homologs in GroE- mollicutes | ||||
|---|---|---|---|---|---|
| Clients | 355.9 ± 72.8 | 362.3 ± 78.6 | 344.9 ± 69.9 | 352.3 ± 78.3 | P-value: 0.563 |
| Controls | 331.9 ± 53.3 | 334.9 ± 65.7 | 343.0 ± 58.9 | 348.0 ± 61.5 | P-value: 0.574 |
| Clients | 0.014 ± 0.009 | 0.025 ± 0.014 | 0.017 ± 0.007 | 0.024 ± 0.014 | P-value: 0.192 |
| Controls | 0.017 ± 0.011 | 0.017 ± 0.010 | 0.019 ± 0.021 | 0.022 ± 0.014 | P-value: 0.781 |
| Clients | 0.170 ± 0.041 | 0.171 ± 0.051 | 0.171 ± 0.041 | 0.179 ± 0.047 | P-value: 0.980 |
| Controls | 0.203 ± 0.098 | 0.220 ± 0.092 | 0.196 ± 0.092 | 0.217 ± 0.081 | P-value: 0.841 |
a,b The values for the corresponding E. coli proteins (clients and control). Note that if an E. coli protein does not have even a single homolog in the relevant mollicute group then it was not included in the calculation. Thus, we get slightly different values for E. coli in the two columns.
c The statistical significance between the GroE+ and GroE- values.
Amino acid composition [%] of the GroEL clients and non-clients in E. coli and their homologs in GroE+ and GroE- mollicutes.
| Proteome | 5.06 | 5.90 | 2.06 | 10.03 | 9.14 | 2.89 | 0.83 | 5.31 | 4.36 | 5.64 | 3.83 | 5.02 | 6.39 | 0.67 | 7.09 | 1.65 | 9.45 | 3.02 | 6.31 | 5.34 |
| Class IV client homologs | 6.84 | 6.54 | 2.22 | 9.23 | 8.83 | 3.12 | 0.57 | 4.48 | 3.62 | 5.13 | 3.40 | 6.92 | 5.88 | 0.98 | 6.44 | 2.29 | 9.14 | 2.43 | 6.47 | 5.48 |
| Nonclient homologs | 6.62 | 7.34 | 2.09 | 10.05 | 9.67 | 3.51 | 0.53 | 3.95 | 3.52 | 5.30 | 3.73 | 5.72 | 5.81 | 0.71 | 5.66 | 2.02 | 9.59 | 2.85 | 5.90 | 5.42 |
| Proteome | 5.11 | 5.57 | 1.84 | 9.63 | 9.31 | 2.71 | 1.06 | 5.42 | 4.03 | 5.16 | 3.36 | 5.02 | 7.00 | 0.69 | 7.38 | 1.41 | 10.06 | 2.99 | 6.84 | 5.42 |
| Class IV client homologs | 6.90 | 6.26 | 2.15 | 8.99 | 9.62 | 3.12 | 0.72 | 4.42 | 3.56 | 4.79 | 2.88 | 6.60 | 6.07 | 0.94 | 6.34 | 1.89 | 9.54 | 2.61 | 6.95 | 5.73 |
| Nonclient homologs | 6.91 | 7.23 | 1.77 | 9.27 | 9.64 | 3.22 | 0.64 | 3.99 | 3.21 | 5.42 | 3.10 | 6.28 | 6.14 | 0.83 | 6.03 | 1.81 | 9.50 | 2.85 | 6.34 | 5.82 |
| Proteome | 9.52 | 7.07 | 2.82 | 10.67 | 6.01 | 4.43 | 1.53 | 3.89 | 2.85 | 5.40 | 4.44 | 7.37 | 5.80 | 1.16 | 3.95 | 2.27 | 4.41 | 5.51 | 5.76 | 5.15 |
| Class IV clients | 10.08 | 6.84 | 3.02 | 9.69 | 5.51 | 4.50 | 1.28 | 3.68 | 2.88 | 5.15 | 4.13 | 8.04 | 5.16 | 1.32 | 3.70 | 2.85 | 4.15 | 5.95 | 6.15 | 5.92 |
| Nonclients | 10.51 | 7.19 | 3.09 | 10.63 | 6.18 | 4.52 | 1.67 | 3.87 | 2.81 | 5.34 | 4.15 | 7.86 | 5.56 | 1.22 | 3.77 | 2.33 | 4.35 | 4.61 | 5.36 | 4.96 |
The overall compositions of E. coli and the GroE+ and GroE- mollicutes are also provided. The blue cells mark amino acid for which the frequency in mollicute is higher (color gradient starting with 10% difference) compared to the corresponding amino acid frequency in E. coli, while the red cells mark amino acid with frequency that is lower (color gradient starting with 10% difference) compared with the corresponding frequency in E. coli.
Fig 2Distribution of the GroE system in a 16S rRNA-based evolution tree of mollicutes.
The branches of the GroE+ and GroE- species are marked in red and blue, respectively. The numbers on the branching points represent the bootstrapping frequencies, the branch length represents the sequence distance and the scale shows a distance of 5%. The diamonds and squares show the two major clades that contain the GroE system. The triangle marks the species where lateral gene transfer (LGT) was suggested before and circles mark the two additional species for which we suggest LGT events.
Fig 3Evolutionary tree of GroEL sequences of GroE+ mollicutes.
The diamonds and squares show the two major clades that contain the GroE system. The triangle marks the species where lateral gene transfer (LGT) was suggested before and circles mark the two additional species for which we suggest LGT events.