| Literature DB >> 29784778 |
Lauren L Porter1, Loren L Looger2.
Abstract
A central tenet of biology is that globular proteins have a unique 3D structure under physiological conditions. Recent work has challenged this notion by demonstrating that some proteins switch folds, a process that involves remodeling of secondary structure in response to a few mutations (evolved fold switchers) or cellular stimuli (extant fold switchers). To date, extant fold switchers have been viewed as rare byproducts of evolution, but their frequency has been neither quantified nor estimated. By systematically and exhaustively searching the Protein Data Bank (PDB), we found ∼100 extant fold-switching proteins. Furthermore, we gathered multiple lines of evidence suggesting that these proteins are widespread in nature. Based on these lines of evidence, we hypothesized that the frequency of extant fold-switching proteins may be underrepresented by the structures in the PDB. Thus, we sought to identify other putative extant fold switchers with only one solved conformation. To do this, we identified two characteristic features of our ∼100 extant fold-switching proteins, incorrect secondary structure predictions and likely independent folding cooperativity, and searched the PDB for other proteins with similar features. Reassuringly, this method identified dozens of other proteins in the literature with indication of a structural change but only one solved conformation in the PDB. Thus, we used it to estimate that 0.5-4% of PDB proteins switch folds. These results demonstrate that extant fold-switching proteins are likely more common than the PDB reflects, which has implications for cell biology, genomics, and human health.Entities:
Keywords: conformational diversity; metamorphic proteins; protein fold switching; protein function; protein structure
Mesh:
Substances:
Year: 2018 PMID: 29784778 PMCID: PMC6003340 DOI: 10.1073/pnas.1800168115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.(A–D, Left) Four categories of fold-switching proteins. Names of each category are in the upper left-hand corners of their respective white and gray boxes. Specific examples of each fold switch category are included. Blue and red regions of protein structure indicate regions that change folds. Secondary structure diagrams corresponding to the blue and red regions are placed between the two structures for comparison, with the names of the proteins above. Gray protein regions maintain essentially the same secondary structure in both conformations. PDB ID codes and chains (left to right) are as follows: 5jytA, 2qkeE (class A); 1rk4A, 1k0nA (class B); and 1x0gA&C, 1x0gB&D; 4twaA, 4ydqB (class C). Structures are meant to illustrate conformational differences, not the most biologically relevant conformations. Thus, both conformations of KaiB, CLIC1, and IscA are shown as monomers, monomers, and tetramers, respectively. Colored boxes at the bottom indicate the biological function of each protein; one box corresponds to one protein. All images of 3D protein structures in this figure and all others were made in PyMOL (38). (A–D, Right) Biological functions are grouped by triggers and separated by dashed lines. The numeral 4 in the white box in class D represents four proteins of the class “other” that did not all fit in the allotted space. Four proteins with other triggers are omitted from this figure.
Fig. 2.Distributions of randomly generated and fold switch-derived QRs (measures of independent folding cooperativity from the SEED algorithm) differ significantly (P < 10−34, Kolmogorov–Smirnov test). The x axis is limited between 0 and 2, and the majority of both of these populations (98%, randomly generated; 89%, fold switch-derived) lie within these limits.
Predictions of experimentally supported fold switchers
| PDB ID codes + chains | Predicted fold-switching regions | Methods of experimental validation |
| 2kxoA | 1–89 | NMR |
| 2lshA | 29–115 | NMR |
| 2mz7A | 267–312 | NMR |
| 4pmkA | 27–62 | NMR |
| 2n4oA | 16–69 | NMR |
| 2ktmA | 167–201 | NMR |
| 2le3A | Not predicted | NMR |
| 2x9cA | Not predicted | NMR |
| 4ov8A | 247–318 | Cryo-EM |
| 3j9eD | 2–71 | Cryo-EM |
| 5suzA | 474–509 or 415–509 | Cell-based assays |
| 4hlsA | 146–222 | Circular dichroism + size exclusion chromatography |
| 1s5pA | 48–107, 98–189, 208–274 | Isothermal titration calorimetry |
| 3tkaA | 236–313 | Small-angle X-ray scattering |
| 3gaxA | 48–120 | Fibrillar deposits identified in vivo |
contains literature justifications.