| Literature DB >> 34498740 |
Soumya Mishra1,2, Loren L Looger2, Lauren L Porter1,3.
Abstract
Extant fold-switching proteins remodel their secondary structures and change their functions in response to cellular stimuli, regulating biological processes and affecting human health. Despite their biological importance, these proteins remain understudied. Predictive methods are needed to expedite the process of discovering and characterizing more of these shapeshifting proteins. Most previous approaches require a solved structure or all-atom simulations, greatly constraining their use. Here, we propose a high-throughput sequence-based method for predicting extant fold switchers that transition from α-helix in one conformation to β-strand in the other. This method leverages two previous observations: (a) α-helix ↔ β-strand prediction discrepancies from JPred4 are a robust predictor of fold switching, and (b) the fold-switching regions (FSRs) of some extant fold switchers have different secondary structure propensities when expressed by themselves (isolated FSRs) than when expressed within the context of their parent protein (contextualized FSRs). Combining these two observations, we ran JPred4 on 99-fold-switching proteins and found strong correspondence between predicted and experimentally observed α-helix ↔ β-strand discrepancies. To test the overall robustness of this finding, we randomly selected regions of proteins not expected to switch folds (single-fold proteins) and found significantly fewer predicted α-helix ↔ β-strand discrepancies. Combining these discrepancies with the overall percentage of predicted secondary structure, we developed a classifier to identify extant fold switchers (Matthews correlation coefficient of .71). Although this classifier had a high false-negative rate (7/17), its false-positive rate was very low (2/136), suggesting that it can be used to predict a subset of extant fold switchers from a multitude of available genomic sequences.Entities:
Keywords: bioinformatics; fold-switching proteins; metamorphic proteins; protein folding
Mesh:
Substances:
Year: 2021 PMID: 34498740 PMCID: PMC8545793 DOI: 10.1002/bip.23471
Source DB: PubMed Journal: Biopolymers ISSN: 0006-3525 Impact factor: 2.240
SCHEME 1Summary of predictive approach
FIGURE 1Helix ↔ strand discrepancies predicted by JPred4 correspond to experimentally observed α‐helix ↔ β‐strand differences in fold‐switching regions. Dotted line represents best linear fit of all datapoints (black and red circles; Pearson correlation: .82). Red circles correspond to benchmark set of 17‐fold switchers. Only 16 can be observed because two KaiB variants (4KSO and 1WWJ) overlap exactly at (0.31, 0.26)
FIGURE 2JPred4 predicts different secondary structures for isolated and contextualized FSRs of extant fold switchers with substantive transitions between α‐helix and β‐strand. Each panel shows the experimentally determined secondary structures of both conformations of the fold switcher (purple and green) along with JPred4 secondary structure predictions of the whole sequence (black) and FSR (gray). Purple and green regions of protein structure correspond to FSR sequence shown in diagram; gray corresponds to structurally constant regions (SCRs). Predicted secondary structures that were at least two contiguous residues long are shown. The KaiB variant (2QKE) represents all members of the KaiB family; the other three (1WWJ, 4SKO, 1R5P) are not shown; amylin is also not shown due to lack of space. The differential secondary structure predictions for ORF9b were reported previously.[ ] The green secondary structure diagram of BAX is shaded with lines to signify that its structure has not been solved, though other experimental evidence strongly suggests that it folds into a β‐sheet. All three‐dimensional protein structures were made using PyMOL[ ]
FIGURE 3JPred4 discriminates between single folders and fold switchers. Single folders/fold switchers are blue circles/red triangles. The dashed line represents the threshold for classifying fold switchers by fraction of predicted secondary structure/fraction of α‐helix ↔ β‐strand discrepancies (0.2). Datapoints at or above this threshold are predicted to switch folds. Only 16/17‐fold switchers can be seen because two KaiB variants have identical coordinates (0.37, 0.26)