| Literature DB >> 33892639 |
Luciano Kagami1, Joel Roca-Martínez1,2,3, Jose Gavaldá-García1,2,3, Pathmanaban Ramasamy1,2,3,4,5, K Anton Feenstra6,7, Wim F Vranken8,9,10.
Abstract
BACKGROUND: The SARS-CoV-2 virus, the causative agent of COVID-19, consists of an assembly of proteins that determine its infectious and immunological behavior, as well as its response to therapeutics. Major structural biology efforts on these proteins have already provided essential insights into the mode of action of the virus, as well as avenues for structure-based drug design. However, not all of the SARS-CoV-2 proteins, or regions thereof, have a well-defined three-dimensional structure, and as such might exhibit ambiguous, dynamic behaviour that is not evident from static structure representations, nor from molecular dynamics simulations using these structures. MAIN: We present a website ( https://bio2byte.be/sars2/ ) that provides protein sequence-based predictions of the backbone and side-chain dynamics and conformational propensities of these proteins, as well as derived early folding, disorder, β-sheet aggregation, protein-protein interaction and epitope propensities. These predictions attempt to capture the inherent biophysical propensities encoded in the sequence, rather than context-dependent behaviour such as the final folded state. In addition, we provide the biophysical variation that is observed in homologous proteins, which gives an indication of the limits of their functionally relevant biophysical behaviour.Entities:
Keywords: Biophysical features; COVID-19; Proteins; SARS-CoV-2; Single sequence based predictions
Year: 2021 PMID: 33892639 PMCID: PMC8062939 DOI: 10.1186/s12860-021-00362-w
Source DB: PubMed Journal: BMC Mol Cell Biol ISSN: 2661-8850
Fig. 1Overview of the employed workflow and the incorporated predictions
Overview of the prediction software used
| Software | Type | Color | Description |
|---|---|---|---|
| DynaMine | Black | Rigidity of the backbone, higher values mean backbone movements are more likely to be limited; values > 1.0 indicate membrane spanning regions, > 0.8 rigid conformations, < 0.69 flexible regions. Residues with 0.80–0.69 values are ‘context’ dependent and capable of being either rigid or flexible. | |
| Grey | Rigidity of the sidechain, higher values mean the sidechain is more likely to be conformationally restricted. These values are highly dependent on the amino acid type (i.e. a Trp will be rigid, an Asp flexible). | ||
| Blue, red, purple | The propensity of the residue, based on local amino acid context, to adopt helix, sheet or coil conformations. Higher values indicate higher propensities. | ||
| EFoldMine | Green | Likelihood that this residue will adopt a persistent conformation based on only local interactions with other amino acids. Values > 0.169 indicate residues most likely to start the protein folding process. | |
| DisoMine | Yellow | Likelihood that this residue will be disordered (highly dynamic, many conformations). Values > 0.5 indicate that this is most likely a disordered residue. | |
| Agmata | Dark green | Residues with higher values indicate residues likely to be involved in β-sheet aggregation. The values displayed in the plots are divided by a factor 20 from the original. | |
| SeRenDIP | Cyan | Indicates residues likely to participate in protein-protein interactions (PPIs). Values > 0.5 indicate residues that are most likely involved in PPIs. | |
| SeRenDIP-CE | Sea-green | Indicates residues likely to be part of a conformational (discontinuous) epitope (CE) region. Values > 0.5 indicate residues that are most likely involved in CEs. |
Fig. 2Predictions for the P0DTC9 SARS-CoV-2 protein amino acids (x-axis) for a) backbone dynamics (black), helix (red) and sheet (blue) propensity, and b) early folding (light green), disorder (yellow) β-sheet aggregation (dark green), protein interaction (cyan) and epitopes (seagreen). The two regions for which structures have been determined are indicated by black boxes (a, b), with annotations for consensus α-helix (red boxes) and β-strands (blue boxes) based on these structures included. Regional highlights not evident in these structures (AR, I, II) are discussed in the text. On the interactive plots on the server, predictions can be toggled on and off by clicking on their name