| Literature DB >> 34305174 |
Kunal Dutta1, Ammar D Elmezayen2, Anas Al-Obaidi2, Wei Zhu3, Olga V Morozova4, Sergey Shityakov5,6,7, Ibrahim Khalifa8.
Abstract
At the very beginning of the new decade, the COVID-19 pandemic has badly hit modern human societies. SARS-CoV-2, the causative agent of COVID-19 acquiring mutations and circulating as new variants. Herein, we have found three new antiviral peptides (AVPs) against the SARS-CoV-2. These AVPs are analogous to the spike glycoprotein of the SARS-CoV-2. Antiviral peptides, i.e., Seq12, Seq12m, and Seq13m, can block the receptor-binding domain (RBD) of the SARS-CoV-2, which is necessary for communicating with the angiotensin-converting enzyme 2 (ACE2). Also, these AVPs sustain their antiviral properties, even after the insertion of 25 mutations in the RBD (Rosetta and FoldX based). Further, Seq12 and Seq12m showed negligible cytotoxicity. Besides, the binding free energies calculated using MM-PB/GBSA method are also in agreement with the molecular docking studies. The molecular interactions between AVPs and the viral membrane protein (M) also showed a favorable interaction suggesting it could inhibit the viral re-packaging process. In conclusion, this study suggests Seq12, Seq12m, and Seq13m could be helpful to fight against SARS-CoV-2. These AVPs could also aid virus diagnostic tools and nasal spray against SARS-CoV-2 in the future.Entities:
Keywords: SARS-CoV-2; antiviral peptide; mutation; receptor-binding domain; spike glycoprotein
Year: 2021 PMID: 34305174 PMCID: PMC8283670 DOI: 10.1016/j.molstruc.2021.131113
Source DB: PubMed Journal: J Mol Struct ISSN: 0022-2860 Impact factor: 3.196
Fig. 1Peptide sequence mapping and three-dimensional structure of the AVPs. (a) Seq12, (b) Seq12m, (c) seq13m, (d) sequence mapping of the AVPs with the spike glycoprotein of the SARS-CoV-2 (Q913 – G972).
Physicochemical and biological properties of the antiviral peptides
| Properties | Antiviral peptides | ||
|---|---|---|---|
| Seq12 | Seq12m | Seq13m | |
| APVpred | AVP | AVP | AVP |
| Meta-iAVP | 0.716 | 0.752 | 0.85 |
| PROB-HemoPI | 0.17 | 0.17 | 0.46 |
| Hydrophobicity | -0.23 | -0.21 | -0.10 |
| Steric hindrance | 0.69 | 0.67 | 0.65 |
| Solvation | 0.38 | 0.39 | 0.65 |
| Hydropathicity | -0.78 | -0.70 | -0.25 |
| Amphiphilicity | 0.72 | 0.62 | 0.73 |
| Hydrophilicity | 0.09 | 0.03 | -0.20 |
| Net hydrogen | 47.0 | 46.0 | 30.0 |
| Charge (pH = 7) | 1.1 | 0.0 | -1.0 |
| Isoelectric point (pI) | 8.71 | 7.10 | 4.79 |
| Molecular weight | 5302.68 | 5374.75 | 4657.83 |
| Instability index | 37.74s | 46.15u | 53.32u |
| Aliphatic index | 72.77 | 72.77 | 76.10 |
| GRAVY | -0.777 | -0.711 | -0.0246 |
| Estimated half life | 0.8 h | 0.8 h | 1 h |
| Water solubility | Good | Good | Poor |
| Epitope for IgE | Non-allergen | Non-allergen | Non-allergen |
| IL-4 inducer | Non-inducer* | Non-inducer* | Non-Inducer* |
| IL-10 inducer | Inducer | Inducer | Inducer |
| IC50 | 39.38 µM | 39.31 µM | 38.54 µM |
AVPpred is available at http://crdd.osdd.net/servers/avppred.
Motif.
Align
Composition
Physicochemical composition.
Meta-iAVP is available at http://codes.bio/meta-iavp/.
Hemolytic prediction = PROB score HemoPI was calculated using HemoPI-1/1+motif and (SVM+Motif (HemoPI-2)).
Molecular weight in dalton.
Instability index : s = stable, u = unstable.
GRAVY = Grand average of hydropathy.
Predicted half-life in mammalian reticulocytes.
Calculated using AlgPred, it is available at http://crdd.osdd.net/raghava/algpred/.
IL-4 inducer predictions are made using the SVM method (SVM Threshold = 0.5). *However, the complete antiviral sequence contains two motifs to induce the IL-4 and hydride method (SVM+motif ) score is 1.04, suggesting it is an IL-4 inducer peptide.
L-10 inducing probability was calculated using RANDOM FOREST probability threshold = 0.5 and SVM method.
The half-maximal inhibitory concentration (IC50). It is a theoretical value.
Fig. 2Comparison of antiviral properties of Seq12, Seq12m, and Seq13m with other anti-SARS-CoV-2 peptides. *PROB score is the normalized SVM score and ranges between 0 and 1, i.e., 1 very likely to be hemolytic, 0 improbable to be hemolytic. **AVP-IC50Pred is a multiple machine learning technique based prediction of peptides' antiviral activity in terms of half-maximal inhibitory concentration (IC50).
Fig. 3Comparative structural analysis of the receptor-binding domain (RBD) of the SARS-CoV-2. (a) Wild type spike protein of SARS-CoV-2, RBD is highlighted in yellow (b) RRDis map of wild type and manual mutant model of the RBD (c) RRDis map of energy mutant, evolutionary mutant, combine mutant (d) overlapping three-dimensional structures of wild type RBD (gold) and manual mutant model of RBD (cyan), (e) overlapping three-dimensional structures of different mutant models of RBD, energy mutant (pink), evolutionary mutant (gold), combine mutant (cyan), (f) correspondence analysis of different structural models (based on Dali Z-scores), (g) heatmap of different mutant models of the RBD. *Corresponding analysis is a multidimensional scaling method. It positions data with the most similar structural neighborhoods near each other.
In silico mutant models of the receptor-binding domain of the SARS-CoV-2
| Amino acid residues (Wild type to mutant type) | FoldX | Rosetta | |
|---|---|---|---|
| Combined mutant -27.63 | A348P | -1.66 | -2.34 |
| N354E | -0.45 | - | |
| A372T | 0.20 | - | |
| S373M | -2.30 | -2.82 | |
| T393F | -1.41 | -6.03 | |
| N394S | -0.63 | - | |
| S399M | -3.17 | -3.09 | |
| R403K | 0.48 | - | |
| K417V* | 0.21 | - | |
| N460K | -0.75 | - | |
| I468L | -0.40 | - | |
| T470Y | -2.10 | -2.19 | |
| S477G | -0.03 | - | |
| S494Y | -1.47 | -2.12 | |
| G502P | -1.94 | -2.61 | |
| V503P | -1.18 | -2.72 | |
| H519N | -0.02 | - | |
| Energy mutant: -25.41 | A348P | -1.66 | -2.34 |
| S373M | -2.30 | -2.82 | |
| T393F | -1.41 | -6.03 | |
| S399M | -3.17 | -3.09 | |
| T470Y | -2.10 | -2.19 | |
| S494Y | -1.47 | -2.12 | |
| G502P | -1.94 | -2.61 | |
| V503P | -1.18 | -2.72 | |
| Evolution mutant: -6.29 | A348P | -1.66 | -1.66 |
| N354E | -0.45 | -0.45 | |
| A372T | 0.20 | 0.20 | |
| T393S | 0.47 | 0.47 | |
| N394S | -0.63 | -0.63 | |
| R403K | 0.48 | 0.48 | |
| K417V* | 0.21 | 0.21 | |
| N460K | -0.75 | -0.75 | |
| I468L | -0.40 | -0.40 | |
| T470N | 0.06 | -0.06 | |
| S477G | -0.03 | -0.03 | |
| H519N | -0.02 | -0.02 |
In the manual mutant model of RBD (RBDm), F486P, Y489F, Q493M, G496P, Q498L, T500Y, N501L, and Y505A substitutions were manually incorporated. *Among SARS-CoV-2 variants, mutation at N501 is common in B1.1.7, P.1, B1.351, and at K417 in B.1.617.2.1
Structural analysis of mutant models of the receptor-binding domain (RBD) of the SARS-CoV-2
| Sl. No. | Mutant models of RBD | RMSD | Percentage of identities | Z-score* |
|---|---|---|---|---|
| 1. | RBDm | 1.7 | 94 | 25.3 |
| 2. | RBDc | 1.7 | 96 | 25.3 |
| 3. | RBDe | 1.5 | 88 | 25.2 |
| 4. | RBDb | 1.7 | 84 | 23.9 |
RMSD = root mean square deviations
Percentage of identities with the wild type RBD structure, RBDm = manual mutant, RBDc = combined model, RBDe = energy mutant, RBDb = evolutionary mutant model of the receptor-binding domain. *Dali Z-score.
HADDOCK docking properties of the AVP-RBD complexes
| Properties | Systems | |||||
|---|---|---|---|---|---|---|
| Seq12 + RBD | Seq12 + RBDm3 | Seq12m + RBD | Seq12m + RBDm | Seq13m + RBD | Seq13m + RBDm | |
| HADDOCK Score | -111.2 | -92.8 | -76.8 | -76.7 | -81.4 | -41.7 |
| Cluster size | 104 | 36 | 13 | 27 | 40 | 14 |
| RMSD | 0. 5 | 0.4 | 2.6 | 4.7 | 0.5 | 3.5 |
| VDW2 energy | -68.8 | -71.3 | -71.9 | -78.3 | -75.2 | -63.4 |
| Electrostatic energy | -182.4 | -142.0 | -145.0 | -53.2 | -70.3 | -73.8 |
| Desolvation energy | -36.9 | -36.6 | -4.9 | -43.1 | -20.9 | -28.5 |
RMSD = Root mean square deviation from the overall lowest-energy structure. 2VDW = van der Waals energy. 3RBDm = mutant model of the receptor-binding domain of the SARS-CoV-2. Energy units are in kcal/mol.
Fig. 4The binding posture of the AVPs with the receptor-binding domain of the SARS-CoV-2. wild type receptor binding domain complex with (a) Seq12, (b) Seq12m, (c) Sqe13m, mutant receptor binding domain complex with (d) Seq12, (e) Seq12m, (f) Sqe13m.
Fig. 5Membrane protein (M) of the SARS-CoV-2. (a) predicted transmembrane domains of the M-protein are highlighted in yellow, (b) analyses of molecular docking results between the AVPs and the transmembrane domains.
Fig. 6Comparisons of 50 ns MD-Simulation results of the antiviral peptides Seq12, Seq12m, and Seq13m.
Fig. 7Comparisons of the 100 ns MD-Simulation results of the AVP-RBD complexes.
Fig. 8Difference of RMSF values obtained from 100 ns MD-simulation of the AVP-RBD complexes. The major fluctuations of the C-alpha atoms are highlighted by black in the cartoon presentation of the RBD, from the 149th-158th amino acid residue (*) and from 200th to the rest of the amino acid residues (**).
Binding free energy (kcal/mol) resulting from MM-PBSA analysis of different AVP-RBD complexes
| Energy components* | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Δ | Δ | Δ | Δ | Δ | Δ | Δ | Δ | Δ | |
| A | -34.32 | -55.44 | 46.67 | -7.53 | -89.76 | 39.13 | 12.35 | -62.98 | -50.62 |
| B | 0.56 | -92.36 | 17.31 | -10.39 | -91.80 | 6.91 | 17.87 | -102.76 | -84.88 |
| C | -42.41 | -65.07 | 56.69 | -8.94 | -107.49 | 47.75 | 14.27 | -74.02 | -59.74 |
| D | -12.33 | -81.68 | 26.52 | -9.13 | -94.01 | 17.39 | 14.19 | -90.82 | -76.62 |
| E | -24.69 | -41.09 | 34.51 | -5.36 | -65.78 | 29.14 | 9.8 | -46.46 | -36.64 |
| F | -40.48 | -93.84 | 56.78 | -10.89 | -134.33 | 45.89 | 16.3 | -104.74 | -88.44 |
A = Seq12+RBD, B = Seq12+RBDm, C = Seq12m+RBD, D = Seq12m+RBDm, E = Seq13m+RBD, F = Seq13m+RBDm. *MM-PBSA was calculated using CaFE tools with the 100 ns MD-trajectory generated using NAMD software.
Binding free energy (kcal/mol) resulting from MM-GBSA analysis of different AVP-RBD complexes
| Energy components* | ||||||||
|---|---|---|---|---|---|---|---|---|
| Vdw | EEL | EGB | ESURF | G gas | G solv | Δ | Δ | |
| A | -67.18 | -140.14 | 172.36 | -10.40 | -207.33 | 161.96 | -45.36 | -92.09 |
| B | -97.66 | -62.18 | 120.23 | -14.01 | -159.85 | 106.22 | -53.63 | -93.31 |
| C | -93.66 | -208.49 | 254.69 | -13.60 | -302.16 | 241.09 | -61.07 | -50.09 |
| D | -102.19 | -44.99 | 97.17 | -12.94 | -147.18 | 84.22 | -62.95 | -75.13 |
| E | -84.81 | -155.38 | 210.48 | -10.98 | -240.20 | 199.49 | -40.70 | -57.96 |
| F | -70.96 | -87.50 | 129.40 | -9.52 | -158.47 | 119.88 | -38.59 | -53.90 |
A = Seq12+RBD, B = Seq12+RBDm, C = Seq12m+RBD, D = Seq12m+RBDm, E = Seq13m+RBD, F = Seq13m+RBDm. ⁎MM-GBSA was calculated for 100 ns using Abmer 16 package. Generalized Born ESURF calculated using 'LCPO' surface areas. #ΔGGBSA was calculated using HawkDock.
Fig. 9Linear regression between predicted ΔGPB/GBSA calculated for 50 ns (*) and predicted ΔGPB/GBSA calculated for 100 ns (**). The dotted line around the solid line indicates 95% confidence intervals.
Fig. 10Distribution of binding free energies (MM-GBSA) per amino acid residues. (a) Seq12 + RBD, (b) Seq12m + RBD, (c) Seq13m + RBD, (d) Seq12 + RBDm, (e) Seq12m + RBDm, (f) Seq13m + RBDm. *Only top twenty participating amino acid are presented according to the ascending / desending order of energies (MM-GBSA, kcal/mol).
Comparison of translational, vibrational, and rotational entropy among the AVP-RBD complexes calculated using the two-phase thermodynamic (2PT) model.
| Systems | Δ | Δ | Δ | Total Δ |
|---|---|---|---|---|
| Seq12 + RBD | -15.17 | -14.68 | -16.28 | -46.13 |
| Seq12 + RBDm$ | -15.17 | -14.71 | -13.44 | -43.33 |
| Sqe12m + RBD | -15.16 | -14.68 | -15.16 | -45.01 |
| Sqe12m + RBDm$ | -15.16 | -14.69 | -10.88 | -40.74 |
| Seq13m + RBD | -15.07 | -14.41 | -17.10 | -46.58 |
| Seq13m + RBDm$ | -15.07 | -14.33 | -0.43 | -29.85 |
Entropy results were (harmonic approximation) calculated with NMODE. All entropy results have units kcal/mol (Temperature is 298.15 K).