| Literature DB >> 35644302 |
Gloria Gallo1, Uilla Barcick1, Camila Coelho1, Murilo Salardani1, Maurício F Camacho1, Daniela Cajado-Carvalho2, Flávio V Loures1, Solange M T Serrano2, Leon Hardy3, André Zelanis1, Martin Würtele4.
Abstract
The main protease Mpro of SARS-CoV-2 is a well-studied major drug target. Additionally, it has been linked to this virus' pathogenicity, possibly through off-target effects. It is also an interesting diagnostic target. To obtain more data on possible substrates as well as to assess the enzyme's primary specificity a two-step approach was introduced. First, Terminal Amine Isobaric Labeling of Substrates (TAILS) was employed to identify novel Mpro cleavage sites in a mouse lung proteome library. In a second step, using a structural homology model, the MM/PBSA variant MM/GBSA (Molecular Mechanics Poisson-Boltzmann/Generalized Born Surface Area) free binding energy calculations were carried out to determine relevant interacting amino acids. As a result, 58 unique cleavage sites were detected, including six that displayed glutamine at the P1 position. Furthermore, modeling results indicated that Mpro has a far higher potential promiscuity towards substrates than expected. The combination of proteomics and MM/PBSA modeling analysis can thus be useful for elucidating the specificity of Mpro, and thus open novel perspectives for the development of future peptidomimetic drugs against COVID-19, as well as diagnostic tools.Entities:
Keywords: COVID-19; Free energy calculations; MM/PBSA; Main protease; Proteomics; TAILS
Mesh:
Substances:
Year: 2022 PMID: 35644302 PMCID: PMC9134770 DOI: 10.1016/j.peptides.2022.170814
Source DB: PubMed Journal: Peptides ISSN: 0196-9781 Impact factor: 3.867
Cleavage sites displaying the canonical specificity of Mpro identified by TAILS analysis. Six peptides spanning the canonical Gln at P1 site were identified by TAILS.
| Cleavage site (P5-P5’) | Identified peptide (P1’-Pn’) | Gene name | UniProt Accession | Protein name |
|---|---|---|---|---|
| GAEIQ↓DGRFN | DGRFNLFKVQQGR | FIP1 | Q9D824–4 | Pre-mRNA 3'-end-processing factor FIP1 |
| GETEQ↓KRIRK | KRIRKKKAKKR | CA131 | Q8CIL4 | Uncharacterized protein C1orf131 homolog |
| SSPSQ↓SGDTQ | SGDTQTFAQKLQLR | GOGA4 | Q91VW5 | Golgin subfamily A member 4 |
| QLNSQ↓LFVGG | LFVGGKSSRQKGFFGCIR | CTP5C | Q0V8T7 | Contactin-associated protein like 5–3 |
| KRSQQ↓EDDQE | EDDQEFFEDR | FBX43 | Q8CDI2 | F-box only protein 43 |
| SSSVQ↓IDPPL | IDPPLSSWKDLRTFKQR | COSA1 | Q2UY11 | Collagen alpha-1(XXVIII) chain |
Since TAILS approach targets prime-side peptides, non-prime side positions (i.e. P5 to P1) were bioinformatically inferred from peptide identifications, after database searches using the MS data.
MM/GBSA analysis of Mbinding peptides. The main energy components from the MM/GBSA calculations , the conformational entropy term , as estimated with normal mode analysis and the total free binding energy are given in kcal/mol.
| Peptide | ΔHGBSA (kcal/mol) | TΔS (kcal/mol) | ΔG (kcal/mol) |
|---|---|---|---|
| GAEIQDGRFN | -64.5 + /- 0.9 | -37.9 + /- 1.2 | -26.6 + /- 1.3 |
| GETEQKRIRK | -74.3 + /- 1.1 | -47.5 + /- 1.0 | -26.8 + /- 1.6 |
| KRSQQEDDQE | -73.7 + /- 1.2 | -45.3 + /- 1.1 | -28.4 + /- 1.3 |
| QLNSQLFVGG | -78.6 + /- 1.0 | -42.7 + /- 1.6 | -35.9 + /- 1.7 |
| SSPSQSGDTQ | -56.2 + /- 1.1 | -39.9 + /- 1.5 | -16.3 + /- 1.6 |
| SSSVQIDPPL | -62.2 + /- 0.7 | -37.7 + /- 1.1 | -24.5 + /- 1.4 |
Fig. 1Structural alignment of Mpeptide complexes from homology modeling. A) The GAEIQDGRFN peptide identified in the TAILS analysis was homology modeled with MODELLER using a SARS-CoV peptide complex structure (PDB ID 2Q6G). The SARS-CoV-2 Mpro peptide binding site is shown as electrostatic potential surface. B) Structural superimposition of the six peptides displaying the canonical Gln at P1 position.
MM/GBSA energy components. Different energy components from the MM/GBSA analysis of the binding of the peptides displaying the canonical Gln at P1 position to SARS-CoV-2 Mpro are given in kcal/mol.
| Peptide | MM/GBSA Term | Ligand Energy | Ligand SEM | Receptor Energy | Receptor SEM | Total Energy | Total SEM |
|---|---|---|---|---|---|---|---|
| GAEIQDGRFN | ΔHGBSA | -25.0 | 1.3 | -39.8 | 0.6 | -64.8 | 1.9 |
| ΔGnonpolar | -6.80 | 0.12 | -4.83 | 0.04 | -11.63 | 0.16 | |
| ΔGpolar | 38.2 | 4.3 | 21.4 | 1.2 | 59.7 | 5.5 | |
| ΔEel | -11.7 | 4.7 | -11.7 | 1.4 | -23.4 | 6.1 | |
| ΔEVdW | -44.7 | 0.9 | -44.7 | 0.3 | -89.4 | 1.1 | |
| GETEQKRIRK | ΔHGBSA | -23.3 | 1.3 | -51.8 | 0.6 | -75.1 | 1.9 |
| ΔGnonpolar | -7.36 | 0.12 | -5.20 | 0.03 | -12.57 | 0.15 | |
| ΔGpolar | 160.8 | 5.4 | 130.2 | 1.4 | 290.9 | 6.8 | |
| ΔEel | -129.1 | 5.6 | -129.1 | 1.7 | -258.2 | 7.3 | |
| ΔEVdW | -47.6 | 0.9 | -47.6 | 0.3 | -95.3 | 1.2 | |
| KRSKQEDDQE | ΔHGBSA | -22.5 | 1.7 | -50.8 | 0.8 | -73.4 | 2.5 |
| ΔGnonpolar | -7.29 | 0.11 | -5.25 | 0.02 | -12.55 | 0.13 | |
| ΔGpolar | 61.0 | 6.3 | 30.7 | 3.5 | 91.7 | 9.9 | |
| ΔEel | -26.6 | 7.1 | -26.6 | 4.2 | -53.2 | 11.2 | |
| ΔEVdW | -49.6 | 1.1 | -49.6 | 0.3 | -99.3 | 1.5 | |
| QLNSQLFVGG | ΔHGBSA | -35.4 | 1.4 | -43.0 | 0.5 | -78.4 | 2.0 |
| ΔGnonpolar | -7.16 | 0.13 | -4.95 | 0.03 | -12.12 | 0.16 | |
| ΔGpolar | 90.2 | 2.5 | 80.5 | 1.3 | 170.7 | 3.8 | |
| ΔEel | -70.2 | 3.0 | -70.2 | 1.6 | -140.3 | 4.6 | |
| ΔEVdW | -48.4 | 1.0 | -48.4 | 0.3 | -96.7 | 1.4 | |
| SSPSQSGDTQ | ΔHGBSA | -19.1 | 1.4 | -37.0 | 0.6 | -56.1 | 2.0 |
| ΔGnonpolar | -6.05 | 0.13 | -3.99 | 0.03 | -10.03 | 0.16 | |
| ΔGpolar | 53.1 | 3.0 | 33.3 | 1.3 | 86.4 | 4.3 | |
| ΔEel | -29.3 | 3.5 | -29.3 | 1.6 | -58.6 | 5.1 | |
| ΔEVdW | -36.9 | 1.1 | -36.9 | 0.4 | -73.8 | 1.5 | |
| SSSVQIDPPL | ΔHGBSA | -26.2 | 1.2 | -36.5 | 0.4 | -62.7 | 1.6 |
| ΔGnonpolar | -6.28 | 0.10 | -4.51 | 0.02 | -10.79 | 0.12 | |
| ΔGpolar | 44.9 | 2.7 | 32.7 | 1.1 | 77.6 | 3.8 | |
| ΔEel | -21.7 | 3.1 | -21.7 | 1.2 | -43.4 | 4.3 | |
| ΔEVdW | -43.0 | 0.9 | -43.0 | 0.4 | -86.1 | 1.2 |
Fig. 2Per-residue binding energy decomposition of peptides. A) GAEIQDGRFN. B) GETEQKRIRK. C) KRSQQEDDQE. D) QLNSQLFVGG. E) SSPSQSGDTQ. F) SSSVQIDPPL. Using MM/GBSA analysis, the interaction energy of each amino acid with the rest of the system was estimated. The importance of the residues at P1 and P2 positions become apparent. Variations at e.g. P4 and P1’ positions could also explain improved binding of the corresponding peptides. Values are given in kcal/mol.
Fig. 3MM/GBSA energy components of protein and peptide. A) GAEIQDGRFN. B) GETEQKRIRK. C) KRSQQEDDQE. D) QLNSQLFVGG. E) SSPSQSGDTQ. F) SSSVQIDPPL. Polar solvation energy (ΔGpolar) represents the electrostatic interaction between the solute and the continuum solvent. It mostly compensates favorable electrostatic interactions between peptide and protein (ΔEel). Values are given in kcal/mol.
Fig. 4Binding energy decomposition analysis averaged over all peptide positions. A) The data from the MM/GBSA analysis of the six peptides displaying the canonical Gln at P1 position obtained through TAILS analysis indicate the relative importance of the amino acid residues at the P1, P2 and P4 positions. B) Results from the 51 non-canonical peptides obtained, showing similar per-residue energy decompositions. Results are given in kcal/mol.