| Literature DB >> 27792169 |
Muhammad Usman Mirza1,2, Nazia Ikram3.
Abstract
The Ebola virus (EBOV) has been recognised for nearly 40 years, with the most recent EBOV outbreak being in West Africa, where it created a humanitarian crisis. Mortalities reported up to 30 March 2016 totalled 11,307. However, up until now, EBOV drugs have been far from achieving regulatory (FDA) approval. It is therefore essential to identify parent compounds that have the potential to be developed into effective drugs. Studies on Ebola viral proteins have shown that some can elicit an immunological response in mice, and these are now considered essential components of a vaccine designed to protect against Ebola haemorrhagic fever. The current study focuses on chemoinformatic approaches to identify virtual hits against Ebola viral proteins (VP35 and VP40), including protein binding site prediction, drug-likeness, pharmacokinetic and pharmacodynamic properties, metabolic site prediction, and molecular docking. Retrospective validation was performed using a database of non-active compounds, and early enrichment of EBOV actives at different false positive rates was calculated. Homology modelling and subsequent superimposition of binding site residues on other strains of EBOV were carried out to check residual conformations, and hence to confirm the efficacy of potential compounds. As a mechanism for artefactual inhibition of proteins through non-specific compounds, virtual hits were assessed for their aggregator potential compared with previously reported aggregators. These systematic studies have indicated that a few compounds may be effective inhibitors of EBOV replication and therefore might have the potential to be developed as anti-EBOV drugs after subsequent testing and validation in experiments in vivo.Entities:
Keywords: Ebola virus; VP35; VP40; molecular docking; phytochemicals; retrospective validation; virtual screening
Mesh:
Substances:
Year: 2016 PMID: 27792169 PMCID: PMC5133775 DOI: 10.3390/ijms17111748
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Ebola situation report March 2016. Ebola virus disease cases and deaths. Data are based on official information reported by the Ministry of Health in an Ebola situation report published on 30 March 2016.
Figure 2Chemical structures of virtual hits are represented in 2D format.
Figure 3Schematic workflow summarising the screening of Ebola virus (EBOV) inhibitors through a series of steps.
Predicted ADMET assessment of 13 virtual hits.
| ADMET | Compd. 1 | Compd. 2 | Compd. 3 | Compd. 4 | Compd. 5 | Compd. 6 | Compd. 7 | Compd. 8 | Compd. 9 | Compd. 10 | Compd. 11 | Compd. 12 | Compd. 13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BBB | + | + | + | + | + | + | + | + | + | + | + | + | + |
| HIA | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Caco-2 permeable | − | _ | − | − | + | + | − | − | + | + | + | − | − |
| Solubility | −3.16 | −3.1 | −3.25 | −3.25 | −5.24 | −4.33 | −4.38 | −2.44 | −2.53 | −2.89 | −2.62 | −3.63 | −2.53 |
| P-gp | |||||||||||||
| Substrate | + | − | − | + | + | − | + | − | − | − | + | − | − |
| Inhibitor | − | − | − | − | + | − | + | − | − | − | − | − | − |
| ROCT | − | − | − | − | − | − | − | − | − | + | − | − | |
| CYP450 substrate | |||||||||||||
| 2C9 | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 2D6 | − | − | − | − | − | − | − | − | − | − | + | − | − |
| 3A4 | − | − | − | − | + | − | − | − | − | + | + | + | |
| CYP450 inhibitor | |||||||||||||
| 1A2 | + | + | + | + | + | − | − | − | − | − | − | − | |
| 2C9 | − | − | − | − | + | − | − | − | − | − | − | − | − |
| 2D6 | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 2C19 | + | − | − | − | + | − | − | − | − | − | − | − | − |
| 3A4 | − | − | + | − | + | − | − | − | − | − | + | − | − |
| CYP IP | Low | Low | Low | Low | Low | Low | Low | Low | Low | Low | Low | Low | Low |
| AMES toxicity | − | − | − | − | − | − | − | − | − | − | − | − | − |
| Carcinogens | − | − | − | − | − | − | − | − | − | − | − | − | − |
ADME, absorption distribution metabolism elimination; BBB, blood–brain barrier; HIA, human intestinal absorption; CYP450, cytochrome P450; CYP IP, CYP inhibitory promiscuity; ROCT, renal organic cation transportation; +, present; −, not present.
Screening of 13 virtual hits from series of filters being applied.
| Virtual Hits | Oral Bioavailability | ADMET | Drug Safety Profiling | Filtered State | Undesirable Structuresmioties | Aggregator Advisor | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Drug Likeness | Lipinski’s Ro5 | Veber Rule | Egan Rule | GSK 4/400 Rule | Pfizer 3/75 Rule | Lilly MedChem Rules | Aggregator Likelihood | Similar with | |||||
| √ | √ | √ | √ | √ | √ | × | √ | Rejected | High_risk coumarines | Reported as an aggregator | |||
| Log | |||||||||||||
| √ | √ | √ | √ | √ | √ | √ | √ | Intermediate | Low_risk coumarines | Non-aggregator Log | |||
| √ | √ | √ | √ | √ | √ | × | √ | Rejected | High_risk coumarines | Reported as an aggregator | |||
| Log | |||||||||||||
| √ | √ | √ | √ | √ | √ | √ | × | Intermediate | Low_risk coumarines | Reported as an aggregator | |||
| Log | |||||||||||||
| √ | √ | √ | √ | √ | √ | × | √ | Intermediate | Low_risk coumarines | Not similar to any known aggregator in in-house database High Log | |||
| √ | √ | √ | √ | √ | √ | × | × | Intermediate | Low_risk coumarines | Reported as an aggregator | |||
| Log | |||||||||||||
| √ | √ | √ | √ | √ | √ | √ | × | Rejected | High_risk epoxide | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | √ | √ | Accepted | No | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | √ | √ | Accepted | No | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | √ | √ | Accepted | No | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | × | √ | Rejected | Frequent_hitter dopamine; Low_risk benzodioxolane | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | √ | √ | Accepted | No | Non-aggregator | |||
| √ | √ | √ | √ | √ | √ | √ | √ | Accepted | No | Reported as an aggregator | |||
| Log | |||||||||||||
Veber Rule; Bad or Good oral bioavailability rule (rotatable bonds ≤ 10) and (TPSA ≤ 140 Å or H-Bonds Donors + H-Bonds Acceptors ≤ 12), Egan Rule; Bad or Good oral bioavailability rule (0 ≥ TPSA ≤ 132) and (−1 ≥ logp ≤ 6), GSK4/400 rule; Bad or Good ADMET profile (logp ≤ 4) and (Molecular Weight ≤ 400), Pfizer 3/75 rule; Rule which narrates to compounds with a logp (>3) and TPSA (<75) are almost 2.5 times more likely to be toxic as to be clean, Lilly MedChem Rules; a set of 275 rules to identify compounds that may interfere with biological assays in terms of reactivity, interference with assay measurements, activities that damage proteins, and lack of druggability, √; Compounds fulfilled the criteria, ×; couldn’t pass the criteria, Accepted; Compounds with no structural cautions, Intermediate; Compounds with low-risk structural cautions, Rejected; Compounds that include high-risk structural cautions, Tc; Tonimoto coefficient.
Molecular docking analysis.
| Source-ID/Name/Formula | Autodock Vina (ΔG) | DSX Drugscore | Molecular Interactions | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EBOV-Z VP35 (kcal/mol) | EBOV-Z VP40 (kcal/mol) | EBOV-Z VP35 | EBOV-Z VP40 | EBOV-Z VP35 | EBOV-Z VP40 | ||||||||||
| Score (kcal/mol) | PCS | Score (kcal/mol) | PCS | Binding Site Interacting Residues | No. of H-Bond Interactions | No. of Hydrophobic Bonds Interactions | Total Number of Bonds | Binding Site Interacting Residues | No. of H-Bond Interactions | No. of Hydrophobic Bonds Interactions | Total number of Bonds | ||||
| Timtec-ST45161107 | −9.2 | −7.1 | −156.867 | −0.265 | −120.189 | −0.255 | Lys248, Pro293, Ile295, Val245, Phe328, Ile297, Asp302, Pro304, Gln244, Tyr229, Gly236, His240, Gln241, Phe235 | 0 | 31 | 31 | Gly126, Thr123, His124, Thr173, Phe174, Ile82, Phe172, Tyr171, Phe125, Gly126 | 0 | 30 | 30 | |
| Otava-7118230235 | −8.6 | −6.9 | −133.08 | −0.291 | −95.698 | −0.229 | Ile303, Pro304, Asp302, Gln244, Val245, Gln241, Phe328, Ile297, Lys248, Pro293, Ile295 | 0 | 27 | 27 | His124, Phe125, Thr123, Tyr171, Thr173, Phe174, Ile82, Phe172 | 1 (3.18A) Thr173 | 30 | 31 | |
| Timtec-ST50912611 | −8.2 | −7.2 | −152.793 | −0.317 | −115.494 | −0.247 | Val245, Pro293, Ile303, Ile295, Lys248, Gln241, Pro304, Gln244, Asp302, Phe328, Ile297 | 1 (3.12) His296 | 28 | 29 | Phe174, Ile82, Thr173, His124, Thr123, Phe125, Arg134, Tyr171, Phe172 | 0 | 29 | 29 | |
| Timtec-ST50616170 | −8.3 | −6.4 | −111.212 | −0.24 | −93.831 | −0.203 | Val245, Pro293, Ile303, Ile295, Lys248, Gln241, Pro304, Gln244, Asp302, Phe328, Ile297, Val294 | 0 | 29 | 29 | Phe174, Thr173, His124, Thr123, Phe125, Tyr171, Phe172 | 1 (3.12A) Thr173 | 28 | 29 | |
| Analyticon-NP-010155 | −8 | −6.3 | −105.282 | −0.25 | −98.943 | −0.232 | Val294, Ile295, Pro304, Gln241, Gln244, Lys248, Val245, Ala221, Lys251, Pro293 | 0 | 26 | 26 | Thr123, Thr173, Phe125, His124, Gln170, Gly126, Tyr171 | 1 (2.85A) Thr173 | 23 | 24 | |
| Otava-0115540195 | −8 | −6.9 | −124.21 | −0.269 | −91.619 | −0.283 | Ile295, Ile297, Asp302, Phe328, Phe235, Tyr229, Gly236, Pro304, His240, Gln241, Pro293, Val294, Lys248, Val245, Gln244 | 1 (3.11) Gln241 | 28 | 28 | Phe125, Arg134, His124, Thr123, Phe172, Thr173, Tyr171 | 0 | 20 | 20 | |
| Analyticon-NP-019744 | −7.8 | −7.3 | −102.276 | −0.198 | −91.924 | −0.213 | Gln244, Lys248, Ile295, Pro293, Leu249, Val294, Val245, His296 | 1 (2.93) His296 | 34 | 34 | Phe125, Arg134, His124, Thr123, Phe172, Tyr171 | 1 (3.27) Arg134 | 21 | 23 | |
| Analyticon-NP-005474 | −7.7 | −7 | −123.944 | −0.253 | −116.708 | −0.23 | Val245, Lys248, Gln244, Ile295, Pro293, Val294, His296 | 3 (3.11A, 3.12, 3.22A) His296 | 26 | 29 | Phe172, Thr123, His124, Gln170, Gly126, Phe125, Tyr171, Thr173 | 2 (3.01A, 3.03A) Thr173 | 33 | 39 | |
| PubChem-CID17597017 | −7.6 | −6.7 | −98.897 | −0.258 | −91.398 | −0.213 | Pro293, Ile295, Phe328, Pro304, Gln244, Val245, Lys248, Val294 | 0 | 19 | 19 | Phe172, Thr123, Phe125, Tyr171, Thr173 | 0 | 19 | 19 | |
| Analyticon-NP-000375 | −7.3 | −7.5 | −95.53 | −0.214 | −116.548 | −0.217 | Val 294, Pro 293, Gln244, Lys248, Val245, Ile295, Val294 | 1 (3.27) His296 | 29 | 30 | Phe172, Thr123, His124, Gln170, Gly126, Phe125, Tyr171 | 1 (3.10) Thr123 | 33 | 35 | |
| Analyticon_NP-014205 | −7.3 | −6.9 | −111.517 | −0.281 | −95.502 | −0.246 | Phe328, Val294, Gln244, Val245, Lys248, Ile295, Pro293 | 0 | 24 | 24 | Phe172, Thr123, His124, Phe174, Tyr171, Thr173 | 1 (3.00) Thr123 | 21 | 23 | |
| Analyticon-NP-014522 | −7.2 | −6.9 | −97.222 | −0.227 | −96.182 | −0.243 | Phe328, Val245, Val294, Gln244, Val245, Lys248, Ile295, Pro293 | 0 | 26 | 26 | Phe172, His124, Thr123, Asn136, Arg134, Tyr171, Thr173 | 1 (2.89A) Thr173 1 (3.08A) Phe172 1 (3.29A) His124 | 21 | 26 | |
| Analyticon_NP-003228 | −7 | −6.4 | −105.752 | −0.238 | −91.824 | −0.194 | Val245, Asp302, Ile297, Gln241, Val294, Gln244, Val245, Lys248, Ile295, Pro293 | 1 | 19 | 19 | Ile82, Phe172, His124, Tyr171, Phe125, Thr123, Thr173, | 1 (2.96A) Tyr171 1 (3.13A) Thr123 | 19 | 19 | |
Post-docking analysis is representing Binding energies G (kcal/mol), Rescoring binding energies, Per Contact Score (PCS), molecular interactions with EBOV-Z VP35 and VP40 (In bold are common compounds for both viral proteins.)
Figure 4Protein ligand analysis by ligplot. Ligplots showed the conserved binding modes (outlined by the orange lines) of virtual hits with respective targets: EBOV-Z VP35 (A) and VP40 (B). Conserved interacting residues are displayed in red circles.
Figure 5Molecular surface representation of EBOV-Z VP35 with respective ligands. (A–M) Molecular surface representation (in green) with respective ligands displayed in stick format (in magenta). Binding site residues are in pink. Alongside each 3D complex are schematic representations of the 2D interactions (with a cutoff distance of 4 Å) between each ligand and EBOV-Z VP35 amino acid residue using Ligplot analysis. Residues involved in hydrophobic contacts are black and demarcated by a spoked red arc, while those involved in hydrogen bonding are green, along with the values of the distances. Atoms are shown in white for carbon, red for oxygen, and blue for nitrogen.
Figure 6Molecular surface representation of EBOV-Z VP40 with respective ligands. (A–M) Molecular surface representation (in green) with respective ligands displayed in stick format (in magenta). Binding site residues are in pink. Residues Thr123, Phe125, and Arg134 form the catalytic triad in the active sites that interact with RNA are displayed in yellow. Alongside each 3D complex are schematic representations of the 2D interactions (with a cutoff distance of 4 Å) between each ligand and EBOV-Z VP40 amino acid residue using Ligplot analysis. Residues involved in hydrophobic contacts are black and demarcated by a spoked red arc, while those involved in hydrogen bonding are green, along with the values of the distances. Atoms are shown in black for carbon, red for oxygen, and blue for nitrogen.
Figure 7Structural analysis of different Ebola strains. (A) Multiple sequence alignment of viral protein 35 (VP35) and 40 (VP40) of Ebola Zaire, Reston, Côte d’Ivoire, Sudan and Bundibugyo along with their chain length. Residues are coloured according to default colouring scheme of clustalX. Important structural residues of binding sites of VP35 and VP40 among the five Ebola subtypes are highlighted with a black outline, and labelled by residue name, as indicated by an arrowhead. The red arrowhead indicates RNA interacting residues (Thr123, Phe125 and Arg134) of VP40, which are also conserved among all subtypes. Sequences of representative Ebola virus subtypes were retrieved from GenBank and aligned using ClustalW. The alignment was further hand curated using Jalview 2.7; (B) Structural comparisons of conserved binding site residues. Individual RMSD values of conserved binding site residues (VP35 in left, VP40 in right) of all Ebola strains are plotted against the corresponding residues of EBOV-Z. The RMSD analysis shows that binding site residues of EBOV-S, EBOV-R, EBOV-B and EBOV-T contain a close structural similarity with corresponding residues of EBOV-Z; Structural superimposition of VP35 (C) and VP40 (D) are displayed in the bottom. Amino acid side chains highlight the conserved binding site residues of VP35 and VP40 in all strains as EBOV-Z (tan), EBOV-S (plum), EBOV-T (green), EBOV-B (sky blue), EBOV-R (coral), top ligands (wire form) are docked in the binding site of EBOV-Z. Molecular surface of the EBOV-Z binding site is highlighted in brown.
Homology modelling of EBOV different strains.
| Target Proteins | Ebola Strains | Template PDB ID | Query Cover | E-Value | Maximum Identity |
|---|---|---|---|---|---|
| VP40 | EBOV-T | 3TCQ.A | 100% | 0 | 76% |
| EBOV-R | 3TCQ.A | 98% | 0 | 79% | |
| EBOV-B | 3TCQ.A | 86% | 0 | 88% | |
| VP35 | EBOV-S | 3KS4.A | 51% | 6.00 × 10−105 | 81% |
| EBOV-T | 3KS4.A | 49% | 3.00× 10−102 | 82% | |
| EBOV-B | 3KS4.A | 49% | 2.00× 10−100 | 80% |
PSI-BLAST aligned templates against Protein Data Bank (PDB) repository, of EBOV VP35 (Sudan, Tai Forest and Bundibugyo) and VP40 (Tai Forest, Bundibugyo and Reston) with query coverage, E-value, and maximum identity.
Figure 8ROC curve analysis. ROC curve analysis of actives and inactives for detecting the performance of the VS method being employed, with the true- and false-positive rates in the X- and Y-axis, respectively; (A) The AUC of 0.931 (95% CI, 0.67–0.98) against EBOV-VP35; (B) the AUC of 0.830 (95% CI, 0.69–0.90) against EBOV-VP40. ROC, receiver operating characteristic, both by binomial and empirical approach; AUC, area under the ROC curve; CI, confidence interval; Z-value, Z-score for testing the designated hypothesis test; p-value, probability level associated with the Z-value; EF, Enrichment factor.
Figure 9Metabolic sites prediction by MetaPrint2D. Diagrammatical representation of MetaPrint2D plots of common compounds (2-D format) for Ebola viral proteins, VP35 and VP40. Sites of metabolism are representing by colours as Red: most favourable site, Orange: medium/moderate, Green: Low, very low is uncoloured; and grey: no data available. NOR represents Normalized Occurrence Ratio; high NOR showed the more frequently reported metabolic sites in the metabolite database.