Faraz Shaikh1,2, Yuguang Zhao2, Luis Alvarez2, Maria Iliopoulou2, Christopher Lohans3, Christopher J Schofield3, Sergi Padilla-Parra2,4, Shirley W I Siu1, Elizabeth E Fry2, Jingshan Ren2, David I Stuart2,5. 1. Department of Computer and Information Science, Faculty of Science and Technology , University of Macau , E11, Macau 999078 , China. 2. Division of Structural Biology , University of Oxford , The Henry Wellcome Building for Genomic Medicine , Headington, Oxford OX3 7BN , U.K. 3. Department of Chemistry , University of Oxford , Mansfield Road , Oxford OX1 3TA , U.K. 4. Biocruces-Bizkaia Health Research Institute , Ikerbasque, Basque Foundation for Science , Bilbao 48011 , Spain. 5. Diamond Light Source Limited , Harwell Science & Innovation Campus , Didcot OX11 0DE , U.K.
Abstract
Potent Ebolavirus (EBOV) inhibitors will help to curtail outbreaks such as that which occurred in 2014-16 in West Africa. EBOV has on its surface a single glycoprotein (GP) critical for viral entry and membrane fusion. Recent high-resolution complexes of EBOV GP with a variety of approved drugs revealed that binding to a common cavity prevented fusion of the virus and endosomal membranes, inhibiting virus infection. We performed docking experiments, screening a database of natural compounds to identify those likely to bind at this site. Using both inhibition assays of HIV-1-derived pseudovirus cell entry and structural analyses of the complexes of the compounds with GP, we show here that two of these compounds attach in the common binding cavity, out of eight tested. In both cases, two molecules bind in the cavity. The two compounds are chemically similar, but the tighter binder has an additional chlorine atom that forms good halogen bonds to the protein and achieves an IC50 of 50 nM, making it the most potent GP-binding EBOV inhibitor yet identified, validating our screening approach for the discovery of novel antiviral compounds.
Potent Ebolavirus (EBOV) inhibitors will help to curtail outbreaks such as that which occurred in 2014-16 in West Africa. EBOV has on its surface a single glycoprotein (GP) critical for viral entry and membrane fusion. Recent high-resolution complexes of EBOV GP with a variety of approved drugs revealed that binding to a common cavity prevented fusion of the virus and endosomal membranes, inhibiting virus infection. We performed docking experiments, screening a database of natural compounds to identify those likely to bind at this site. Using both inhibition assays of HIV-1-derived pseudovirus cell entry and structural analyses of the complexes of the compounds with GP, we show here that two of these compounds attach in the common binding cavity, out of eight tested. In both cases, two molecules bind in the cavity. The two compounds are chemically similar, but the tighter binder has an additional chlorine atom that forms good halogen bonds to the protein and achieves an IC50 of 50 nM, making it the most potent GP-binding EBOV inhibitor yet identified, validating our screening approach for the discovery of novel antiviral compounds.
Ebola hemorrhagic fever,
a deadly disease infecting both human
and nonhuman primates, is caused by the highly virulent negative-stranded
RNA, membrane-enveloped filovirus—Ebolavirus (EBOV). The 2014–16
West African outbreak claimed over 11 000 lives because suitable
therapeutics were not available. The membrane envelope of EBOV is
decorated by trimers of glycoprotein (GP), each monomer of which is
cleaved by furin into two polypeptides, GP1 and GP2. GP is solely
responsible for host cell attachment, endosomal entry, and membrane
fusion,[1−8] making it an obvious target for therapeutic intervention. A large
number of Food and Drug Administration (FDA)-approved drugs have been
found to be active against EBOV infection in vitro using either EBOV
or pseudotyped virus assays;[9−17] however, the precise mechanisms of inhibition remain largely unknown.
We have recently demonstrated, using X-ray crystallography, that nine
such drugs (Figure S1) interact directly
with EBOV GP.[18−20] The approved drugs bind in a cavity between the attachment
(GP1) and fusion (GP2) subunits, stabilized by predominantly hydrophobic
interactions. The cavity lies at the entrance to a large tunnel linking
to equivalent tunnels from the other monomers of the trimer at the
threefold axis. Residues lining the binding site are highly conserved
among filoviruses, with the exception of Marburg viruses (MARVs).
The cavity is occupied by residues 192–194 (DFF lid, which
immediately follow the putative cathepsin B/L cleavage site) in the
apo structure of the GP. Inhibitor binding expels the DFF lid from
the cavity, reducing the stability of the protein as judged by its
melting temperature. These results suggested that inhibitor binding
might trigger the premature release of GP2, preventing fusion between
the viral and endosome membranes. Alternatively because inhibitor
binding alters the conformation of the cathepsin B/L cleavage site,
it might inhibit cleavage, preventing removal of the glycan cap domain,
thus blocking the engagement of GP with its receptor NPC1.[19,20]The discovery of an inhibitor-binding site on EBOV GP offers
opportunities
for structure-based drug design against EBOV. Natural compounds have
been shown to be effective against different stages of viral infection[21,22] and have considerable structural diversity and remain a major source
of new drugs. We have therefore performed structure-based in silico
screening of a traditional Chinese medicine (TCM) database against
EBOV GP to identify novel drug leads. This approach has been combined
with thermal shift assays, pseudovirus entry assays, and crystallography
to identify and validate potential inhibitors. Our study reveals that
although the predictive power of the in silico screening is limited,
it still identified two novel compounds (out of eight tested) that
display inhibitory activity, as confirmed by pseudovirus entry assays
and proof of binding from crystallography. Indeed, one of these compounds
appears to be the most potent GP binder yet identified.
Results
Virtual Screening
Method Validation Using a Set of Known Binders
Our previous
work provides a set of eight drugs known to inhibit
EBOV by direct interaction with the GP (Figure S1).[18−20] To validate the docking methods, all drugs were subjected
to a virtual screening workflow, which we established using the Schrödinger
suite (http://www.schrodinger.com/). The IC50 values of the drugs against EBOV[10] were converted to pIC50 (−log
IC50) values and together with the docking scores Glide
XP[23,24] and quantum mechanics-polarized ligand docking
(QPLD)[25] are listed in Table S1. The correlation between the docking scores and pIC50 values is shown in Figure . Docking scores of Glide XP show only very weak correlation
to experimental pIC50 values (R2 = 0.18), whereas docking scores of QPLD have better correlation
to the experimental pIC50 values (R2 = 0.51), although, given the small number of compounds tested,
this result is not in itself robust. We think that the relatively
poor correlation between the docking scores and the experimental pIC50 values may be attributed to the conformational flexibility
of the side chains within the binding pocket of the GP, which cannot
be accounted for by the docking program. Nevertheless, this validation
method suggests that our virtual screening workflow combining Glide
XP docking and QPLD may be able to select binding compounds for the
target Ebola GP (although both false positives and negatives would
also be expected). As QPLD Emodel scores show some correlation with
the experimental IC50, we used these to rank screened compounds
for conducting further experiments (QPLD was also found to be useful
for the prediction of binding to another viral protein, suggesting
general utility[26]).
Figure 1
Correlation between the
docking score and IC50 for eight
known EBOV GP binders. (A) Plot of Glide XP Emodel against pIC50. (B) Plot of QPLD Emodel against pIC50.
Correlation between the
docking score and IC50 for eight
known EBOV GP binders. (A) Plot of Glide XP Emodel against pIC50. (B) Plot of QPLD Emodel against pIC50.
Virtual Screening of Novel
Natural Compound Inhibitors
To identify novel inhibitors
of Ebola GP, we screened the ZINC natural
compound library from the TCM database.[27] Out of nearly 2.5 million compounds, high-throughput virtual screening
(HTVS) selected 416 compounds for subsequent docking calculations.
Among these candidates, 88 compounds were selected based on their
Glide XP docking and ligand efficiency scores using scores from a
known inhibitor (toremifene) as cutoff values. QPLD calculations further
reduced the list to 16 compounds, all of which have passed filters
for pan assay interference compounds.[28] A total of eight of these best-scoring compounds were purchased
for in vitro experiments and crystallographic studies based on availability
and price (Figure and Table S2).
Figure 2
Natural compounds selected
from the in silico screen results for
experimental evaluations. Toremifene is included as a reference. (A)
ZINC32540717 (118), 1-{2-[3-hydroxy-4-(4-phenyl-1H-pyrazol-3-yl)phenoxy]ethyl}piperidine-4-carboxamide. (B)
ZINC00167626, 5-amino-2-{[3-(trifluoromethyl)phenyl]sulfanyl}benzonitrile.
(C) ZINC09410451 (118a), 1-[2-[(4Z)-4-[4-(4-chlorophenyl)-5-methyl-1,2-dihydropyrazol-3-ylidene]-3-oxocyclohexa-1,5-dien-1-yl]oxyethyl]piperidine-4-carboxamide.
(D) ZINC00407254, 2-(1-benzofuran-2-yl)-1-(2,4-dihydroxyphenyl)ethan-1-one.
(E) ZINC12893941, (2E)-3-(2H-chromen-3-yl)-1-(4-hydroxyphenyl)prop-2-en-1-one.
(F) ZINC04772639, (2E)-3-(6-bromo-2,4-dihydro-1,3-benzodioxin-8-yl)-1-(5-fluoro-2-hydroxyphenyl)prop-2-en-1-one.
(G) ZINC11865143, 1-(2,4-dihydroxy-6-methylphenyl)-2-(4-methoxyphenyl)ethan-1-one.
(H) ZINC00056827, 4-[(4-aminophenyl)disulfanyl]aniline. (I) Toremifene,
2-[4-[(Z)-4-chloro-1,2-diphenylbut-1-enyl]phenoxy]-N,N-dimethylethanamine.
Natural compounds selected
from the in silico screen results for
experimental evaluations. Toremifene is included as a reference. (A)
ZINC32540717 (118), 1-{2-[3-hydroxy-4-(4-phenyl-1H-pyrazol-3-yl)phenoxy]ethyl}piperidine-4-carboxamide. (B)
ZINC00167626, 5-amino-2-{[3-(trifluoromethyl)phenyl]sulfanyl}benzonitrile.
(C) ZINC09410451 (118a), 1-[2-[(4Z)-4-[4-(4-chlorophenyl)-5-methyl-1,2-dihydropyrazol-3-ylidene]-3-oxocyclohexa-1,5-dien-1-yl]oxyethyl]piperidine-4-carboxamide.
(D) ZINC00407254, 2-(1-benzofuran-2-yl)-1-(2,4-dihydroxyphenyl)ethan-1-one.
(E) ZINC12893941, (2E)-3-(2H-chromen-3-yl)-1-(4-hydroxyphenyl)prop-2-en-1-one.
(F) ZINC04772639, (2E)-3-(6-bromo-2,4-dihydro-1,3-benzodioxin-8-yl)-1-(5-fluoro-2-hydroxyphenyl)prop-2-en-1-one.
(G) ZINC11865143, 1-(2,4-dihydroxy-6-methylphenyl)-2-(4-methoxyphenyl)ethan-1-one.
(H) ZINC00056827, 4-[(4-aminophenyl)disulfanyl]aniline. (I) Toremifene,
2-[4-[(Z)-4-chloro-1,2-diphenylbut-1-enyl]phenoxy]-N,N-dimethylethanamine.
Evaluation of the Virtual Screen Results
by Thermal Shift Assay
and Crystallography
We first performed thermal shift assays
to test if the eight selected compounds could perturb the thermal
stability of GP. The results show that compounds ZINC32540717 (118) and ZINC09410451 (118a) [derivatives of
natural isoflavones[29] interact with the
dye (SYPRO Orange) and interfere with the fluorescence emission; the
remaining six compounds at pH 5.2 and at 500 μM concentration
do not alter the melting temperature of the GP (data not shown). Nevertheless,
we then carried out crystal soaking experiments at two compound concentrations,
2.5 and 5.0 mg/mL for all compounds. For each compound, eight crystals
were soaked, four at each concentration for 5–20 min. Soaked
crystals were mounted in loops and frozen in liquid nitrogen for diffraction
data collection on beamlines I04-1 and I24 of Diamond Light Source.
High multiplicity X-ray data were collected from crystals soaked with
compounds 118 and 118a to 2.05 and 2.30
Å, respectively. |Fo – Fc| difference electron density maps phased with
rigid-body refined models based on our previously published GP–bepridil
structure, excluding the ligand and water molecules,[18] indicated that compounds 118 and 118a bind to GP (Figure ). No binding was observed for the other six compounds for data collected
under the same regime.
Figure 4
Overall structure of
EBOV GP and electron density maps. (A) Surface
representation of the trimeric EBOV GP (PDB ID 6HRO). GP1, GP2, and
the glycan cap domain are colored in blue, red, and cyan, respectively;
for clarity, one GP monomer is in bright color and other two in light
colors. The bound inhibitor 118a is shown as cyan sticks.
(B) Close-up of the inhibitor-binding site. (C,D) Simulated annealing
omit |Fo – Fc| electron density maps contoured at 3σ for bound compounds 118 (C) and 118a (D). In both cases, two inhibitor
molecules are bound. There is no density for the pyrazole ring (indicated
by a red arrow) of 118 molecule I.
Compounds 118 and 118a Inhibit Ebola
Pseudovirus Infection
Compounds 118 and 118a were tested for their ability to inhibit EBOV infection
in vitro, using HIV-1-derived pseudoviruses expressing the Ebola virus
envelope GPs (EBOV pseudoparticle, EBOVpp) as described previously.[19] We used TZM-bl cells in the EBOVpp infection
assay because TZM-bl cells contain a β-Gal expression cassette
with an HIV-1-induced promoter, infected cells can be identified through
the hydrolysis of X-gal.[30,31] The best-known inhibitor
that directly interacts with EBOV GP, toremifene,[18−20] was used as
a positive control. Multiple concentrations (0.01–25 μM)
of 118, 118a, and toremifene were evaluated
in the EBOVpp infection assay, and the experiment was done in triplicate
(Figures and S2). The results show that all three compounds
inhibit EBOVpp infection and fusion (Figure S2) in a dose-dependent manner. The IC50s derived from the
experiment are 3.1 ± 0.02, 0.05 ± 0.01, and 0.09 ±
0.08 μM for 118, 118a, and toremifene,
respectively. Thus, 118a has the lowest IC50 among the inhibitors known to bind EBOV GP directly.[18−20] Although the experiments were not performed on live EBOV, the relative
ranking of inhibition constants is likely to be indicative of relative
potency against EBOV.
Figure 3
Compound 118a is a potent inhibitor of EBOVpp
infection
in live cells. (A–C) Infectivity assays to recover the IC50 and IC80 with different dilutions for compounds 118a (A), 118 (B), and toremifene (C) were performed
using a β-Gal assay. The percentage of infection inhibition
in a number of cells per condition is plotted against inhibitor concentration.
The error bars show the standard error coming from three independent
measurements, and the solid lines show a fit using a sigmoidal mathematical
model. The IC50 for 118a is 0.05 ± 0.01,
3.1 ± 0.02 μM for 118, and 0.09 ± 0.08
μM for toremifene.
Compound 118a is a potent inhibitor of EBOVpp
infection
in live cells. (A–C) Infectivity assays to recover the IC50 and IC80 with different dilutions for compounds 118a (A), 118 (B), and toremifene (C) were performed
using a β-Gal assay. The percentage of infection inhibition
in a number of cells per condition is plotted against inhibitor concentration.
The error bars show the standard error coming from three independent
measurements, and the solid lines show a fit using a sigmoidal mathematical
model. The IC50 for 118a is 0.05 ± 0.01,
3.1 ± 0.02 μM for 118, and 0.09 ± 0.08
μM for toremifene.
Overall Structures of GP–118 and GP–118a Complexes
The complexes were refined with good R-factors and stereochemistry (Table ). The resultant electron density maps unambiguously
show the binding of the compounds. In both cases, there are two inhibitor
molecules bound in each GP-binding pocket (Figure ). Hereafter, we name the molecule that binds closest to Y517
as molecule I (this is the most interior and presumably the tighter
binder) and that closest to M548 as molecule II.
Table 1
Data Collection and Refinement Statistics
GP–118
GP–118a
Data Collection
space
group
R32
Cell Dimensions
a, b, c (Å)
114.2, 114.2, 305.4
115.1, 115.1, 307.9
α, β,
γ
(deg)
90, 90, 120
90, 90, 120
resolution (Å)
57.1–2.05 (2.09–2.05)a
83.8–2.30 (2.34–2.30)
Rmerge
0.063 (-)
0.098 (-)
I/σI
21.1 (1.1)
36.5 (1.4)
completeness (%)
98.4 (90.4)
100 (99.9)
redundancy
15.8 (6.0)
91.0 (16.8)
CC1/2
1.0 (0.44)
1.0 (0.75)
Refinement
resolution (Å)
57.1–2.05
83.8–2.30
no. of reflections
32976/1758
33711/1696
Rwork/Rfree
0.191/0.221
0.180/0.211
No. Atoms
protein
3045
3029
ligand/glycan/ion
213
223
water
165
143
Mean B-Factors
protein
66
92
ligand/glycan/ion
104
134
water
54
77
rms Deviations
bond lengths (Å)
0.005
0.003
bond angles (deg)
0.8
0.7
Values in parentheses are for the
highest-resolution shell.
Overall structure of
EBOV GP and electron density maps. (A) Surface
representation of the trimeric EBOV GP (PDB ID 6HRO). GP1, GP2, and
the glycan cap domain are colored in blue, red, and cyan, respectively;
for clarity, one GP monomer is in bright color and other two in light
colors. The bound inhibitor 118a is shown as cyan sticks.
(B) Close-up of the inhibitor-binding site. (C,D) Simulated annealing
omit |Fo – Fc| electron density maps contoured at 3σ for bound compounds 118 (C) and 118a (D). In both cases, two inhibitor
molecules are bound. There is no density for the pyrazole ring (indicated
by a red arrow) of 118 molecule I.Values in parentheses are for the
highest-resolution shell.There is no electron density for the pyrazole ring of 118 molecule I, although other groups of the molecule have reasonably
well-defined density (Figure C). We initially thought that the density might represent
an impurity molecule that is very similar to compound 118 produced during its synthesis. However, the NMR spectrum showed
no sign of other molecules in the sample. Because the pyrazole ring
can potentially undergo hydrolysis, it is conceivable that for some
molecules the pyrazole ring of 118 I is hydrolyzed during
crystal soaking.[32,33]The overall structures
of the protein parts of the GP–118 and GP–118a complexes are very similar
to each other, as well as to the previously published GP–drug
complex structures.[18,19] Apart from some local conformation
changes around the binding cavity, the binding of different inhibitors
does not introduce significant variations in the overall structure
of the protein. For example, by superimposition of these complexes
using SHP,[34] GP–118a overlaps 386 (out of 388), 382, and 381 Cαs of
GP–118, GP–toremifene, and the unliganded
GP with root-mean-square deviations of 0.38, 0.53, and 0.55 Å,
respectively.
Two Molecules of 118a and 118 Bind
in Each Cavity of GP
The inhibitor-binding site of EBOV GP
is located between the N-terminus of GP1 and the stem of the GP2 fusion
loop (Figure ). In
the apo structure, the inhibitor-binding cavity is occupied by residues
192–194 (the DFF lid) of GP1, which may function to hold the
putative cleavage site[35−39] in position for the removal of the glycan cap by the host cathepsin
B/L—allowing binding of the receptor NPC1 in the late endosome/lysosome.[19,40,41] Binding of an inhibitor in the
cavity expels the DFF lid. The inhibitor-binding cavity is also the
entrance of a tunnel that connects with the corresponding tunnels
in the other monomers of the GP trimer at the threefold axis. The
β1−β2 hairpin, β3, β6, and β13
of GP1 and the stem of the fusion loop (β19−β20)
and α3 of GP2 contribute residues lining the inhibitor-binding
pocket (Figures B
and S3). The volume occupied by F193 and
F194 (the FF volume) in apo GP is important for inhibitor binding
and is occupied by all inhibitors whose complex structures with GP
are known.[18−20] In the cases of benztropine and imipramine, this
is achieved by two drug molecules, one molecule occupies part of the
volume in front of M548 and the other molecule fills the space in
front of Y517. Despite 118a and 118 being
the largest inhibitors known to bind GP, once again in each case two
inhibitor molecules bind (Figures and S3). Compound 118a has a molecular volume of 382 Å3, and,
excluding atoms lying outside the cavity, the two bound molecules
alone sample about 640 Å3 of the total ∼1000
Å3 volume of the pocket (in contrast, the previous
nine inhibitors, with molecular volumes ranging from 188 Å3 for ibuprofen to 362 Å3 for toremifene, in
aggregate occupy 878 Å3).
Interactions between EBOV
GP and 118a
Molecule I of compound 118a binds with its chlorophenyl
ring deep in the subpocket adjacent to residues V66 and A101, making
extensive interactions with the side chains of residues V106, A101,
L515, and Y517, as well as main-chain atoms of G67 and G102 (Figure A). This subpocket
is partially occupied by F194 in apo GP, by a phenyl ring in toremifene,
and by the benzodioxol ring of paroxetine and the phenyl ring of bepridil
in their complex structures. The chlorine atom makes a strong halogen
bond with the carbonyl oxygen of G67 with a bond distance of 3.1 Å,
C–Cl–O angle of 167°, and Cl–O–C
angle of 116° (a classic halogen bonding interaction of chlorine
with the backbone Lewis bases at a glycine residue,[42]Figure C). The V66 side chain in this subpocket rotates to avoid clashes
on binding. The methylpyrazole group, apart from the close contacts
with the corresponding group of molecule II as discussed below, is
positioned to make parallel ring stacking interactions with the side
chain of Y517 and hydrophobic interactions with L515 and M548. The
hydroxyphenoxy ethyl moiety interacts extensively with the side chains
of its flanking residues, R64 and T519. The position and orientation
of the phenoxy ring are similar to the phenoxy and benzyl ring of
bound toremifene and bepridil, respectively. The piperidine carboxamide
group extends fully into the tunnel, exploiting protein interactions
not used by other known GP binders. Upon binding, residue D522 refolds
toward and makes bifurcated hydrogen bonds (of length 3.0 and 3.2
Å) with the nitrogen group of the piperidine ring. In addition,
the carboxamide moiety hydrogen-bonds to the side chains of N61 and
R587 (from a neighboring monomer) (Figure A).
Figure 5
Protein–inhibitor interactions for compound 118a. (A,B) Molecule I (panel A) and molecule II (panel B)
in the binding
pocket (PDB ID 6HRO). Protein main chains are drawn as thick blue (GP1) and red (GP2)
sticks, the inhibitor as cyan sticks, and water molecules as red balls.
Protein residues that interact with the inhibitor are shown as gray
sticks. Yellow broken sticks represent hydrogen bonds. Residue R587
from a neighboring monomer is labeled with a # prefix. The position
of the chlorine atom is indicated by a green *. (C,D) Environment
of the chlorine atoms of molecule I (panel C) and molecule II (panel
D). Halogen bonds are shown as magenta lines with bond distances and
angles labeled. Protein residues are drawn as blue (GP1) and red (GP2)
sticks.
Protein–inhibitor interactions for compound 118a. (A,B) Molecule I (panel A) and molecule II (panel B)
in the binding
pocket (PDB ID 6HRO). Protein main chains are drawn as thick blue (GP1) and red (GP2)
sticks, the inhibitor as cyan sticks, and water molecules as red balls.
Protein residues that interact with the inhibitor are shown as gray
sticks. Yellow broken sticks represent hydrogen bonds. Residue R587
from a neighboring monomer is labeled with a # prefix. The position
of the chlorine atom is indicated by a green *. (C,D) Environment
of the chlorine atoms of molecule I (panel C) and molecule II (panel
D). Halogen bonds are shown as magenta lines with bond distances and
angles labeled. Protein residues are drawn as blue (GP1) and red (GP2)
sticks.Compound 118a molecule
II binds the GP with its chlorophenyl
group nestled in a subpocket adjacent to α3 interacting with
the side chains of I38, L186, M548, and L554 (Figure B). This subvolume is occupied by F193 in
the apo structure and also by inhibitors in other inhibitor complexes,
for example, by a phenyl ring of benztropine molecule A and the isobutoxy
group of bepridil. The chlorine atom makes a halogen bond with the
carbonyl oxygen of L554 with a bond distance of 3.7 Å, C–Cl–O
angle of 143°, and Cl–O–C angle of 106°, although
this is less ideal geometry than that seen in molecule I (Figure D). The methylpyrazole
ring of molecule II is sandwiched between, and extensively contacted
by, L186 and the pyrazole ring of molecule I. The methyl group contacts
all five nonhydrogen atoms of the pyrazole ring of molecule I, and
the interactions are so intimate (separation 3.2–3.8 Å)
that the electron density of the two groups is connected even at high
contour level (Figure D), which may not be favorable for binding. The hydroxyphenoxy group
of 118a molecule II makes off-center ring stacking interactions
with P187 and contacts the side chain of M548. The hydroxyphenoxy
group is also protected from the solvent by the main chain of residues
189–191, the putative cathepsin B/L cleavage site, which becomes
partially ordered in the complex. The piperidine carboxamide moiety
hangs out of the binding cavity and has weak electron density.Several water molecules are trapped in the binding cavity. Three
have direct interactions with the inhibitor, the first bridges interactions
from the carboxamide to the carboxyl group of E100 and carbonyl oxygen
of L63, the second hydrogen-bonds to the hydroxyl oxygen of the hydroxylphenoxy,
and the third hydrogen-bonds to the pyrazole ring of molecule II.
Interactions between EBOV GP and 118
Compared
to 118a, compound 118 has phenyl and pyrazole
groups instead of the chlorophenyl and methylpyrazole groups. As noted
above, the pyrazole ring of 118 molecule I appears to
be hydrolyzed. In addition, the hydroxyl group of the hydroxyphenoxy
moiety may be modified because there is extra density connected to
this group. The rest of 118 molecule I is bound in a
very similar fashion to molecule I of 118a; the hydrogen
bond interactions from the piperidine carboxamide group and even the
nearby water molecules are conserved (Figure A). The phenyl ring is positioned similarly
in the subpocket adjacent to V66 and A101; however, lacking the chlorine
atom, it does not make any interactions with the main chain of G67
and G102. Molecule II of 118 binds in a similar position
to molecule II of 118a (Figure B); although it lacks the chlorine atom on
the phenyl ring and the methyl group on the pyrazole ring, it is positioned
slightly deeper in the cavity and closer to molecule I. The phenyl
and pyrazole rings make similar interactions with I38, L186, M548,
and L554 and with molecule I to those made by molecule II of 118a. The hydroxyphenoxy moiety makes fewer contacts to P187
and has no interactions with the putative cathepsin B/L cleavage site
(residues 190–191), which is disordered.
Figure 6
Protein–inhibitor
interactions for compound 118. (A,B) Molecule I (panel
A) and molecule II (panel B) in the binding
pocket (PDB ID 6HS4). Protein chains, water molecules, and hydrogen bonds are shown
as in Figure ; compound 118 is drawn as orange sticks.
Protein–inhibitor
interactions for compound 118. (A,B) Molecule I (panel
A) and molecule II (panel B) in the binding
pocket (PDB ID 6HS4). Protein chains, water molecules, and hydrogen bonds are shown
as in Figure ; compound 118 is drawn as orange sticks.
Structural Changes Introduced by Inhibitor Binding
All 11
inhibitors, of considerable chemical diversity,[18−20] bind within
the same hydrophobic cavity, with affinity derived from
shape complementary enabled by small conformational changes in the
protein. Superimposition of inhibitor-bound structures on apo GP shows
that while inhibitor binding does not introduce significant main-chain
structural changes around the major binding area of the cavity (Figure A), there are various
side-chain rearrangements, most notably for residues V66, M548, L554,
and L558. V66 changes conformation to allow binding of different chemical
groups in the subpocket adjacent to it, whereas M548, L554, and L558
changes are associated with different shaped groups occupying the
subvolume adjacent to α3. The piperidine carboxamide of 118 or 118a and the dimethylethanamine group
of toremifene are positioned to make direct interactions with D522
via either hydrogen bonds or hydrophobic contacts and stabilize the
N-terminal end of the fusion loop (residues 522–526) in a different
conformation to the apo form (for smaller inhibitors, these residues
become disordered and show only weak electron density) (Figure B). The structural changes,
in turn, lead to ordering of two or three residues contributed from
the expression vector pNeosec at the N-terminus of GP1.
Figure 7
Protein structural
changes and binding modes of different inhibitors.
(A) Structural differences at the inhibitor-binding site of apo GP
and 10 GP–inhibitor complexes. Superimpositions were done using
the whole GP; the apo structure is shown as thicker gray sticks and
inhibitor bound structures as thinner sticks. (B) Piperidine carboxamide
group of 118a (red and cyan, PDB ID 6HRO) and the dimethylethanamine
group of toremifene (blue and yellow, PDB ID 5JQ7) introduced structural
changes at the N-terminus of the fusion loop compared to the apo GP
(gray). The bifurcated hydrogen bonds from D522 to the nitrogen atom
of the piperidine ring are shown as broken sticks. (C) Comparison
of the binding mode of 118 (orange sticks, PDB ID 6HS4) and 118a (cyan sticks, PDB ID 6HRO) in the cavity. (D–G) Comparison of the binding
pose of 118a with toremifene (D), bepridil (E, PDB ID 6F5U), thioridazine (F,
PDB ID 6G95),
and clomipramine (G, PDB ID 6G9I).
Protein structural
changes and binding modes of different inhibitors.
(A) Structural differences at the inhibitor-binding site of apo GP
and 10 GP–inhibitor complexes. Superimpositions were done using
the whole GP; the apo structure is shown as thicker gray sticks and
inhibitor bound structures as thinner sticks. (B) Piperidine carboxamide
group of 118a (red and cyan, PDB ID 6HRO) and the dimethylethanamine
group of toremifene (blue and yellow, PDB ID 5JQ7) introduced structural
changes at the N-terminus of the fusion loop compared to the apo GP
(gray). The bifurcated hydrogen bonds from D522 to the nitrogen atom
of the piperidine ring are shown as broken sticks. (C) Comparison
of the binding mode of 118 (orange sticks, PDB ID 6HS4) and 118a (cyan sticks, PDB ID 6HRO) in the cavity. (D–G) Comparison of the binding
pose of 118a with toremifene (D), bepridil (E, PDB ID 6F5U), thioridazine (F,
PDB ID 6G95),
and clomipramine (G, PDB ID 6G9I).
Comparison between the
Predicted and Observed Binding Modes
The QPLD docking algorithm
docked both 118 and 118a compounds at the
site corresponding to molecule I of
the crystal structure with the phenyl or chlorophenyl ring in the
subpocket adjacent to V66 and A101 (Figure and Tables S3 and S4). However, the pyrazole ring in both cases tilts away from Y517.
Because the side chain of D522 points away from the binding site in
the structure used for docking, the docking program was unable to
predict the hydrogen bond interactions to the piperidine ring that
requires a side-chain rotation of 180°. The hydrogen bond between
the carboxamide group and N61 was predicted for compound 118. We used the structure of GP observed when toremifene binds for
the in silico screening. In this structure, the side chain of L554
partially occupies the subpocket adjacent to α3 where the chlorophenyl
ring of 118a molecule II (or phenyl ring of 118) is bound, perhaps explaining why the two compounds were not docked
at the site.
Figure 8
Comparison of the crystal structure and the docked pose.
(A) Docked
pose of compound 118a is overlaid with the crystal structure
of GP–118a. 118a in the crystal structure
is shown in cyan and gray for the docked pose; protein side chains
that have large conformational differences or hydrogen bond to the
inhibitor are shown as blue (GP1) and red (GP2) sticks for the crystal
structure and gray sticks for the structure used in docking. (B) Comparison
of the docked pose of compound 118 (gray sticks) with
the bound mode in the crystal structure (orange sticks). Protein side
chains are colored as in (A). The orange broken bonds represent hydrogen
bonds.
Comparison of the crystal structure and the docked pose.
(A) Docked
pose of compound 118a is overlaid with the crystal structure
of GP–118a. 118a in the crystal structure
is shown in cyan and gray for the docked pose; protein side chains
that have large conformational differences or hydrogen bond to the
inhibitor are shown as blue (GP1) and red (GP2) sticks for the crystal
structure and gray sticks for the structure used in docking. (B) Comparison
of the docked pose of compound 118 (gray sticks) with
the bound mode in the crystal structure (orange sticks). Protein side
chains are colored as in (A). The orange broken bonds represent hydrogen
bonds.
Discussion and Conclusions
Compound 118a is the best EBOV inhibitor known to
directly interact with the viral GP. Its potency is about 2-fold better
than toremifene. Two molecules of 118a bind each monomer
of GP, whereas only one molecule of toremifene binds each monomer
of GP. Molecule I of 118a has stronger electron density
and an average B-factor 75% lower than for molecule
II and is therefore the major contributor to inhibitory potency. In
line with this, molecule I of 118a overlaps well with
its predicted binding position and also with bound toremifene and
bepridil (Figure ).
By comparing binding modes and potencies, we previously noted that
the FF volume and the subvolumes adjacent to V66 and α3 are
crucial for binding affinity.[18,20] Here, we show the piperidine
carboxamide group in molecule I of 118a exploring additional
volume inside the tunnel and making hydrogen bonds with the protein.
We have previously suggested that substitution of the chlorine atom
of toremifene with a benzyl ring to occupy the subvolume adjacent
to α3 to mimic the interactions made by clomipramine and thioridazine
might improve the potency of toremifene (Figure ). With knowledge of the GP–118a structure, we suggest that a further modification to
toremifene by substituting the dimethylamine group with the piperidine
carboxamide group of 118a may greatly increase its potency.
Similarly, modifications could also be made to bepridil by replacing
the isobutoxy group with a benzyl ring and addition of an oxyethylpiperidine-4-carboxamide
group of 118a to its benzyl ring. As more GP complex
structures with chemically divergent inhibitors are determined, the
knowledge of protein–inhibitor interactions will guide design
of potent drugs to combat Ebola virus.This study reveals that
a structure-based high-throughput in silico
screen in combination with an inhibitory assay of EBOVpp and crystallography
can be an effective way to identify highly potent small-molecule inhibitors
effective against EBOV. Because only the top hits of the screen need
to be verified experimentally, the method is much more efficient in
the requirements of both time and manpower compared to viral or pseudovirus
entry assays. Nevertheless, it should be borne in mind that only two
of the eight compounds selected by in silico screening showed inhibitory
properties. This probably reflects the limited reliability of the
scoring functions used and also perhaps the difficulty of predicting
binding in the face of extensive side-chain flexibility in the binding
cavity. For such flexible targets, we suggest that where a database
of potential binding cavity structures is experimentally available,
then screening against all possible structures and selecting the best
docking score might increase robustness. A fundamental limitation
of course will remain—the method can only identify the inhibitors
that directly interact with EBOV GP. Despite these caveats, the method
used was able to identify two inhibitors of a novel chemical group
of EBOV GP inhibitors, one of which represents the most potent known
to directly interact with the GP. Given its strong potency, 118a should be tested in vivo using the murine infection model
reported by Johansen et al. to determine its protective ability.[10]Both inhibitors bind with a unique binding
mode, especially their
piperidine carboxamide group, exploiting hydrogen bond interactions
that have not been seen before, while, in the case of 118a, the presence of strong halide bonding is likely to explain much
of the additional potency. The inhibitor-binding cavity of the GP
is large and can accommodate various inhibitors with chemically divergent
structures so that we believe that features such as the halide bonding
might be usefully grafted onto other chemical scaffolds. Such approaches
could provide more potent inhibitors to combat EBOV infection; indeed,
several compounds have been designed based on these structures and
will be made and tested soon.
Experimental Section
Data Collection
and Ligand Library Preparation
The
natural ligand library comprising about 2.5 million compounds was
downloaded from the TCM database@Taiwan.[27] Prior to screening of the natural ligand library, a drug candidate
list with proven inhibitory activity against EBOV was collected.[10] Eight compounds with known IC50 values
and known complex structures with EBOV GP were used for the evaluation
of our in silico workflow. Molecular structures of all drugs were
retrieved from the Drug Bank.[43] Ligands
were prepared for simulation using the ligprep module from the preparation
step of the HTVS workflow of Schrödinger suite (http://www.schrodinger.com/).
Docking Structure Preparation
The crystal structure
of EBOV GP in complex with toremifene was taken from the Protein Data
Bank (PDB ID 5JQ7).[19] The protein structure was preprocessed
using the protein preparation wizard of Schrödinger/Maestro
11.1. The processed structure was subjected to energy minimization
using the OPLS3 force field in the Impact module. The grid box for
docking was created in Glide[23,24] by picking toremifene
as the center and expanding the box size to cover residues of the
whole binding pocket. The final grid box dimensions were 44.9 ×
15.6 × 8 Å3.
Virtual Screening and Binding
Affinity Calculation
The natural ligand library was subjected
to three levels of docking
using the virtual screening workflow in Glide.[23,24] Each molecule was docked in the HTVS mode, from which the top 10%
of the compounds were selected for standard precision (SP) docking,
followed by refinement in the extra precision (XP) docking. The last
step was crucial to reduce false positives returned from SP and better
predict binding poses using a more expensive scoring function. The
final docking poses and binding affinities of known inhibitors and
candidate compounds returned from XP docking were subjected to QPLD
calculations.[25] This method combines Glide
docking with QSite to redock ligands using quantum mechanically derived
partial charges on them in the pocket accounting for the polarization
effect from the protein.
Reagents
The eight selected compounds
used for evaluation
of the virtual screen results were purchased from MolPort with a specified
purity of >90%. The high degree of purity of compound 118 was further confirmed by NMR analysis and, together with 118a, demonstrated to be the active component by crystallographic structure
determination in complex with EBOV GP.
Ebola Pseudovirus Production
and Titration
HIV-1-derived
pseudoviruses expressing the Ebola virus envelope GPs (EBOV pseudoparticle,
EBOVpp) were produced as described previously.[19] HEK-293T cells were seeded in T175 flasks one day prior
to transfection. Cells were transfected with 2 μg pR8ΔEnv,
2 μg BlaM-Vpr, 1 μg pcREV, and 3 or 2 μg of plexm-EBOV_GP
plasmids (containing Zaire EBOV GP residues 1–676 under control
of a β-actin/CMV chimeric promoter). After 10 h of transfection,
the medium was replaced by fresh Dulbecco’s modified Eagle’s
medium (DMEM) with 10% fetal bovine serum (FBS). Virus-containing
medium was collected at 48 and 72 h and passed through a 0.45 μm
filter to isolate the viral particles which were then concentrated
using the Lenti-X concentrator (Clontech). Virus titers were determined
by infecting TZM-bl cells (PTA-5659, no mycoplasma contamination)
with a serial dilution of concentrated pseudovirus, followed by a
β-Gal assay. Because the TZM-bl cells contain a β-Gal
expression cassette with an HIV-1-induced promoter, infected cells
can be identified through the hydrolysis of X-gal.[31]
Infectivity Assay
TZM-bl cells were
plated 24 h before
the assay at 2 × 104 cells per well in black clear-bottomed
96-well plates. On the day of assay, cells were cooled on ice before
the addition of EBOVpp. Viral supernatants were added onto the cells
with 1 in 10 dilution, and they were centrifuged at 2100g for 30 min at 4 °C. Viral supernatants were removed, and cells
were washed with 1× phosphate-buffered saline (PBS). Then, 100
μL of DMEM plus 5% FBS-containing toremifene, a well-characterized
ebola fusion inhibitor,[19]118a and 118 in a concentration range of 100–0.125
μM, or no drug, was added to each well before placing in a 37
°C, CO2 incubator to initiate viral entry. After 48
h, cells were fixed using 2% paraformaldehyde (PFA) for 20 min, followed
by a β-Gal assay. Cells were then imaged using a wide-field
Olympus microscope equipped with 20× air objective and transmitted
light. All cells in each well were measured and tiled using Cellsens
software (Olympus). The relative number of infected cells versus the
total population of cells was calculated using an automated algorithm
(spot tracker) with Icy software (http://icy.bioimageanalysis.org/).
BlaM Assay and Analysis
The β-lactamase assay[30,31] was applied to assess EBOVpp fusion. The procedure was similar to
that used for the infectivity assay, except that TZM-bl cells were
plated at 4 × 104 cells per well, viral supernatants
were added at MOI 0.5, and after removal of the virus supernatant
and washing, DMEM plus 10% FBS-containing toremifene, 118 or 118a in a concentration range of 12–0.4 μM,
or no drug, was added to each well. After 120 min, cells were loaded
with 1× CCF2-AM from the LiveBLAzer FRET—B/G Loading Kit
(Life Technologies) and incubated at room temperature in the dark
for 2 h. After CCF2-AM removal, cells were washed with 1× PBS
and fixed with 2% PFA before viewing. Cells were excited using a 405
nm continuous laser (Leica), and the emission spectra between 430
and 560 nm were recorded pixel by pixel (512 × 512) using a Leica
SP8 X-SMD microscope with a 20× objective. The ratio of blue
emission (440–480 nm, cleaved CCF2-AM) to green (500–540
nm, uncleaved CCF2-AM) was then calculated pixel by pixel using a
customized macro34 for ImageJ (http://imagej.nih.gov/ij/) with 25 different observation fields
for each condition. A blue/green threshold (fusion threshold) was
set using only media. The fusion threshold was calculated recovering
the signal (blue/green intensity ratio) coming from individual cells
plus 2 × standard deviation from ∼300 cells in each observation
field using a custom-made macro with ImageJ.[30] This threshold was then applied to all conditions. Cells above the
threshold were pseudocolored in red and cells below the threshold
were pseudocolored in blue. “Red” cells were then compared
with blue cells (nonfusogenic) as an accurate measure of fusion in
different conditions.
Protein Expression, Purification, and Crystallization
The production of Zaire EBOV (strain Mayinga-76) recombinant GP
extracellular
domain has been described previously.[18,19] In brief,
the construct contains residues 32–312 and 464–632 of
the GP with mutations T42A and H613A and a C-terminal tag of a fold
on trimerization sequence from the bacteriophage T4 fibritin and 6
histidines. The construct was cloned in the mammalian expression vector
pNeosec[44] and then transfected into HEK293T
cells with polyethylenimine and supplemented with 5 μM kifunensine
(Cayman Chemical). The His-tagged protein from dialyzed conditional
media was captured with talon beads, treated with endo-β-acetylglucosaminidase
F1, and further purified by size exclusion chromatography. The resulting
protein has three amino acids (ETG) from the expression vector pNeosec
added at the N-terminus. Crystallization of EBOV GP was performed
using microcrystal seeding and the sitting-drop vapor diffusion method
as described previously.[18,19] Crystals were grown
in conditions containing 9% (w/v) PEG 6000 and 0.1 M sodium citrate
tribasic dihydrate at pH 5.2.
Thermal Shift Assay
Thermal shift assays were performed
using a Mx3005p qPCR machine following exactly the method and protocol
described previously.[18,19]
Crystal Soaking, X-ray
Data Collection, and Structure Determination
To obtain GP–inhibitor
complexes, the inhibitors were diffused
into the GP crystals by soaking. The inhibitors were first dissolved
in 100% dimethyl sulfoxide and then diluted with a solution containing
15% (w/v) PEG 6000 and 0.1 M sodium citrate (pH 5.2) to concentrations
of 5 and 2.5 mg/mL. Eight crystals (four for each inhibitor concentration)
were soaked for each inhibitor in the above solutions for different
lengths of time, ranging from 2 to 20 min.The inhibitor-soaked
crystals were mounted in loops and then dipped into cryoprotectants
containing 75% inhibitor soaking solution and 25% (v/v) glycerol for
a couple of seconds before freezing in liquid nitrogen prior to data
collection.All diffraction data were collected at 100 K with
a frame size
of 0.1° rotation using PILATUS 6M detectors at Diamond Light
Source, UK. GP–118 data were acquired on beamline
I24 with a beam size of 50 × 50 μm2 and a wavelength
of 0.9686 Å. The exposure time per data frame was 0.01 s with
45% beam transmission.[45,46] GP–118a data
were collected on beamline I04-1 with a beam size of 60 × 50
μm2 and a wavelength of 0.9282 Å. Data (360°)
were collected from every crystal that diffracted.Diffraction
images were indexed, integrated, and scaled with the
automated data processing program Xia2 using the 3dii or Dials protocols.[47,48] Data from each crystal were initially phased with rigid-body refinement
using the GP–bepridil structure (PDB ID 6F5U) by omitting the
inhibitor and water molecules. The electron density maps calculated
at this stage were checked carefully. Only those data sets that gave
high-quality electron density for the soaked inhibitors were used
for the later structure refinement. Thus, the final data set for GP–118 to 2.05 Å resolution is from a single crystal, while
the GP–118a complex to 2.3 Å is merged from
five crystals.Structure refinement used REFMAC5[49] or
PHENIX,[50] and models were rebuilt with
COOT.[51] Data collection and structure refinement
statistics are given in Table . Structural comparisons used SHP,[34] simulated annealing omit electron density maps were calculated with
CNS,[52] volumes of the drug-binding cavity
and drug molecules were calculated with VOLUMES (Robert Esnouf, unpublished),
and figures were prepared with PyMOL.[53]
Authors: Arnab Basu; Bing Li; Debra M Mills; Rekha G Panchal; Steven C Cardinale; Michelle M Butler; Norton P Peet; Helena Majgier-Baranowska; John D Williams; Ishan Patel; Donald T Moir; Sina Bavari; Ranjit Ray; Michael R Farzan; Lijun Rong; Terry L Bowlin Journal: J Virol Date: 2011-01-26 Impact factor: 5.103
Authors: Nadine Biedenkopf; Kerstin Lange-Grünweller; Falk W Schulte; Aileen Weißer; Christin Müller; Dirk Becker; Stephan Becker; Roland K Hartmann; Arnold Grünweller Journal: Antiviral Res Date: 2016-11-15 Impact factor: 5.970
Authors: Han Cheng; Adam Schafer; Veronica Soloveva; Dima Gharaibeh; Tara Kenny; Cary Retterer; Rouzbeh Zamani; Sina Bavari; Norton P Peet; Lijun Rong Journal: Antiviral Res Date: 2017-06-20 Impact factor: 5.970
Authors: Derek Dube; Matthew B Brecher; Sue E Delos; Sean C Rose; Edward W Park; Kathryn L Schornberg; Jens H Kuhn; Judith M White Journal: J Virol Date: 2009-01-14 Impact factor: 5.103
Authors: Shyama Sidique; Sergey A Shiryaev; Boris I Ratnikov; Ananda Herath; Ying Su; Alex Y Strongin; Nicholas D P Cosford Journal: Bioorg Med Chem Lett Date: 2009-08-03 Impact factor: 2.823
Authors: Michael B Plewe; Nadezda V Sokolova; Vidyasagar Reddy Gantla; Eric R Brown; Shibani Naik; Alexandra Fetsko; Donald D Lorimer; David M Dranow; Hayden Smutney; Jameson Bullen; Rana Sidhu; Arshil Master; Junru Wang; E Adam Kallel; Lihong Zhang; Birte Kalveram; Alexander N Freiberg; Greg Henkel; Ken McCormack Journal: ACS Med Chem Lett Date: 2020-05-01 Impact factor: 4.345
Authors: Irina N Gaisina; Norton P Peet; Letitia Wong; Adam M Schafer; Han Cheng; Manu Anantpadma; Robert A Davey; Gregory R J Thatcher; Lijun Rong Journal: J Med Chem Date: 2020-06-17 Impact factor: 7.446
Authors: Robert H Bortz; Anthony C Wong; Michael G Grodus; Hannah S Recht; Marc C Pulanco; Gorka Lasso; Simon J Anthony; Eva Mittler; Rohit K Jangra; Kartik Chandran Journal: J Virol Date: 2020-08-31 Impact factor: 5.103
Authors: Jasmine Moshiri; David A Constant; Bowen Liu; Roberto Mateo; Steven Kearnes; Paul Novick; Ritika Prasad; Claude Nagamine; Vijay Pande; Karla Kirkegaard Journal: mBio Date: 2020-11-10 Impact factor: 7.867