Bo Hou1, Yu-Min Zhang2, Han-Yi Liao3, Li-Feng Fu3, De-Dong Li3, Xin Zhao3, Jian-Xun Qi3, Wei Yang3, Geng-Fu Xiao2, Lian Yang1, Zheng-Yu Zuo1, Lin Wang4, Xiang-Lei Zhang4, Fang Bai4, Liu Yang1, George F Gao3, Hao Song3,5, Jiang-Miao Hu1, Wei-Juan Shang2, Jun Zhou1. 1. State Key Laboratory of Phytochemistry and Plant Resources in West China and Yunnan Key Laboratory of Natural Medicinal Chemistry, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, People's Republic of China. 2. State Key Laboratory of Virology, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan 430071, People's Republic of China. 3. CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, People's Republic of China. 4. Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, Shanghai Tech University, Shanghai 201210, People's Republic of China. 5. Research Network of Immunity and Health (RNIH), Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, People's Republic of China.
Abstract
The coronavirus disease 2019 (COVID-19) pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has led to more than 5 million deaths worldwide to date. Due to the limited therapeutic options so far available, target-based virtual screening with LC/MS support was applied to identify the novel and high-content compounds 1-4 with inhibitory effects on SARS-CoV-2 in Vero E6 cells from the plant Dryopteris wallichiana. These compounds were also evaluated against SARS-CoV-2 in Calu-3 cells and showed unambiguous inhibitory activity. The inhibition assay of targets showed that compounds 3 and 4 mainly inhibited SARS-CoV-2 3CLpro, with effective Kd values. Through docking and molecular dynamics modeling, the binding site is described, providing a comprehensive understanding of 3CLpro and interactions for 3, including hydrogen bonds, hydrophobic bonds, and the spatial occupation of the B ring. Compounds 3 and 4 represent new, potential lead compounds for the development of anti-SARS-CoV-2 drugs. This study has led to the development of a target-based virtual screening method for exploring the potency of natural products and for identifying natural bioactive compounds for possible COVID-19 treatment.
The coronavirus disease 2019 (COVID-19) pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has led to more than 5 million deaths worldwide to date. Due to the limited therapeutic options so far available, target-based virtual screening with LC/MS support was applied to identify the novel and high-content compounds 1-4 with inhibitory effects on SARS-CoV-2 in Vero E6 cells from the plant Dryopteris wallichiana. These compounds were also evaluated against SARS-CoV-2 in Calu-3 cells and showed unambiguous inhibitory activity. The inhibition assay of targets showed that compounds 3 and 4 mainly inhibited SARS-CoV-2 3CLpro, with effective Kd values. Through docking and molecular dynamics modeling, the binding site is described, providing a comprehensive understanding of 3CLpro and interactions for 3, including hydrogen bonds, hydrophobic bonds, and the spatial occupation of the B ring. Compounds 3 and 4 represent new, potential lead compounds for the development of anti-SARS-CoV-2 drugs. This study has led to the development of a target-based virtual screening method for exploring the potency of natural products and for identifying natural bioactive compounds for possible COVID-19 treatment.
The recently described SARS-CoV-2, the causative agent of the coronavirus disease 2019
(COVID-19) pandemic, has caused over 320 million cases of infection and more than 5 million
deaths thus far, leading to worldwide economic and social disruption.[1,2] There are only a few effective
antiviral drugs available for the prophylactic or therapeutic treatment of the highly
contagious SARS-CoV-2 in humans. Also, a few pre-existing agents and drug candidates, such
as remdesivir (RDV), have been used as treatment regimens against COVID-19. An RDV prodrug,
a nucleoside-analogue inhibitor, impairs viral infection by targeting RNA-dependent RNA
polymerase (RdRp). RDV potently inhibits SARS-CoV-2 replication at EC50 values of
0.77–3.68 μM.[3,4] Recent reports of the compassionate use of RDV were reported in 68% of
patients with severe COVID-19.[2,5,6] With different mechanisms, inhibitors or blockers
such as bemnifosbuvir (AT-527) and PF-07321332, have been developed for structure-based drug
discovery.[7−10] Bemnifosbuvir, an orally administered double prodrug of a guanosine
nucleotide analogue, has potent antiviral activity by targeting both RdRp and the Nidovirus
RdRp-associated nucleotidyltransferase (NiRAN) in vitro, including SARS-CoV-1, SARS-CoV-2,
and hepatitis C virus. This prodrug is currently in phase II clinical trials for SARS-CoV-2
inhibition, and phase III clinical trials are expected to be completed very
soon.[11,12]
PF-07321332, designed by Pfizer, is a second-generation, orally available SARS-CoV-2-3CL
protease (3CLpro) inhibitor and has been evaluated in a phase I clinical trial study
(NCT04756531) for safety. Phase II/III clinical trials (NCT05011513 and NCT05047601) to
evaluate safety and efficacy are currently in progress, and combination therapy with
low-dose ritonavir is expected to help 3CLpro remain active in the body for longer
periods.[13,14] The
conclusions of phase trials are pending and will determine whether useful agents such as
bemnifosbuvir, plitidepsin, and PF-07321332 are effective in the treatment of
COVID-19.[12] Thus far, target-specific drugs are still expected to be
exploited for SARS-CoV-2.Natural products and their derivatives have played an important role in the new drug
development process, and nearly 50% of these products are FDA-approved drugs.[15] Natural products represent a good source of bioactive molecules and are
characterized by extensive structural diversity.[16,17] In an effort to combat SARS-CoV-2, medicinal chemists have
developed different and practicable strategies for new drugs. Target-based virtual screening
involves docking many molecules to potential multidrug targets by a virtual screening
procedure. The key docking scores and the number of directed edges were used to guide the
selection of potentially active molecules. This approach is easy to implement, and many new
medicinal functions of natural products can be exploited (Figure ).[18] This strategy was used to screen natural
products of Dryopteris species (Table S1, Supporting
Information) to obtain molecules that may have a blocking or inhibitory effect on
coronavirus. The results of the virtual screening showed that compounds
1BIV-10 and 1BIV-11 have the
characteristics of good docking scores and multiple directed edges (Figure S1, Supporting Information). When predicted active molecules
(1BIV-10 and 1BIV-11) were found in
the compound library of Dryopteris species, fragment-based potential
molecules were obtained by LC/MS and HPLC methods. According to these structures, molecular
masses of 510 and 718 Da were found to have similar fragments. In addition, the
investigation of compounds with unusual terpenoid moieties attached to acylphloroglucinol
residues from Dryopteris species is of interest. The next step was to
isolate and identify target compounds by HLPC, LC/MS, NMR, and X-ray crystallographic
techniques. LC/MS-guided isolation of compounds from Dryopteris wallichiana
(Spreng.) Hylander (Dryopteridaceae) afforded two new acylphloroglucinols, wallichins E and
F (1 and 2, Figure A),
along with the previously known compounds wallichins C (3) and D
(4).[19]
Figure 1
Target-based virtual screening used for natural product development against SARS-CoV-2.
(A) Possible targets for new drug development
(2′-O-methyltransferase (2′OMT, PDB:
6W75), 3CL protease (3CLpro,
PDB: 6M2N), nonstructural proteins
3 (NSP3, PDB: 6W6Y), nonstructural
proteins 15 (NSP15, PDB: 6WXC),
nonstructural proteins 1 (NSP1, PDB: 7K3N), helicase (Hel, PDB: 6JYT), nonstructural proteins 15 (NSP15, PDB: 6WXC), RNA-dependent RNA polymerase (RdRp, PDB: 7B3C), spike glycoprotein (SGP, PDB:
6VXX), spike-RBD (RBD, PDB:
7B14), transmembrane protease
serine 2 (TMPRSS2, PDB: 7MEQ),
nucleocapsid (Npro, PDB: 6WZQ),
cathepsin B (CTSB, PDB: 3AI8),
angiotensin-converting enzyme 2 (ACE2, PDB: 6CS2)). (B) Compound database of Dryopteris species. (C)
Predicted target network of target-based virtual screening. (D) Predicted molecular
fragment and molecular mass. (E) LC/MS analysis of the diethyl ether fraction from
D. wallichiana. Acquity Zorbax SB-C18 column (4.6 mm
× 250 mm, 5 μm) at a flow rate of 1.0 mL/min; gradient elution starting at 5%
MeOH in water, ramping to 100% in 100 min, then to 100% MeOH with 0.5% HCOOH.
Figure 2
(A) Chemical structures. (B) Key HMBC, COSY, and NOE correlations of
1–4.
Target-based virtual screening used for natural product development against SARS-CoV-2.
(A) Possible targets for new drug development
(2′-O-methyltransferase (2′OMT, PDB:
6W75), 3CL protease (3CLpro,
PDB: 6M2N), nonstructural proteins
3 (NSP3, PDB: 6W6Y), nonstructural
proteins 15 (NSP15, PDB: 6WXC),
nonstructural proteins 1 (NSP1, PDB: 7K3N), helicase (Hel, PDB: 6JYT), nonstructural proteins 15 (NSP15, PDB: 6WXC), RNA-dependent RNA polymerase (RdRp, PDB: 7B3C), spike glycoprotein (SGP, PDB:
6VXX), spike-RBD (RBD, PDB:
7B14), transmembrane protease
serine 2 (TMPRSS2, PDB: 7MEQ),
nucleocapsid (Npro, PDB: 6WZQ),
cathepsin B (CTSB, PDB: 3AI8),
angiotensin-converting enzyme 2 (ACE2, PDB: 6CS2)). (B) Compound database of Dryopteris species. (C)
Predicted target network of target-based virtual screening. (D) Predicted molecular
fragment and molecular mass. (E) LC/MS analysis of the diethyl ether fraction from
D. wallichiana. Acquity Zorbax SB-C18 column (4.6 mm
× 250 mm, 5 μm) at a flow rate of 1.0 mL/min; gradient elution starting at 5%
MeOH in water, ramping to 100% in 100 min, then to 100% MeOH with 0.5% HCOOH.(A) Chemical structures. (B) Key HMBC, COSY, and NOE correlations of
1–4.
Results and Discussion
Structural Identification
Compound 1, isolated as a yellow gum, exhibited a molecular formula of
C31H42O6, as deduced from the molecular ion peak
observed at m/z 511.3054 [M + H]+ (calcd
511.3054) in its HRESIMS, having 11 degrees of unsaturation. Compound 1
contained five 1H NMR methyl signals, including one
gem-dimethyl group (Table ),
one methyl signal of an acetyl group, and three additional methyl signals. The
1H NMR spectrum of 1 in acetone-d6
displayed a singlet at a very low field (δH 19.02 ppm, −OH), which
was assigned to an enolizable β-triketo carbonyl group. The 1H NMR
signals of the gem-dimethyl group at 1.30 and 1.34 ppm (3H each, s) were
attributed to the presence of an acylfilicinic acid ring system. Two doublets resonating
at 2.34–2.45 ppm were observed in the 1H NMR spectrum of 1,
indicating a methylene bridge, and its 13C NMR chemical shift (17.9 ppm) was
consistent with the CH2 moiety being connected to a terpenoid unit and an
acylfilicinic acid moiety. The HMBC spectrum of 1 showed cross-peaks between
H-11/C-10 (Figure B), as well as between
H-14″/C-7′, supporting the occurrence of an acetyl moiety (108.3 ppm, C-6)
on the A ring and the linkage position of the terpenoid moiety. The position of the side
chain at C-9″ could be determined from HMBC coupling between C-12″ and
H-9″ (Table ). Except for the
contributions of the A ring to the molecular formula of compound 1, the
elemental composition of the terpenoid fragment was determined as
C21H32O2. Therefore, the terpenoid residue should have
six degrees of unsaturation. The 1H NMR spectrum of 1 displayed a
CH2 single double bond (m 4.52, 1H-17″ and m 4.82, 1H-17″) and
trisubstituted double bonds (t, 6.4, 1H-12″). The signal at C-18″
(δC 180.0 ppm) in the 13C NMR spectrum of 1
suggested a carboxylic acid group substitution. The remaining terpenoid fragment contained
three ring systems. The side chain forming a pyran ring was observed from the
13C NMR signal at the C-14″ position (δc 83.8 ppm).
The E configuration was assigned through the associated NOESY signals
between H-12″/H-14″ and H-11″/H-16″ (Figure
B). Other NOESY signals of H-20″/H-11″,
H-20″/H-19″, and H-5″/H-9″ supported the relative
configuration assignment made for the terpenoid moiety.[20] According to
the proposed configurations of the terpenoid group, two theoretical structures could be
proposed with R and S configurations at C-14″. To
clarify the absolute configuration of 1 (Figure A), the geometries of the R and S
configurations in gas were optimized initially at the density functional theory
(DFT)-B3LYP/6-31G(d) level. The electronic circular dichroism (ECD) spectra of the gas
model and polarizable continuum model (PCM) were obtained at the obtained geometries using
the time-dependent DFT (TD-DFT) method and the same basis set. The conformational
contributions afforded two of the lowest-energy conformers (accounting for >90%). The
experimental ECD spectrum of (+)-1 displayed negative Cotton effects (CEs) at
200–223 nm and positive CEs at 223–374 nm. The computed ECD spectrum at the
14″S configuration matched well with the experimental ECD
spectrum of 1 (Figure A). The n
→ π* electronic transitions from S0 of the acylphloroglucinol moiety to S3 of
the olefin group (Figure C) afforded a rotatory
strength Δε = +0.8 at 304 nm, which was in agreement with the weak positive
CEs Δε = +1.5 in the experimental ECD spectrum.[21] The other
positive rotatory strength at electronic transitions between 304 and 350 nm was dominated
by other π → π* characters of the acylphloroglucinol group (>90%
contributions), which was consistent with the experimental positive CEs in the ECD
spectrum. The negative rotatory band (Δε = −3.8) at 210 nm was in
accordance with the experimental negative CE Δε = −2.8 in the ECD
spectrum. The S configuration of C-14″ in 1 could be
identified from the comparison results of the ECD spectrum using TD-DFT at the
B3LYP/6-31G(d) level in the gas model and PCM. Single-crystal X-ray diffraction analysis
confirmed this (Figure and Figure S2, Supporting Information) using Cu Kα radiation, which allowed
for assignment of the absolute configuration of (+)-1 (wallichin E) as
4″S,5″S,9″R,10″S,14″S.
Table 1
1H (600 MHz) and 13C (150 MHz) NMR Spectroscopic Data of
Compounds 1 and 2 (δ in ppm and J in
Hz)
(+)-1
(+)-2
no.
δC, type
δH, mult. (J in
Hz)
HMBC
δC, type
δH, mult. (J in
Hz)
HMBC
1
189.1, C
189.1, C
2
108.3, Ca
108.3, Ca
3
174.9, C
175.0, C
4
47.8, C
47.8, C
5
196.8, C
197.0, C
6
108.3, Ca
108.3, Ca
7′
17.9, CH2
2.34–2.45, m
14″
18.0, CH2
2.35–2.47, m
14″
8
25.2, CH3
1.34, s
3, 5
25.2, CH3
1.34, s
3, 5
9
24.9, CH3
1.30, s
3, 5
24.8, CH3
1.30, s
3, 5
10
200.6, C
200.7, C
11
27.8, CH3
2.51, s
10
27.9, CH3
2.51, s
10
1″
39.1, CH2
1.23, td (12.9, 3.5)
2″, 3″, 9″, 10″, 19″, 20″
39.1, CH2
1.24, td (12.8, 3.9)
2″, 9″, 19″, 20″
1.85–1.87, ma
2″, 4″, 9″, 19″
1.86, m
2″, 4″, 9″, 19″
2″
19.2, CH2
1.58, ma
1″, 4″, 5″, 19″
19.2, CH2
1.58, ma
1″, 4″, 5″, 19″
3″
37.8, CH2
1.81, m
2″, 4″, 18″, 19″
37.8, CH2
1.81, m
1″, 2″, 4″, 5″, 18″, 19″
1.58–1.60, ma
2″, 4″, 18″, 19″
1.58–1.60, ma
1″, 2″, 4″, 18″, 19″
4″
47.8, C
47.8, C
5″
50.4, CH
2.02, d (3.3)
1″, 4″, 6″, 8″, 9″, 10″,
18″, 19″, 20″
50.4, CH
2.03a
1″, 4″, 6″, 8″, 9″, 10″,
18″, 19″, 20″
6″
27.3, CH2
1.46, td (12.8, 4.3)
5″, 7″, 8″
27.3, CH2
1.46, td (12.5, 3.8)
5″, 7″, 8″, 10″
1.38, ma
8″, 10″
1.38, ma
7″
38.4, CH2
2.00–2.06, ma
38.4, CH2
2.00–2.06, ma
2.29–2.37, m
5″, 6″, 8″, 9″, 17″
2.29–2.37, m
5″, 6″, 8″, 9″, 17″
8″
148.9, C
149.0, C
9″
58.0, CH
1.79–1.87a
1″, 5″, 7″, 8″ 10″, 11″,
12″, 17″, 20″
58.0, CH
1.79–1.87a
1″, 5″, 8″, 7″, 10″, 11″,
12″, 17″, 20″
10″
39.6, C
39.6, C
11″
23.1, CH2
2.16, ddd (16.2, 10.8, 6.4)
8″, 9″, 12″, 13″
23.1, CH2
2.18, m
9″, 12″, 13″
2.36–2.39, m
8″, 9″, 12″, 13″, 17″
2.36–2.39, m
8″, 9″, 12″, 13″, 17″
12″
129.4, CH
5.48 t (6.4)
5″, 9″, 11″, 14″, 16″,
129.8, CH
5.47, t (6.3)
11″, 14″, 16″
13″
133.2, C
133.3, C
14″
83.8, CH
4.52, m
7′, 12″, 13″, 15″, 16″
84.0, CH
4.52, m
7′, 12″, 13″, 16″
15″
25.8, CH2
1.83–1.98, m, 2H
NDb
25.9, CH2
1.83–1.98, m
NDb
16″
12.4, CH3
1.73, s
12″, 13″, 14″
12.3, CH3
1.73, s
12″, 13″, 14″
17″
108.3, CH2
4.52, m
7″, 8″, 9″
108.3, CH2
4.49, s
7″, 8″, 9″
4.82, m
4.82, s
18″
180.0, C
180.1, C
19″
17.1, CH3
1.14, s
3″, 4″, 5″
17.2, CH3
1.14, s
3″, 4″, 5″
20″
15.0, CH3
0.79, s
1″, 5″, 9″, 10″
15.0, CH3
0.79, s
1″, 5″, 9″, 10″
OH-5
19.02, brs
19.01, brs
Overlapping signals.
ND: not determined due to overlap with other signals or solvent signals.
Figure 3
(A, B) Comparison of the calculated ECD spectra for
4″S,5″S,9″R,10″S,14″S
and
4″S,5″S,9″R,10″S,14″R
at the TD-DFT-B3LYP/6-31G(d) level with the experimental spectra of compounds
1 and 2 in MeOH (the red trace indicates the gas phase;
the blue trace indicates the PCM in MeOH). (C) Natural transition orbitals (NTOs) of
the most stable conformer involved in each transition computed at the
TD-DFT-B3LYP/6-31G(d) level.
Figure 4
X-ray diagram of 1 (14″S), showing a keto
absolute configuration and a pyran skeleton at the C ring. The bond length of 1.2368
Å at the C-5/O-5 positions is less than 1.2938 Å at the C-1/O-1 positions;
thus the double bonds could be assigned to the C-5/O-5 positions.
(A, B) Comparison of the calculated ECD spectra for
4″S,5″S,9″R,10″S,14″S
and
4″S,5″S,9″R,10″S,14″R
at the TD-DFT-B3LYP/6-31G(d) level with the experimental spectra of compounds
1 and 2 in MeOH (the red trace indicates the gas phase;
the blue trace indicates the PCM in MeOH). (C) Natural transition orbitals (NTOs) of
the most stable conformer involved in each transition computed at the
TD-DFT-B3LYP/6-31G(d) level.X-ray diagram of 1 (14″S), showing a keto
absolute configuration and a pyran skeleton at the C ring. The bond length of 1.2368
Å at the C-5/O-5 positions is less than 1.2938 Å at the C-1/O-1 positions;
thus the double bonds could be assigned to the C-5/O-5 positions.Overlapping signals.ND: not determined due to overlap with other signals or solvent signals.Compound 2 (wallichin F) gave the same molecular formula,
C31H42O6, as compound 1. Both the
1H and 13C NMR spectroscopic data of 2 were closely
comparable to those of 1, but key differences were displayed at C-12″,
C13″, and C-14″ (Table ). Thus, it
was evident that 1 and 2 are isomers. The [α]D
value of 2 was found to be +9.5, while the corresponding value for
1 was +61.8. To clarify the diastereomeric nature of 1, the
ECD spectra of 2 were calculated, and the absolute configuration at the
C-14″ position was established as R by analysis of the computed
ECD data (Figure B). Therefore, the structure of
(+)-2 was assigned as
4″S,5″S,9″R,10″S,14″R.
Inhibition of SARS-CoV-2 and 3CLpro
The signals of mAU detected at 280 nm indicated that it contains two major compounds
(3 and 4) at a ratio of approximately 1.2:1, with retention
times of 3 and 4 at 11.8 and 15.2 min, respectively (Figure S3A,
Supporting Information). The SARS-CoV-2 inhibitory activities
of the isolated compounds were evaluated in vitro in Vero E6 cells. Separate line charts
of the inhibitory percentage at a final concentration of 10 μM are shown in Figure A. Among these compounds, the mixed compounds
3 and 4 presented a high inhibition rate of 87.7%, and
2 had a 51.7% inhibition rate. Thus the mixed compounds 3 and
4 were used to test further activities. The natural content ratio (1.2:1)
of the mixture showed an inhibitory activity of 6.8 μM EC50 (Figure B). Mixed compounds 3 and
4 and pure 3 and 4 displayed inhibitory
activities against SARS-CoV-2, with EC50 values from 4.5 to 12.1 μM in
Vero E6 cells. Figure B shows not only that the
natural ratio of 3 and 4 maintained SARS-CoV-2 inhibition
activity but also that purified compound 3 exhibited improved inhibitory
activity toward SARS-CoV-2 at the cellular level. When comparing the EC50
values of 3 and 4, there were slight differences due to the
chirality at C-14″ R or C-14″ S. The in
vitro cytotoxicity of the compounds was tested using a CCK-8 assay. The resulting
CC50 values of compounds in Vero E6 cells were over 140 μM (Figure S3B,
Supporting Information), indicating low cytotoxicity. The
corresponding selectivity index (SI = CC50/EC50) values were >27,
>35, and >11 for a mixture of 3 and 4 and pure
3 and 4, respectively. To validate the activity mechanism,
Vero E6 cells were infected with both VSV-delG/VSV and VSV-delG/SARS-CoV-2 pseudoviruses
at a multiplicity of infection (MOI) of 0.01 in different concentrations of a mixture of
3 and 4 and pure 3 and 4 between 50
and 100 μM (Figure S3C and D, Supporting Information). An inhibition rate below 50% at 100 μM showed
that these compounds are not able to block SARS-CoV-2-infected cells. The results
indicated that compounds 3 and 4 may act as SARS-CoV-2
inhibitors. It was then investigated as to whether the cell-type-dependent differences in
entry inhibition translated into differential inhibition of authentic SARS-CoV-2. The
EC50 values of pure 3 and 4 were examined in Calu-3
cells infected with SARS-CoV-2 at an MOI of 0.05. The test results showed that
3 and 4 suppressed SARS-CoV-2 with EC50 values of
20.2 and 30.0 μM, respectively (Figure C).
The CC50 values of 3 and 4 in Calu-3 cells were
64.3–98.6 μM, which suggested that these compounds had moderate cytotoxicity
for Calu-3 cells (Figure S3E, Supporting Information). Therefore, compounds 3 and
4 showed inhibitory activities against Calu-3 cells infected with
SARS-CoV-2.
Figure 5
Activities of 1–4 against SARS-CoV-2. (A) The %
inhibition of SARS-CoV-2 was measured at a final concentration of 10 μM
(n = 3, independent experiments). (B) Vero E6 cells at an MOI of
0.01 were infected with SARS-CoV-2 and treated with a series of concentrations of the
indicated antivirals for 24 h p.i. (n = 3). (C) Calu-3 cells were
infected with SARS-CoV-2 at an MOI of 0.05 and treated with different doses of the
indicated antivirals for 24 h p.i. (n = 3). (D) Representative
inhibition curves for baicalein, a mixture of compounds 3 and
4, and pure 3 and 4 against 3CLpro
(n = 3). (E–H) Affinities of mixed 3 and
4, pure 3 and 4, and baicalein for 3CLpro
analyzed using an SPR assay.
Activities of 1–4 against SARS-CoV-2. (A) The %
inhibition of SARS-CoV-2 was measured at a final concentration of 10 μM
(n = 3, independent experiments). (B) Vero E6 cells at an MOI of
0.01 were infected with SARS-CoV-2 and treated with a series of concentrations of the
indicated antivirals for 24 h p.i. (n = 3). (C) Calu-3 cells were
infected with SARS-CoV-2 at an MOI of 0.05 and treated with different doses of the
indicated antivirals for 24 h p.i. (n = 3). (D) Representative
inhibition curves for baicalein, a mixture of compounds 3 and
4, and pure 3 and 4 against 3CLpro
(n = 3). (E–H) Affinities of mixed 3 and
4, pure 3 and 4, and baicalein for 3CLpro
analyzed using an SPR assay.Compounds 3 and 4 were tested against seven target proteins
(TMPRSS2, CTSB, RBD, Npro, Spike-FL, 3CLpro, and ACE2). These compounds were not active
against the TMPRSS2, CTSB, Npro, Spike-FL, and ACE2 proteins, according to results of
surface plasmon resonance (SPR). Compounds 3 and 4 weakly bound
to the RBD protein with Kd values of 64.7–97.1 μM
(detailed information on the SPR method is provided in the Supporting Information, Figures S4–S6 and Table S2). However,
3 and 4 had a strong interaction with 3CLpro from 5.6 to 8.1
μM, as determined by a fluorescence resonance energy transfer (FRET) protease assay
(IC50 in Figure D and Table ). Moreover, SPR provided affinity values and
indicated that 3 and 4 strongly interacted with 3CLpro with
Kd values of 12.0–16.6 μM (Figure E–G and Figure S7A–C, Supporting Information) compared with baicalein, with a
Kd value of 31.3 μM (Figure H and Figure S7D, Supporting Information). Taken together, the results showed that
3 and 4 mainly target SARS-CoV-2 3CLpro, and 3
may serve as a potential SARS-CoV-2 inhibitor.
Table 2
Predicted Active Sites and Activities of Test Compounds
Molecular docking was performed with AutoDock4.2 to create an interaction model and
identify the essential amino acid residues of 3CLpro. The binding sites of baicalein
were determined from the PDB file (6M2N).
The octanol–water partition coefficient (ClogP) was obtained using the
ALOGPS 2.1 program.
Molecular docking was performed with AutoDock4.2 to create an interaction model and
identify the essential amino acid residues of 3CLpro. The binding sites of baicalein
were determined from the PDB file (6M2N).The octanol–water partition coefficient (ClogP) was obtained using the
ALOGPS 2.1 program.
Structural Insights from Docking Experiments
To predict the binding behaviors between 3CLpro and 3, docking studies were
carried out with the AutoDock 4.2 program.[22] The interactions between
3CLpro and 3 were determined to be mainly hydrogen bonds and hydrophobic,
consisting of residues from loop 1, loop 2, and helix 1 (Figure A). The hydrogen bonds displayed strong interactions among ASN142
(loop 1), ARG188 (loop 2), GLU166 (helix 1), and the phenolic hydroxy group of the B ring,
and the acetyl group at the A ring formed a hydrogen bond with GLN192 (loop 2) and was
close to helical regions, enhancing interactions of the activity region of 3CLpro. The
butyl chain and phenol groups of the B ring formed double hydrophobic interactions with
PHE140 and MET165 (helix 1), respectively. The butyl chain forming hydrophobic
interactions with PHE140 of loop 1 might account for the stable spatial occupation.
Although the E ring offers hydrogen bonds to the amino acids of 3CLpro, hydrogen-bonding
interactions from the active site were not discerned in Figure A. Compound 3 adopted a lock-shaped conformation in
the 3CLpro pocket (Figure B), which was larger
than the binding domain in the inhibitor baicalein (Figure C).[23] Furthermore, a molecular dynamics (MD)
simulation of 3CLpro and 3 was performed using Gromacs 5.0.[24] The stabilization of 3 in the active site of 3CLpro was
observed by root-mean-square deviation (RMSD) in Figure D–F. These results provided information on the ligand (Figure D), 3CLpro (Figure E), and 3CLpro–ligand complex stability (Figure F). During the 100 ns MD simulation process, three of the RMSD
values ranged within 0.3 Å. The ligand (3) RMSD remained stable
throughout the MD equilibration process after 15 ns, and no obvious changes were observed.
The RMSD values of 3CLpro varied greatly compared with ligand RMSDs during 0–15 ns.
To explain this phenomenon, principal component analysis (PCA) was performed with the
MODE-TASK script.[25] The main differences in the MD trajectory were
classified into two clusters including the red zone at 100 ns and the blue zone at 10 ns
(Figure G). The representative structures are
aligned in Figure H. The differences were
attributed to loose loop regions of 3CLpro allowing for the side structure to be more
flexible. These simulation results provide the interaction details together of 3CLpro
inhibited by 3 and an interpretation for the potent activity determined for
compound 3 against SARS-CoV-2.
Figure 6
(A) Structural analysis of inhibitor 3 binding to 3CLpro. Residues of
3CLpro are shown in green, and 3 is shown in light gray. (B)
Electrostatic potential of 3 docked to the structure of 3CLpro (PDB code
6M2N). The substrate-binding
region is shown by different colors (−5.0 to 5.0 kcal/mol). (C) Structural
analysis of 3CLpro and baicalein crystal complexes. (D–F) RMSD values of ligand
(3), 3CLpro, and 3CLpro+ligand (3) heavy atoms show that
the docked positions are stable during 100 ns MD simulations. (G) Results of PCA on a
MD trajectory of 3CL protease. The Kaiser–Meyer–Olkein (KMO) value was
0.99, and the KMO value ranged from 0 to 1, with 1 indicating that the MD simulation
was sufficiently equilibrated. (H) The 10 ns structure of 3CLpro+3 was
aligned to the 100 ns structure.
(A) Structural analysis of inhibitor 3 binding to 3CLpro. Residues of
3CLpro are shown in green, and 3 is shown in light gray. (B)
Electrostatic potential of 3 docked to the structure of 3CLpro (PDB code
6M2N). The substrate-binding
region is shown by different colors (−5.0 to 5.0 kcal/mol). (C) Structural
analysis of 3CLpro and baicalein crystal complexes. (D–F) RMSD values of ligand
(3), 3CLpro, and 3CLpro+ligand (3) heavy atoms show that
the docked positions are stable during 100 ns MD simulations. (G) Results of PCA on a
MD trajectory of 3CL protease. The Kaiser–Meyer–Olkein (KMO) value was
0.99, and the KMO value ranged from 0 to 1, with 1 indicating that the MD simulation
was sufficiently equilibrated. (H) The 10 ns structure of 3CLpro+3 was
aligned to the 100 ns structure.
Experimental Section
General Experimental Procedures
The melting point was measured on an X-4 micro melting point apparatus. Optical rotations
were determined using a Rudolph Autopol III polarimeter (Rudolph Research Analytical,
Hackettstown, NJ, USA). UV spectra were recorded with a Shimadzu UV-2401A
spectrophotometer. The Fourier transform infrared (FT-IR) spectra were recorded using a
Thermo Scientific Nicolet iS10 FT-IR spectrometer. The ECD data were recorded on a
Chirascan V100 spectrometer. NMR data were obtained on a Bruker AV-600 (13C:
150 MHz, 1H: 600 MHz) NMR spectrometer. HRESIMS data and analytical LC/MS data
were obtained on an Agilent 6540 Q-TOF mass spectrometer.
Plant Material
Rhizomes of D. wallichiana were collected in November 2019 from Fu Gong
County of Yunnan Province, People’s Republic of China, and identified by Dr.
Zheng-Yu Zuo. A voucher specimen (KUN 1519972) was deposited at the State Key Laboratory
of Phytochemistry and Plant Resource in West China, Kunming Institute of Botany, Chinese
Academy of Sciences.
Extraction and Isolation
The chopped, air-dried rhizomes of D. wallichiana (∼4700 g) were
extracted for 2 weeks at 40 °C with Et2O, and the resulting extract
(∼125 g) was subjected to silica gel chromatography to afford two fractions, 1
(∼31 g) and 2 (∼17 g). Fraction 1 was treated by column chromatography (CC)
on a Sephadex LH-20 column and further purified by HPLC [Ultrasphere Si (10 mm × 250
mm, 5 μm)] using n-hexane–EtOAc (95:5) together with HOAc
(0.5%), 2 mL/min, to obtain 3 (320 mg, ∼0.5% yield) and 4
(260 mg, ∼0.4% yield). Fraction 2 was reprocessed by HPLC [Hypersil BDS
C18 (10 mm × 250 mm, 5 μm),
H2O–MeOH–isopropanol (30:35:35) with 0.5% HCOOH, 2.0 mL/min)] to
purify 1 and 2. Fraction B also yielded 1 (45 mg,
∼0.075% yield) and 2 (10 mg, ∼0.016% yield).
Crystals of 1 were obtained from
CH2Cl2–CH3OH (1:1) at 4 °C, which was
analyzed by X-ray diffraction with Cu Kα radiation (Figure S2, Supporting Information). The intensity data were collected at 100 K on a
Bruker APEX DUO diffractometer equipped with an APEX II CCD using Cu Kα radiation.
Cell refinement and data reduction were performed with Bruker SAINT. Structures were
resolved by direct methods, expanded by Fourier techniques, and refined by the program and
full-matrix least-squares calculations. All non-hydrogen atoms were refined
anisotropically, and all hydrogen atoms were fixed at the calculated positions.
Crystallographic data were deposited at the Cambridge Crystallographic Data Center
(number: CCDC 2106340).
Crystal Data of 1
C31H42O6, M = 510.64,
a = 6.1780(2) Å, b = 7.9180(3) Å,
c = 14.8892(6) Å, α = 93.3090(10)°, β =
96.4600(10)°, γ = 98.8030(10)°, V = 713.11(5)
Å3, T = 100(2) K, space group P1,
Z = 1, μ(Cu Kα) = 0.650 mm–1,
21 537 reflections measured, 5432 independent reflections
(Rint = 0.0532). The final R1
values were 0.0548 (I > 2σ(I)). The final
wR(F2) values were 0.1417
(I > 2σ(I)). The final
R1 values were 0.0555 (all data). The final
wR(F2) values were 0.1434 (all data). The
goodness of fit on F2 was 1.016. The absolute configuration
was determined by the Flack parameter = −0.01(10).
Molecular Modeling for 3CLpro and Compound 3
The 3CLpro structure was docked with 3 and 4 via the AutoDock
4.2 software package.[26] The conversion of 3 and
4 from 2D to 3D structures was carried out using Open Babel GUI
software.[27] Processing of 3CL protease (3CLpro, PDB entry 6M2N) was performed with Auto Dock Tools
(ADT). Water molecules and original ligands were removed using the edit module. Gasteiger
charges and polar hydrogen atoms were assigned to the 3D structure file, and the genetic
algorithm (GA) run was selected as 50 to obtain the conformations. The center coordinate
was obtained from the original ligand, and a docking grid with a size of 60 × 60
× 60 was used. Compound 3 was then docked into the active site of 3CLpro
to obtain a complex structure. The structure was subjected to MD simulation using GROMACS
5.0. The antechamber tool was used to generate the restricted electrostatic point charge
(RESP) of 3 after calculation by Gaussian 09 at the HF/6-31G(d,p) level. A
force field of 3 was taken from the general AMBER force field. ff99SB was
applied to 3CLpro, and the TIP3P water model was added with a boundary 2.0 Å away
from 3CLpro atoms. The whole system was neutralized by the counterions, and then the
steepest descent algorithm was used to minimize within a force tolerance of 10 kJ/mol.
Consequently, the systems with position restraints were equilibrated to 310 K on the
3CLpro and ligand atom. The pressure and temperature were controlled by a
Parrinello–Rahman barostat (2 bar) with τp = 2.0 ps and a
compressibility of 4.5 × 10–5 bar and the V-rescale method with
τT = 0.1 ps, respectively. Long-range electrostatic interactions were
involved within a 1.2 nm cutoff using the particle-mesh Ewald (PME) algorithm.
Constraining the bond lengths and angles of hydrogen bonds was applied by the lincs
algorithm. The initial 1.0 ns MD simulation of NPT was position restrained during the
equilibration process. Canonical 100 ns MD simulation runs were performed for
protein–ligand complexes. The time step and the saved coordinates were set to 2.0
fs and 1.0 ps. 3CLpro–ligand interaction files were taken from the PLIP online tool
(https://plip-tool.biotec.tu-dresden.de/plip-web/plip/index) and visualized using
PyMOL.
3CLpro Enzyme Activity Inhibition Test
The 3CLpro inhibitory activity of the compounds was determined by a FRET protease assay.
MCA-AVLQSGFR-Lys(Dnp)-Lys-NH2 solution was used as the fluorogenic substrate
(Genscript). According to previous reports, a FRET-based protease assay was carried
out.[4,28,29] Briefly, 20 μL of a series of test compound solutions in 50 mM
Tris and 1 mM EDTA buffer at pH = 6.74 were mixed with 5 μL of 1.2 μM 3CLpro
and incubated for 10 min. Then, 20 μL of 20 μM peptide substrate was added to
the solution. The relative fluorescence units (RFU) at 37 °C for 10 min were recorded
with 320 nm excitation and 405 nm emission wavelengths by using SpectraMax iD3 (Molecular
Devices, USA). The percent inhibition ratio of 3 and 4 and
baicalein was calculated using the equation as follows: inhibition activity (%) =
(F3CLpro –
FComps)/(F3CLpro –
FSub) × 100%, where F3CLpro
is the fluorescence value of the 3CLpro control (3CLpro, buffer, and substrate),
FComps is the fluorescence value of the tested compounds
(3CLpro, compound solution, and substrate), and FSub is the
fluorescence value of the substrate control. Each experiment was performed in
triplicate.
Pseudovirus Infection Inhibition Assay
The pseudoviruses of SARS-CoV-2 were built using a GFP-encoding replication-deficient
vesicular stomatitis virus (VSV) vector backbone (VSV-delG-GFP) and prepared as previously
described.[30] Briefly, pseudovirus (50 μL) and serially diluted
test compounds (100 to 0.39 μM) were added to the preplated Vero cells. Vero cells
infected with VSV-delG/VSV were used as controls, and then the plates were imaged 15 h
p.i. The imaging analysis of fluorescent cells and the transducing unit (TU) numbers were
conducted using a CQ1 confocal image cytometer (Yokogawa, Japan). Each experiment was
performed with six replicates.
Evaluation of Cytotoxicity and Antiviral Activities of Compounds 3 and
4
The cytotoxicity of 3 and 4 to Vero E6 cells and Calu-3 cells
was tested by a CCK-8 assay kit (Beyotime, People’s Republic of China). To estimate
the antiviral efficiency of isolated compounds, Vero E6 cells and Calu-3 cells at a
density of 5 × 104 cells per well were seeded in 48-well cell culture
plates and cultured approximately 12 h before infection. The cells were treated with
multiple doses of 3 and 4 for 1 h, and a virus MOI of 0.01 for
Vero E6 cells (0.05 for Calu-3 cells) was added gradually and infected for 2 h. Then, the
virus and compound mixture were removed, and Vero E6 and Calu-3 cells were cultured with
new compound-containing medium. Twenty-four hours p.i., the cell supernatant was collected
and lysed in lysis buffer for further qRT-PCR quantification analysis to obtain
EC50 values. The experiment was completed at biosafety level-3 (BSL-3).
SPR Assay of Compounds
The binding affinity of compounds and 3CLpro was analyzed using a Biacore S200 system (GE
Healthcare). Briefly, 100 μg of 3CLpro was immobilized on a CM5 sensor chip using a
pH = 4.0 acetic acid solution. Compounds 3 and 4 and baicalein
at different concentrations (50, 40, 30, 15, 7.5, 3.75, and 1.875 μM) were injected
for 120 s at a flow rate of 40 μL/min. A cleaning solution of PBS with 50% DMSO was
used to regenerate the CM5 chip. All response values were collected at 25 °C. Biacore
Evaluation Software 2.0 was used to generate Kd values.