Literature DB >> 35084181

Target-Based Virtual Screening and LC/MS-Guided Isolation Procedure for Identifying Phloroglucinol-Terpenoid Inhibitors of SARS-CoV-2.

Bo Hou1, Yu-Min Zhang2, Han-Yi Liao3, Li-Feng Fu3, De-Dong Li3, Xin Zhao3, Jian-Xun Qi3, Wei Yang3, Geng-Fu Xiao2, Lian Yang1, Zheng-Yu Zuo1, Lin Wang4, Xiang-Lei Zhang4, Fang Bai4, Liu Yang1, George F Gao3, Hao Song3,5, Jiang-Miao Hu1, Wei-Juan Shang2, Jun Zhou1.   

Abstract

The coronavirus disease 2019 (COVID-19) pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has led to more than 5 million deaths worldwide to date. Due to the limited therapeutic options so far available, target-based virtual screening with LC/MS support was applied to identify the novel and high-content compounds 1-4 with inhibitory effects on SARS-CoV-2 in Vero E6 cells from the plant Dryopteris wallichiana. These compounds were also evaluated against SARS-CoV-2 in Calu-3 cells and showed unambiguous inhibitory activity. The inhibition assay of targets showed that compounds 3 and 4 mainly inhibited SARS-CoV-2 3CLpro, with effective Kd values. Through docking and molecular dynamics modeling, the binding site is described, providing a comprehensive understanding of 3CLpro and interactions for 3, including hydrogen bonds, hydrophobic bonds, and the spatial occupation of the B ring. Compounds 3 and 4 represent new, potential lead compounds for the development of anti-SARS-CoV-2 drugs. This study has led to the development of a target-based virtual screening method for exploring the potency of natural products and for identifying natural bioactive compounds for possible COVID-19 treatment.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35084181      PMCID: PMC8806002          DOI: 10.1021/acs.jnatprod.1c00805

Source DB:  PubMed          Journal:  J Nat Prod        ISSN: 0163-3864            Impact factor:   4.050


The recently described SARS-CoV-2, the causative agent of the coronavirus disease 2019 (COVID-19) pandemic, has caused over 320 million cases of infection and more than 5 million deaths thus far, leading to worldwide economic and social disruption.[1,2] There are only a few effective antiviral drugs available for the prophylactic or therapeutic treatment of the highly contagious SARS-CoV-2 in humans. Also, a few pre-existing agents and drug candidates, such as remdesivir (RDV), have been used as treatment regimens against COVID-19. An RDV prodrug, a nucleoside-analogue inhibitor, impairs viral infection by targeting RNA-dependent RNA polymerase (RdRp). RDV potently inhibits SARS-CoV-2 replication at EC50 values of 0.77–3.68 μM.[3,4] Recent reports of the compassionate use of RDV were reported in 68% of patients with severe COVID-19.[2,5,6] With different mechanisms, inhibitors or blockers such as bemnifosbuvir (AT-527) and PF-07321332, have been developed for structure-based drug discovery.[7−10] Bemnifosbuvir, an orally administered double prodrug of a guanosine nucleotide analogue, has potent antiviral activity by targeting both RdRp and the Nidovirus RdRp-associated nucleotidyltransferase (NiRAN) in vitro, including SARS-CoV-1, SARS-CoV-2, and hepatitis C virus. This prodrug is currently in phase II clinical trials for SARS-CoV-2 inhibition, and phase III clinical trials are expected to be completed very soon.[11,12] PF-07321332, designed by Pfizer, is a second-generation, orally available SARS-CoV-2-3CL protease (3CLpro) inhibitor and has been evaluated in a phase I clinical trial study (NCT04756531) for safety. Phase II/III clinical trials (NCT05011513 and NCT05047601) to evaluate safety and efficacy are currently in progress, and combination therapy with low-dose ritonavir is expected to help 3CLpro remain active in the body for longer periods.[13,14] The conclusions of phase trials are pending and will determine whether useful agents such as bemnifosbuvir, plitidepsin, and PF-07321332 are effective in the treatment of COVID-19.[12] Thus far, target-specific drugs are still expected to be exploited for SARS-CoV-2. Natural products and their derivatives have played an important role in the new drug development process, and nearly 50% of these products are FDA-approved drugs.[15] Natural products represent a good source of bioactive molecules and are characterized by extensive structural diversity.[16,17] In an effort to combat SARS-CoV-2, medicinal chemists have developed different and practicable strategies for new drugs. Target-based virtual screening involves docking many molecules to potential multidrug targets by a virtual screening procedure. The key docking scores and the number of directed edges were used to guide the selection of potentially active molecules. This approach is easy to implement, and many new medicinal functions of natural products can be exploited (Figure ).[18] This strategy was used to screen natural products of Dryopteris species (Table S1, Supporting Information) to obtain molecules that may have a blocking or inhibitory effect on coronavirus. The results of the virtual screening showed that compounds 1BIV-10 and 1BIV-11 have the characteristics of good docking scores and multiple directed edges (Figure S1, Supporting Information). When predicted active molecules (1BIV-10 and 1BIV-11) were found in the compound library of Dryopteris species, fragment-based potential molecules were obtained by LC/MS and HPLC methods. According to these structures, molecular masses of 510 and 718 Da were found to have similar fragments. In addition, the investigation of compounds with unusual terpenoid moieties attached to acylphloroglucinol residues from Dryopteris species is of interest. The next step was to isolate and identify target compounds by HLPC, LC/MS, NMR, and X-ray crystallographic techniques. LC/MS-guided isolation of compounds from Dryopteris wallichiana (Spreng.) Hylander (Dryopteridaceae) afforded two new acylphloroglucinols, wallichins E and F (1 and 2, Figure A), along with the previously known compounds wallichins C (3) and D (4).[19]
Figure 1

Target-based virtual screening used for natural product development against SARS-CoV-2. (A) Possible targets for new drug development (2′-O-methyltransferase (2′OMT, PDB: 6W75), 3CL protease (3CLpro, PDB: 6M2N), nonstructural proteins 3 (NSP3, PDB: 6W6Y), nonstructural proteins 15 (NSP15, PDB: 6WXC), nonstructural proteins 1 (NSP1, PDB: 7K3N), helicase (Hel, PDB: 6JYT), nonstructural proteins 15 (NSP15, PDB: 6WXC), RNA-dependent RNA polymerase (RdRp, PDB: 7B3C), spike glycoprotein (SGP, PDB: 6VXX), spike-RBD (RBD, PDB: 7B14), transmembrane protease serine 2 (TMPRSS2, PDB: 7MEQ), nucleocapsid (Npro, PDB: 6WZQ), cathepsin B (CTSB, PDB: 3AI8), angiotensin-converting enzyme 2 (ACE2, PDB: 6CS2)). (B) Compound database of Dryopteris species. (C) Predicted target network of target-based virtual screening. (D) Predicted molecular fragment and molecular mass. (E) LC/MS analysis of the diethyl ether fraction from D. wallichiana. Acquity Zorbax SB-C18 column (4.6 mm × 250 mm, 5 μm) at a flow rate of 1.0 mL/min; gradient elution starting at 5% MeOH in water, ramping to 100% in 100 min, then to 100% MeOH with 0.5% HCOOH.

Figure 2

(A) Chemical structures. (B) Key HMBC, COSY, and NOE correlations of 1–4.

Target-based virtual screening used for natural product development against SARS-CoV-2. (A) Possible targets for new drug development (2′-O-methyltransferase (2′OMT, PDB: 6W75), 3CL protease (3CLpro, PDB: 6M2N), nonstructural proteins 3 (NSP3, PDB: 6W6Y), nonstructural proteins 15 (NSP15, PDB: 6WXC), nonstructural proteins 1 (NSP1, PDB: 7K3N), helicase (Hel, PDB: 6JYT), nonstructural proteins 15 (NSP15, PDB: 6WXC), RNA-dependent RNA polymerase (RdRp, PDB: 7B3C), spike glycoprotein (SGP, PDB: 6VXX), spike-RBD (RBD, PDB: 7B14), transmembrane protease serine 2 (TMPRSS2, PDB: 7MEQ), nucleocapsid (Npro, PDB: 6WZQ), cathepsin B (CTSB, PDB: 3AI8), angiotensin-converting enzyme 2 (ACE2, PDB: 6CS2)). (B) Compound database of Dryopteris species. (C) Predicted target network of target-based virtual screening. (D) Predicted molecular fragment and molecular mass. (E) LC/MS analysis of the diethyl ether fraction from D. wallichiana. Acquity Zorbax SB-C18 column (4.6 mm × 250 mm, 5 μm) at a flow rate of 1.0 mL/min; gradient elution starting at 5% MeOH in water, ramping to 100% in 100 min, then to 100% MeOH with 0.5% HCOOH. (A) Chemical structures. (B) Key HMBC, COSY, and NOE correlations of 1–4.

Results and Discussion

Structural Identification

Compound 1, isolated as a yellow gum, exhibited a molecular formula of C31H42O6, as deduced from the molecular ion peak observed at m/z 511.3054 [M + H]+ (calcd 511.3054) in its HRESIMS, having 11 degrees of unsaturation. Compound 1 contained five 1H NMR methyl signals, including one gem-dimethyl group (Table ), one methyl signal of an acetyl group, and three additional methyl signals. The 1H NMR spectrum of 1 in acetone-d6 displayed a singlet at a very low field (δH 19.02 ppm, −OH), which was assigned to an enolizable β-triketo carbonyl group. The 1H NMR signals of the gem-dimethyl group at 1.30 and 1.34 ppm (3H each, s) were attributed to the presence of an acylfilicinic acid ring system. Two doublets resonating at 2.34–2.45 ppm were observed in the 1H NMR spectrum of 1, indicating a methylene bridge, and its 13C NMR chemical shift (17.9 ppm) was consistent with the CH2 moiety being connected to a terpenoid unit and an acylfilicinic acid moiety. The HMBC spectrum of 1 showed cross-peaks between H-11/C-10 (Figure B), as well as between H-14″/C-7′, supporting the occurrence of an acetyl moiety (108.3 ppm, C-6) on the A ring and the linkage position of the terpenoid moiety. The position of the side chain at C-9″ could be determined from HMBC coupling between C-12″ and H-9″ (Table ). Except for the contributions of the A ring to the molecular formula of compound 1, the elemental composition of the terpenoid fragment was determined as C21H32O2. Therefore, the terpenoid residue should have six degrees of unsaturation. The 1H NMR spectrum of 1 displayed a CH2 single double bond (m 4.52, 1H-17″ and m 4.82, 1H-17″) and trisubstituted double bonds (t, 6.4, 1H-12″). The signal at C-18″ (δC 180.0 ppm) in the 13C NMR spectrum of 1 suggested a carboxylic acid group substitution. The remaining terpenoid fragment contained three ring systems. The side chain forming a pyran ring was observed from the 13C NMR signal at the C-14″ position (δc 83.8 ppm). The E configuration was assigned through the associated NOESY signals between H-12″/H-14″ and H-11″/H-16″ (Figure B). Other NOESY signals of H-20″/H-11″, H-20″/H-19″, and H-5″/H-9″ supported the relative configuration assignment made for the terpenoid moiety.[20] According to the proposed configurations of the terpenoid group, two theoretical structures could be proposed with R and S configurations at C-14″. To clarify the absolute configuration of 1 (Figure A), the geometries of the R and S configurations in gas were optimized initially at the density functional theory (DFT)-B3LYP/6-31G(d) level. The electronic circular dichroism (ECD) spectra of the gas model and polarizable continuum model (PCM) were obtained at the obtained geometries using the time-dependent DFT (TD-DFT) method and the same basis set. The conformational contributions afforded two of the lowest-energy conformers (accounting for >90%). The experimental ECD spectrum of (+)-1 displayed negative Cotton effects (CEs) at 200–223 nm and positive CEs at 223–374 nm. The computed ECD spectrum at the 14″S configuration matched well with the experimental ECD spectrum of 1 (Figure A). The n → π* electronic transitions from S0 of the acylphloroglucinol moiety to S3 of the olefin group (Figure C) afforded a rotatory strength Δε = +0.8 at 304 nm, which was in agreement with the weak positive CEs Δε = +1.5 in the experimental ECD spectrum.[21] The other positive rotatory strength at electronic transitions between 304 and 350 nm was dominated by other π → π* characters of the acylphloroglucinol group (>90% contributions), which was consistent with the experimental positive CEs in the ECD spectrum. The negative rotatory band (Δε = −3.8) at 210 nm was in accordance with the experimental negative CE Δε = −2.8 in the ECD spectrum. The S configuration of C-14″ in 1 could be identified from the comparison results of the ECD spectrum using TD-DFT at the B3LYP/6-31G(d) level in the gas model and PCM. Single-crystal X-ray diffraction analysis confirmed this (Figure and Figure S2, Supporting Information) using Cu Kα radiation, which allowed for assignment of the absolute configuration of (+)-1 (wallichin E) as 4″S,5″S,9″R,10″S,14″S.
Table 1

1H (600 MHz) and 13C (150 MHz) NMR Spectroscopic Data of Compounds 1 and 2 (δ in ppm and J in Hz)

 (+)-1
(+)-2
no.δC, typeδH, mult. (J in Hz)HMBCδC, typeδH, mult. (J in Hz)HMBC
1189.1, C  189.1, C  
2108.3, Ca  108.3, Ca  
3174.9, C  175.0, C  
447.8, C  47.8, C  
5196.8, C  197.0, C  
6108.3, Ca  108.3, Ca  
7′17.9, CH22.34–2.45, m14″18.0, CH22.35–2.47, m14″
825.2, CH31.34, s3, 525.2, CH31.34, s3, 5
924.9, CH31.30, s3, 524.8, CH31.30, s3, 5
10200.6, C  200.7, C  
1127.8, CH32.51, s1027.9, CH32.51, s10
1″39.1, CH21.23, td (12.9, 3.5)2″, 3″, 9″, 10″, 19″, 20″39.1, CH21.24, td (12.8, 3.9)2″, 9″, 19″, 20″
1.85–1.87, ma2″, 4″, 9″, 19″1.86, m2″, 4″, 9″, 19″
2″19.2, CH21.58, ma1″, 4″, 5″, 19″19.2, CH21.58, ma1″, 4″, 5″, 19″
3″37.8, CH21.81, m2″, 4″, 18″, 19″37.8, CH21.81, m1″, 2″, 4″, 5″, 18″, 19″
1.58–1.60, ma2″, 4″, 18″, 19″1.58–1.60, ma1″, 2″, 4″, 18″, 19″
4″47.8, C  47.8, C  
5″50.4, CH2.02, d (3.3)1″, 4″, 6″, 8″, 9″, 10″, 18″, 19″, 20″50.4, CH2.03a1″, 4″, 6″, 8″, 9″, 10″, 18″, 19″, 20″
6″27.3, CH21.46, td (12.8, 4.3)5″, 7″, 8″27.3, CH21.46, td (12.5, 3.8)5″, 7″, 8″, 10″
1.38, ma8″, 10″1.38, ma
7″38.4, CH22.00–2.06, ma 38.4, CH22.00–2.06, ma 
2.29–2.37, m5″, 6″, 8″, 9″, 17″2.29–2.37, m5″, 6″, 8″, 9″, 17″
8″148.9, C  149.0, C  
9″58.0, CH1.79–1.87a1″, 5″, 7″, 8″ 10″, 11″, 12″, 17″, 20″58.0, CH1.79–1.87a1″, 5″, 8″, 7″, 10″, 11″, 12″, 17″, 20″
10″39.6, C  39.6, C  
11″23.1, CH22.16, ddd (16.2, 10.8, 6.4)8″, 9″, 12″, 13″23.1, CH22.18, m9″, 12″, 13″
2.36–2.39, m8″, 9″, 12″, 13″, 17″2.36–2.39, m8″, 9″, 12″, 13″, 17″
12″129.4, CH5.48 t (6.4)5″, 9″, 11″, 14″, 16″,129.8, CH5.47, t (6.3)11″, 14″, 16″
13″133.2, C  133.3, C  
14″83.8, CH4.52, m7′, 12″, 13″, 15″, 16″84.0, CH4.52, m7′, 12″, 13″, 16″
15″25.8, CH21.83–1.98, m, 2HNDb25.9, CH21.83–1.98, mNDb
16″12.4, CH31.73, s12″, 13″, 14″12.3, CH31.73, s12″, 13″, 14″
17″108.3, CH24.52, m7″, 8″, 9″108.3, CH24.49, s7″, 8″, 9″
4.82, m4.82, s
18″180.0, C  180.1, C  
19″17.1, CH31.14, s3″, 4″, 5″17.2, CH31.14, s3″, 4″, 5″
20″15.0, CH30.79, s1″, 5″, 9″, 10″15.0, CH30.79, s1″, 5″, 9″, 10″
OH-5 19.02, brs  19.01, brs 

Overlapping signals.

ND: not determined due to overlap with other signals or solvent signals.

Figure 3

(A, B) Comparison of the calculated ECD spectra for 4″S,5″S,9″R,10″S,14″S and 4″S,5″S,9″R,10″S,14″R at the TD-DFT-B3LYP/6-31G(d) level with the experimental spectra of compounds 1 and 2 in MeOH (the red trace indicates the gas phase; the blue trace indicates the PCM in MeOH). (C) Natural transition orbitals (NTOs) of the most stable conformer involved in each transition computed at the TD-DFT-B3LYP/6-31G(d) level.

Figure 4

X-ray diagram of 1 (14″S), showing a keto absolute configuration and a pyran skeleton at the C ring. The bond length of 1.2368 Å at the C-5/O-5 positions is less than 1.2938 Å at the C-1/O-1 positions; thus the double bonds could be assigned to the C-5/O-5 positions.

(A, B) Comparison of the calculated ECD spectra for 4″S,5″S,9″R,10″S,14″S and 4″S,5″S,9″R,10″S,14″R at the TD-DFT-B3LYP/6-31G(d) level with the experimental spectra of compounds 1 and 2 in MeOH (the red trace indicates the gas phase; the blue trace indicates the PCM in MeOH). (C) Natural transition orbitals (NTOs) of the most stable conformer involved in each transition computed at the TD-DFT-B3LYP/6-31G(d) level. X-ray diagram of 1 (14″S), showing a keto absolute configuration and a pyran skeleton at the C ring. The bond length of 1.2368 Å at the C-5/O-5 positions is less than 1.2938 Å at the C-1/O-1 positions; thus the double bonds could be assigned to the C-5/O-5 positions. Overlapping signals. ND: not determined due to overlap with other signals or solvent signals. Compound 2 (wallichin F) gave the same molecular formula, C31H42O6, as compound 1. Both the 1H and 13C NMR spectroscopic data of 2 were closely comparable to those of 1, but key differences were displayed at C-12″, C13″, and C-14″ (Table ). Thus, it was evident that 1 and 2 are isomers. The [α]D value of 2 was found to be +9.5, while the corresponding value for 1 was +61.8. To clarify the diastereomeric nature of 1, the ECD spectra of 2 were calculated, and the absolute configuration at the C-14″ position was established as R by analysis of the computed ECD data (Figure B). Therefore, the structure of (+)-2 was assigned as 4″S,5″S,9″R,10″S,14″R.

Inhibition of SARS-CoV-2 and 3CLpro

The signals of mAU detected at 280 nm indicated that it contains two major compounds (3 and 4) at a ratio of approximately 1.2:1, with retention times of 3 and 4 at 11.8 and 15.2 min, respectively (Figure S3A, Supporting Information). The SARS-CoV-2 inhibitory activities of the isolated compounds were evaluated in vitro in Vero E6 cells. Separate line charts of the inhibitory percentage at a final concentration of 10 μM are shown in Figure A. Among these compounds, the mixed compounds 3 and 4 presented a high inhibition rate of 87.7%, and 2 had a 51.7% inhibition rate. Thus the mixed compounds 3 and 4 were used to test further activities. The natural content ratio (1.2:1) of the mixture showed an inhibitory activity of 6.8 μM EC50 (Figure B). Mixed compounds 3 and 4 and pure 3 and 4 displayed inhibitory activities against SARS-CoV-2, with EC50 values from 4.5 to 12.1 μM in Vero E6 cells. Figure B shows not only that the natural ratio of 3 and 4 maintained SARS-CoV-2 inhibition activity but also that purified compound 3 exhibited improved inhibitory activity toward SARS-CoV-2 at the cellular level. When comparing the EC50 values of 3 and 4, there were slight differences due to the chirality at C-14″ R or C-14″ S. The in vitro cytotoxicity of the compounds was tested using a CCK-8 assay. The resulting CC50 values of compounds in Vero E6 cells were over 140 μM (Figure S3B, Supporting Information), indicating low cytotoxicity. The corresponding selectivity index (SI = CC50/EC50) values were >27, >35, and >11 for a mixture of 3 and 4 and pure 3 and 4, respectively. To validate the activity mechanism, Vero E6 cells were infected with both VSV-delG/VSV and VSV-delG/SARS-CoV-2 pseudoviruses at a multiplicity of infection (MOI) of 0.01 in different concentrations of a mixture of 3 and 4 and pure 3 and 4 between 50 and 100 μM (Figure S3C and D, Supporting Information). An inhibition rate below 50% at 100 μM showed that these compounds are not able to block SARS-CoV-2-infected cells. The results indicated that compounds 3 and 4 may act as SARS-CoV-2 inhibitors. It was then investigated as to whether the cell-type-dependent differences in entry inhibition translated into differential inhibition of authentic SARS-CoV-2. The EC50 values of pure 3 and 4 were examined in Calu-3 cells infected with SARS-CoV-2 at an MOI of 0.05. The test results showed that 3 and 4 suppressed SARS-CoV-2 with EC50 values of 20.2 and 30.0 μM, respectively (Figure C). The CC50 values of 3 and 4 in Calu-3 cells were 64.3–98.6 μM, which suggested that these compounds had moderate cytotoxicity for Calu-3 cells (Figure S3E, Supporting Information). Therefore, compounds 3 and 4 showed inhibitory activities against Calu-3 cells infected with SARS-CoV-2.
Figure 5

Activities of 1–4 against SARS-CoV-2. (A) The % inhibition of SARS-CoV-2 was measured at a final concentration of 10 μM (n = 3, independent experiments). (B) Vero E6 cells at an MOI of 0.01 were infected with SARS-CoV-2 and treated with a series of concentrations of the indicated antivirals for 24 h p.i. (n = 3). (C) Calu-3 cells were infected with SARS-CoV-2 at an MOI of 0.05 and treated with different doses of the indicated antivirals for 24 h p.i. (n = 3). (D) Representative inhibition curves for baicalein, a mixture of compounds 3 and 4, and pure 3 and 4 against 3CLpro (n = 3). (E–H) Affinities of mixed 3 and 4, pure 3 and 4, and baicalein for 3CLpro analyzed using an SPR assay.

Activities of 1–4 against SARS-CoV-2. (A) The % inhibition of SARS-CoV-2 was measured at a final concentration of 10 μM (n = 3, independent experiments). (B) Vero E6 cells at an MOI of 0.01 were infected with SARS-CoV-2 and treated with a series of concentrations of the indicated antivirals for 24 h p.i. (n = 3). (C) Calu-3 cells were infected with SARS-CoV-2 at an MOI of 0.05 and treated with different doses of the indicated antivirals for 24 h p.i. (n = 3). (D) Representative inhibition curves for baicalein, a mixture of compounds 3 and 4, and pure 3 and 4 against 3CLpro (n = 3). (E–H) Affinities of mixed 3 and 4, pure 3 and 4, and baicalein for 3CLpro analyzed using an SPR assay. Compounds 3 and 4 were tested against seven target proteins (TMPRSS2, CTSB, RBD, Npro, Spike-FL, 3CLpro, and ACE2). These compounds were not active against the TMPRSS2, CTSB, Npro, Spike-FL, and ACE2 proteins, according to results of surface plasmon resonance (SPR). Compounds 3 and 4 weakly bound to the RBD protein with Kd values of 64.7–97.1 μM (detailed information on the SPR method is provided in the Supporting Information, Figures S4–S6 and Table S2). However, 3 and 4 had a strong interaction with 3CLpro from 5.6 to 8.1 μM, as determined by a fluorescence resonance energy transfer (FRET) protease assay (IC50 in Figure D and Table ). Moreover, SPR provided affinity values and indicated that 3 and 4 strongly interacted with 3CLpro with Kd values of 12.0–16.6 μM (Figure E–G and Figure S7A–C, Supporting Information) compared with baicalein, with a Kd value of 31.3 μM (Figure H and Figure S7D, Supporting Information). Taken together, the results showed that 3 and 4 mainly target SARS-CoV-2 3CLpro, and 3 may serve as a potential SARS-CoV-2 inhibitor.
Table 2

Predicted Active Sites and Activities of Test Compounds

compound(s)ligand binding sitesaKd (μM)IC50 (μM, mean ± SD)EC50 in Vero E6 cells (μM, mean ± SD)ClogPb
3+4 12.05.6 ± 0.86.8 ± 1.6 
3PHE140, ASN142, MET165, GLU166, ARG188, GLN19216.67.5 ± 2.14.5 ± 1.75.41
4THR24, THR25, HIS41, SER46, MET165, PRO168, ASP187, ARG188, GLN19214.38.1 ± 2.012.1 ± 2.35.41
baicaleinHIS41, MET49, LEU141, CYS145, GLU16631.35.1 ± 1.2 3.19

Molecular docking was performed with AutoDock4.2 to create an interaction model and identify the essential amino acid residues of 3CLpro. The binding sites of baicalein were determined from the PDB file (6M2N).

The octanol–water partition coefficient (ClogP) was obtained using the ALOGPS 2.1 program.

Molecular docking was performed with AutoDock4.2 to create an interaction model and identify the essential amino acid residues of 3CLpro. The binding sites of baicalein were determined from the PDB file (6M2N). The octanol–water partition coefficient (ClogP) was obtained using the ALOGPS 2.1 program.

Structural Insights from Docking Experiments

To predict the binding behaviors between 3CLpro and 3, docking studies were carried out with the AutoDock 4.2 program.[22] The interactions between 3CLpro and 3 were determined to be mainly hydrogen bonds and hydrophobic, consisting of residues from loop 1, loop 2, and helix 1 (Figure A). The hydrogen bonds displayed strong interactions among ASN142 (loop 1), ARG188 (loop 2), GLU166 (helix 1), and the phenolic hydroxy group of the B ring, and the acetyl group at the A ring formed a hydrogen bond with GLN192 (loop 2) and was close to helical regions, enhancing interactions of the activity region of 3CLpro. The butyl chain and phenol groups of the B ring formed double hydrophobic interactions with PHE140 and MET165 (helix 1), respectively. The butyl chain forming hydrophobic interactions with PHE140 of loop 1 might account for the stable spatial occupation. Although the E ring offers hydrogen bonds to the amino acids of 3CLpro, hydrogen-bonding interactions from the active site were not discerned in Figure A. Compound 3 adopted a lock-shaped conformation in the 3CLpro pocket (Figure B), which was larger than the binding domain in the inhibitor baicalein (Figure C).[23] Furthermore, a molecular dynamics (MD) simulation of 3CLpro and 3 was performed using Gromacs 5.0.[24] The stabilization of 3 in the active site of 3CLpro was observed by root-mean-square deviation (RMSD) in Figure D–F. These results provided information on the ligand (Figure D), 3CLpro (Figure E), and 3CLpro–ligand complex stability (Figure F). During the 100 ns MD simulation process, three of the RMSD values ranged within 0.3 Å. The ligand (3) RMSD remained stable throughout the MD equilibration process after 15 ns, and no obvious changes were observed. The RMSD values of 3CLpro varied greatly compared with ligand RMSDs during 0–15 ns. To explain this phenomenon, principal component analysis (PCA) was performed with the MODE-TASK script.[25] The main differences in the MD trajectory were classified into two clusters including the red zone at 100 ns and the blue zone at 10 ns (Figure G). The representative structures are aligned in Figure H. The differences were attributed to loose loop regions of 3CLpro allowing for the side structure to be more flexible. These simulation results provide the interaction details together of 3CLpro inhibited by 3 and an interpretation for the potent activity determined for compound 3 against SARS-CoV-2.
Figure 6

(A) Structural analysis of inhibitor 3 binding to 3CLpro. Residues of 3CLpro are shown in green, and 3 is shown in light gray. (B) Electrostatic potential of 3 docked to the structure of 3CLpro (PDB code 6M2N). The substrate-binding region is shown by different colors (−5.0 to 5.0 kcal/mol). (C) Structural analysis of 3CLpro and baicalein crystal complexes. (D–F) RMSD values of ligand (3), 3CLpro, and 3CLpro+ligand (3) heavy atoms show that the docked positions are stable during 100 ns MD simulations. (G) Results of PCA on a MD trajectory of 3CL protease. The Kaiser–Meyer–Olkein (KMO) value was 0.99, and the KMO value ranged from 0 to 1, with 1 indicating that the MD simulation was sufficiently equilibrated. (H) The 10 ns structure of 3CLpro+3 was aligned to the 100 ns structure.

(A) Structural analysis of inhibitor 3 binding to 3CLpro. Residues of 3CLpro are shown in green, and 3 is shown in light gray. (B) Electrostatic potential of 3 docked to the structure of 3CLpro (PDB code 6M2N). The substrate-binding region is shown by different colors (−5.0 to 5.0 kcal/mol). (C) Structural analysis of 3CLpro and baicalein crystal complexes. (D–F) RMSD values of ligand (3), 3CLpro, and 3CLpro+ligand (3) heavy atoms show that the docked positions are stable during 100 ns MD simulations. (G) Results of PCA on a MD trajectory of 3CL protease. The Kaiser–Meyer–Olkein (KMO) value was 0.99, and the KMO value ranged from 0 to 1, with 1 indicating that the MD simulation was sufficiently equilibrated. (H) The 10 ns structure of 3CLpro+3 was aligned to the 100 ns structure.

Experimental Section

General Experimental Procedures

The melting point was measured on an X-4 micro melting point apparatus. Optical rotations were determined using a Rudolph Autopol III polarimeter (Rudolph Research Analytical, Hackettstown, NJ, USA). UV spectra were recorded with a Shimadzu UV-2401A spectrophotometer. The Fourier transform infrared (FT-IR) spectra were recorded using a Thermo Scientific Nicolet iS10 FT-IR spectrometer. The ECD data were recorded on a Chirascan V100 spectrometer. NMR data were obtained on a Bruker AV-600 (13C: 150 MHz, 1H: 600 MHz) NMR spectrometer. HRESIMS data and analytical LC/MS data were obtained on an Agilent 6540 Q-TOF mass spectrometer.

Plant Material

Rhizomes of D. wallichiana were collected in November 2019 from Fu Gong County of Yunnan Province, People’s Republic of China, and identified by Dr. Zheng-Yu Zuo. A voucher specimen (KUN 1519972) was deposited at the State Key Laboratory of Phytochemistry and Plant Resource in West China, Kunming Institute of Botany, Chinese Academy of Sciences.

Extraction and Isolation

The chopped, air-dried rhizomes of D. wallichiana (∼4700 g) were extracted for 2 weeks at 40 °C with Et2O, and the resulting extract (∼125 g) was subjected to silica gel chromatography to afford two fractions, 1 (∼31 g) and 2 (∼17 g). Fraction 1 was treated by column chromatography (CC) on a Sephadex LH-20 column and further purified by HPLC [Ultrasphere Si (10 mm × 250 mm, 5 μm)] using n-hexane–EtOAc (95:5) together with HOAc (0.5%), 2 mL/min, to obtain 3 (320 mg, ∼0.5% yield) and 4 (260 mg, ∼0.4% yield). Fraction 2 was reprocessed by HPLC [Hypersil BDS C18 (10 mm × 250 mm, 5 μm), H2O–MeOH–isopropanol (30:35:35) with 0.5% HCOOH, 2.0 mL/min)] to purify 1 and 2. Fraction B also yielded 1 (45 mg, ∼0.075% yield) and 2 (10 mg, ∼0.016% yield).

Wallichin E (1):

yellow gum; mp 240–247 °C; [α]D25 +61.8 (c 0.5, MeOH); UV (MeOH) λmax (log ε) 203 (4.03), 226 (3.82), 242 (3.82), 276 (3.64), 325 (3.78) nm; IR (ATR) νmax 3080, 2931, 2866, 1693, 1658, 1621, 1569, 1525, 1472 cm–1; ECD (c 5.5 × 10–4 M, MeOH) λmax (Δε) 209 (−2.78), 240 (+1.98), 322 (+1.96); 1H NMR (600 MHz, acetone-d6) and 13C NMR (150 MHz, acetone-d6), Table ; HRESIMS m/z [M + H]+ 511.3054 (calcd for C31H43O6 511.3054).

Wallichin F (2):

brown gum; [α]D25 +9.5 (c 0.4, MeOH); UV (MeOH) λmax (log ε) 203 (4.05), 227 (3.83), 242 (3.85), 276 (3.67), 326 (3.83) nm; IR (ATR) νmax 3080, 2931, 2865, 1694, 1657, 1620, 1569, 1524, 1471 cm–1; ECD (c 7.8 × 10–4 M, MeOH) λmax (Δε) 199 (2.43), 229 (−1.83), 247 (−0.72), 291 (−0.73), 318 (−1.14); 1H NMR (600 MHz, acetone-d6) and 13C NMR (150 MHz, acetone-d6), Table ; HRESIMS m/z [M + H]+ 511.3060 (calcd for C31H43O6 511.3054).

Crystal Structure Analysis of Compound 1

Crystals of 1 were obtained from CH2Cl2–CH3OH (1:1) at 4 °C, which was analyzed by X-ray diffraction with Cu Kα radiation (Figure S2, Supporting Information). The intensity data were collected at 100 K on a Bruker APEX DUO diffractometer equipped with an APEX II CCD using Cu Kα radiation. Cell refinement and data reduction were performed with Bruker SAINT. Structures were resolved by direct methods, expanded by Fourier techniques, and refined by the program and full-matrix least-squares calculations. All non-hydrogen atoms were refined anisotropically, and all hydrogen atoms were fixed at the calculated positions. Crystallographic data were deposited at the Cambridge Crystallographic Data Center (number: CCDC 2106340).

Crystal Data of 1

C31H42O6, M = 510.64, a = 6.1780(2) Å, b = 7.9180(3) Å, c = 14.8892(6) Å, α = 93.3090(10)°, β = 96.4600(10)°, γ = 98.8030(10)°, V = 713.11(5) Å3, T = 100(2) K, space group P1, Z = 1, μ(Cu Kα) = 0.650 mm–1, 21 537 reflections measured, 5432 independent reflections (Rint = 0.0532). The final R1 values were 0.0548 (I > 2σ(I)). The final wR(F2) values were 0.1417 (I > 2σ(I)). The final R1 values were 0.0555 (all data). The final wR(F2) values were 0.1434 (all data). The goodness of fit on F2 was 1.016. The absolute configuration was determined by the Flack parameter = −0.01(10).

Molecular Modeling for 3CLpro and Compound 3

The 3CLpro structure was docked with 3 and 4 via the AutoDock 4.2 software package.[26] The conversion of 3 and 4 from 2D to 3D structures was carried out using Open Babel GUI software.[27] Processing of 3CL protease (3CLpro, PDB entry 6M2N) was performed with Auto Dock Tools (ADT). Water molecules and original ligands were removed using the edit module. Gasteiger charges and polar hydrogen atoms were assigned to the 3D structure file, and the genetic algorithm (GA) run was selected as 50 to obtain the conformations. The center coordinate was obtained from the original ligand, and a docking grid with a size of 60 × 60 × 60 was used. Compound 3 was then docked into the active site of 3CLpro to obtain a complex structure. The structure was subjected to MD simulation using GROMACS 5.0. The antechamber tool was used to generate the restricted electrostatic point charge (RESP) of 3 after calculation by Gaussian 09 at the HF/6-31G(d,p) level. A force field of 3 was taken from the general AMBER force field. ff99SB was applied to 3CLpro, and the TIP3P water model was added with a boundary 2.0 Å away from 3CLpro atoms. The whole system was neutralized by the counterions, and then the steepest descent algorithm was used to minimize within a force tolerance of 10 kJ/mol. Consequently, the systems with position restraints were equilibrated to 310 K on the 3CLpro and ligand atom. The pressure and temperature were controlled by a Parrinello–Rahman barostat (2 bar) with τp = 2.0 ps and a compressibility of 4.5 × 10–5 bar and the V-rescale method with τT = 0.1 ps, respectively. Long-range electrostatic interactions were involved within a 1.2 nm cutoff using the particle-mesh Ewald (PME) algorithm. Constraining the bond lengths and angles of hydrogen bonds was applied by the lincs algorithm. The initial 1.0 ns MD simulation of NPT was position restrained during the equilibration process. Canonical 100 ns MD simulation runs were performed for protein–ligand complexes. The time step and the saved coordinates were set to 2.0 fs and 1.0 ps. 3CLpro–ligand interaction files were taken from the PLIP online tool (https://plip-tool.biotec.tu-dresden.de/plip-web/plip/index) and visualized using PyMOL.

3CLpro Enzyme Activity Inhibition Test

The 3CLpro inhibitory activity of the compounds was determined by a FRET protease assay. MCA-AVLQSGFR-Lys(Dnp)-Lys-NH2 solution was used as the fluorogenic substrate (Genscript). According to previous reports, a FRET-based protease assay was carried out.[4,28,29] Briefly, 20 μL of a series of test compound solutions in 50 mM Tris and 1 mM EDTA buffer at pH = 6.74 were mixed with 5 μL of 1.2 μM 3CLpro and incubated for 10 min. Then, 20 μL of 20 μM peptide substrate was added to the solution. The relative fluorescence units (RFU) at 37 °C for 10 min were recorded with 320 nm excitation and 405 nm emission wavelengths by using SpectraMax iD3 (Molecular Devices, USA). The percent inhibition ratio of 3 and 4 and baicalein was calculated using the equation as follows: inhibition activity (%) = (F3CLpro – FComps)/(F3CLpro – FSub) × 100%, where F3CLpro is the fluorescence value of the 3CLpro control (3CLpro, buffer, and substrate), FComps is the fluorescence value of the tested compounds (3CLpro, compound solution, and substrate), and FSub is the fluorescence value of the substrate control. Each experiment was performed in triplicate.

Pseudovirus Infection Inhibition Assay

The pseudoviruses of SARS-CoV-2 were built using a GFP-encoding replication-deficient vesicular stomatitis virus (VSV) vector backbone (VSV-delG-GFP) and prepared as previously described.[30] Briefly, pseudovirus (50 μL) and serially diluted test compounds (100 to 0.39 μM) were added to the preplated Vero cells. Vero cells infected with VSV-delG/VSV were used as controls, and then the plates were imaged 15 h p.i. The imaging analysis of fluorescent cells and the transducing unit (TU) numbers were conducted using a CQ1 confocal image cytometer (Yokogawa, Japan). Each experiment was performed with six replicates.

Evaluation of Cytotoxicity and Antiviral Activities of Compounds 3 and 4

The cytotoxicity of 3 and 4 to Vero E6 cells and Calu-3 cells was tested by a CCK-8 assay kit (Beyotime, People’s Republic of China). To estimate the antiviral efficiency of isolated compounds, Vero E6 cells and Calu-3 cells at a density of 5 × 104 cells per well were seeded in 48-well cell culture plates and cultured approximately 12 h before infection. The cells were treated with multiple doses of 3 and 4 for 1 h, and a virus MOI of 0.01 for Vero E6 cells (0.05 for Calu-3 cells) was added gradually and infected for 2 h. Then, the virus and compound mixture were removed, and Vero E6 and Calu-3 cells were cultured with new compound-containing medium. Twenty-four hours p.i., the cell supernatant was collected and lysed in lysis buffer for further qRT-PCR quantification analysis to obtain EC50 values. The experiment was completed at biosafety level-3 (BSL-3).

SPR Assay of Compounds

The binding affinity of compounds and 3CLpro was analyzed using a Biacore S200 system (GE Healthcare). Briefly, 100 μg of 3CLpro was immobilized on a CM5 sensor chip using a pH = 4.0 acetic acid solution. Compounds 3 and 4 and baicalein at different concentrations (50, 40, 30, 15, 7.5, 3.75, and 1.875 μM) were injected for 120 s at a flow rate of 40 μL/min. A cleaning solution of PBS with 50% DMSO was used to regenerate the CM5 chip. All response values were collected at 25 °C. Biacore Evaluation Software 2.0 was used to generate Kd values.
  1 in total

Review 1.  Bioactive natural products in COVID-19 therapy.

Authors:  Zhonglei Wang; Ning Wang; Liyan Yang; Xian-Qing Song
Journal:  Front Pharmacol       Date:  2022-08-19       Impact factor: 5.988

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.