Literature DB >> 35531003

Molecular docking, pharmacophore based virtual screening and molecular dynamics studies towards the identification of potential leads for the management of H. pylori.

Manoj G Damale1, Rajesh B Patil2, Siddique Akber Ansari3, Hamad M Alkahtani3, Abdulrahman A Almehizia3, Devanand B Shinde4, Rohidas Arote5, Jaiprakash Sangshetti6.   

Abstract

The enzyme pantothenate synthetase panC is one of the potential new antimicrobial drug targets, but it is poorly characterized in H. pylori. H. pylori infection can cause gastric cancer and the management of H. pylori infection is crucial in various gastric ulcers and gastric cancer. The current study describes the use of innovative drug discovery and design approaches like comparative metabolic pathway analysis (Metacyc), exploration of database of essential genes (DEG), homology modelling, pharmacophore based virtual screening, ADMET studies and molecular dynamics simulations in identifying potential lead compounds for the H. pylori specific panC. The top ranked virtual hits STOCK1N-60270, STOCK1N-63040, STOCK1N-44424 and STOCK1N-63231 can act as templates for synthesis of new H. pylori inhibitors and they hold a promise in the management of gastric cancers caused by H. pylori. This journal is © The Royal Society of Chemistry.

Entities:  

Year:  2019        PMID: 35531003      PMCID: PMC9070323          DOI: 10.1039/c9ra03281a

Source DB:  PubMed          Journal:  RSC Adv        ISSN: 2046-2069            Impact factor:   4.036


Introduction

The Helicobacter pylori infection in the patients with chronic gastritis and peptic ulcer can become the primary cause of gastric cancer.[1-5] Gastric cancer is the fourth common malignancy worldwide causing over 700 000 deaths per year. H. pylori is a microaerophilic, spiral-shaped Gram-negative bacterium which colonizes in the human stomach eventually causing duodenal and gastric ulcers. Broad spectrum antibacterials and antibiotics such as metronidazole, clarithromycin, levofloxacin, amoxicillin, tetracycline, furazolidone, and rifabutin are used in the treatment and management of H. pylori infection. Unfortunately, the efficacy of these antibiotics against H. pylori has weakened due to a strong resistance developed by H. pylori organism.[6,7] Furthermore, many factors such as the strain of H. pylori, the host genetic factor like polymorphism in the interleukin-1, gender, and individual's habits like smoking and their diet may aggravate the H. pylori infection. It is established that the colonization of the H. pylori with the nitro sating bacteria in the achlorhydric stomach becomes the primary cause of gastric cancer.[8] Therefore, eradication of the H. pylori infection and proper management and treatment of the duodenal and gastric ulcers is essential in the prevention of ensuing gastric cancer. Emergence of strong resistance is the main concern with most of the currently used broad spectrum antibacterials and antibiotics in H. pylori infection. Hence, design and development of the newer potential drug candidates effective against the newer targets specific for H. pylori may be advantageous. The enzyme pantothenate synthetase, encoded by the panC gene, catalyzes the biosynthesis of pantothenate (vitamin B5) from an adenosine triphosphate (ATP)-dependent condensation of the d-pantoate and the β-alanine in bacteria.[9] The pantothenate is a key precursor of the coenzyme A and the acyl carrier protein. Many intracellular processes such as fatty acid metabolism, cell signaling, synthesis of the polypeptides and the non-ribosomal peptides are regulated by the coenzyme A and the acyl carrier protein. Interestingly, mammals lack the pantothenate synthetase and its biosynthetic pathway and derive the pantothenate from their diet.[10] Thus, it is advantageous to target bacterium specific pantothenate synthetase in the development of potential inhibitors. Many experimental crystal structures of pantothenate synthetase are available which could be exploited in design and development of its potential inhibitors. But, most of these crystal structures and inhibitors are reported for the Mycobacterium tuberculosis. To best of our knowledge the H. pylori specific panC inhibitors has not been reported so far and may be due to the unavailability of the experimental crystal structure of the H. pylori specific pantothenate synthetase. The complete genome sequencing has identified the key proteins of H. pylori.[11-13] The European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI)[14] has assigned the accession number to gene of H. pylori panC as EBI-7515141 and the details of the interactions and protein features are available in UniProtKB database with ID P56061.[15] In absence of the experimentally solved crystal structure, homology modeling is the most reliable method to construct the theoretical models of proteins under study.[16] The homology modeling derived validated theoretical models of proteins can be used in molecular docking studies, which is the most popular technique to understand the binding mode and the affinities of ligands at the binding site of such proteins.[17] In fact, such molecular modeling approaches has lead to identification of many potential anti-cancer agents.[18-22] Furthermore, the best binding modes of the ligands with high affinity and the structural traits of such ligands can be used to build the pharmacophore.[23,24] The features in such pharmacophore are usually exploited in the virtual screening which is an efficient and alternative approach to the High Throughput Screening to derive the promising hits.[25-27] The deeper insights into the binding modes and energetic of such hits at the binding site of the modeled protein in simulated biological environment can be achieved through molecular dynamics simulations. The effects and risk of the promising hits can be accessed through the prediction of absorption, distribution, metabolism, elimination and toxicity (ADMET) parameters which aid in the design of drug like compounds.[28,29] The fact that the inhibitors of the H. pylori specific pantothenate synthetase could be the best way to tackle the problem of poor prognosis of the H. pylori infection and the associated risk of the gastric cancer provoked us to investigate the computational aspects of this issue and provide the possible candidate molecules as inhibitors. Thus, the present work describes the homology modeling of the H. pylori specific pantothenate synthetase, the molecular docking studies with the known inhibitors, the pharmacopore development from the best docked poses of the top rank compounds, pharmacophore based virtual screening, the ADMET studies and the detail investigation of the possible mechanism of inhibition through the molecular dynamics simulations of the best hit molecules.

Materials and methods

Homology modeling

The H. pylori specific metabolic pathways were identified from the MetaCyc Metabolic Pathway Database (https://metacyc.org/).[30-32] The key enzymes in these pathways were further indentified with the help of choke point finder tool available in this database. The database of essential genes (DEG) (http://www.essentialgene.org)[33] was also explored to identify the essential genes and the corresponding metabolic enzymes in the Helicobacter pylori. The sequence of the key metabolic enzymes thus identified was retrieved from the Uniprot database (https://www.uniprot.org/).[15,34] This protein sequence was subjected to the BLASTP search in order to identify the matching protein.[35] These preliminary investigations lead to the identification of pantothenate synthetase, the unique protein specific for the H. pylori. The primary sequence of the panC specific for the H. pylori (Uniprot accession number P56061) was subjected to the similarity search in the Molecular Operating Environment (MOE).[36] The similarity search was executed by transferring the sequence in the sequence query and with the search parameters set to the values for gap start −12, gap extend −2, E-value cutoff 10, E-value accept 0.5, Z iterations 100 and Z cutoff 6. Subsequently, the MOE homology model tool linked with Generalized Born/Volume Integral (GB/VI) scoring function and Amber99 force field was used to generate the homology models.[37] MOE's site finder promodel module was employed to identify and validate the binding site. In this module the atoms on the protein's surface were identified, and the centers of spheres defined by combinations of four such points are marked. The clusters of such spheres defined the potential binding sites on the protein's surface, or voids within it. The likelihood of a hydrophilic contact was marked and this method identified 10 potential binding sites amongst which the first was used in the docking studies. Further validation of these models based on various computational approaches was carried out and the most suitable model was used in docking studies.

Molecular docking

Molecular docking studies were carried out on MOE molecular modeling suite. The validated homology model structure was subjected to the structure preparation wizard to curate the inaccuracies in the protein structure using the LigX standard protocol. The promodel module was used for the active site prediction. One hundred fifteen already known M. tuberculosis PanC inhibitors[38-42] were used for the docking studies (Table 1).

M. tuberculosis panC inhibitors from the literature

Compound no.ScaffoldR1R2R3X
1A
2A
3A
4A
5A
6A
7A
8A
9A
10A–NO2
11A–H–H
12A–NO2
13A–NO2
14A–NO2
15B–H(CH3)3CNH– C
16BCH3(CH3)3CNH– C
17B(CH3)3CNH– N
18B–H C
19BCH3 C
20B N
21B–H(CH3)3CNH– C
22BCH3(CH3)3CNH– C
23B(CH3)3CNH– N
24B–H C
25BCH3 C
26B N
27B–H(CH3)3CNH– C
28BCH3(CH3)3CNH– C
29B(CH3)3CNH– N
30B–H C
31BCH3 C
32B N
33B–H(CH3)3CNH–HC
34BCH3(CH3)3CNH–HC
35B(CH3)3CNH–HN
36B–H HC
37BCH3 HC
38B HN
39B–H(CH3)3CNH– C
40B–H(CH3)3CNH– C
41B(CH3)3CNH– N
42B–H C
43BCH3 C
44B N
45C–H
46C–H
47C–H
48C–H
49C–H
50C–H
51C–H
52C–H
53C–H
54C–H
55C–H
56C–CH3
57C–CH3
58C–CH3
59C–CH3
60C–CH3
61C–CH3
62C–CH3
63C–CH3
64C–CH3
65C–CH3
66C–CH3
67D–H
68D–H
69D–H
70D–H
71D–H
72D–CH3
73D–CH3
74D–CH3
75D–CH3
76D–CH3
77D–Cl
78D–Cl
79D–Cl
80D–Cl
81D–Cl
82D–Br
83D–Br
84D–Br
85D–Br
86D–Br
87D–C2H5
88D–C2H5
89D–C2H5
90D–C2H5
91D–C2H5
92D–C3H7
93D–C3H7
94D–C3H7
95D–C3H7
96D–C3H7
97E
98E
99E
100E
101E
102E
103E
104E
105E
106E
107E
108E
109E
110E
111E
112E
113E
114E
115E
The 2D structures of these inhibitors were drawn in the LigX module and converted into the 3D conformers and subsequently subjected to the energy minimization step using MMFF force field. The docked structures were ranked as per their binding affinity score called as the S-score. Re-docking experiment to judge and ensure the docking accuracy was also carried out using already optimized inhibitor structures and panC structure. The root mean square deviations and deviations in docking score were accounted in deciding the accuracy of docking protocol.

Pharmacophore modeling and virtual screening

The top ranked docked conformers of the known PanC inhibitors were used for the construction of the pharmacophore model.[43] The phase module of MOE was used to derive the pharmacophoric features on the basis of the structural and energy features of the protein and the inhibitors. The 3D pharmacophoric model thus generated having various pharmacophoric features was used for the virtual screening through virtual screening module of the MOE. The InterBioScreen natural database (https://www.ibscreen.com/natural-compounds)[44,45] consisting of 60 000 natural compounds was screened against this pharmacophore.

ADMET analysis

The potential hits obtained in the virtual screening were subjected to the ADMET screening for the predication of drug-like properties with the FAF-Drug2 tool for the ADMET prediction.[46] Further, the data of the ADMET studies were filtered and validated for the Lipinski's rule of 5 and the Veber rule.[47,48] In the ADMET studies the values of TPSA (<100), log P (<5) and molecular weight (<500) indicates good oral bioavailability, whereas number of rotatable (<25), rigid bonds (<10) and number hydrogen bond acceptors (<10)/donors (<5) indicates good intestinal availability.

Molecular dynamics (MD) simulation studies

The top-ranked hits with desired ADMET features were subjected to the MD simulations using Gromacs 5.1.2.[49,50] The remote server of the Bioinformatics Resources and Applications Facility (BRAF), C-DAC, Pune was used to perform the production phase MD simulations. The topology of the protein was generated using the CHARMM36 all-atom force field[51] while the ligand topology was generated with the CHARMM General Force Field (CGenFF) from the CGenFF server available at https://cgenff.paramchem.org. After solvating the system with the simple point charge solvent-216 model in a dodecahedral unit cell the system was neutralized with the addition of appropriate counter ions such as sodium and chloride. The system was subsequently energy minimized and equilibrated with respect to the constant volume and the pressure position restraint dynamics at a constant temperature of 300 K for 100 picoseconds. The system was subsequently subjected to the energy minimization in order to remove the steric clashes. Subsequently, the position restraint dynamics under NVT and NPT (constant volume and pressure) conditions at 300 K for 100 ps was carried out. During the 10 nanosecond production phase MD, the covalent bonds were restrained with the LINCS algorithm[52] and the long range electrostatics such as coulombic and Lennard Jones interaction energies were controlled with the cutoff value of 12 A° with the Particle Mesh Ewald method (PME).[53] The results of the MD were analyzed with the help of different modules in the GROMACS package.

Results and discussion

The results of Metacyc analysis showed that total 144 unique metabolic pathways are present in the H. pylori with 570 known enzymes. Amongst these enzymes, 172 enzymes are unique to the H. pylori. Further, 24 enzymes amongst these unique enzymes serve as essential enzymes executing the respective metabolic pathway and can serve as promising targets in the drug design. The analysis of these enzymes suggested that the pantothenate synthetase (PanC, Pantoate-β-alanine ligase) is unique to H. pylori, but not expressed in Homo sapiens. The identification of key genes responsible for the survival of the H. pylori through the database of essential genes also revealed that PanC is one of the essential gene. The primary sequence of the panC with 276 amino acids was retrieved from the UniPort database. The results of the preliminary BLAST analysis suggest that best model has the sequence identity of 43.87% with the template of the X-ray crystallographic structure of the panC from Thermotoga maritima (PDB ID: 2EJC). The atomic resolution being the key parameter for the selection of the template structure, this template structure with atomic resolution 2.4 Å was found appropriate. Further, the full genomes of H. pylori and T. maritima were retrieved (https://www.ncbi.nlm.nih.gov/genome) and subjected to species variation studies. It was found that there is 25.4% species similarity suggesting the suitability of species for homology modeling. The differences in the residue composition of the binding site of panC from Thermotoga maritima and homology model of H. pylori were also investigated. The binding site of panC from T. maritime constitutes the residues Pro28, Thr29, Met30, Gly31, Tyr32, Leu33, His34, His37, Leu40, Gln61, Glu66, Tyr71, Leu116, Arg121, His124, Phe125, Tyr145, Phe146, Gly147, Lys149, Asp150, Ala151, Gln152, Gln153, Phe154, Leu157, Ser185, Ser186, Arg187, Tyr241, Arg272 and Ile274. All these residues are also part of the binding site of homology model of H. pylori except few differences such as absence of Tyr32, Phe154 and Ile274 and presence of Ala29, Leu260 and Leu271. Sequence alignment of template and homology model has performed in MOE Sequence Editor in order to check the similarity and identity as shown in Fig. 1.
Fig. 1

Sequence alignment of model and template.

The homology models generated were subjected to the structure validation tools like Procheck, ProSA and SPDBV.[54-56] In order to validate the homology model the Cα deviation and all atom fit were calculated in SPDBV tool. The Cα deviation and all atom fit values 0.42 and 0.1 Å respectively suggest that the model structure is acceptable for further studies. The structural alignment of the homology model and template structure is shown in Fig. 2.
Fig. 2

Structural alignment of homology model and template structure (A) Cα backbone alignment; (B) all atom alignment.

The final model was chosen in which the 99.2% of the residues were in the allowed region as shown in the Ramachandran plot (Fig. 3).
Fig. 3

Structure validation parameter Ramachandran plot and ProSa Z score of model structure.

ProSA-web server provides the estimate of errors in experimental and theoretical models of the proteins. It uses the atomic co-ordinates of each residue of the protein and provides the Z score and residue score. The Z score is measure of overall quality of model protein and determined on the basis of Z scores of all experimentally determined protein chains in model protein. The Z score of final model was −7.84 suggesting the overall good model quality. The results of binding site analysis through MOE's site finder module showed that the residues Pro25, Glu63, Gln58, Tyr68, Leu270, Leu271, Phe59, Gln153, His124, Phe125, Asp150, Leu271, Leu270, Ser185 and Lys149 are the key residues in the binding pocket which contribute to the various types of interactions with ligand (Fig. 4). The binding site of the final model correlated well with the binding site of the template structure and most of these residues are present at the binding site of both the structures.
Fig. 4

PanC binding site residues predicted by site finder module of MOE.

The identification of the key contributing residues in the binding pocket of the panC model was carried out by the measurement of the contribution of interaction energy of each residue with the ligand through the Promodel module of MOE-2013. During the docking studies all the known inhibitors (total 115 such inhibitors) of M. tuberculosis were docked into the binding site of the modeled H. pylori panC. The M. tuberculosis specific panC inhibitors were chosen because of diversity in the scaffolds of these inhibitors. Further, these inhibitors were reported to bind at the catalytic site of M. tuberculosis panC where the pantoate and β-alanine substrates are processed to the patothenate. A similar biochemical mechanism is expected at the binding site of H. pylori panC and for computational design such inhibitors can be a good starting point. The docking results showed that the residues Pro25, Glu63, Gln58, Tyr68, Leu270, Leu271, Phe59, Gln153, His124, Phe125, Asp150, Leu271, Leu270, Ser185 and Lys349 are the key residues in the binding pocket which contribute to the various types of interactions with ligand. The analysis of docking interactions showed that the core scaffold of all the known inhibitors plays an important role in the key interactions. The details of the structures of all the known inhibitors with the docking scores, the interacting residues and the type of key interactions are provided in the Table 2.

Docking and re-docking results

Comp. no.Docking score (kcal mol−1)Redocking score (kcal mol−1)Interacting residuesKinds of interactions
H-BondvdWPi
1−4.23−4.6GLN58, GLN153, ARG187301
2−4.07−4.34MET27, GLU63, TYR68, ASN62, GLU117312
3−4.1−4.29HIS34, LEU116, ASP150, SER186, LEU271214
4−4.81−4.9MET27, TYR68, ASP150, LEU271, ARG121, LEU117, TYR238312
5−4.25−4.33MET27, GLN58, GLN153, ARG121312
6−5.1−5.19MET27, HIS24, GLU64, ASN62, ARG121, GLN58, GLN153, GLU117413
7−5.12−5.39HIS34, HIS124, ARG121, ASP150, LYS149215
8−4.57−4.63MET27, GLU63, TYR68, ASN62, ARG121313
9−5.03−5.22HIS31, HIS34, LYS149, SER186, ARG269, ARG187, SER185633
10−6.08−6.24PRO25, MET27, HIS34, LEU116, ARG269, ARG121, ASP150, GLN153621
11−4.71−4.87AP150, LEU271, LEU260, ARG269312
12−4.7−4.71MET27, LEU116, ARG121, ARG269, GLN153, HIS124511
13−4.91−5.01MET27, HIS34, SER185, SER186, ARG187, ARG269412
14−4.54−4.57GLY28, HIS34, LYS149, LEU271, ARG269411
15−4.8995−3.6702HIS124, ASP150, ARG269, LYS149, LEU271232
16−4.2209−4.9713HIS124, ASP150, LEU271212
17−5.8087−5.756MET27, GLU63, HIS124, ARG187113
18−5.2863−5.39HIS124, ASP150, ARG187, ARG269, LEU271214
19−5.1268−5.6967MET27, PHE59, LYS149, ARG187, HIS124, ARG187114
20−4.0056−4.1323MET27, PHE59, TYR68, GLU63, ARG269, GLN159, GLN58314
21−4.2869−4.3971HIS124, ASP150, LEU271211
22−4.4866−4.7215MET27, PHE59, TYR68, HIS124, LEU271, ASP150214
23−4.6603−4.6689PHE59, HIS124, ASP150, LEU271212
24−5.8087−5.9781PRO25, MET27, LEU37, HIS34, PHE146, GLY147, VAL132216
25−4.8805−5.0012MET27, HIS34, PHE125, HIS124, ARG187223
26−4.27−4.6732GLU63, ARG269, LEU271, LEU260212
27−3.9108−4.2996TYR68, ARG187210
28−4.3609−4.9383GLU63, ARG121, HIS124114
29−4.4464−4.4861GLN58, HIS124, LEU271, ASP150212
30−4.42414.6267GLN58, ASP150, LYS149, ARG269, LEU271312
31−5.2496−5.342MET27, TYR68, ARG187, HIS124, ARG121, GLN152, LEU116322
32−5.11−5.1287MET27, GLY28, HIS31, ALA29, HIS34, ARG187, SER186, TYR68415
33−3.6937−3.8271ASP150, LEU271111
34−3.935−4.135HIS124, LEU271, ASP150212
35−3.747−3.8005MET27, HIS34, ASP150, GLN153311
36−4.954−4.991ARG121, ASP150, LEU271213
37−4.6733−4.7029ASP150, ARG269, LEU271113
38−4.5544−4.6281ARG121, ASP150, ALA151, LEU160, LEU271413
39−4.2869−4.3187GLU63, ARG121, HIS124312
40−3.9108−4.0013GLU63, ARG269, LEU260112
41−4.4464−4.4556GLN58, HIS124, LEU271, ASP150213
42−3.935−4.1032HIS124, LEU271, ASP150111
43−4.954−5.0031MET27, HIS34, PHE125, HIS124, ARG187212
44−4.6733−4.7631MET27, HIS34, HIS124, ARG187211
45−3.57−3.66TYR68, ASP150, LEU271, SER186, ARG187311
46−4.78−4.81HIS34, HIS124, PHE125, ALA151, LEU271, LEU260, ASP150, ARG269, LEU271415
47−4.08−4.12GLU63, ALA151, GLN152, GLN153, ARG121, LYS149, HIS124412
48−4.84−4.93LEU116, ARG121, ALA151, LYS149, ALA183, LEU270, ARG269515
49−4.16−4.14PRO69, ALA29, TYR68, ASP150, SER186, TYR190, LEU271, ARG187213
50−4.38−4.14ALA29, MET27, GLN58, HIS124, ASP150, ARG187, LYS149, SER185, SER186424
51−4.39−4.41TYR68, ASP150, LEU271, SER186, ARG187, HIS124, GLN153, LYS149515
52−4.25−4.31MET27, TYR68, LYS149, VAL176, SER185, SER186414
53−5.84−5.91PRO25, TYR26, ALA29, HIS34, GLN153, GLY147, ASP150, SER186313
54−5.79−5.789ARG121, LYS149, GLN152, ASN183, LEU270, LEU271, ARG269314
55−5.93−6.01HIS34, LYS149, SER185, ARG187, ARG269, LEU271303
56−5.68−5.79MET27, PHE59, TYR68, ARG187, ASP150, LEU271314
57−5.77−5.87PRO25, MET27, HIS31, HIS34, LEU37, SER187, VAL128, VAL132, SER186, GLY147324
58−4.63−4.76MET27, GLU63, TYR68, GLN153, ARG121313
59−5.1−5.12PRO25, MET27, HIS34, MET135, ILE131, ARG187, ARG269224
60−4.05−4.21HIS34, TYR68, PHE271112
61−5.33−5.39MET27, GLN58, HIS124, VAL128, TYR145, GLY147, ASP150, GLN153, LEU271413
62−5.15−5.21PRO25, MET27, THR26, ILE131, MET135, LYS149, GLN153422
63−7.7−7.79GLN58, GLU63, GLN153, ARG121, TYR68511
64−5.02−5.19TYR68, ASP150, GLN152, ARG269, SER186322
65−3.96−4.18HIS34, ARG187, ASP150, GLN152, LEU271114
66−4.13−4.19PRO25, MET27, HIS31, SER187, ILE131, ARG187, MET135123
67−2.8687−2.8729PHE59, TYR58, HIS124, ASP150, ARG187, LEU271413
68−3.3455−3.4411LEU30, HIS31, HIS34, LYS149, SER185313
69−4.7611−4.812LEU30, HIS31, HIS34, LYS149, SER185313
70−4.1494−4.4271LEU30, HIS34, LYS149, SER185, SER186, ARG187414
71−3.6942−3.7316GLU63, PHE125, ARG121, HIS124, ARG269213
72−4.2448−4.3993MET27, HIS31, HIS34, LYS149, SER185, ARG187314
73−3.5961−3.6821ASN62, GLU63, HIS124, PHE125, ARG187, ARG269313
74−5.7105−5.7389GLY60, ALA61, ASN62, GLU63, LYS149, ARG187, ARG269, ASN271, ARG121613
75−4.1277−4.1791MET27, HIS31, HIS34, LYS149, SER185, ARG187, LEU271214
76−3.9968−4.156MET27, HIS34, ASP150, LEU271113
77−5.0105−5.1995ASN62, GLU63, ARG121, ARG187, ARG269313
78−6.1995−6.2677ASN62, GLU63, ARG121, LYS149, SER185, ARG187, ARG269813
79−3.3602−3.3871LYS149, ALA183, SER185, ARG187, ARG269, LEU271, ILE202, LEU182423
80−4.2577−4.3289PRO25, MET27, GLY28, HIS31, GLY33, HIS34, LYS149, LEU184412
81−4.3562−4.3891PRO25, HIS31, GLY33, HIS34, LYS149, LEU184226
82−4.8416−4.9562GLN58, GLU63, TYR68, HIS124, ARG269, ARG187312
83−4.8644−4.9493GLU63, GLN58, ASP64, TYR68, HIS124, ARG269, ARG187314
84−4.0638−4.1452MET27, PHE59, TYR68, ARG187212
85−5.323−5.3671GLY28, ALA29, HIS34, SER185, LYS149, ARG269, ARG187, SER186, LEU184513
86−5.1447−5.1591HIS34, PHE59, TYR68, ARG187, LEU184423
87−3.2977−3.3592LYS149, SER185, ARG187, ARG269, LEU270, LEU271313
88−2.9499−2.8429MET27, GLU63, HIS124, VAL128, ARG187, ARG169213
89−4.4058−4.4851PHE59, PRO25, HIS31, GLY33, HIS34, LYS149, LEU184226
90−4.4837−4.5201MET34, LYS149, ALA183, SER185, ARG187, ARG269, LEU271, ILE202, LEU182423
91−2.011−2.293PHE59, TYR58, HIS124, ASP150, ARG187112
92−3.4441−3.4896LYS149, ALA183, SER185, ARG187, ARG269, LEU271, ILE202, LEU182424
93−2.9499−2.9618MET27, GLU63, HIS124, VAL128, ARG187, ARG169213
94−3.1448−3.1844MET27, HIS34, GLU63, HIS124, VAL128, ARG187, ARG169214
95−4.4864−4.5891PRO25, HIS31, HIS34, GLY33, HIS34, LYS149, LEU184, SER185326
96−2.9824−3.3168MET27, GLU63, HIS124, VAL128, ARG187, ARG169214
97−3.8659−3.962GLU63, ARG121, SER185, LEU271, HIS124412
98−4.5719−4.782PRO25, HIS34, LYS149, SER185, ARG187, ARG269, TYR238413
99−4.5506−4.62HIS34, LYS149, SER185, ARG187, ARG269, TYR238313
100−4.0345−4.491GLU63, ARG121, SER185, LEU271, HIS124313
101−4.8854−4.887PRO25, HIS31, HIS34, LYS149, SER185, ARG187, ARG269, TYR238413
102−3.7926−3.8416HIS34, ARG121, SER185, LEU271, HIS124411
103−5.305−5.4591PRO25, HIS31, HIS34, LYS149, SER185, ARG187, ARG269, TYR238, LEU271514
104−5.1502−5.3819PRO25, HIS34, LYS149, SER185, ARG187, ARG269, TYR238, LEU271414
105−5.831−5.8702PRO25, HIS31, HIS34, LYS149, SER185, SER186, ARG187, ARG269, TYR238514
106−6.0256−6.0482PRO25, HIS31, HIS34, LYS149, SER185, SER186, ARG187, ARG269, TYR238, LEU27514
107−4.8235−4.8491MET27, HIS31, HIS34, LYS149, SER185, ARG187, ARG269, TYR238514
108−6.0962−6.173PRO25, MET27, HIS31, HIS34, LYS149, SER185, SER186, ARG187, ARG269, TYR238614
109−4.0069−4.1639HIS34, ARG121, SER185, LEU271, HIS124313
110−4.9985−5.0173MET27, HIS34, LYS149, SER185, ARG187, ARG269, TYR238414
111−4.9124−5.0792GLU63, GLN58, ASP64, TYR68, HIS124, ARG269, ARG187314
112−6.2852−6.4021PRO25, MET27, HIS31, HIS34, LYS149, SER185, SER186, ARG187, ARG269, TYR238624
113−5.1362−5.1562PRO25, HIS34, LYS149, SER185, ARG269, TYR238, LEU271421
114−5.7806−5.8492PRO25, HIS34, GLY60, LYS149, SER185, ARG187, ARG269, TYR238, LEU271514
115−4.7319−4.872PRO25, HIS34, LYS149, SER185, ARG187, ARG269, TYR238413
The tetrahydropyrido thieno pyrimidin-4-one derivative 63 having the lowest binding free energy of −7.70 kcal mol−1 was found forming two hydrogen bond interactions with the nitrogen atoms of core scaffold and the residues Glu63 and Tyr68 and two hydrogen bond interactions with the oxygen atom of o-nitro substituent with the Gln58 and Gln153. This inhibitor also forms the hydrophobic π–π stacking interaction between the phenyl ring and Arg121 (Fig. 5). The lowest binding free energy for this ligand may be due to such hydrogen bond formation and hydrophobic interaction. The re-docking of the ligands produced similar interactions with slight variations in the docking scores suggesting the accuracy of docking protocol followed.
Fig. 5

The binding pose and molecular interactions of inhibitor 63 into the active site of the model structure of the PanC.

Pharmacophore-model construction and virtual screening

These pharmacophoric features provide the important insights of functionalities contributing in the molecular activity of the inhibitors. The key pharmacophoric features such as the hydrogen bond acceptor, the metal ligator, the aromatic ring, the hydrogen bond donor responsible for the inhibitory activity were identified. The pharmacophoric model was constructed on the basis of the key interactions of top ligands with the residues at the binding site of the PanC. For this purpose the set of predefined features such as hydrogen bond acceptor (HBA), aromatic ring (Aro) from such interactions were exploited. The mapping of developed 3D-pharmacophore model with the top docked hit 63 into the active site of the homology model of the PanC is shown in Fig. 6.
Fig. 6

3D-pharmacophore models of the top docked hit 63 into the active site of the homology model of the PanC.

The results of the pharmacophore design suggest that the ligand 63 has four key pharmacophoric features namely two hydrogen bond acceptor sites (F1 & F3, designated as Acc), two aromatic rings (F2 & F4 designated as Aro). The best pharmacophore model was selected on the basis of the highest correlation coefficient, root mean square deviation (rmsd) values and E-value. As per the protocol shown in Fig. 7, virtual screening of InterBioScreen natural compounds database was carried out.
Fig. 7

Schematic representation of the virtual screening protocol.

The pharmacophoric features F1–F4 were chosen as the filtering criteria during virtual screening. The virtual screening experiment gave 20 virtual hits as given in the Table 3.

Systematic representation of in silico docking data of the top ranked virtual hits

Sr. no.Hits_IDDocking score (kcal mol−1)Interacting residuesKinds of interactions
H-BondvdWPi
1STOCK1N-76518−6.38HIS34, GLU63, ARG121, HIS124, ASP150, ARG269, SER185, SER186, GLU271433
2STOCK1N-74127−5.45THR26, MET27, TYR68, HIS34, GLN58, ASP150, LYS149, SER187332
3STOCK1N-73916−5.7THR26, MET27, HIS34, GLN58, GLU63, LYS149, RG187623
4STOCK1N-71293−5.81THR26, MET27, GLY28, PRO25, ARG121, LYS149, ARG269, LEU271422
5STOCK1N-68553−6.69ASN62, GLU63, ARG121, ARG187, LEU271, ARG269511
6STOCK1N-67377−7.87HIS34, TYR68, ARG121, PHE125, HIS124, ASP150, SER185, TYR238, ARG269525
7STOCK1N-64449−6.71ARG121, HIS124, LYS149, ASP150, SER185, ARG187, ARG269333
8STOCK1N-64228−6.07PRO25, GLN58, ASP150, ARG269, GLN153321
9STOCK1N-63827−8.27PRO25, THR26, MET27, HIS34, GLN58, GLU63, LYS149, ARG187, LEU271544
10 STOCK1N-63231 −8.67 TYR68, MET27, HIS124, ALA151, GLN152, ASP150, LYS149 6 1 4
11 STOCK1N-63040 −9.51 ASN62, GLU63, ARG121, GLN152, GLN153, ARG187, LEU271, ARG269 5 3 3
12 STOCK1N-60270 −10.7 LEU30, MET27, HIS34, TYR68, ARG121, ARG268, GLN153, ASP150, PHE146, GLN152, SER186, ARG269 6 1 4
13STOCK1N-59730−8.33HIS34, TYR68, PHE125, GLU63, LYS149, ARG187, ARG269333
14STOCK1N-26126−7.87ARG121, LYS149, ASP150, SER185, SER186, ARG269, ARG187611
15STOCK1N-28765−4.65ARG121, ASP150, LYS149, SER185, SER186, ARG269611
16STOCK1N-32864−7.46PRO25, GLN58, ASP150, GLN153, ARG269421
17STOCK1N-36335−4.49PHE59, GLU63, ARG187, HIS124, ARG121, ARG269, LEU271511
18 STOCK1N-44424 −8.91 TYR68, ARG121, PHE125, ASP150, ARG187, TYR238, ARG269, LEU271 6 2 3
19STOCK1N-45307−7.14TYR68, MET27, HIS124, ALA151, GLN152, ASP150, LYS149614
20STOCK1N-45539−6.39ARG121, HIS124, LYS149, ASP150, SER185, ARG187, ARG269, LEU271411
The virtual hits thus obtained were subjected to docking studies to investigate the interactions at the binding site. The virtual hits STOCK1N-60270, STOCK1N-63040, STOCK1N-44424 and STOCK1N-63231 showed the lowest binding free energy of −10.1, −9.51, −8.91 and −8.67 kcal mol−1 respectively in the docking studies. These virtual hits were found making key hydrogen bond and hydrophobic π–π stacking interactions with residues such as Gln153, Tyr68, Glu63 and Arg121. The analysis of the docking score and the binding poses of the virtual hits it was observed that STOCK1N-60270, STOCK1N-63040, STOCK1N-44424 and STOCK1N-63231 are the best possible hits. The binding poses of these hit molecules at the binding site is shown in Fig. 8.
Fig. 8

The binding pose of top hits at binding site, (A) STOCK1IN 63040; (B) STOCK1IN 44424; (C) STOCK1IN 63231 and; (D) STOCK1IN 60270.

The hits with matching features of the best docked ligand may have the highest potential to inhibit the pantothenate synthetase of the H. pylori. The docked conformer of the tetrahydropyrido thieno pyrimidin-4-one derivative 63 having lowest binding free energy and the docked conformer of the potential virtual hit STOCK1N-60270 overlaid at the binding site is shown in Fig. 9.
Fig. 9

The docked conformers of ligand 63 (magenta) and top virtual hit STOCK1N-60270 (green) at the binding site.

In silico ADME predictions

The drug-like properties has been predicted by analyzing absorption, distribution, metabolism, elimination and toxicity risk characteristics. These characteristics are calculated and analyzed in terms of various physicochemical parameters and pharmaceutical properties using the FAFDrug2 ADMET prediction tool and the data is summarized in Table 4. The data obtained for the virtual hits with STOCK1N ID numbers 73916, 64228, 63827, 63231, 63040, 60270, 44424 and 45307 were found within the acceptable range of the ADMET criteria.

Pharmacokinetic parameters of virtual hits for good oral bioavailabilitya

Sr. no.Ligand_ID% ABSMWlog PTPSARotatableBRigidBHBDHBARingsRatio H/CToxicity
1STOCK1N-7651877.15467.514.3992.329241720.31Non toxic
2STOCK1N-7412781.52354.362.2779.657190620.37Non toxic
3STOCK1N-7391686.73425.484.4564.554300530.23Non toxic
4STOCK1N-7129376.16356.332.9495.206191720.37Non toxic
5STOCK1N-6855380.64260.292.0882.196133310.36Non toxic
6STOCK1N-6737759.45519.594.48143.638285520.36Non toxic
7STOCK1N-6444982.55266.291.8476.666102410.46Non toxic
8STOCK1N-6422878.99431.444.8187.006261630.28Non toxic
9STOCK1N-6382761.59543.572.73137.437344740.33Non toxic
10STOCK1N-6323167.47453.443.42120.398252820.38Non toxic
11STOCK1N-6304062.68507.542.24134.278293730.37Non toxic
12STOCK1N-6027049.96485.573.07171.1313145810.48Non toxic
13STOCK1N-5973067.47455.463.32120.399212820.38Non toxic
14STOCK1N-2612685.43514.703.5768.312330530.23Non toxic
15STOCK1N-2876576.94269.261.8892.933172520.54Non toxic
16STOCK1N-3286467.47389.362.66120.398162820.47Non toxic
17STOCK1N-3633561.49587.742.33137.713375830.27Non toxic
18STOCK1N-4442472.49401.453.51105.849192610.32Non toxic
19STOCK1N-4530773.31417.422.30103.464302620.41Non toxic
20STOCK1N-4553971.14473.525.46109.756312630.25Non toxic

ABS – absorption; TPSA – topological polar surface area; n-ROTB – number of rotatable bonds; MV – molecular volume; MW – molecular weight; log P logarithm of partition coefficient; nON – number of hydrogen bond acceptors; n-OHNH – number of hydrogen bonds donors.

ABS – absorption; TPSA – topological polar surface area; n-ROTB – number of rotatable bonds; MV – molecular volume; MW – molecular weight; log P logarithm of partition coefficient; nON – number of hydrogen bond acceptors; n-OHNH – number of hydrogen bonds donors. The value of the topological polar surface area (TPSA) and the log P of the hits indicate that they have very good oral bioavailability. The parameters like the number of rotatable bonds and the number of rigid bonds are linked with the intestinal absorption and showed that all the hits may have good intestinal absorption. In silico assessment of the hits also showed that they have very good pharmacokinetic properties based on their physicochemical values. The structures of the best virtual hits with their docking scores are given in the Fig. 10. The structures of other virtual hits are provided in ESI S1.†
Fig. 10

Structures of the top virtual hits.

Molecular dynamics simulation

The molecular dynamics simulations offer a more precise estimate of the interactions, energetic and the conformational changes of the ligands at the binding site of the protein and such studies actually simulate the biological environment which is more advantageous than the in vacuo conditions. In order to gain deeper insights into the exact mode of the binding of the best virtual hits and to evaluate the energetic of the ligand enzyme interactions more accurately the MD simulation studies were undertaken. The docked complexes of the panC with the potential hits (STOCK1N-60270, STOCK1N-63040, STOCK1N-44424 and STOCK1N-63231) with the top ranked conformers were used in the 10 ns MD simulations. The results of the MD simulation are evaluated through the measurement of root mean square deviation (RMSD), root mean square fluctuation (RMSF), number of H-bonds formed between the ligand and the key residues and the energetic of each complex. In case of the RMSD measurement the deviations from the original starting positions of the backbone atoms or the ligand atoms are measured during entire MD simulation. Lower deviations are indicative of the better stability. The average RMSD values for the complexes of STOCK1N IDs 60270, 63040, 44424 and 63231 were 0.265 ± 0.002, 0.289 ± 0.002, 0.228 ± 0.003 and 0.1236 ± 0.006 respectively. This suggests better stability of the STOCK1IN-63231 than the other hits. The average RMSD values for the ligands in these complexes 60270, 63040, 44424 and 63231 were 0.165 ± 0.002, 0.189 ± 0.003, 0.178 ± 0.001 and 0.108 ± 0.004 respectively, which suggests the stability of the hit 63231 better than the other hits. The results of the MD studies with RMSD are provided in Fig. 11.
Fig. 11

RMSD for panC (A) the backbone atoms and (B) the atoms of virtual hits.

The RMSF is a measure of elasticity of the protein under the investigation in terms of the fluctuations in the protein backbone during the MD simulation. The results of the RMSF evaluations are provided in Fig. 12.
Fig. 12

RMS fluctuations in the backbone atoms during the MD simulations.

The RMSF evaluations suggest that the backbone fluctuations are minimal in case of the complex of 63231 suggesting its better stability than the other complexes. The minimal fluctuations here may be attributed to the favorable interactions at the active site. The non bonded interactions such as hydrogen bonds between the ligand and the key residues at the binding site of the protein contribute to the binding affinity and activity of the ligand. The more number of hydrogen bonds between hydrogen bond donor–acceptor atoms of the ligand with such atoms of residues in binding site suggests the more binding affinity. In MD simulation studies the maximum number of hydrogen bonds formed between the ligands 60270, 63040, 44424 and 63231 and residues at binding site are 11, 15, 10 and 3 respectively. The average number of hydrogen bonds formed during entire simulation for the ligands 60270, 63040, 44424 and 63231 are 5.1, 2.1, 1.6 and 1.8 respectively suggesting the better binding affinity of the complex 60270. The results of hydrogen bond analysis are provided in Fig. 13.
Fig. 13

Hydrogen bonds formed with the chosen virtual hits during entire MD simulation (A) 44424; (B) 60270; (C) 63040; and (D) 63231.

The docking studies also support the evaluation of hydrogen bonds. Other non bonded interaction energies such as the short-range coulombic and the Lennard-Jones and the total interaction energies were also computed to understand the strength of the interaction between compounds under study and the panC. The average short-range coulombic and Lennard-Jones and total interaction energies are given in Table 5. The results suggest that only in compound 44424 the short range coulombic interactions occur.

Energy evaluation for hit molecules

Compound STOCK1IN IDShort-range coulombic interaction energy (kJ mol−1)Short-range Lennard-Jones interaction energy (kJ mol−1)Total short-range interaction energy (kJ mol−1)
602700 −149.88 ± 4 −4.794482 × 105
630400−229.108 ± 6.7−4.798895 × 105
44424−87.6557 ± 6.7−174.416 ± 5.6−4.719041 × 105
63231−80.0583 ± 1.9−177.261 ± 4−3.89751 × 105
This interaction may be due to the presence of the ionizable carboxylate groups extended through the pentyl chain. The Lennard-Jones (LJ) interaction energy estimate suggest that the compound 60270 has the highest LJ interaction energy which may be contributing in the highest activity of this compound. The results of energy evaluations are also provided in Fig. 14.
Fig. 14

Energy evaluation between the protein and the hit molecules. (A) The short range coulombic interaction energy; (B) the short range Lennard-Jones interaction energy.

On the basis of these MD results such as the number of hydrogen bonds formed during the simulation, RMSD, RMSF and the corresponding energetics of the simulation, the virtual hit STOCK1IN 60270 could be the best ligand as a potential inhibitor of modeled PanC.

Conclusion

In the present study the novel comparative pathway analysis approach was employed to find the unique metabolic pathways and to identify the key enzymes in such pathways in the H. pylori. The study has shown that the pantothenate synthetase of the H. pylori can serve as the potential drug target as it is essential for its survival. The combined structure-based drug design approaches namely homology modeling, molecular docking, pharmacophore modeling, virtual screening and molecular dynamics simulations provided the virtual hits with better structural, binding, pharmacokinetic and toxicity features. The virtual hits STOCK1N-60270, 63040, 44424 and 63231 has been identified as potential H. pylori specific panC inhibitors. Based on the docking and MD studies the virtual hit STOCK1IN 60270 which is a succinic acid derivative could serve as a lead for further development of potential inhibitors of the H. pylori specific panC. These virtual hit could be potentially beneficial in gastric carcinomas due to underlying H. pylori infections.

Conflicts of interest

There are no conflicts to declare.
  46 in total

1.  The generalized Born/volume integral implicit solvent model: estimation of the free energy of hydration using London dispersion instead of atomic surface area.

Authors:  Paul Labute
Journal:  J Comput Chem       Date:  2008-07-30       Impact factor: 3.376

2.  Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology.

Authors:  Peter D Karp; Suzanne M Paley; Markus Krummenacker; Mario Latendresse; Joseph M Dale; Thomas J Lee; Pallavi Kaipa; Fred Gilham; Aaron Spaulding; Liviu Popescu; Tomer Altman; Ian Paulsen; Ingrid M Keseler; Ron Caspi
Journal:  Brief Bioinform       Date:  2009-12-02       Impact factor: 11.622

Review 3.  Sequence Similarity Searching.

Authors:  Gang Hu; Lukasz Kurgan
Journal:  Curr Protoc Protein Sci       Date:  2018-08-13

4.  The challenge of Helicobacter pylori resistance to antibiotics: the comeback of bismuth-based quadruple therapy.

Authors:  Francis Mégraud
Journal:  Therap Adv Gastroenterol       Date:  2012-03       Impact factor: 4.409

5.  The protein-protein interaction map of Helicobacter pylori.

Authors:  J C Rain; L Selig; H De Reuse; V Battaglia; C Reverdy; S Simon; G Lenzen; F Petel; J Wojcik; V Schächter; Y Chemama; A Labigne; P Legrain
Journal:  Nature       Date:  2001-01-11       Impact factor: 49.962

6.  Molecular modeling based synthesis and evaluation of in vitro anticancer activity of indolyl chalcones.

Authors:  Rashmi Gaur; Dharmendra K Yadav; Shiv Kumar; Mahendra P Darokar; Feroz Khan; Rajendra Singh Bhakuni
Journal:  Curr Top Med Chem       Date:  2015       Impact factor: 3.295

7.  Design, synthesis and biological evaluation of imidazo[2,1-b]thiazole and benzo[d]imidazo[2,1-b]thiazole derivatives as Mycobacterium tuberculosis pantothenate synthetase inhibitors.

Authors:  Ganesh Samala; Parthiban Brindha Devi; Shalini Saxena; Nikhila Meda; Perumal Yogeeswari; Dharmarajan Sriram
Journal:  Bioorg Med Chem       Date:  2016-02-02       Impact factor: 3.641

8.  Pharmacophore modeling and virtual screening studies to identify novel selective SIRT2 inhibitors.

Authors:  Gokcen Eren; Agostino Bruno; Sezen Guntekin-Ergun; Rengul Cetin-Atalay; Fikriye Ozgencil; Yesim Ozkan; Mahmut Gozelle; Selen Gozde Kaya; Gabriele Costantino
Journal:  J Mol Graph Model       Date:  2019-03-05       Impact factor: 2.518

9.  FAF-Drugs2: free ADME/tox filtering tool to assist drug discovery and chemical biology projects.

Authors:  David Lagorce; Olivier Sperandio; Hervé Galons; Maria A Miteva; Bruno O Villoutreix
Journal:  BMC Bioinformatics       Date:  2008-09-24       Impact factor: 3.169

Review 10.  The impacts of H. pylori virulence factors on the development of gastroduodenal diseases.

Authors:  Wei-Lun Chang; Yi-Chun Yeh; Bor-Shyang Sheu
Journal:  J Biomed Sci       Date:  2018-09-11       Impact factor: 8.410

View more
  2 in total

1.  Drug-target affinity prediction using graph neural network and contact maps.

Authors:  Mingjian Jiang; Zhen Li; Shugang Zhang; Shuang Wang; Xiaofeng Wang; Qing Yuan; Zhiqiang Wei
Journal:  RSC Adv       Date:  2020-06-01       Impact factor: 4.036

2.  Pharmacoinformatics approaches to identify potential hits against tetraacyldisaccharide 4'-kinase (LpxK) of Pseudomonas aeruginosa.

Authors:  Manoj G Damale; Shahebaaz K Pathan; Rajesh B Patil; Jaiprakash N Sangshetti
Journal:  RSC Adv       Date:  2020-09-04       Impact factor: 4.036

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.