Literature DB >> 29049350

An integrative in-silico approach for therapeutic target identification in the human pathogen Corynebacterium diphtheriae.

Syed Babar Jamal1, Syed Shah Hassan1,2, Sandeep Tiwari1, Marcus V Viana1, Leandro de Jesus Benevides1, Asad Ullah2, Adrián G Turjanski3, Debmalya Barh4, Preetam Ghosh5, Daniela Arruda Costa1, Artur Silva6, Richard Röttger7, Jan Baumbach7, Vasco A C Azevedo1,8.   

Abstract

Corynebacterium diphtheriae (Cd) is a Gram-positive human pathogen responsible for diphtheria infection and once regarded for high mortalities worldwide. The fatality gradually decreased with improved living standards and further alleviated when many immunization programs were introduced. However, numerous drug-resistant strains emerged recently that consequently decreased the efficacy of current therapeutics and vaccines, thereby obliging the scientific community to start investigating new therapeutic targets in pathogenic microorganisms. In this study, our contributions include the prediction of modelome of 13 C. diphtheriae strains, using the MHOLline workflow. A set of 463 conserved proteins were identified by combining the results of pangenomics based core-genome and core-modelome analyses. Further, using subtractive proteomics and modelomics approaches for target identification, a set of 23 proteins was selected as essential for the bacteria. Considering human as a host, eight of these proteins (glpX, nusB, rpsH, hisE, smpB, bioB, DIP1084, and DIP0983) were considered as essential and non-host homologs, and have been subjected to virtual screening using four different compound libraries (extracted from the ZINC database, plant-derived natural compounds and Di-terpenoid Iso-steviol derivatives). The proposed ligand molecules showed favorable interactions, lowered energy values and high complementarity with the predicted targets. Our proposed approach expedites the selection of C. diphtheriae putative proteins for broad-spectrum development of novel drugs and vaccines, owing to the fact that some of these targets have already been identified and validated in other organisms.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29049350      PMCID: PMC5648181          DOI: 10.1371/journal.pone.0186401

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Corynebacterium diphtheriae is responsible for causing diphtheria which remains a major global cause of death (http://www.who.int/immunization_monitoring/diseases/diphteria/), and has conventionally been divided into four subgroups of biovars i.e., gravis, intermedius, mitis and belfanti based on biochemical characteristics according to Funke et al., 1997 [1] and Whitman et al., 2012 [2]. It was once a major cause of infant mortality, which spread as an epidemic and resulted in thousands of deaths [3]. The death rates dropped over time specifically in countries where living standards have improved, and the death rates rapidly declined after the introduction of immunization programs [3]. Despite these measures, it remains a significant pathogen around the globe, even today. A variety of mechanisms were responsible for causing such death rates; for example the ‘strangling angel’ effect on children that ascended from the wing shaped pseudo-membranes formed in the oropharynx. Disarticulation and impaction of these pseudo-membranes triggers acute airway obstruction and can result in sudden death [3, 4]. Since there has been a plethora of reported cases on both non-lethal and lethal diphtheria across various countries in the past few years, and that significant population displacements in the form of immigration are happening, more such cases are bound to follow. A passable handling requires quick inroads in discovering diphtheria antitoxin and antibiotic treatment [5]. Computational methods and other approaches, like reverse vaccinology, have been established for the rapid identification of novel targets in the post-genomic era [6, 7]. Approaches like subtractive and comparative microbial genomics as well as differential genome analysis [8] are being used for the identification of targets in a number of human pathogens like M. tuberculosis [, Burkholderia pseudomalleii [10], Helicobacter pylori [11] Pseudomonas aeruginosa [12], Neisseria gonorrhea [13] and Salmonella typhi [14]. The main principle is to find targeted genes/proteins that are essential for the pathogen and possess no homology counterpart in the host [15], such that drugs targeting these “pathogen-essential non-host homologs” can be applied with little (or no) off targets in the host. Some pathogen-essential proteins, though, may possess a certain degree of homology to host proteins. However, they might still be selected as potential molecular targets for structure-based selective inhibitor development. Significant differences in the active sites or in other druggable pockets might exist, such that the pathogenic protein could still be targeted [16, 17]. Here, we exploit an integrative in silico approach for the predictive proteome of C. diphtheriae species to associate the genomic information with the identification of putative therapeutic targets based on their three dimensional structure. It can be utilized for the identification of potent inhibitors, which might possibly lead to the discovery of compounds that inhibit pathogenic growth. The predicted proteomes from the 13 genomes of C. diphtheriae were modeled (pan-modelome) using the MHOLline workflow as proposed by Hassan et al., 2014 [18]. Furthermore, intra-species conserved proteins with adequate 3D models (core-modelome) were filtered on the basis of predicted essentiality for the bacteria, which leads to the identification of eight essential bacterial proteins. They were found non homologous to all host proteins and have been subjected to virtual screening using multiple compound libraries. We provided a list of putative targets in C. diphteriae, and possible mechanisms to design peptide vaccines, and suggest novel lead, natural and drug-like compounds that could bind to the proposed target proteins.

Materials and methods

Genomes selection

The thirteen C. diphtheriae strains, including three of the four biovars: gravis, mitis and belfanti ( were included in this study. The gene and protein sequences of these thirteen C. diphtheriae strains were retrieved from NCBI (ftp://ftp.ncbi.nih.gov/genomes/Bacteria). The different steps involved in this computational approach for genome-scale modelome prediction and for the prioritization of putative drug and vaccine targets are given in (Figs ).

Intra-species subtractive modelomics workflow for conserved target identification in C. diphtheriae species.

The table represents the total number of protein sequences as an input data fed to the MHOLline workflow (upper red arrow). The blue arrow represents the core genes of thirteen Cd strains. The rectangular boxes show how this workflow processes and filters a large quantity of genomic data for putative drug and vaccine target identification of a pathogen.

Prediction of core-modelome and identification of core genome

To construct the core-modelome of C. diphtheriae, we followed a slightly modified protocol described by Hassan et al., 2014 [18]. High throughput structural modeling, MHOLline (http://www.mholline.lncc.br), was used to predict the modelome (whole-proteome set of protein 3D models) for each strain. MHOLline uses comparative modeling approach for protein 3D structure prediction through MODELLER [19]. Our workflow also includes BLASTp (Basic Local Alignment Search Tool for Protein) [20], HMMTOP (Prediction of transmembrane helices and topology of proteins), [21] BATS (Blast Automatic Targeting for Structures), FILTERS, ECNGet (Get Enzyme Commission Number), MODELLER, and PROCHECK [22]. MHOLline work on the basis of available template. It is probable that MHOLline cannot detect all the common conserved proteins due to the unavailability of the template. To overcome this probability, we used EDGAR (an Efficient Database framework for comparative Genome Analyses using BLAST score Ratios for pan-genomics analysis) to collect common conserved genome as well of all Cd strains [23]. Later, the results from MHOLine and EDGAR were compared and crosschecked to obtain the final dataset of common conserved proteins.

Identification of intra-species conserved proteins

Primarily, for the identification of highly conserved proteins with available 3D models in all Cd strains (≥ 95% sequence identity), the standalone release of NCBI BLASTp+ (v2.2.26) was adapted from the NCBI ftp. Site (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/) and installed on a local machine. Furthermore, a search was performed using NCTC13129 as a random reference genome for all strains. Comparative genomics/proteomics approach was next adopted for selecting the highly conserved proteins using an all-against-all BLASTp analysis with a cut-off value of E = 0.0001, as in many other essentiality studies before [6, 13, 15, 18, 24].

Essential and non-host homologous (ENH) protein targets

A subtractive genomics approach was next followed for the selection of conserved targets, which were essential to the bacteria [15]. Concisely, the set of proteins derived from the core-modelome of C. diphtheriae was subjected to the Database of Essential Genes (DEG) for homology analyses. The DEG encompasses experimentally validated data of currently available essential genomic elements like protein-coding genes and non-coding RNAs, from bacteria, archaea and eukaryotes. For a bacterium, essential genes form a minimal genome, i.e., a set of functional modules that has key roles in the emerging field of synthetic biology [25]. The cutoff values used for BLASTp were: E-value = 0.0001, bit score ≥100 and identity ≥ 25% [15, 18]. The pool of essential genes was then subjected to NCBI-BLASTp (E-value = 0.0001, bit score ≥100 and identity ≥ 25%) against the human genome for filtering pathogen-essential host-homologs [6]. The remaining set of pathogen-essential non-host homologs were additionally crosschecked with NCBI-BLASTp PDB database using the default values to find any remote structural similarity with the existing host homolog protein structures, keeping the cutoff level to ≤ 15% for query coverage. The biochemical pathways of these proteins have been checked using KEGG (Kyoto Encyclopedia of Genes and Genomes) [26], functionality using UniProt (Universal Protein Resource) [27], virulence using PAIDB (Pathogenicity island database) [28], and cellular localization using CELLO (subCELlular LOcalization predictor) [29]. The final list of targets was based on criteria described by Barh et al., 2011 & Hassan et al., 2014 [15, 18].

Essential and host homologous (EH) protein targets

We further extended our analyses to the set of protein targets that were essential to C. diphtheriae but homologous to host proteins. The essential protein targets deviating from the cutoff values for essential non-host homologous proteins were treated as host homologous proteins. This set of targets was also checked for pathway involvement, functional annotation virulence, and cellular localization as mentioned above.

Computational identification of druggable pockets

The information obtained from 3D structures and druggability analyses are important features for prioritizing and authenticating putative pathogen targets [30, 31]. As mentioned above, for druggability analyses, the final list of essential non-host and host homologous protein targets were subjected to DoGSiteScorer in PDB format [32]. The DoGSiteScorer is an automated pocket detection and analysis tool for calculating the druggability of protein cavities. For each detected cavity the tool returns the pocket residues and a druggability score ranging from 0 to 1. Values closer to 1 indicate highly druggable protein cavity, i.e. the predicted cavities are likely to bind ligands with high affinity [32]. The DoGSiteScorer also calculates volume, depth, surface area, lipophilic surface, and further parameters for each predicted cavity.

Ligand libraries preparation, virtual screening and docking analyses

The ligand libraries were prepared from four different sources, compounds from ZINC database (ZINC drug-like molecules, ZINC Natural Product), natural compounds from literature survey [33] and the Di-terpenoid Iso-steviol derivatives (). ZINC (drug-like molecules) contains 11,193 drug-like molecules, with Tanimoto cutoff level of 60% [34] and ZINC (Natural Product) contain 11,203 molecules. Whereas, the small library of natural compounds contained 28 molecules and the library of Di-terpenoid Iso-steviol derivatives contained 31 molecules respectively. The structures of these molecules were constructed using MOE-Builder tool. The 3D structures were modeled and partial charges were calculated using MOE (Molecular Operating Environment). The energies of the modeled molecules were minimized using the energy minimization algorithm of MOE tool (gradient: 0.05, Force Field: MMFF94X, Chiral Constraint) [35]. The modeled molecules were saved in the.mol2 file format and subjected to docking analysis. The 3D structures of proteins were examined for structural errors such as missing atoms, wrong bonds and protonation states in the MVD (Molegro Virtual Docker) [36]. The consensus set of protein cavities and those predicted with DogSiteScorer (druggability ≥ 0.80) were compared with the MVD detected cavities, for all Cd targets. The maximum numbers of residues from DoGSiteScorer falling in the cavities detected by MVD were merged and final grid was generated based on the consensus between the highest scoring pocket from DoGSiteScorer and cavities detected by MVD for docking. The most druggable cavity was subjected to virtual screening using MVD. The program comprises of three search algorithms for molecular docking analyses namely MolDock Simplex Evolution (SE), MolDock Optimizer [36] and Iterated Simplex (IS). We employed the MolDock Optimizer search algorithm, which is based on a differential evolutionary algorithm, using the default parameters that are a) population size = 50, b) scaling factor = 0.5 and c) crossover rate = 0.9. The orientations of docked molecules from the library of natural compounds and from the derivatives of Di-terpenoid Iso-steviol were analyzed in Chimera [37]. The 200 top ranked compounds (ZINC drug-like molecules, ZINC Natural Product) for each target protein were evaluated for shape complementarity and hydrogen bond interactions. This led to the selection of a final set of compounds with polypharmacology and polypharmacy characteristics for target proteins in C. diphtheriae.

Results and discussion

Modelome prediction and conserved targets identification in C. diphtheriae

Among 13 strains of C. diphtheriae species, our employed methodology produced high-confidence 3D structural models from orthologous proteins in C. diphtheriae species through the efficient MHOLline workflow (). A comparative structural genomics approach was followed where all the G2 sequences classified as “Very High”, “High”, “Good” and “Medium to Good quality” by MHOLline, from the 12 Cd strains, were aligned to the Cd NCTC13129 strain as a reference genome. First, we identified a set of common conserved proteins with a pre-defined sequence similarity of 95–100%. This resulted in a set of 463 protein sequences, being conserved in all Cd strains ().

Efficiency of the MHOLline biological workflow for genome-scale modelome (3D models) prediction.

Predicted proteomes from the genomes of 13 C. diphtheriae strains were fed to the MHOLline workflow in FASTA format. The grey bars represent the number of input data. The remaining bars (MHOLline output data) show the number of not aligned sequences (G0, green bars), sequences for which there is a template structure available at RCSB PDB (blue bars), and sequences with acceptable template structures that were modeled in the MHOLline workflow (G2, red bars).

Protein targets as putative drug and vaccine candidates

The identification of essential proteins in C. diphtheriae was carried out where the core-modelome was compared to DEG (Database of Essential Genes). This filter drastically reduced the number of selected targets to 23 final targets. Further comparison of the corresponding protein sequences to the human host proteome resulted in a set of 8 targets as essential non-host homologous (ENH, ) and a set of 15 targets as essential host homologous proteins (EH, ). aDruggability predicted with DoGSiteScorer software. A druggability score above 0.60 is considered to be good, but a score above 0.80 is favored [32]. bMolecular weight was determined using ProtParam tool (http://web.expasy.org/protparam/). cMolecular function (MF) and biological process (BP) for each target protein was determined using UniProt. dCellular localization of pathogen targets was performed using CELLO. eKEGG was used to find the role of these targets in different cellular pathways. fPAIDB was used to check if the putative targets are involved in the pathogen’s virulence. gDS = Drug Score aDruggability predicted with DoGSiteScorer software. A druggability score above 0.60 is usually considered, but a score above 0.80 is favored [32]. b Molecular weight was determined using ProtParam tool (http://web.expasy.org/protparam/). c Molecular function (MF) and biological process (BP) for each target protein was determined using UniProt. dCellular localization of pathogen targets was performed using CELLO. eKEGG was used to find the role of these targets in different cellular pathways. fPAIDB was used to check if the putative targets are involved in the pathogen’s virulence. gDS = Drug Score.

Prioritization parameters for drug targets and vaccine candidates

There are several factors that can aid in determining potential therapeutic targets [30]. For vaccine candidates, the information about subcellular localization is important: Proteins that contain transmembrane motifs are favored [24, 30, 38, 39]. The 23 essential proteins have a low molecular weight and all are localized in the cytoplasmic compartment of C. diphtheriae (Tables ). After the druggability evaluation using DoGSiteScorer [32] for both essential non-host and host homologous conserved targets from C. diphtheriae, we could predict at least one druggable cavity for each Cd target. The host homologous proteins as therapeutic targets could adversely affect the host. Therefore, the first step in numerous in silico drug target identification approaches are filtering proteins homologous to host proteome. Thus, we only consider the eight pathogen-essential non host homologs for the docking studies [13, 15, 40]. For the eight pathogen-essential non host homologs () glpX, nusB, rpsH, hisE, DIP1084, DIP0983, smpB, and bioB 3, 0, 1, 0, 2, 0, 1 and 3 cavities with score > 0.80 were predicted. The cavity of each protein exhibiting the highest druggability score was subjected to docking analyses. The numbers of predicted cavities with their respective druggability scores are given in Tables . The identified eight non-host homologous and essential Cd proteins could be novel therapeutic targets for Corynebacterium diphtheriae. As per our knowledge, glpX, hisE and bioB proteins have been reported as potential drug target in Mtb. Protein nusB is a member of Nus-transcription Factor family that help bacteria in the process of elongation, transcription: translation coupling and termination. Some members of this family (nusG) has already been reported as drug target. Furthermore, rpsH and smpB are also reported as potential drug target by Folador et al., 2016 in their in silico study [41]. Protein DIP1084 is Putative iron transport membrane protein (FecCD-family) and DIP0983 is uncharacterized Hypothetical Protein that need to be characterized experimentally. Hence, these protein could be a good therapeutic target against Cd.

Virtual screening and molecular docking

For each target protein (glpX, nusB, rpsH, hisE, DIP1084, DIP0983, smpB, and bioB) four different libraries were separately screened. A total of 28 molecules from natural compounds library and 31 compounds from the derivatives of Di-terpenoid Iso-steviol library were docked. Furthermore, top 200 drug-like molecules from virtual screening analyses of two large libraries (ZINC drug-like molecules, ZINC Natural Product) were examined one-by-one for the selection of the final set of promising molecules that showed favorable interactions with the ENH targets. The biological importance and an analysis of the predicted protein-ligand interaction/s for each target are described here. The molecule names, ZINC codes and MolDock scores for the selected ligands, as well as the number of predicted hydrogen bonds with the protein cavity residues involved in these interactions, are shown below (Tables ) for each target protein. The predicted binding modes of selected ligands are also shown for each pathogen target in Figs

Validation of docking protocol

To validate the accuracy of MolDock program (MVD), the co-crystallized ligand of Biotin synthase, bioB (PDB ID; 1R30) was extracted and then re-docked into the binding pocket of receptor protein. The RMSD between docked and co-crystallized ligand was found to be 1.81 A˚, which shows that the adopted docking protocol is valid and can be used to correctly predict the binding pose of the ligands [35, 42]. The superposition of co-crystallized ligands and docked is shown in NP_939302.1 (glpX, Fructose 1, 6-bisphosphatase II) is a key enzyme of gluconeogenesis and catalyzes the hydrolysis of fructose 1, 6-bisphosphate to form fructose 6-phosphate and orthophosphate. A reverse reaction catalyzed by phosphofructokinase in glycolysis, and the product, fructose 6-phosphate, is an important precursor in various biosynthetic pathways [43]. In all organisms, gluconeogenesis is an important metabolic pathway that allows the cells to synthesize glucose from non-carbohydrate precursors, such as organic acids, amino acids and glycerol. FBPases are members of the large superfamily of lithium sensitive phosphatases, which includes three families of inositol phosphatases and FBPases (the phosphoesterase clan CL0171, 3167 sequences, Pfam data base). The FBPases are already reported as targets for the development of drugs for the treatment of noninsulin dependent diabetes [44, 45]. Based on a comparison with a crystallographic structure of the glpX template (PDB ID: 1NI9, GlpX from Escherichia coli), none of the active site residues were identified. The docking analysis was performed utilizing the highest scoring pocket obtained from DoGSiteScorer. shows a set of 10 promising ligands according to their minimum energy values and the maximum number of hydrogen bond interactions from the four aforementioned libraries. Compounds ZINC67912153, ZINC13142972, Jacarandic Acid and 16-hydrazonisosteviol are shown in . A-I: 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939302.1 (glpX, Fructose 1,6-bisphosphatase II) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structures of Jacarandic Acid with glpX protein. Figs B-I, II, C-I, II & D-I, II represent same information for compounds 16-hydrazonisosteviol, ZINC13142972 and ZINC67912153 respectively, for the same protein cavity. *NP = Natural Product (http://zinc.docking.org/catalogs/acdiscnp) NP_939692.1 (nusB, Transcription antitermination protein NusB) is a prokaryotic transcription factor involved in antitermination processes, during which it interacts with the mRNA nut site at boxA portion. The crystal structure of M. tuberculosis and E. coli NusB proteins suggest that the basic N-terminal region of the molecule associates with the rRNA BoxA. Hypothetically, this is indicative of the so-called arginine rich RNA binding motif (ARM) in the bacteriophage N protein, HIV tat and HIV rev. This suggestion is supported by the presence of a phosphate-binding site at the N-terminal end of α-A in each NusB protomer that includes a pair of conserved arginines, Arg10 and Arg14 [46]. The bismuth-dithiol solutions have been shown to selectively inhibit Escherichia coli rho transcription termination factor [47]. A comparison between the crystallographic structures of the NusB template (PDB ID: 1EYV, NusB from M. tuberculosis) and our modeled structure reveals that the conserved arginines were located at position 12 and 16 (Arg12 and Arg16) and are likely to contribute in the interactions. Although none of these residues are predicted to form hydrogen bonds with selected docked ligands, these molecules were predicted to interact with other residues in the pocket. shows the 8 selected ligands from all the four libraries according to their minimum energy values and the number of hydrogen bond interactions. The compounds ZINC15043210, ZINC00053531 Jacarandic Acid and 16-hydrazonisosteviol are shown in (. A decent binding mode and good shape complementarity was observed in these complexes. A-I: 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939692.1 (nusB, Transcription antitermination protein NusB) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structures of Jacarandic Acid with nusB protein. Figs B-I, II, C-I, II and D-I, II represent same information for compounds 16-hydrazonisosteviol, ZINC00053531 and ZINC15043210 respectively, for the same protein cavity. NP_938900.1 (rpsH, 30S ribosomal protein S8) is an important RNA-binding protein that inhabits a central position within the small ribosomal subunit. It widely interacts with 16S rRNA and is vital for the correct folding of the central domain of the rRNA. The protein rpsH S8 also controls the synthesis of numerous ribosomal proteins by binding to mRNA. It binds exactly to very similar sites in the two RNA molecules. It is a ribosomal protein that has medium-size, and its role as a significant primary RNA-binding protein in the 30S subunit is discovered recently. The S8 mutations within the protein have been shown to result in defective ribosome assembly. In Escherichia coli, the S8-binding site within 16S rRNA has been investigated independently by a number of techniques including nuclease protection, RNA–protein crosslinking, RNA modification, hydroxyl-radical footprinting and chemical probing. The rpsH S8 protein is also one of the principal regulatory elements that control ribosomal protein synthesis by the translational feedback inhibition mechanism discovered by Nomura and colleagues [48]. It regulates the expression of the spc operon that encodes, in order, the ten ribosomal proteins L14, L24, L5, S14, S8, L6, L18, S5, L30 and L15 [49]. The active site residues of rpsH, based on a comparison with its template structure were Arg86, Tyr88, Ser107, Ser109, Gly124, Gly125 and Glu126. However, none of the molecules interacts with these residues (); nonetheless they are predicted to interact with other residues of the binding cavity predicted by DoGSiteScorer. The predicted binding mode of best scoring compounds each library ZINC35457686, ZINC15221730, Jacarandic Acid and 17-hydroxyisosteviol are shown in A-I 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_938900.1 (rpsH, 30S ribosomal protein S8) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structures of Jacarandic Acid with rpsH protein. Figs B-I, II, C-I, II and D-I, II represent same information for compounds 17-hydroxyisosteviol ZINC15221730 and ZINC35457686 respectively, for the same cavity. NP_938502.1 (bioB, Biotin synthase) catalyzes the final step in the biotin biosynthetic pathway by converting dethiobiotin (DTB) to biotin. This reaction uses organic radical chemistry for inserting sulfur atom between non activated carbons C6 and C9 of DTB. BioB is a member of the “radical SAM” or “AdoMet radical” superfamily, which is categorized by the presence of a conserved CxxxCxxC sequence motif (C, Cys; x, any amino acid) that synchronizes an essential Fe4S4 cluster, as well as by the use of S-adenosyl-Lmethionine (SAM or AdoMet) for radical generation. AdoMet radical enzymes act on a wide variety of biomolecules. For example, BioB and lipoyl-acyl carrier protein synthase (LipA) are involved in vitamin biosynthesis; lysine 2,3-aminomutase (LAM) facilitates the fermentation of lysine; class III ribonucleotide reductase (RNR) and pyruvate formate lyase (PFL) catalyze the formation of glycyl radicals in their respective target proteins; and spore photoproduct lyase repairs ultraviolet light-induced DNA damage [50]. The protein bioB was reported as putative drug target in C. diphtheriae by Barh et al., 2011 in their in silico study [15]. A comparison between our modeled protein and template structures suggest Cys86, Cys90, Cys93 and Arg291 as the active residues. Although, only Cys86, Cys90 and Cys93 were found to interact with the compounds from our prepared libraries, the molecules were predicted to interact with other residues in the pocket. The binding mode of compounds with active site residues and low scores suggest a set of 10 molecules () as promising leads from our four libraries. The predicted binding modes of Jacarandic Acid, 16-oxime, 17-hydroxyisosteviol, ZINC16952914 and ZINC77269615 are shown in . A-I 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_938502.1 (bioB, Biotin synthase) with Rhein (CID 10168). A-II: 3D surface representation of the docking analyses for the structure of Rhein with bioB protein. Figs B-I, II, C-I, II & D-I, II represent same information for compounds 16-oxime, 17-hydroxyisosteviol, ZINC16952914 and ZINC77269615 respectively, for the same protein cavity. NP_939612.1 (hisE, Phosphoribosyl-ATP pyrophosphatase) is the second enzyme in the histidine-biosynthetic pathway, hydrolyzing irreversibly phosphoribosyl-ATP to phosphoribosyl-AMP and pyrophosphate. It is encoded by the hisE gene, which is present as a separate gene in many bacteria and archaea but is fused to hisI in other bacteria, fungi and plants. As it is essential for growth as seen in in vitro experiments, HisE is a potential drug target for tuberculosis [51]. A comparison of template and target protein structures here showed that there was no reported information about ligand-residue/s association in the active site cavity. Hence, the cavity chosen for virtual screening was simply the one that presented the highest DogSiteScorer druggability score (>80). A list of best dock molecules is shown below (). The binding patterns of Jacarandic Acid, 16–17 dihydroxyisosteviol, ZINC05809437 and ZINC67913372 are shown in A-1 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939612.1 (hisE, Phosphoribosyl-ATP pyrophosphatase) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structure of Jacarandic Acid with hisE protein. Figs B-I, II, C-I, II & D-I, II represent same information for compounds 16–17 dihydroxyisosteviol, ZINC05809437 and ZINC67913372 respectively, for the same protein cavity. NP_939123.1 (smpB, SsrA-binding protein) is a small protein B (SmpB), which is very useful for biological functions of tmRNA. In bacteria, a hybrid RNA molecule that combines the functions of both messenger and transfer RNAs rescues stalled ribosomes, and targets aberrant, partially synthesized proteins for proteolytic degradation. The flexible RNA molecule adopts an open L-shaped conformation and SmpB binds to its elbow region, stabilizing the single-stranded D-loop in an extended conformation. The most prominent feature of the structure of tmRNA is a 90o rotation of the TѰC-arm around the helical axis. Because of this important conformation, the SmpB–tmRNA D-complex positioned into the A-site of the ribosome orients SmpB towards the small ribosomal subunit, and directs tmRNA towards the elongation-factor binding region of the ribosome. The tmRNA–SmpB rescue system is ubiquitous in bacteria, and is also found in some chloroplasts and mitochondria [52]. In this case the template structure (PDB ID: 1P6V) did not contain any ligand, and no reported information was found about the ligand-residue interaction in their cavities. Therefore, amongst the cavities identified by MVD, the best cavity for docking analysis was chosen in consensus with highest druggability score from the DogSiteScorer. ZINC31168211 was found to form the network of 12 hydrogen bonds with Asn9, Ser16, Val49, Ser50, Thr52, Asp53, Ser54, Thr109. lists top compounds from respective libraries selected for this target while the binding modes of Rhein, 16-hydroxyisosteviol, ZINC01414475 and ZINC31168211 are also shown ( A-I 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939123.1 (smpB, SsrA-binding protein) with Rhein (CID 10168). A-II: 3D surface representation of the docking analyses for the structure of Rhein with smpB protein. Figs B-I, II, C-I, II & D-I, II represent same information for compounds 16-hydroxyisosteviol ZINC01414475 & ZINC31168211 respectively, for the same protein cavity. NP_939445.1 (DIP1084, Putative iron transport membrane protein, FecCD-family) The Pfam search for the protein showed that it has two main components, FecCD and ABC_trans. The FecCD is a subfamily of bacterial binding-protein-dependent transport systems family constituting transport system permease proteins involved in the transport of numerous compounds through the membrane. These transporters tend to catalyze the thermodynamically unfavorable translocation of substrates against a transmembrane concentration gradient through the coupling to a second, energetically favorable process. ABC systems can be categorized in three functional groups, as follows. Importers mediate the uptake of nutrients in prokaryotes. The nature of the substrates that are transported is very wide, including mono- and oligosaccharides, organic and inorganic ions, amino acids, peptides, iron-siderophores, metals, polyamine cations, opines, and vitamins [53]. Exporters are involved in the secretion of various molecules, such as peptides, lipids, hydrophobic drugs, polysaccharides, and proteins, including toxins such as hemolysin. The third category of systems is apparently not involved in transport, with some members being involved in translation of mRNA and in DNA repair. shows a set of 11 high scoring compounds against the proposed target. Compound ZINC70454922 from ZINC NP library was predicted to form ten hydrogen bonds with relatively low docking score (). A-I 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939445.1 (DIP1084, Putative iron transport membrane protein, FecCD-family) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structure of Jacarandic Acid with DIP1084, Putative iron transport membrane protein. Figs B-I, II, C-I, II & D-1, II D represent same information for compounds 16-hydrazonisosteviol ZINC13142972 and ZINC70454922 respectively, for the same protein cavity. NP_939345.1 (DIP0983, Hypothetical protein DIP0983) is a conserved hypothetical protein. It is annotated as a possible lysine decarboxylase (LDC) in the Pfam database (PF03641) [54] due to the presence of the highly conserved PGGxGTxxE motif. Some enzymes i:e “Lonely Guy” LOG are often mis-annotated as lysine decarboxylases enzymes; it is apparently responsible for catalyzing L-lysine decarboxylation to produce the polyamine metabolite cadaverine [55]. Conversely, this annotation is not supported by any biochemical or functional data in any of the PGGxGTxxE motif containing LDC identified so far. This motif is highly conserved among a vast number of proteins with unknown function, predicted from bacterial, yeast, and plant; in Arabidopsis thaliana, all the genome-annotated LOG proteins are identified as LDC like proteins by protein family. Based on sequence BLAST against the PDB, LOG from Claviceps purpurea shares more than 30% identical residues with crystal structures of LDC-like proteins of unknown function, whose structures are already determined. Recently, lysine decarboxylase has been reported as a therapeutic target by Lohinai et al., 2015 for Periodontal Inflammation [56]. Here we listed 12 compounds showing good potency against our target tabulated in . Four of the compounds with promising docking results are shown in A-1: 3D cartoon representation of the docking analyses for the most druggable protein cavity of NP_939345.1 (DIP0983, Hypothetical protein DIP0983) with Jacarandic Acid (CID 73645). A-II: 3D surface representation of the docking analyses for the structure of Jacarandic Acid with Hypothetical protein DIP0983. Figs B-I, II, C-I, II & D-I, II represent same information for compounds 17-hydroxyisosteviol, ZINC00211173 and ZINC67911471 respectively, for the same protein cavity. Among the drug-like molecule ZINC13142972 (1-[(2S, 3S, 4S, 5R)-3,4-dihydroxy-5-(hydroxymethyl) oxolan-2-yl]imidazo[1,2-b]pyrazole-7-carbonitrile) was predicted to show good results against two of our targets NP_939302.1 (glpX, Fructose 1,6-bisphosphatase II) and NP_939445.1 (DIP1084, Putative iron transport membrane protein, FecCD-family). It has been reported that at present 50% of drug molecules are either from natural source or their derivatives [57]. Interestingly, the compounds from second library of ZINC (Natural Product) showed better energy scores among all the libraries. Furthermore, from the library of natural compounds (28 molecules), Jacarandic Acid and Rhein were identified as the top ranked molecules and in silico analysis of the library (derivatives of diterpenoid isosteviol) suggest that compounds 16-hydroxyisosteviol, 16-hydrazonisosteviol, 17-hydroxyisosteviol, 16–17 dihydroxyisosteviol and 16-oxime, 17-hydroxyisosteviol were top ranked molecules, however, with much higher energy scores (less negative) than the top compounds from the ZINC libraries (ZINC drug-like molecules, ZINC Natural Product).

Conclusion

We utilized a bioinformatics pipeline for determining the conserved proteome of 13 strains of C. diphtheriae, and subsequently exploit 3D structural information, resulting in a small set of prioritized putative drug/vaccine targets, of which eight proteins are pathogen-essential, non-host homologous and 15 are pathogen-essential, host-homologs. After a detailed structural comparison between host and pathogen proteins, we suggest that eight of the non -host homologs could be considered for antimicrobial chemotherapy in future studies on anti-diphtheriae drugs and vaccines. Moreover, the strategy described herein is of general nature and can also be employed to other pathogenic microorganisms.

Structural information of the Di-terpenoid Iso-steviol derivatives.

(DOCX) Click here for additional data file.

Information of templates used for 8 essential non host homologous targets.

(DOCX) Click here for additional data file.

Common conserved proteins with their templates.

(XLS) Click here for additional data file.
Table 1

Strains of C. diphtheriae employed in the pan-modelome study with information on genomes statistics, disease prevalence and location of isolation.

StrainGPIDNCBI AccessionGenome Size (Mb)ProteinsGC%LocationBiovar
31APRJNA42399NC_016799.12.53535225853.60Brazil
241PRJNA42407NC_016782.12.42655224553.40Brazil
BH8PRJNA42423NC_016800.12.48552222353.60Brazil
C7PRJNA42401NC_016801.12.49919223053.50USA
CDCE8392PRJNA42405NC_016785.12.43333213553.60USAMitis
HC01PRJNA42409NC_016786.12.42715216253.40BrazilMitis
HC02PRJNA42411NC_016802.12.46861217953.70BrazilMitis
HC03PRJNA42415NC_016787.12.47836218653.50BrazilMitis
HC04PRJNA42417NC_016788.12.48433219053.50BrazilGravis
INCA402PRJNA42419NC_016783.12.44907216353.70BrazilBelfanti
PW8PRJNA42403NC_016789.12.53068220053.70USA
VA01PRJNA42421NC_016790.12.39544210053.40BrazilGravis
NCTC13129PRJNA87NC_002935.22.48863218553.50UKGravis
Table 2

Drug and/or vaccine target prioritization parameters and functional annotation of the eight essential non-host homologous putative targets.

Gene and protein codesOfficial full nameCavities with DSa > 0.80Cavities with DSa,g > 0.60 and < 0.80Mol. Wtb(KDa)FunctionscCellular componentdPathwayseVirulencef
NP_939692.1, nusBTranscription antitermination protein NusB/ N utilization substance protein B homolog0220.382MF: RNA binding.BP: DNA-templated transcription, termination, regulation of transcription, DNA-templated.CytoplasmunknownNo
NP_939612.1, hisEPhosphoribosyl-ATP pyrophosphatase019.877MF: RNA binding, phosphoribosyl-ATP diphosphatase activityBP: histidine biosynthetic processCytoplasmBiosynthesis of amino acidsYes
NP_939445.1,DIP1084Iron ABC transporter membrane protein/ Putative iron transport membrane protein, FecCD-family2335.470MF: Transporter activityBP: TransportMembraneThe ATP-binding cassette (ABC) transporters form one of the largest known protein familiesYes
NP_939345.1,DIP0983Hypothetical protein DIP0983/ Uncharacterized protein0428.193MF: possible lysine decarboxylases (Pfam)/52% sequence identity with PDB Template 1WEK.BP: A pyridoxal-phosphate protein. Also acts on 5-hydroxy-L-lysine (IUBMB Comments)CytoplasmunknownYes
NP_939302.1, glpXFructose 1,6-bisphosphatase II3235.589MF: fructose 1,6-bisphosphate 1-phosphatase activity, metal ion binding.BP: gluconeogenesis, glycerol metabolic processCarbohydrate MetabolismNo
NP_939123.1, smpBSsrA-binding protein1218.784MF: RNA bindingCytoplasmunknownYes
NP_938900.1, rpsH30S ribosomal protein S81114.292MF: rRNA binding, structural constituent of ribosomeBP: TranslationExtracellula/CytoplasmunknownNo
NP_938502.1, bioBBiotin synthase3138.224MF: 2 iron, 2 sulfur cluster binding, 4 iron 4 sulfur cluster binding, biotin synthase activity, iron ion bindingBP: biotin biosynthetic processCytoplasmBiotin metabolismYes

aDruggability predicted with DoGSiteScorer software. A druggability score above 0.60 is considered to be good, but a score above 0.80 is favored [32].

bMolecular weight was determined using ProtParam tool (http://web.expasy.org/protparam/).

cMolecular function (MF) and biological process (BP) for each target protein was determined using UniProt.

dCellular localization of pathogen targets was performed using CELLO.

eKEGG was used to find the role of these targets in different cellular pathways.

fPAIDB was used to check if the putative targets are involved in the pathogen’s virulence.

gDS = Drug Score

Table 3

Drug and/or vaccine target prioritization parameters and functional annotation of the fifteen essential host homologous putative targets.

Gene and protein codesOfficial full nameCavities with DSa > 0.80Cavities with DSa,g > 0.60 and < 0.80Mol. Wtb.(KDa)FunctionscCellulard componentPathwayseVirulencef
NP_938651.1RecRRecombination protein RecR0223.901MF: DNA binding, metal ion bindingBP: DNA recombination, DNA repairCytoplasmHomologous recombinationYes
NP_938792.1DIP0411Putative electron transport related protein0219.950MF: Antioxidant activity, oxidoreductase activityCytoplasm/MembraneYes
NP_938922.1rpsM30S ribosomal protein S130113.777MF: rRNA binding, structural constituent of ribosome, RNA bindingBP: TranslationCytoplasmRibosomeNo
NP_939046.1DIP0672Putative uptake hydrogenase small subunit2043.949MF: 3 iron, 4 sulfur cluster binding, 4 iron, 4 sulfur cluster binding, ferredoxin hydrogenase activity, metal ion bindingCytoplasmMicrobial metabolism in diverse environmentsYes
NP_939341.1dapD, DIP0979Tetrahydropicolinate succinylaseEC 2.3.1.1171133.780MF: 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase activity, magnesium ion bindingBP: diaminopimelate biosynthetic process, lysine biosynthetic process via diaminopimelateCytoplasmBiosynthesis of amino acids Yes
NP_939343.1DIP0981Putative succinyltransferaseEC 2.3.1.1171133.039MF: 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase activityCytoplasmBiosynthesis of amino acids Yes
NP_939460.1ilvH, DIP1099Acetolactate synthase small subunitEC 2.2.1.61319.063MF: acetolactate synthase activity, amino acid bindingBP: branched-chain amino acid biosynthetic processCytoplasm/Membrane2-Oxocarboxylic acid metabolismYes
NP_939590.1cobMPrecorrin-4 C11-methyltransferaseEC 2.1.1.1331227.181MF: precorrin-2 dehydrogenase activity, precorrin-4 C11-methyltransferase activityBP: cobalamin biosynthetic process, porphyrin-containing compound biosynthetic processCytoplasmPorphyrin and chlorophyll metabolismYes
NP_939786.1DIP1438Putative transport membrane protein4344.215MF: Transporter activityBP: transmembrane transportMembraneThe ATP-binding cassette (ABC) transportersYes
NP_939832.1DIP1484Putative uroporphyrinogen III methyltransferase3228.296MF: Methyltransferase activityBP: oxidation-reduction processCytoplasmPorphyrin and chlorophyll metabolismYes
NP_939958.1aroH, DIP1616Phospho-2-dehydro-3-deoxyheptonate aldolaseEC 2.5.1.542350.805MF: 3-deoxy-7-phosphoheptulonate synthase activityBP: aromatic amino acid family biosynthetic processCytoplasmBiosynthesis of amino acidsYes
NP_940018.1DIP1680Putative GTP cyclohydrolase 1 type 2 EC 3.5.4.162140.657MF: GTP binding, GTP cyclohydrolase I activity, metal ion bindingBP: 7,8-dihydroneopterin 3'-triphosphate biosynthetic processCytoplasmYes
NP_940228.1cysE, DIP1891Serine acetyltransferase EC 2.3.1.301020.208MF: serine O-acetyltransferase activityBP: cysteine biosynthetic process from serineCytoplasmCarbon metabolismYes
NP_940284.1DIP1952Putative pyruvate dehydrogenase3162.497MF: Catalytic activity, magnesium ion binding, thiamine pyrophosphate bindingCytoplasm(PYRUVATE METABOLISM) Nicotinate and nicotinamide metabolismYes
NP_940605.1DIP2303Putative DNA protection during starvation protein0118.223MF: Ferric iron binding, oxidoreductase activity, oxidizing metal ionsBP: cellular iron ion homeostasis, response to stressCytoplasmYes

aDruggability predicted with DoGSiteScorer software. A druggability score above 0.60 is usually considered, but a score above 0.80 is favored [32].

b Molecular weight was determined using ProtParam tool (http://web.expasy.org/protparam/).

c Molecular function (MF) and biological process (BP) for each target protein was determined using UniProt.

dCellular localization of pathogen targets was performed using CELLO.

eKEGG was used to find the role of these targets in different cellular pathways.

fPAIDB was used to check if the putative targets are involved in the pathogen’s virulence.

gDS = Drug Score.

Table 4

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected best-ranked molecules against NP_939302.1 (glpX, Fructose 1,6-bisphosphatase II).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-64.12653/ Val95, Asp197
Jacarandic Acid-62.06584/ Asp96, Thr98, Asp197, Pro221
Derivative of diterpenoid isosteviol16-hydrazonisosteviol-64.21075/ Asp93, Val95, Asp96, Val198
16-oxime, 17-hydroxyisosteviol-69.68244/ Asp93, Asp96, Thr98
Benzyl ester isosteviol lactone-69.84643/ Asp93, Asp197, Glu222
ZINC CompoundsZINC00042420-106.973/ Arg175, Arg197, Val198
ZINC13142972-109.6483/ Asp93, Asp197, Glu222
ZINC *NP CompoundsZINC67912153-135.11113/Gly64, Glu65, Asp93, Val95, Gly97, Thr98, Phe127, Arg175, Glu222
ZINC67902753-121.7628/Glu65, Val95, Thr98, Glu222
ZINC38143633-123.15011/Lys37, Asp93, Val95, Asp96, Gly97, Thr98

*NP = Natural Product (http://zinc.docking.org/catalogs/acdiscnp)

Table 5

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_939692.1 (nusB, Transcription antitermination protein NusB).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-78.06521/ Asp36
Jacarandic Acid-68.19133/ Asp36, Glu91, Arg98
Derivative of diterpenoid isosteviol16-hydrazonisosteviol-92.79112/ Glu31, Leu73
ZINC CompoundsZINC00053531-99.47163/ Asp34, Asp36, Ala69
ZINC19899354-114.9663/ Asp36, Ala69, Asp77
ZINC NP CompoundsZINC67911826-131.2887/Ala30, Glu31, Asp34, Ile35, Arg102
ZINC15043210-124.5807/ Glu31, Ile35, Leu73 Glu91, Arg102
ZINC31168395-131.4426/ Glu31, Asp34, Ala69, Leu73, Asp77, Arg102
Table 6

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_938900.1 (rpsH, 30S ribosomal protein S8).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-48.88033/ Asp5, Arg15
Jacarandic Acid-49.35064/ Ser12, Val77
Derivative of diterpenoid isosteviol16-hydrazonisosteviol-68.24463/ Asp5, Arg13, Arg79
17-hydroxyisosteviol-64.58555/ Asp5, Arg79
16–17 dihydroxyisosteviol-56.38684/ Asp5, Arg79
16-oxime, 17-hydroxyisosteviol-65.19954/ Ser26, Ser29, Ser30
ZINC CompoundsZINC15221730-103.6365/ Ala8, Arg15, Arg79
ZINC71913776-87.14745/ Arg15, Arg79
ZINC72333100-104.8073/ Arg15, Arg79, Val80
ZINC NP CompoundsZINC35457686-107.09110/Ala8, Ser12, Arg15, Val77, Arg78, Arg79, Lys82
ZINC67903079-131.21010/ Asp5, Ser12, Arg15, Arg78, Arg79, Lys82
ZINC31163223-100.6847/ Ala8, Asp9, Arg78, Arg79, Lys82
Table 7

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_938502.1 (bioB, Biotin synthase).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-72.29183/ Tyr183, Asn184, His185
Jacarandic Acid-98.01692/ Ala132, Glu188
Derivative of diterpenoid isosteviol16-hydrazonisosteviol-107.554/ Gly165, Tyr183, Asn184, Glu188
17-hydroxyisosteviol-92.21414/ Cys93, Ala132, Val134, Tyr183
16-oxime, 17-hydroxyisosteviol-98.95925/ Glu95, Val134, Glu188, Arg206, Thr323
Benzyl ester isosteviol lactone-89.88815/ Ala132, Val134, Gly165, Asn184
ZINC CompoundsZINC16952914-119.3545/ Val134, Ile164, Gly165, Asn184, Asn186
ZINC NP CompoundsZINC77269615-164.85317/Cys86, Phe92, Cys93, Ser94, Gln95, Ala132, Val134, Ser163, Asn184, Asn186, Arg206
ZINC04098512-162.05014/ Cys86, Cys90, Cys93, Ala132, Val134, Asn184, Asn186, His201, Arg206, Asp256, Thr323
ZINC15112225-142.08010/ Phe92, Cys93, Ser94, Gln95, Asn186, Arg206, Asn253
Table 8

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_939612.1 (hisE, Phosphoribosyl-ATP pyrophosphatase).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-54.95561/ Tyr84
Jacarandic Acid-61.02413/ Leu87
Derivative of diterpenoid isosteviol16–17 dihydroxyisosteviol-70.84963/ Leu65, Tyr84
ZINC CompoundsZINC05809437-89.67811/ Asp5
ZINC NP CompoundsZINC38143703-99.4994/ Thr79, Ile83, Leu87
ZINC67913372-97.9975/ Gly36, Ile62, Ile83, Tyr84
Table 9

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_939123.1 (smpB, SsrA-binding protein).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-67.6983/ Ser16, Ser114, Leu115
Jacarandic Acid-52.36891/ Asn69
Derivative of diterpenoid isosteviol16-hydroxyisosteviol-53.21414/ Ser50, Thr52, Asn69
16-hydrazonisosteviol-64.62033/ Thr52, Asn67
16–17 dihydroxyisosteviol-59.73643/ Ser16, Lys19, Val49
ZINC CompoundsZINC01414475-86.79443/ Thr52, Asn67, Asn69
ZINC17128469-74.53493/ Ser16, Leu51, Thr52
ZINC NP CompoundsZINC31168211-158.05612/ Asn9, Ser16, Val49, Ser50, Thr52, Asp53, Ser54, Thr109
ZINC33832449-134.97410/ Asn9, Ser16, Asn17, Val49, Ser50, Thr52, Asp53
ZINC04096316-137.6139/ Asn9, Ser10, Ser16, Asn17, Lys19, Val49, Ser50, Thr52
Table 10

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_939445.1 (DIP1084, Putative iron transport membrane protein, FecCD-family).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-66.44061/ Ser164
Jacarandic Acid-77.59811/ Gly265
Derivative of diterpenoid isosteviol16-hydrazonisosteviol-96.39454/ Tyr97, Ser164, Ile266
17-hydroxyisosteviol-90.14884/ Tyr97, Ser164, Ile266
Benzyl ester isosteviol lactone-71.47334/ Tyr97, Ser164
ZINC CompoundsZINC01645563-95.71165/ Tyr97, Ser102, Ser164, Ile266
ZINC13142972-111.1855/ Ser103, Ser106, Thr110, Ser164, Ser168
ZINC62023045-103.5424/ Tyr97, Ser102, Ser106, Ser164
ZINC NP CompoundsZINC70454922-155.66710/ Tyr97, Ser102, Ser106, Ser168, Ser172, Met191, Gly319
ZINC31167925-135.53510/ Tyr97, Ser164, Ser168, Met191, Gly265, Ile266, Thr322
ZINC04963990-127.6718/ Asp95, Tyr97, Ser106, Met191, Ile266, Phe268
Table 11

Compounds/Libraries name, MolDock scores and predicted hydrogen bonds for the selected molecules against NP_939345.1 (DIP0983, hypothetical protein DIP0983).

CompoundsMolDock ScoreH-Bond/Residues
Plant derived natural compoundsRhein-55.78193/ Cys13, Leu17, Asp177
Jacarandic Acid-80.82943/ Cys13, Gly14
Derivative of diterpenoid isosteviol17-hydroxyisosteviol-95.90252/ His11, Cys13
16–17 dihydroxyisosteviol-83.72262/ His11, Cys13
ZINC CompoundsZINC00114311-125.4236/ Arg10, Arg155, Gly172
ZINC00211173-98.60647/ Arg10, Gly106, Arg155, Gly174, Asp177
ZINC01427915-112.226/ Arg10, Ala77, Arg78, Gly172, Gly174
ZINC04836994-136.8475/ Arg10, Gly106, Ile131, Glu132
ZINC32004947-146.725/ Arg10, Gly106, Ile131, Glu132
ZINC NP CompoundsZINC67911471-176.09113/ Asn8, Cys13, Ser76, Ala77, Arg78 Arg150, Arg155, Gly174, Thr175
ZINC31163223-162.90812/ Asn8, Arg10, His11, Arg150, Arg155, Gly172, Thr175
ZINC04096393-148.42310/ Glu9, Arg10, Ala77, Arg78, Arg155, Lys156, Thr175
  55 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Structural evidence for specific S8-RNA and S8-protein interactions within the 30S ribosomal subunit: ribosomal protein S8 from Bacillus stearothermophilus at 1.9 A resolution.

Authors:  C Davies; V Ramakrishnan; S W White
Journal:  Structure       Date:  1996-09-15       Impact factor: 5.006

3.  A novel comparative genomics analysis for common drug and vaccine targets in Corynebacterium pseudotuberculosis and other CMN group of human pathogens.

Authors:  Debmalya Barh; Neha Jain; Sandeep Tiwari; Bibhu Prasad Parida; Vivian D'Afonseca; Liwei Li; Amjad Ali; Anderson Rodrigues Santos; Luís Carlos Guimarães; Siomar de Castro Soares; Anderson Miyoshi; Atanu Bhattacharjee; Amarendra Narayan Misra; Artur Silva; Anil Kumar; Vasco Azevedo
Journal:  Chem Biol Drug Des       Date:  2011-05-25       Impact factor: 2.817

4.  Biofilm Lysine Decarboxylase, a New Therapeutic Target for Periodontal Inflammation.

Authors:  Zsolt Lohinai; Beata Keremi; Eva Szöko; Tamás Tábi; Csaba Szabo; Zsolt Tulassay; John C DiCesare; Carole A Davis; Lindsay M Collins; Martin Levine
Journal:  J Periodontol       Date:  2015-06-25       Impact factor: 6.993

5.  Selective tight binding inhibitors of trypanosomal glyceraldehyde-3-phosphate dehydrogenase via structure-based drug design.

Authors:  A M Aronov; C L Verlinde; W G Hol; M H Gelb
Journal:  J Med Chem       Date:  1998-11-19       Impact factor: 7.446

Review 6.  Genomic-scale prioritization of drug targets: the TDR Targets database.

Authors:  Fernán Agüero; Bissan Al-Lazikani; Martin Aslett; Matthew Berriman; Frederick S Buckner; Robert K Campbell; Santiago Carmona; Ian M Carruthers; A W Edith Chan; Feng Chen; Gregory J Crowther; Maria A Doyle; Christiane Hertz-Fowler; Andrew L Hopkins; Gregg McAllister; Solomon Nwaka; John P Overington; Arnab Pain; Gaia V Paolini; Ursula Pieper; Stuart A Ralph; Aaron Riechers; David S Roos; Andrej Sali; Dhanasekaran Shanmugam; Takashi Suzuki; Wesley C Van Voorhis; Christophe L M J Verlinde
Journal:  Nat Rev Drug Discov       Date:  2008-10-17       Impact factor: 84.694

7.  Comparative genomics allowed the identification of drug targets against human fungal pathogens.

Authors:  Ana Karina R Abadio; Erika S Kioshima; Marcus M Teixeira; Natalia F Martins; Bernard Maigret; Maria Sueli S Felipe
Journal:  BMC Genomics       Date:  2011-01-27       Impact factor: 3.969

8.  Proteome scale comparative modeling for conserved drug and vaccine targets identification in Corynebacterium pseudotuberculosis.

Authors:  Syed Shah Hassan; Sandeep Tiwari; Luís Carlos Guimarães; Syed Babar Jamal; Edson Folador; Neha Barve Sharma; Siomar de Castro Soares; Síntia Almeida; Amjad Ali; Arshad Islam; Fabiana Dias Póvoa; Vinicius Augusto Carvalho de Abreu; Neha Jain; Antaripa Bhattacharya; Lucky Juneja; Anderson Miyoshi; Artur Silva; Debmalya Barh; Adrian Gustavo Turjanski; Vasco Azevedo; Rafaela Salgado Ferreira
Journal:  BMC Genomics       Date:  2014-10-27       Impact factor: 3.969

9.  Comparative genomics analysis of Mycobacterium ulcerans for the identification of putative essential genes and therapeutic candidates.

Authors:  Azeem Mehmood Butt; Izza Nasrullah; Shifa Tahir; Yigang Tong
Journal:  PLoS One       Date:  2012-08-13       Impact factor: 3.240

10.  Identification and characterization of potential therapeutic candidates in emerging human pathogen Mycobacterium abscessus: a novel hierarchical in silico approach.

Authors:  Buvaneswari Shanmugham; Archana Pan
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

View more
  12 in total

1.  Putative vaccine candidates and drug targets identified by reverse vaccinology and subtractive genomics approaches to control Haemophilus ducreyi, the causative agent of chancroid.

Authors:  Alissa de Sarom; Arun Kumar Jaiswal; Sandeep Tiwari; Letícia de Castro Oliveira; Debmalya Barh; Vasco Azevedo; Carlo Jose Oliveira; Siomar de Castro Soares
Journal:  J R Soc Interface       Date:  2018-05       Impact factor: 4.118

2.  Reverse vaccinology and subtractive genomics reveal new therapeutic targets against Mycoplasma pneumoniae: a causative agent of pneumonia.

Authors:  Thaís Cristina Vilela Rodrigues; Arun Kumar Jaiswal; Alissa de Sarom; Letícia de Castro Oliveira; Carlo José Freire Oliveira; Preetam Ghosh; Sandeep Tiwari; Fábio Malcher Miranda; Leandro de Jesus Benevides; Vasco Ariston de Carvalho Azevedo; Siomar de Castro Soares
Journal:  R Soc Open Sci       Date:  2019-07-31       Impact factor: 2.963

3.  Network-Based Metabolism-Centered Screening of Potential Drug Targets in Klebsiella pneumoniae at Genome Scale.

Authors:  Müberra Fatma Cesur; Bushra Siraj; Reaz Uddin; Saliha Durmuş; Tunahan Çakır
Journal:  Front Cell Infect Microbiol       Date:  2020-01-14       Impact factor: 5.293

4.  In Silico Identification of New Targets for Diagnosis, Vaccine, and Drug Candidates against Trypanosoma cruzi.

Authors:  Rafael Obata Trevisan; Malú Mateus Santos; Chamberttan Souza Desidério; Leandro Gomes Alves; Thiago de Jesus Sousa; Letícia de Castro Oliveira; Arun Kumar Jaiswal; Sandeep Tiwari; Weslley Guimarães Bovi; Mariana de Oliveira-Silva; Juliana Cristina Costa-Madeira; Lúcio Roberto Cançado Castellano; Marcos Vinicius Silva; Vasco Azevedo; Virmondes Rodrigues Junior; Carlo José Freire Oliveira; Siomar de Castro Soares
Journal:  Dis Markers       Date:  2020-12-10       Impact factor: 3.434

5.  Reverse vaccinology and subtractive genomics approaches for identifying common therapeutics against Mycobacterium leprae and Mycobacterium lepromatosis.

Authors:  Arun Kumar Jaiswal; Sandeep Tiwari; Syed Babar Jamal; Letícia de Castro Oliveira; Helioswilton Sales-Campos; Leonardo Eurípedes Andrade-Silva; Carlo Jose Freire Oliveira; Preetam Ghosh; Debmalya Barh; Vasco Azevedo; Siomar C Soares; Virmondes Rodrigues Rodrigues; Marcos Vinicius da Silva
Journal:  J Venom Anim Toxins Incl Trop Dis       Date:  2021-04-09

6.  Genome-Based Drug Target Identification in Human Pathogen Streptococcus gallolyticus.

Authors:  Nosheen Afzal Qureshi; Syeda Marriam Bakhtiar; Muhammad Faheem; Mohibullah Shah; Ahmed Bari; Hafiz M Mahmood; Muhammad Sohaib; Ramzi A Mothana; Riaz Ullah; Syed Babar Jamal
Journal:  Front Genet       Date:  2021-03-25       Impact factor: 4.599

7.  Proteome based mapping and molecular docking revealed DnaA as a potential drug target against Shigella sonnei.

Authors:  Farah Shahid; Youssef Saeed Alghamdi; Mutaib Mashraqi; Mohsin Khurshid; Usman Ali Ashfaq
Journal:  Saudi J Biol Sci       Date:  2021-10-01       Impact factor: 4.219

8.  The Druggable Pocketome of Corynebacterium diphtheriae: A New Approach for in silico Putative Druggable Targets.

Authors:  Syed S Hassan; Syed B Jamal; Leandro G Radusky; Sandeep Tiwari; Asad Ullah; Javed Ali; Paulo V S D de Carvalho; Rida Shams; Sabir Khan; Henrique C P Figueiredo; Debmalya Barh; Preetam Ghosh; Artur Silva; Jan Baumbach; Richard Röttger; Adrián G Turjanski; Vasco A C Azevedo
Journal:  Front Genet       Date:  2018-02-13       Impact factor: 4.599

9.  Identification of novel drug targets in bovine respiratory disease: an essential step in applying biotechnologic techniques to develop more effective therapeutic treatments.

Authors:  Meena Kishore Sakharkar; Karthic Rajamanickam; Ramesh Chandra; Haseeb A Khan; Abdullah S Alhomida; Jian Yang
Journal:  Drug Des Devel Ther       Date:  2018-05-07       Impact factor: 4.162

10.  An Integrated Database of Small RNAs and Their Interplay With Transcriptional Gene Regulatory Networks in Corynebacteria.

Authors:  Mariana Teixeira Dornelles Parise; Doglas Parise; Flavia Figueira Aburjaile; Anne Cybelle Pinto Gomide; Rodrigo Bentes Kato; Martin Raden; Rolf Backofen; Vasco Ariston de Carvalho Azevedo; Jan Baumbach
Journal:  Front Microbiol       Date:  2021-06-17       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.