| Literature DB >> 26677339 |
Shakhinur Islam Mondal1, Sabiha Ferdous2, Nurnabi Azad Jewel2, Arzuba Akter3, Zabed Mahmud2, Md Muzahidul Islam2, Tanzila Afrin4, Nurul Karim5.
Abstract
Bacterial enteric infections resulting in diarrhea, dysentery, or enteric fever constitute a huge public health problem, with more than a billion episodes of disease annually in developing and developed countries. In this study, the deadly agent of hemorrhagic diarrhea and hemolytic uremic syndrome, Escherichia coli O157:H7 was investigated with extensive computational approaches aimed at identifying novel and broad-spectrum antibiotic targets. A systematic in silico workflow consisting of comparative genomics, metabolic pathways analysis, and additional drug prioritizing parameters was used to identify novel drug targets that were essential for the pathogen's survival but absent in its human host. Comparative genomic analysis of Kyoto Encyclopedia of Genes and Genomes annotated metabolic pathways identified 350 putative target proteins in E. coli O157:H7 which showed no similarity to human proteins. Further bio-informatic approaches including prediction of subcellular localization, calculation of molecular weight, and web-based investigation of 3D structural characteristics greatly aided in filtering the potential drug targets from 350 to 120. Ultimately, 44 non-homologous essential proteins of E. coli O157:H7 were prioritized and proved to have the eligibility to become novel broad-spectrum antibiotic targets and DNA polymerase III alpha (dnaE) was the top-ranked among these targets. Moreover, druggability of each of the identified drug targets was evaluated by the DrugBank database. In addition, 3D structure of the dnaE was modeled and explored further for in silico docking with ligands having potential druggability. Finally, we confirmed that the compounds N-coeleneterazine and N-(1,4-dihydro-5H-tetrazol-5-ylidene)-9-oxo-9H-xanthene-2-sulfon-amide were the most suitable ligands of dnaE and hence proposed as the potential inhibitors of this target protein. The results of this study could facilitate the discovery and release of new and effective drugs against E. coli O157:H7 and other deadly human bacterial pathogens.Entities:
Keywords: DNA polymerase III alpha; E. coli O157:H7; KEGG metabolic pathways; homology modeling; novel and broad-spectrum antibiotic targets
Year: 2015 PMID: 26677339 PMCID: PMC4677596 DOI: 10.2147/AABC.S88522
Source DB: PubMed Journal: Adv Appl Bioinform Chem ISSN: 1178-6949
Figure 1A schematic representation of the workflow of computational drug target identification and prediction of putative inhibitors of the selected target.
Abbreviations: BLASTP, Protein Basic Local Alignment Search Tool; TTD, Therapeutic Target Database.
Host–pathogen common and pathogen-specific pathways from KEGG database
| Pathway IDs | Pathway names | Pathway IDs | Pathway names |
|---|---|---|---|
| ecs00010 | Glycolysis/Gluconeogenesis | ecs00860 | Porphyrin and chlorophyll metabolism |
| ecs00020 | Citrate cycle (TCA cycle) | ecs00900 | Terpenoid backbone biosynthesis |
| ecs00030 | Pentose phosphate pathway | ecs00920 | Sulfur metabolism |
| ecs00040 | Pentose and glucuronate interconversions | ecs00970 | Aminoacyl-tRNA biosynthesis |
| ecs00051 | Fructose and mannose metabolism | ecs01040 | Biosynthesis of unsaturated fatty acids |
| ecs00052 | Galactose metabolism | ecs02010 | ABC transporters |
| ecs00053 | Ascorbate and aldarate metabolism | ecs03010 | Ribosome |
| ecs00061 | Fatty acid biosynthesis | ecs03018 | RNA degradation |
| ecs00071 | Fatty acid metabolism | ecs03020 | RNA polymerase |
| ecs00130 | Ubiquinone and other terpenoid-quinone biosynthesis | ecs03030 | DNA replication |
| ecs00190 | Oxidative phosphorylation | ecs03060 | Protein export |
| ecs00230 | Purine metabolism | ecs03410 | Base excision repair |
| ecs00240 | Pyrimidine metabolism | ecs03420 | Nucleotide excision repair |
| ecs00250 | Alanine, aspartate and glutamate metabolism | ecs03430 | Mismatch repair |
| ecs00260 | Glycine, serine and threonine metabolism | ecs03440 | Homologous recombination |
| ecs00270 | Cysteine and methionine metabolism | ecs04122 | Sulfur relay system |
| ecs00280 | Valine, leucine and isoleucine degradation | ecs00561 | Glycerolipid metabolism |
| ecs00290 | Valine, leucine and isoleucine biosynthesis | ecs00562 | Inositol phosphate metabolism |
| ecs00300 | Lysine biosynthesis | ecs00564 | Glycerophospholipid metabolism |
| ecs00310 | Lysine degradation | ecs00590 | Arachidonic acid metabolism |
| ecs00330 | Arginine and proline metabolism | ecs00592 | alpha-Linolenic acid metabolism |
| ecs00340 | Histidine metabolism | ecs00600 | Sphingolipid metabolism |
| ecs00350 | Tyrosine metabolism | ecs00620 | Pyruvate metabolism |
| ecs00360 | Phenylalanine metabolism | ecs00630 | Glyoxylate and dicarboxylate metabolism |
| ecs00380 | Tryptophan metabolism | ecs00640 | Propanoate metabolism |
| ecs00400 | Phenylalanine, tyrosine and tryptophan biosynthesis | ecs00650 | Butanoate metabolism |
| ecs00410 | beta-Alanine metabolism | ecs00670 | One carbon pool by folate |
| ecs00430 | Taurine and hypotaurine metabolism | ecs00730 | Thiamine metabolism |
| ecs00450 | Selenocompound metabolism | ecs00740 | Riboflavin metabolism |
| ecs00460 | Cyanoamino acid metabolism | ecs00750 | Vitamin B6 metabolism |
| ecs00471 | D-Glutamine and D-glutamate metabolism | ecs00760 | Nicotinate and nicotinamide metabolism |
| ecs00480 | Glutathione metabolism | ecs00770 | Pantothenate and CoA biosynthesis |
| ecs00500 | Starch and sucrose metabolism | ecs00780 | Biotin metabolism |
| ecs00511 | Other glycan degradation | ecs00785 | Lipoic acid metabolism |
| ecs00520 | Amino sugar and nucleotide sugar metabolism | ecs00790 | Folate biosynthesis |
| ecs00281 | Geraniol degradation | ecs00627 | Aminobenzoate degradation |
| ecs00361 | Chlorocyclohexane and chlorobenzene degradation | ecs00633 | Nitrotoluene degradation |
| ecs00362 | Benzoate degradation | ecs00642 | Ethylbenzene degradation |
| ecs00363 | Bisphenol degradation | ecs00660 | C5-Branched dibasic acid metabolism |
| ecs00364 | Fluorobenzoate degradation | ecs00680 | Methane metabolism |
| ecs00401 | Novobiocin biosynthesis | ecs00903 | Limonene and pinene degradation |
| ecs00440 | Phosphonate and phosphinate metabolism | ecs00910 | Nitrogen metabolism |
| ecs00473 | D-Alanine metabolism | ecs00930 | Caprolactam degradation |
| ecs00521 | Streptomycin biosynthesis | ecs01053 | Biosynthesis of siderophore group non-ribosomal peptides |
| ecs00523 | Polyketide sugar unit biosynthesis | ecs01110 | Biosynthesis of secondary metabolites |
| ecs00540 | Lipopolysaccharide biosynthesis | ecs01120 | Microbial metabolism in diverse environments |
| ecs00550 | Peptidoglycan biosynthesis | ecs02020 | Two-component system |
| ecs00621 | Dioxin degradation | ecs02030 | Bacterial chemotaxis |
| ecs00622 | Xylene degradation | ecs02040 | Flagellar assembly |
| ecs00623 | Toluene degradation | ecs02060 | Phosphotransferase system |
| ecs00624 | Polycyclic aromatic hydrocarbon degradation | ecs03070 | Bacterial secretion system |
| ecs00625 | Chloroalkane and chloroalkene degradation | ecs05130 | Pathogenic |
| ecs00626 | Naphthalene degradation | ||
Abbreviation: KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 2Comparative subcellular localization of proteins from the common host-pathogen pathways and pathogen-specific pathways.
Figure 3Percentage distribution of novel drug targets involved in different metabolic pathways or biological processes.
Figure 4Homology modeled structure of the Escherichia coli O157:H7 Sakai strand DNA polymerase III alpha, modeled with ESyPred3D server.
Figure 5Structure validation and energy minimization.
Notes: (A) Result of PROCHECK verification program, showing number and percentages of residues in most favored regions (red); additional allowed regions (yellow); generously allowed regions (creamy white); and in disallowed regions (white). Based on an analysis of 118 structures of resolution of at least 2.0 angstroms and R-factor no greater than 20%, a good quality model would be expected to have over 90% in the most favored regions. (B) Result of the 3D structure verification tool ANOLEA. This figure shows residues in favorable energy environment (green) and residues in unfavorable energy (red).
Abbreviation: ANOLEA, atomic non-local environment assessment.
Figure 6Active site residues (shown in green) of the Escherichia coli O157:H7 Sakai strand DNA polymerase III alpha.
Note: Figure prepared by CASTp server.
Lowest docking energies and important residues of the binding site observed to be interactive with the ligands from BindingDB database
| No | Compounds from BindingDB | Important amino acid residues involved | Docking energy (Kcal/mol) |
|---|---|---|---|
| 1 | ZINC 5117079 | SER365, PHE392, ARG391, ARG397, MET400, ASP402, ASP404, ARG711 | −8.8 |
| 2 | CID 9809878 | SER365, ARG391, ARG711, PHE392, ASP402, ASP404, GLY364, MET400, ARG397, SER545, GLY56, VAL52, LYS30, LYS53, ALA57, TYR549 | −8.7 |
| 3 | ZINC 28356629 | SER365, PHE392, ARG391, ARG397, MET400, ASP402, ASP404, ARG711, PHE757, ASN758, HIS761 | −8.5 |
Figure 7Three-dimensional representation.
Notes: Three-dimensional representation of the interactive residues on the binding site of the protein when it interacts with (A) active inhibitors (ligands) respectively with CID 9809878; ZINC 5117079 and ZINC 28356629 and (B) top binding affinity molecules DB04118 and DB04698. The color indicator on the left side shows the types of interaction of particular residues.
Lowest docking energies, important residues of the binding site observed to be interactive with the ligands from DrugBank, percentage of human intestinal absorption and plasma protein binding, Caco-2 cell permeability and carcinogenicity in rats
| No | DrugBank compounds | Important amino acid residues involved in interactions | Docking energy (Kcal/mol) | Human intestinal absorption % | Plasma protein binding % | Caco-2 cell permeability (nm/second) | Carcinogenicity (Rats) |
|---|---|---|---|---|---|---|---|
| 1 | DB04118 | LYS33, SER365, ARG391, ARG397, VAL398, ASP402, MET400, GLU548, TYR549, SER545 | −9.8 | 95.36 | 99.28 | 17.4462 | Negative |
| 2 | DB04698 | SER365, PHE392, ARG391, ARG397, MET400, ASP402, GLU548, TYR549, LYS554, ARG711 | −9.9 | 88.10 | 100.00 | 0.362186 | Positive |
Figure 8Structure of top hit compounds by in silico screening.
Notes: (A) DB04118 (N-Coeleneterazine), (B) DB04698 (N-(1,4-Dihydro-5H-tetrazol-5-ylidene)-9-oxo-9H-xanthene-2-sulfonamide).