| Literature DB >> 28324524 |
Mohd Shahbaaz1, Faizan Ahmad2, Md Imtaiyaz Hassan3.
Abstract
Haemophilus influenzae, a Gram-negative bacterium and a member of the family Pasteurellaceae, causes chronic bronchitis, bacteremia, meningitis, etc. The H. influenzae is the first organism whose genome was completely sequenced and annotated. Here, we have extensively analyzed the genome of H. influenzae using available proteins structure and function analysis tools. The objective of this analysis is to assign a precise function to hypothetical proteins (HPs) whose functions are not determined so far. Function prediction of these proteins is helpful in precise understanding of mechanisms of pathogenesis and biochemical pathways important for selecting novel therapeutic target. After an extensive analysis of H. Influenzae genome we have found 13 HPs showing high level of sequence and structural similarity to the enzyme isomerase. Consequently, the structures of HPs have been modeled and analyzed to determine their precise functions. We found these HPs are alanine racemase, lysine 2, 3-aminomutase, topoisomerase DNA-binding C4 zinc finger, pseudouridine synthase B, C and E (Rlu B, C and E), hydroxypyruvate isomerase, nucleoside-diphosphate-sugar epimerase, amidophosphoribosyltransferase, aldose-1-epimerase, tautomerase/MIF, Xylose isomerase-like, have TIM barrel domain and sedoheptulose-7-phosphate isomerase like activity, signifying their corresponding functions in the H. influenzae. This work provides a better understanding of the role HPs with isomerase activities in the survival and pathogenesis of H. influenzae.Entities:
Keywords: Haemophilus influenza; Hypothetical protein; Isomerase activity; Structure analysis; Structure prediction; Structure–function relationship
Year: 2014 PMID: 28324524 PMCID: PMC4569619 DOI: 10.1007/s13205-014-0274-1
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
List of HPs with isomerase activity from H. influenzae Rd KW
| S.no | Accession no | Gene id | Protein product | Uniprot id | Protein name |
|---|---|---|---|---|---|
| 1. | NC_000907.1 | 950992 | NP_438263.1 | P44506 | HP HI0090 |
| 2. | NC_000907.1 | 949459 | NP_438493.1 | P44641 | HP HI0329 |
| 3. | NC_000907.1 | 949423 | NP_438817.1 | P46494 | HP HI0656.1 |
| 4. | NC_000907.1 | 950211 | NP_438854.1 | P44827 | HP HI0694 |
| 5. | NC_000907.1 | 950733 | NP_439174.1 | Q57151 | HP HI1013 |
| 6. | NC_000907.1 | 950006 | NP_439175.1 | P44094 | HP HI1014 |
| 7. | NC_000907.1 | 950150 | NP_439355.1 | P45104 | HP HI1199 |
| 8. | NC_000907.1 | 950157 | NP_439364.1 | P71373 | HP HI1208 |
| 9. | NC_000907.1 | 950209 | NP_439468.1 | P44160 | HP HI1317 |
| 10. | NC_000907.1 | 950703 | NP_439541.1 | O86237 | HP HI1388.1 |
| 11. | NC_000907.1 | 950784 | NP_439587.1 | Q57152 | HP HI1436 |
| 12. | NC_000907.1 | 950455 | NP_439742.1 | P44268 | HP HI1600 |
| 13. | NC_000907.1 | 950796 | NP_439799.1 | P52606 | HP HI1657 |
List of sequence-based predicted function of HPs with isomerase activity and Motif discovered using MEME of H. influenzae strain Rd KW20
| S.NO | Cluster# | UNIPROT ID | MEME results | Consensusa function | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Motif 1 | Motif 2 | Motif 3 | MAST function prediction | |||||||
| Start | Site | Start | Site | Start | Site | |||||
| 1. | Cluster 102 | P44506 | 151 | ENLPHLCLRGLM | 209 | PSAIKCGSTMV | 76 | EWHFIG | UPF0001 protein | Alanine racemase |
| 2. | Cluster 170 | P44641 | 120 | GCAVNC | 236 | IFAHAM | 88 | GFSTDP |
| lysine 2,3-aminomutase |
| 3. | Cluster 152 | P46494 | 76 | FGMFIGCSHYPECDFVV | 1 | MNQSLFHH | 115 | RRGRQGKIFY | No result | Topoisomerase DNA-binding C4 zinc finger |
| 4. | Cluster 80 | P44827 | 84 | VYAAGRLDRDSEGLLILTNNGELQHRLADPKFKTEKTYWVQVEGI | 51 | TKVVLFNKPFDVLTQFTDEQGRATLKD | 178 | WLEIKISEGRNRQVRRMTAHIGFP | Ribosomal large subunit pseudouridine synthase E | ribosomal large subunit pseudouridine synthase E |
| 5. | Cluster 128 | Q57151 | 99 | CPNVHIM | 71 | WGGSAI | 178 | DYFHAQ | Putative hydroxypyruvate isomerase | hydroxypyruvate isomerase |
| 6. | Cluster 162 | P44094 | 149 | MCELLINDYSRKGFVDGIVVRLPTICIRPGKPNKAASSFVSSIMREPLHG | 55 | CPVSEE | 291 | QALALGFKV | No result | Nucleoside-diphosphate-sugar epimerase |
| 7. | Cluster 80 | P45104 | 176 | WIAVGRLDINTSGLLLFTTDGELANRLMHPSREVEREYSVRVFGQ | 140 | CRVLMYYKPEGELCTRSDPEGRATVFD | 256 | WYDVTLMEGRNREVRRLWESQGIQ | Ribosomal large subunit pseudouridine synthase B | ribosomal large subunit pseudouridine synthase B |
| 8. | Cluster 113 | P71373 | 209 | DHSECRGAFNFAAPKSIKQH | 284 | DCENYL | 268 | VVPEKLLNAGFQFQY | Epimerase family protein HI_1208 | Amidophosphoribosyltransferase (Epimerase) |
| 9. | Cluster 38 | P44160 | 86 | QPAHGT | 75 | PICYPW | 29 | CGWNTKNFPC | Putative glucose-6-phosphate 1-epimerase | Aldose 1-epimerase |
| 10. | Cluster 38 | O86237 | 104 | QPAHCW | 48 | DFYYPF | 34 | KGKHAIRFLC | No result | Tautomerase/MIF |
| 11. | Cluster 114 | Q57152 | 51 | WVFIPRM | 72 | AISPYI | 38 | FSIDTM | No result | RNA pseudouridine synthase C |
| 12. | Cluster 196 | P44268 | 249 | KGTVWD | 99 | CECEGH | 35 | ENWSKM | No result | Xylose isomerase-like, TIM barrel domain |
| 13. | Cluster 141 | P52606 | 97 | ELYCHQ | 32 | QMVMQC | 1 | MLQKVK | No result | Sedoheptulose 7-phosphate isomerase |
aConsensus result form on the basis of values present in Table S3 and S4
List of structure-based predicted function and validation of HP with isomerase activity in H. influenzae strain Rd KW20
| S. no. | Uniprot id | Template | Identity (%) | RMSD | TM score | Ramachandran plot | Proposed function |
|---|---|---|---|---|---|---|---|
| 1. | P44506 | PLP-binding protein, 1W8G | 61 | 0.223 | 0.6229 | 99.5 % (94.4 % core 5.1 % allow 0.0 % gener 0.5 % disall) | Alanine racemase |
| Pyridoxal phosphate-binding protein, 3SY1 | 57 | 0.243 | 0.5130 | ||||
| Alanine racemase, 4A3Q (HHpred result) | N/A | 3.997 | 0.2651 | ||||
| 2. | P44641 | Lysine-2,3-aminomutase, 2A5H | 34 | 0.241 | 0.3718 | 99 % (90.8 % core 8.2 % allow 0.0 % gener 1.0 % disall) | Lysine 2,3-aminomutase |
| 3. | P46494 (Robetta server) | Rosetta de novo protocol (no template used) | N/A | N/A | N/A | 100 % (86.7 % core 13.3 % allow 0.0 % gener 0.0 % disall) | DNA Topoisomerase,type IA,Zn finger |
| 4. | P44827 | pseudouridine synthase Rlu E, 2OLW | 66 | 0.233 | 0.73306 | 98.1 % (87.3 % core 9.9 % allow 0.9 % gener 1.9 % disall) | Pseudouridine synthase Rlu E |
| pseudouridine synthase Rlu E, 2OML | 66 | 0.604 | 0.74206 | ||||
| Ribosomal small subunit pseudouridine synthase A, 1KSK | 31 | 1.662 | 0.71557 | ||||
| 5. | Q57151 | AP endonuclease, family 2, 3NGF | 54 | 0.206 | 0.8329 | 99.6 % (92.4 % core 7.1 % allow 0.0 % gener 0.4 % disall) | hydroxypyruvate isomerase/D-tagatose 3-epimerase |
| Putative Oxygenase, 1K77 | 60 | 0.751 | 0.9661 | ||||
| L-ribulose 3-epimerase, 3VYL | 26 | 1.759 | 0.4682 | ||||
| 6. | P44094 | Nucleoside-diphosphate-sugar epimerase, 2HRZ | 41 | 0.194 | 0.1688 | 99.6 % (92.8 % core 6.1 % allow 0.7 % gener 0.4 % disall) | Nucleoside-diphosphate-sugar epimerase |
| 7. | P45104 (ITASSER) | Ribosomal large subunit pseudouridine synthase F, 3DH3 | 29 | 0.657 | 0.66428 | 97.8 % (81.9 % core 12.4 % allow 3.5 % gener 2.2 % disall) | Ribosomal large subunit pseudouridine synthase F |
| 8. | P71373 | Nucleoside-diphosphate sugar epimerase (SulA family), 3OH8 | 33 | 0.655 | 0.84610 | 99.6 % (93.2 % core 5.7 % allow 0.8 % gener 0.4 % disall) | Nucleoside-diphosphate sugar epimerase |
| Epimerase family protein SDR39U1, 4B4O | 32 | 0.572 | 0.92240 | ||||
| 9. | P44160 | Crystal Structure Analysis of HI1317, 1JOV | 93 | 0.227 | 0.97926 | 99.6 % (88.8 % core 9.5 % allow 1.2 % gener 0.4 % disall) | Galactose mutarotase (Aldose 1-epimerase) |
| putative mutarotase (YeaD), 2HTA | 33 | 0.640 | 0.94691 | ||||
| Hexose-6-phosphate mutarotase, 2CIR | 26 | 1.583 | 0.84898 | ||||
| 10. | Q57152 (ITASSER) | Solution NMR Structure of protein YqcC, 2HGK | 47 | 0.876 | 0.97784 | 96.9 % (85.4 % core 10.4 % allow 1.0 % gener 3.1 % disall) | Beta-fructofuranosidase/invertase inhibitor |
| 11. | P44268 | Crystal structure of a DUF692 family protein, 3BWW | 73 | 0.478 | 0.79170 | 98.9 % (86.5 % core 11.3 % allow 1.1 % gener 1.1 % disall) | Xylose isomerase-like |
| L-ribulose 3-epimerase, 3VYL | 50 | 3.912 | 0.73310 | ||||
| 12. | P52606 | Crystal structure of Escherichia coli DiaA, 2YVA | 35 | 0.364 | 0.98496 | 99.4 % (92.7 % core 5.6 % allow 1.1 % gener 0.6 % disall) | Sedoheptulose 7-phosphate isomerase |
| Phosphoheptose isomerase, 1X92 | 32 | 0.341 | 0.98528 | ||||
| Phosphoheptose isomerase, 3BJZ | 32 | 0.447 | 0.87983 | ||||
| Phosphoheptose isomerase 1, 1TK9 | 27 | 0.495 | 0.92181 | ||||
| Phosphoheptose Isomerase, 2I2 W | 28 | 0.521 | 0.86321 |
Fig. 1Representation of model structure of HP P44506. a Showing characteristic TIM barrel domain. b Secondary structure prediction of HPs using their three-dimensional structural framework by STRIDE, where α-helix, β-strands, loops, 310 helix and β-bridges are represented in red, green, yellow, blue and pink, respectively (this illustration is applicable for all figures). c Residues present in the active site pocket are illustrated in stick
Fig. 2Representation of model structure of HP P44641. a Overall structure is represented in cartoon. b Secondary structure. c Predicted active site residues shown in stick
Fig. 3Representation of model structure of HP P46494. a Cartoon model showing overall structure with non-bonded sphere describing the membrane. b Secondary structure of P46494. c A detailed description of P46494 active site
Fig. 4Representation of model structure of HP P44827. a Cartoon diagram showing mixed alpha/beta fold. Collection of non-bonded spheres represents membrane. b Secondary structure evaluation using predicted three-dimensional structure. c Stick representation of P44827 active site, with Asp91 residue is proposed to be involve in nucleic acid binding
Fig. 5Representation of model structure of HP Q57151. a Cartoon model showing overall topology described by predicted structure. b Description of secondary structure prediction in the HP. c A detailed description of active site
Fig. 6Representation of model structure of HP P44094. a Overall structure of P44094 shown in cartoon model with membrane is represented as non-bonded spheres. b Secondary structure of HP P44094. c Representation of the active site residues of P44094 in stick model
Fig. 7Representation of model structure of HP P45104. a Three-dimensional structure represented in cartoon model with membrane represented as non-bonded spheres. b Representation of secondary structure predicted using STRIDE. c Representation of the active site residues of HP P45104 in stick model
Fig. 8Representation of model structure of HP P71373. a Cartoon model representation of overall structure in which membrane is represented by non-bonded atoms. b Predicted secondary structure. c Representation of the active site residues of HP P71373 in stick model
Fig. 9Representation of model structure of HP P44160. a Showing characteristic β-sandwich topology. b Detailed description of secondary structure using STRIDE. c The active site pocket is illustrated in stick representation
Fig. 10Representation of model structure of HP O86237. a Overall structure is represented in cartoon form. b Secondary structure of HP O86237. c Detailed description of active site of O86237
Fig. 11Representation of model structure of HP Q57152. a Describing all alpha helix topology in carton form. b Secondary structure prediction and assignment of STRIDE. c Description of active site residues in stick form
Fig. 12Representation of model structure of HP P44268. a Showing characteristic TIM barrel topology. b Secondary structure prediction of HP P44268. c Representation of active site residues in stick
Fig. 13Representation of model structure of HP P52606. a Illustration of beta sandwich topology of P52606. b Predicted Secondary structural elements in P52606. c Stick representation of active site residues