| Literature DB >> 28324295 |
Mohd Shahbaaz1, Faizan Ahmad2, Md Imtaiyaz Hassan3.
Abstract
Haemophilus influenzae is a small pleomorphic Gram-negative bacteria which causes several chronic diseases, including bacteremia, meningitis, cellulitis, epiglottitis, septic arthritis, pneumonia, and empyema. Here we extensively analyzed the sequenced genome of H. influenzae strain Rd KW20 using protein family databases, protein structure prediction, pathways and genome context methods to assign a precise function to proteins whose functions are unknown. These proteins are termed as hypothetical proteins (HPs), for which no experimental information is available. Function prediction of these proteins would surely be supportive to precisely understand the biochemical pathways and mechanism of pathogenesis of Haemophilus influenzae. During the extensive analysis of H. influenzae genome, we found the presence of eight HPs showing lyase activity. Subsequently, we modeled and analyzed three-dimensional structure of all these HPs to determine their functions more precisely. We found these HPs possess cystathionine-β-synthase, cyclase, carboxymuconolactone decarboxylase, pseudouridine synthase A and C, D-tagatose-1,6-bisphosphate aldolase and aminodeoxychorismate lyase-like features, indicating their corresponding functions in the H. influenzae. Lyases are actively involved in the regulation of biosynthesis of various hormones, metabolic pathways, signal transduction, and DNA repair. Lyases are also considered as a key player for various biological processes. These enzymes are critically essential for the survival and pathogenesis of H. influenzae and, therefore, these enzymes may be considered as a potential target for structure-based rational drug design. Our structure-function relationship analysis will be useful to search and design potential lead molecules based on the structure of these lyases, for drug design and discovery.Entities:
Keywords: Function prediction; Gene annotation; Haemophilus influenzae; Hypothetical protein; Sequence analysis; Structure analysis; Structure–function relationship
Year: 2014 PMID: 28324295 PMCID: PMC4434415 DOI: 10.1007/s13205-014-0231-z
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
List of hypothetical protein with lyase activity in H. influenzae strain Rd KW20
| S. no. | Accession no | Gene ID | Protein product | Uniprot ID | Protein name |
|---|---|---|---|---|---|
| 1 | NC_000907.1 | 949660 | NP_438613.1 | P44717 | HP HI0452 |
| 2 | NC_000907.1 | 950684 | NP_438775.1 | P44782 | HP HI0617 |
| 3 | NC_000907.1 | 950339 | NP_439586.2 | P44197 | HP HI1435 |
| 4 | NC_000907.1 | 950454 | NP_439740.1 | P45267 | HP HI1598 |
| 5 | NC_000907.1 | 950030 | NP_439212.1 | Q57498 | HP HI1053 |
| 6 | NC_000907.1 | 949991 | NP_439177.1 | P44095 | HP HI1016 |
| 7 | NC_000907.1 | 950004 | NP_439172.1 | P44093 | HP HI1011 |
| 8 | NC_000907.1 | 950653 | NP_438618.1 | P44720 | HP HI0457 |
List of sequence-based predicted function of HPs with lyase activity and motif discovered using MEME of H. influenzae strain Rd KW20
| S. no. | Clustera | Uniprot ID | MEME results | Consensusb function | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Motif 1 | Motif 2 | Motif 3 | MAST function prediction | |||||||
| Start | Site | Start | Site | Start | Site | |||||
| 1 | Cluster 118 | P44717 | 262 | GYIESH | 199 | TQEHYL | 405 | YGKYKF | UPF0053 protein | Cystathionine-beta-synthase |
| 2 | Cluster 187 | P44782 | 152 | VKLTPVTGRSHQLRLHMLALGHPILGDKFY | 56 | FCEPAHRLDMATSGIIVFALSKAADRELKRQFREREPKKHYQAIVWGH | 18 | YQDNHLCVVNKPSG | Ribosomal large subunit pseudouridine synthase A | RNA pseudouridylate synthase rluA (lyase) |
| 3 | Cluster 187 | P44197 | 152 | VKLIPHTGRKHXLRXHMKHVFHPIXGDTQY | 46 | HVFPIHRLDRPTSGVLLFALSSEIANLMCEQFEQKYVQKSYLAVVRGY | 6 | YQDGFLVAVNKPAG | tRNA pseudouridine synthase C | Ribosomal large subunit pseudouridine synthase C rluC |
| 4 | Cluster 114 | P45267 | 335 | RVYFERM | 14 | AISPQI | 253 | FSQDFM | No result | CYTH-like adenylate cyclase |
| 5 | Cluster 196 | Q57498 | 1 | MFTDWK | 59 | TRCESC | 21 | KQYPKM | No result | Carboxymuconolactone decarboxylase |
| 6 | Cluster 131 | P44095 | 128 | GIWFPCTW | 21 | PNHCGTHM | 11 | TPFSSF | No result | Cyclase family protein |
| 7 | Cluster 45 | P44093 | 68 | WLKENGCTQFYFKYCST | 281 | IHNENYIE | 151 | NLMRLM | No result | D-tagatose-1,6-bisphosphate aldolase |
| 8 | Cluster 89 | P44720 | 315 | DGSGGH | 123 | WRKDLENAPH | 338 | RWYRSQ | UPF0755 protein | Aminodeoxychorismate lyase |
aCLUSS predicted clusters
bConsensus result form on the basis of values present in supplementary Table S2 & S3
List of structure-based predicted function and validation of HP with lyase activity in H. influenzae strain Rd KW20
| S. no. | Uniprot ID | Template | Identity (%) | RMSD | Ramachandran | Proposed function |
|---|---|---|---|---|---|---|
| 1 | P44717 (robetta model) | Magnesium and cobalt efflux protein CorC, 4HG0 CorC/HlyC transporter-associated domain of a CBS domain protein, 2PLS | 23 60 | 3.400 0.598 | 99 % (91.7 % core 6.5 % allow 0.8 % gener 1.0 % disall) | CBS domain associated CorC/HlyC transporter |
| 2 | P44782 (sparksx model) | Template-pseudouridine synthase RluA, 2I82 | 60 | 0.174 | 99.5 % (91.1 % core 8.4 % allow 0.0 % gener 0.5 % disall) | Pseudouridine synthase RluA |
| 3 | P44197 (Modeller structure) | Pseudouridine synthase RluC, 1XPI Pseudouridine synthase RluC, 1V9K | 34 33 | 0.379 0.334 | 99.5 % (88.2 % core 8.8 % allow 2.5 % gener 0.5 % disall) | Pseudouridine synthase RluC |
| 4 | P45267 (Sparksx model) | Putative adenylate cyclase, 2GFG | 39 | 0.918 | 98.8 % (88.9 % core 9.6 % allow 0.3 % gener 1.2 % disall) | Adenylate cyclase |
| 5 | Q57498 (Sparksx model) | Template—Carboxymuconolactone decarboxylase family protein, 1VKE | 31 | 0.300 | 99 % (94.2 % core 2.9 % allow 1.9 % gener 1.0 % disall) | Carboxymuconolactone decarboxylase |
| 6 | P44095 (I-TASSERmodel) | Manganese dependent isatin hydrolase, 4J0N (A) Metal-dependent hydrolase with cyclase activity, 1R61 | 54 27 | 2.752 1.139 | 98.3 % (88.4 % core 9.1 % allow 0.8 % gener 1.7 % disall) | Metal-dependent hydrolase with cyclase activity |
| 7 | P44093 (Raptorx model) | Putative tRNA synthase, 3DQQ | 56 | 0.215 | 99.7 % (93.1 % core 6.6 % allow 0.0 % gener 0.3 % disall) | Putative tRNA synthase |
| 8 | P44720 (Modeller) | Predicted aminodeoxychorismate lyase protein, 2R1F | 49 | 0.378 | 99.7 % (89.8 % core 8.3 % allow 1.6 % gener 0.3 % disall) | Aminodeoxychorismate lyase protein |
Fig. 1Representation of model structure of HP P44717. a Showing CBS, CorC_HlyC and cytochrome c oxidase domains. b Diagram showing the monomer of CBSX with CBS1 and CBS2 domain. c Residues present in the active site pocket are illustrated in stick
Fig. 2Representation of model structure of HP P44782. a Overall structure is represented in cartoon. b Superposition of the RluA–RNA complex (PDB ID: 2I82; red) with free RluA (P44782; dark green). The forefinger and thumb loops that “grasp” the RNA are indicated with motif III loop, and contrast the absence of the thumb loop in P44782 structure. c Multiple sequence alignment of P44782 and Rlu A (PDB ID: 2I82) with red color indicates residues that are conserved in two proteins. Motifs I, II, and III are as defined by Koonin (1996) highlighted in blue boxes and conserved active site residues in black boxes. d Predicted active site residues shown in stick
Fig. 3Representation of model structure of HP P44197. a Cartoon model showing overall structure. b Multiple sequence alignment showing conserved residues in red color, with a catalytic domain of P44197, Rlu C and E. coli Rlu A (PDB ID: 2I82) distinguished residues in black box and motifs I and II in blue box. c A detailed description of P44197 active site in which Asp54 is surrounded by arginine residues from both the sides namely Arg52 and Arg164
Fig. 4Representation of model structure of HP P45267. a Cartoon diagram showing N-terminal part is shown in blue while C-terminal in red. Collection of non-bonded spheres represents membrane. b Stick representation of P45267 active site with Glu10 residue is proposed to be involve in cAMP binding
Fig. 5Representation of model structure of HP Q57498. a Cartoon model showing overall structure with active site residues. b A description of active site with His68 residue represents the citrate-binding site
Fig. 6Representation of model structure of HP P44095. a Overall structure of P44095 shown in cartoon model with N-terminal is shown in blue and C-terminal in red. b Representation of the active site residues of P44095 in stick model
Fig. 7Representation of model structure of HP P44093. a Three-dimensional structure represented in cartoon model with membrane represented as non-bonded spheres. b Representation of the active site residues of P44093 in stick model
Fig. 8Representation of model structure of HP P44720. a Cartoon model representation of overall structure in which membrane is represented by non-bonded atoms. b Representation of the active site residues of P44720 in stick model