| Literature DB >> 28324558 |
Kundan Kumar1, Amresh Prakash1, Farah Anjum2, Asimul Islam1, Faizan Ahmad1, Md Imtaiyaz Hassan3.
Abstract
Candida dubliniensis is an emerging pathogenic yeast in humans and infections are usually restricted to mucosal parts of the body. However, its presence in specimens of immunocompromised individuals, especially in HIV-positive patients, is of major medical concern. There is a large fraction of genomes of C. dubliniensis in the database which are uncharacterized for their biochemical, biophysical, and/or cellular functions, and are identified as hypothetical proteins (HPs). Function annotation of Candida genome is, therefore, essentially required to facilitate the understanding of mechanisms of pathogenesis and biochemical pathways important for selecting novel therapeutic target. Here, we carried out an extensive analysis to explain the functional properties of genome, using available protein structure and function analysis tools. We successfully modeled the structures of eight HPs for which a template with moderate sequence similarity was available in the protein data bank. All modeled structures were analyzed and we found that these proteins may act as transporter, kinase, transferase, ketosteroid, isomerase, hydrolase, oxidoreductase, and binding targets for DNA and RNA. Since these unique HPs of Candida showed no homologs in humans, these proteins are expected to be a potential target for future antifungal therapy.Entities:
Keywords: Candida dubliniensis; Domains and motifs; Functional annotation; Functional genomics; Homology modeling; Hypothetical protein; Sequence analysis
Year: 2014 PMID: 28324558 PMCID: PMC4522726 DOI: 10.1007/s13205-014-0256-3
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
List of bioinformatics tools and databases used for sequence and structure-based function annotation
| S. no. | Name of Web server | URL | Uses | References |
|---|---|---|---|---|
| 1. Sequence similarity search tool | ||||
| BLAST: basic local alignment search tool |
| To find a similar sequence in the database | Mount ( | |
| 2. Biophysical and chemical characterization | ||||
| ProtParam |
| To calculate physical and chemical properties | Wilkins et al. ( | |
| 3. Sub-cellular localization of the protein | ||||
| I. | SOSUI |
| To predict the transmembrane domain | Hirokawa et al. ( |
| II. | TMHMM |
| Used to predict the transmembrane topology | Krogh et al. ( |
| III. | Psort II |
| To predict sub-cellular localization | Nakai and Horton ( |
| IV. | SignalP |
| To predict cleavage site of signal protein | Petersen et al. ( |
| V. | HMMTOP |
| To predict the transmembrane helix | Tusnady and Simon ( |
| 4. Functional analysis tool | ||||
| I. | Conserved domain |
| To search conserved domain in the sequences | Marchler-Bauer et al. ( |
| II. | InterProScan |
| To find the motif in the sequences | Quevillon et al. ( |
| III. | Interpro |
| To categorize by predicting domains and important sites | Apweiler et al. ( |
| IV. | SMART |
| To Identify and annotate domains in protein | Letunic et al. ( |
| V. | CATH |
| To use hierarchical domain classification of PDB structures | Orengo et al. ( |
| VI. | Pfam |
| To collect protein families, based on multiple sequence alignments and HMM | Finn et al. ( |
| 5. Predicting the fold pattern | ||||
| I. | PFP-FunDSeqE |
| To find the type of protein fold in the protein sequences | Shen and Chou ( |
| II. | HHpred |
| Used for homology detection | Kalev and Habeck ( |
| III. | Dali server |
| For searching similar 3D structure | Holm and Rosenstrom ( |
| 6. Virulence prediction | ||||
| FungalRV |
| In adhesin prediction | Chaudhuri et al. ( | |
| 7. Structure prediction | ||||
| I. | MODELLER |
| To model three-dimensional structures | Webb and Sali ( |
| II. | SWISS-MODEL |
| Homology modeling server | Biasini et al. ( |
| III. | Phyre2 |
| Ab initio method for structure prediction | Kelley and Sternberg ( |
| 9. Structure validation | ||||
| SAVES |
| To validate protein structures | Laskowski et al. ( | |
| 10. Docking analysis | ||||
| AutoDock |
| Sandeep et al. ( | ||
Validation report of all predicted models
| S. no. | Uniprot ID | Score | Tools used | Structure selected | Template | RMSD with template | ||
|---|---|---|---|---|---|---|---|---|
| MODELLER | SWISS-MODEL | Phyre | ||||||
| 1 | B9WFH2 | LG Score | 2.027 | 0.958 | 2.061 | MODELLER | 4GBZ | 2.419 |
| MaxSub | 0.116 | 0.065 | 0.145 | |||||
| Procheck (%) | 83.60 | 74.40 | 84.60 | |||||
| 2 | B9WFH4 | LG Score | 2.253 | 3.359 | 3.576 | SWISS-MODEL | 3T5P | 2.389 |
| MaxSub | 0.152 | 0.285 | 0.298 | |||||
| Procheck (%) | 85.00 | 83.80 | 79.60 | |||||
| 3 | B9WFR9 | LG Score | 0.618 | 2.461 | 3.357 | SWISS-MODEL | 3UBM | 0.178 |
| MaxSub | 0.020 | 0.216 | 0.298 | |||||
| Procheck (%) | 80.80 | 87.30 | 79.60 | |||||
| 4 | B9WFS0 | LG Score | 2.341 | 1.878 | 2.667 | MODELLER | 2RGQ | 0.460 |
| MaxSub | 0.309 | 0.255 | 0.335 | |||||
| Procheck | 85.80 | 84.60 | 85.80 | |||||
| 5 | B9WFS1 | LG Score | 5.560 | 5.353 | 5.137 | MODELLER | 1VA4 | 0.296 |
| MaxSub | 0.533 | 0.495 | 0.446 | |||||
| Procheck (%) | 86.60 | 79.60 | 87.80 | |||||
| 6 | B9WFS6 | LG Score | 0.599 | 1.935 | 5.206 | Phyre | 1K3R | 0.000 |
| MaxSub | 0.012 | 0.128 | 0.367 | |||||
| Procheck (%) | 63.10 | 74.10 | 85.10 | |||||
| 7 | B9WFU3 | LG Score | 1.123 | 1.125 | 1.622 | MODELLER | 3IDV | 1.877 |
| MaxSub | 0.104 | 0.168 | 0.174 | |||||
| Procheck (%) | 76.90 | 79.00 | 70.50 | |||||
| 8 | B9WFW8 | LG Score | 1.564 | Phyre | 1SXJ | 0.334 | ||
| MaxSub | 0.111 | |||||||
| Procheck (%) | 77.20 | |||||||
Function of HPs of C. dubliniensis
| S. no. | Gene ID | Uniprot ID | Protein product | Protein function |
|---|---|---|---|---|
| 1 | 8047379 | B9WFH2 | XP_002419776.1 | Transporter activity |
| 2 | 8047381 | B9WFH4 | XP_002419778.1 | Kinase activity |
| 3 | 8047468 | B9WFR9 | XP_002419873.1 | Transferase activity |
| 4 | 8047469 | B9WFS0 | XP_002419874.1 | Ketosteroid isomerase activity |
| 5 | 8047470 | B9WFS1 | XP_002419875.1 | Hydrolase activity |
| 6 | 8047474 | B9WFS6 | XP_002419880.1 | RNA binding |
| 7 | 8047491 | B9WFU3 | XP_002419897.1 | Oxidoreductase activity |
| 8 | 8047669 | B9WFW8 | XP_002419922.1 | DNA binding |
Fig. 1The structure of HP B9WFH2 bounded to BGC. a The overall structure of HP B9WFH2 comprised 15 transmembrane helices (green), 2 small extracellular helices (pink) and 4 intracellular helices interconnected (orange) with loops. b BGC docked in the HP B9WFH2. The active site residues are shown in stick and BGC in ball and stick model. The hydroxyl groups of BGC are involved in polar interaction (black dotted lines) with residues Lys21 and His20, Aromatic residues Phe489 and Trp456 present in the vicinity of BGC may involve in regulation glucose transport. c Showing GLUT inhibitor (5-(4-hydroxy-3-trifluoromethyl benzylidene)-3-[4,4,4-trifluoro-2-methyl-2-(2,2,2-trifluoroethyl) butyl] thiazolidine-2,4-dione) binds HP B9WFH2 at the same region where BGC binds to the HP B9WFH2. d Residues interacts with the GLUT inhibitors are shown in stick model
Sequence and structural motifs present in the HPs
| S. no. | Uniprot ID | Sequence motif | Structure motif |
|---|---|---|---|
| 1 | B9WFH2 | MFS_gen_substrate_transporter | Val48-Ser51 |
| Sugar transport | Val152-Ile155 | ||
| 2 | B9WFH4 | Seg | Asn146- Leu148 |
| DAGK_cat | Gly209- His211 | ||
| Sphingosine kinase | Gln344- Arg346 | ||
| DAGK | Thr95- Ile97 | ||
| Leu152-Ile154 | |||
| Ser167- Lys169 | |||
| 3 | B9WFR9 | CoA-transferase family III (CaiB/BaiF) | Tyr274- Ala276 |
| Alpha Methylacyl-Coa racemase | Gly334- Ile336 | ||
| 4 | B9WFS0 | – | – |
| 5 | B9WFS1 | α/β-Hydrolase | Leu86-Leu88 |
| 6 | B9WFS6 | Methyltrn_RNA_3 Nucleic acid-binding proteins | Ile212-Glu214 |
| 7 | B9WFU3 | Thioredoxin | Asp145-Lys147 |
| 8 | B9WFW8 | Rad17 | Gly101-Ser105 |
| Asn70-Thr72 | |||
| Glu193-Glu195 | |||
| Gln17-Asp179 | |||
| Thr597-Gly599 |
Fig. 2Overall structure of HP B9WFH4. a The structure of HP B9WFH4 adopts two domains architectures, CTD designated for sphingosine binding, whereas ADP binds in the NTD. The hydrophobic pocket of CTD comprises residues Val205, Ile228 and Leu264 which are involved in hydrophobic interaction with polycarbon tail of sphingosine. The polar head of sphingosine is spatially fitted at the cleft of CTD and NTD. Residues Asp117, Thr144 and Ser225 are involved in polar interactions (shown in red dots). b Docked pose of ADP at NTD. The important residues of HP B9WFH4 are shown in stick and H-bond interactions are denoted by black dashed lines. c Showing sphingosine kinase inhibitor [(S)-1-(4-(4-(3-(2-Cyclohexylethyl) phynyl)oxazol-2-yl)benzoyl) pyrrolidine-2carboximidamide] binds HP B9WFH4 at the same region where sphingosine binds to HP B9WFH4. d Residues that interact with the sphingosine kinase inhibitor are shown as stick model
Fig. 3Overall structure of HP B9WFR9. Structure of B9WFR9 docked with coenzyme A (CoA). Predicted active site residues are shown in stick and polar interaction are represented with dashed line (black)
Fig. 4a Overall structure of HP B9WFS0 docked with estradiol-17 β-hemisuccinate. b Superimposed structure of HP B9WFS0 with its template (PDB id: 2RGQ)
Fig. 5a Overall structure of HP B9WFS1. b Superimposed structure of HP B9WFS1 with template structure showing conserved residues
Fig. 6a Overall structure of B9WFS6 showing TIM barrel domain and β-barrel auxiliary domain. b Superimposed structure of HP B9WFS6 with template structure
Fig. 7Overall structure of HP B9WFU3 having two units of thioredoxin fold; the N-terminal unit having conserved residues may be the site of oxidoreductase activity
Fig. 8Overall structure of HP B9WFU3. Line shown in the figure is the line across which DNA winds, and three domains (label I, II and III) form complex for DNA binding, having similar topology to the RFC monomer unit