| Literature DB >> 32458979 |
Md Amran Gazi1, Sultan Mahmud2, Shah Mohammad Fahim1, Md Rezaul Islam3, Subhasish Das1, Mustafa Mahfuz1,4, Tahmeed Ahmed1.
Abstract
Campylobacter jejuni (C. jejuni) is considered to be one of the most frequent causes of bacterial gastroenteritis globally, especially in young children. The genome of C. jejuni contains many proteins with unknown functions termed as hypothetical proteins (HPs). These proteins might have essential biological role to show the full spectrum of this bacterium. Hence, our study aimed to determine the functions of HPs, pertaining to the genome of C. jejuni. An in-silico work flow integrating various tools were performed for functional assignment, three-dimensional structure determination, domain architecture predictors, subcellular localization, physicochemical characterization, and protein-protein interactions (PPIs). Sequences of 267 HPs of C. jejuni were analyzed and successfully attributed the function of 49 HPs with higher confidence. Here, we found proteins with enzymatic activity, transporters, binding and regulatory proteins as well as proteins with biotechnological interest. Assessment of the performance of various tools used in this analysis revealed an accuracy of 95% using receiver operating characteristic (ROC) curve analysis. Functional and structural predictions and the results from ROC analyses provided the validity of in-silico tools used in the present study. The approach used for this analysis leads us to assign the function of unknown proteins and relate them with the functions that have already been described in previous literature.Entities:
Keywords: Campylobacter; NCBI; gastroenteritis; hypothetical protein; in silico
Mesh:
Substances:
Year: 2020 PMID: 32458979 PMCID: PMC7284324 DOI: 10.1042/BSR20193939
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Figure 1Flow chart showing the overall design of the study
HPs functionally annotated from C. jejuni
| No. | Protein IDs | Protein function |
|---|---|---|
| 1 | WP_002868767.1 | Curli production assembly, transport component CsgG |
| 2 | WP_002854524.1 | Chemotaxis phosphatase CheX |
| 3 | WP_009882162.1 | SprA-related family |
| 4 | WP_010790856.1 | Pyridoxamine 5′-phosphate oxidase |
| 5 | WP_009882239.1 | Hemagglutination activity domain |
| 6 | WP_002854991.1 | FxsA cytoplasmic membrane protein, FxsA |
| 7 | WP_002855029.1 | DNA replication regulator, HobA |
| 8 | WP_002868905.1 | GDSL-like lipase |
| 9 | WP_002869356.1 | Divergent polysaccharide deacetylase |
| 10 | WP_002856929.1 | C4-type zinc ribbon domain |
| 11 | WP_002869028.1 | Esterase-like activity of phytase |
| 12 | WP_011812736.1 | Domain of unknown function DUF234 |
| 13 | WP_002868809.1 | Ankyrin repeats, Ank_2 |
| 14 | WP_002869368.1 | Type-1V conjugative transfer system mating pair stabilization, TraN |
| 15 | WP_009882583.1 | NLPC_P60 stabilizing domain |
| 16 | WP_002853389.1 | Jag, N-terminal domain superfamily |
| 17 | WP_009882608.1 | Adhesin from |
| 18 | WP_002856369.1 | Putative β-lactamase-inhibitor-like |
| 19 | WP_079254190.1 | β-1,4-N-acetylgalactosaminyltransferase (CgtA) |
| 20 | WP_002856180.1 | Heavy-metal-associated domain |
| 21 | WP_002831611.1 | Transcription factor zinc-finger |
| 22 | WP_002790076.1 | Methyl-accepting chemotaxis protein (MCP) signaling domain |
| 23 | WP_002853792.1 | Plasminogen-binding protein pgbA N-terminal |
| 24 | WP_002869072.1 | Putative S-adenosyl- |
| 25 | WP_002869097.1 | MaoC-like dehydratase domain |
| 26 | WP_002869326.1 | Metallo-carboxypeptidase |
| 27 | WP_002869139.1 | Pyruvate phosphate dikinase, PEP |
| 28 | WP_002869195.1 | Anti-sigma-28 factor |
| 29 | WP_002856630.1 | PD-(D/E)XK nuclease superfamily |
| 30 | WP_002855458.1 | MgtE intracellular N domain |
| 31 | WP_002797496.1 | Flagellar FliJ protein |
| 32 | WP_024088174.1 | Nitrate reductase chaperone |
| 33 | WP_009883030.1 | ATPase, AAA-type, core |
| 34 | WP_002824979.1 | Putative NADH-ubiquinone oxidoreductase chain E |
| 35 | WP_002869225.1 | DMSO reductase anchor subunit (DmsC) |
| 36 | WP_002856602.1 | Putative β-lactamase-inhibitor-like |
| 37 | WP_002868888.1 | Tetratricopeptide repeat, TPR_2 |
| 38 | WP_002868880.1 | ABC-type transport auxiliary lipoprotein component |
| 39 | WP_009883121.1 | Flagellar FLiS export co-chaperone |
| 40 | WP_002860117.1 | Menaquinone biosynthesis |
| 41 | WP_002779704.1 | T-antigen specific domain |
| 42 | WP_011187233.1 | Toprim domain |
| 43 | WP_011187235.1 | AAA domain, AAA_25 |
| 44 | WP_002809111.1 | TrbM superfamily |
| 45 | WP_011117548.1 | Bacterial virulence protein VirB8 |
| 46 | WP_011117549.1 | Conjugal transfer protein |
| 47 | WP_011117575.1 | Type IV secretion system proteins,T4SS |
| 48 | WP_011799393.1 | TrbM superfamily |
| 49 | WP_011117588.1 | mRNA interferase PemK-like |
Figure 2Functional classification of 49 HPs into various groups
ROC results of various tools used in the present study
| No. | Software | Accuracy (%) | Sensitivity (%) | Specificity (%) | ROC area |
|---|---|---|---|---|---|
| 1 | PFAM | 95% | 94.7% | 100% | 0.97 |
| 2 | SMART | 95% | 94.9% | 100% | 0.97 |
| 3 | MOTIF | 95% | 94.9% | 100% | 0.97 |
| 4 | INTERPROSCAN | 95% | 94.9% | 100% | 0.97 |
| 5 | CDART | 97.5% | 97.4% | 100% | 0.99 |
| 6 | SUPERFAMILY | 95% | 94.1% | 100% | 0.97 |
| 7 | SVMprot | 90% | 88.9% | 100% | 0.94 |
| 8 | Average |