| Literature DB >> 36228026 |
Md Atikur Rahman1, Uzma Habiba Heme2, Md Anowar Khasru Parvez3.
Abstract
Members of the Bacillus genus are industrial cell factories due to their capacity to secrete significant quantities of biomolecules with industrial applications. The Bacillus paralicheniformis strain Bac84 was isolated from the Red Sea and it shares a close evolutionary relationship with Bacillus licheniformis. However, a significant number of proteins in its genome are annotated as functionally uncharacterized hypothetical proteins. Investigating these proteins' functions may help us better understand how bacteria survive extreme environmental conditions and to find novel targets for biotechnological applications. Therefore, the purpose of our research was to functionally annotate the hypothetical proteins from the genome of B. paralicheniformis strain Bac84. We employed a structured in-silico approach incorporating numerous bioinformatics tools and databases for functional annotation, physicochemical characterization, subcellular localization, protein-protein interactions, and three-dimensional structure determination. Sequences of 414 hypothetical proteins were evaluated and we were able to successfully attribute a function to 37 hypothetical proteins. Moreover, we performed receiver operating characteristic analysis to assess the performance of various tools used in this present study. We identified 12 proteins having significant adaptational roles to unfavorable environments such as sporulation, formation of biofilm, motility, regulation of transcription, etc. Additionally, 8 proteins were predicted with biotechnological potentials such as coenzyme A biosynthesis, phenylalanine biosynthesis, rare-sugars biosynthesis, antibiotic biosynthesis, bioremediation, and others. Evaluation of the performance of the tools showed an accuracy of 98% which represented the rationality of the tools used. This work shows that this annotation strategy will make the functional characterization of unknown proteins easier and can find the target for further investigation. The knowledge of these hypothetical proteins' potential functions aids B. paralicheniformis strain Bac84 in effectively creating a new biotechnological target. In addition, the results may also facilitate a better understanding of the survival mechanisms in harsh environmental conditions.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36228026 PMCID: PMC9560612 DOI: 10.1371/journal.pone.0276085
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Workflow representing the overall design of the study.
The tasks listed in the green outlined boxes were applied only after the analyzed HPs showed the same function in at least three different bioinformatics tools.
Fig 2The gene ontology of all the 414 HPs.
(A) The distribution of the HPs among the three gene ontology categories. (B) Graph of the cellular components. (C) Graph of the biological processes. (D) Graph of the molecular functions. Here, the distribution of GO terms is presented on the Y axis and the area of the bubbles is relative to the number of proteins found in each category.
Fig 3A & B. Tertiary structures analysis. Three-dimensional structures were modeled by the SWISS-MODEL server reliably using the templates with higher coverage, more than 30% of identity, and higher GMQE scores along with Ramachandran Favored percentages ≥90%. Only the templates determined by the X-ray crystallography with high resolution were used. The known proteins and the modeled structures are indicated in red and blue colors respectively. The proteins are orientated using the Chimera MatchMaker according to the optimal superposition of the matching residues.
Hypothetical proteins functionally annotated from the B. paralicheniformis strain Bac84.
| No. | HP ID | Inferred function |
|---|---|---|
|
| WP_158700706.1 | Metal-dependent hydrolase |
|
|
| Catalytic core DNA breaking-rejoining enzymes |
|
|
| RNA polymerase sporulation sigma factor SigK |
|
|
| YhzD-like protein |
|
|
| Response regulator aspartate phosphatase |
|
|
| The YqzH-like protein family |
|
|
| The YgaB-like protein family |
|
|
| Inner membrane protein YiaA-like |
|
|
| YqaH-like protein |
|
|
| Bacteriophage A118-like, holin |
|
|
| Metal-responsive transcriptional regulator |
|
|
| Sigma-M inhibitor protein YhdK |
|
|
| Streptogramin lyase |
|
|
| RlpA-like domain superfamily |
|
|
| Phenylalanyl-tRNA synthetase |
|
|
| Flavin-phosphopantothenoylcysteine decarboxylase/Flavin prenyltransferase |
|
|
| Pathogenicity locus—Putative mitomycin resistance proteins |
|
|
| YetA-like protein |
|
|
| ESAT-6-like superfamily |
|
|
| YkyB-like protein |
|
|
| Transcription regulator DksA-related |
|
|
| Nudix_Hydrolase super family |
|
|
| YhzD-like protein |
|
|
| Heat Shock protein (Hsp20 proteins) |
|
|
| HesB-like domain superfamily |
|
|
| YqfQ-like protein |
|
|
| RmlC-like cupin superfamily |
|
|
| Chromosome segregation protein SMC |
|
|
| Response regulator aspartate phosphatase |
|
|
| Putative phage metallopeptidase |
|
|
| Alpha/Beta hydrolase fold |
|
|
| Swarming motility protein SwrA |
|
|
| Acyl-CoA N-acyltransferase |
|
|
| BslA (Biofilm surface layer A) |
|
|
| Immunity protein WapI-like/YxiJ super family |
|
|
| Six-hairpin glycosidase superfamily |
|
|
| Prephenate dehydratase |
ROC results of the tools used in this study.
| Software | Accuracy (%) | Sensitivity (%) | Specificity (%) | ROC area |
|---|---|---|---|---|
| Pfam | 99.0 | 98.0 | 100 | 0.99 |
| InterPro | 100.0 | 100.0 | 100.0 | 1 |
| CATH | 100.0 | 100.0 | 100.0 | 1 |
| SUPERFAMILY | 96.0 | 94.7 | 100.0 | 0.99 |
| SCANPROSITE | 97.0 | 93.8 | 100.0 | 0.99 |
| SMART | 98.0 | 97.0 | 100.0 | 1 |
| CDD-BLAST | 96.0 | 65.9 | 100.0 | 0.985 |