| Literature DB >> 36035185 |
Bilal Ahmed Abbasi1, Aishwarya Dharan1, Astha Mishra1, Devansh Saraf1, Irsad Ahamad1, Prashanth Suravajhala1,2, Jayaraman Valadi1,3,4.
Abstract
Clostridium difficile (C. difficile) is a multi-strain, spore-forming, Gram-positive, opportunistic enteropathogen bacteria, majorly associated with nosocomial infections, resulting in severe diarrhoea and colon inflammation. Several antibiotics including penicillin, tetracycline, and clindamycin have been employed to control C. difficile infection, but studies have suggested that injudicious use of antibiotics has led to the development of resistance in C. difficile strains. However, many proteins from its genome are still considered uncharacterized proteins that might serve crucial functions and assist in the biological understanding of the organism. In this study, we aimed to annotate and characterise the 6 C. difficile strains using in silico approaches. We first analysed the complete genome of 6 C. difficile strains using standardised approaches and analysed hypothetical proteins (HPs) employing various bioinformatics approaches coalescing, including identifying contigs, coding sequences, phage sequences, CRISPR-Cas9 systems, antimicrobial resistance determination, membrane helices, instability index, secretory nature, conserved domain, and vaccine target properties like comparative homology analysis, allergenicity, antigenicity determination along with structure prediction and binding-site analysis. This study provides crucial supporting information about the functional characterization of the HPs involved in the pathophysiology of the disease. Moreover, this information also aims to assist in mechanisms associated with bacterial pathogenesis and further design candidate inhibitors and bona fide pharmaceutical targets.Entities:
Keywords: annotation; clostridium difficile; essential genes; function abbreviations C. difficile-clostridium difficile CDI-C. difficile infection; uncharacterized proteins
Year: 2022 PMID: 36035185 PMCID: PMC9403866 DOI: 10.3389/fgene.2022.878012
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Methodology used for characterization of HPs for Clostridium strains. Briefly, we have jotted down the HPs using python script and characterised the six strains of C. difficile using a cohort of tools (Please see methodology). The subsequent annotation is tabulated and analysed.
Characteristics of selected genomes of Clostriodioles strains.
| S. No | Description | Accession ID | Size (Mb) | Proteins (HP) | GC% | CDS |
|---|---|---|---|---|---|---|
| 01 |
| CP019870.1 | 4,124,384 | 3,547 (356) | 28.7 | 3,683 |
| 02 |
| NZ_CP029423.1 | 4,204,902 | 3,647 (409) | 28.9 | 3,802 |
| 03 |
| NC_017,173.1 | 4,159,517 | 3,587 (413) | 28.5 | 3,797 |
| 04 |
| FN665653.1 | 4,047,729 | 3,446 (404) | 28.7 | 3,697 |
| 05 |
| NC_013,315.1 | 4,110,554 | 3,552 (374) | 28.6 | 3,715 |
| 06 |
| NC_017,178.1 | 4,179,867 | 3,614 (377) | 28.7 | 3,806 |
FIGURE 2Overview of Subsystem Category Distribution for six strains (A) C. difficile strain CF5. (B) C. difficile strain BR81 (C) C. difficile strain R20291 (D) C. difficile strain 196 (E) C. difficile strain 2,007,855 (F) C. difficile strain M120. Identification of Prophage and CRISPR-Cas9 systems.
Intact Prophage region identified in Clostriodium difficile strains.
| Genome | Intact Region | Region length (Kb) | Score | Total protein | Position | Common phage | GC% |
|---|---|---|---|---|---|---|---|
|
| 1 | 55.9 | 140 | 71 | 1,684,408–1,740,383 | PHAGE_Clostr_phiMMP01_NC_028,883 (32) | 28.64 |
|
| 1 | 56.2 | 130 | 74 | 1,707,633–1,763,916 | PHAGE_Clostr_phiMMP03_NC_028,959 (30) | 29.02 |
|
| 1 | 57.7 | 140 | 72 | 1,673,218–1,730,955 | PHAGE_Clostr_phiMMP01_NC_028,883 (31) | 28.48 |
|
| 1 | 55.9 | 140 | 71 | 1,666,638–1,722,613 | PHAGE_Clostr_phiMMP01_NC_028,883 (32) | 28.64 |
AMR genes determination in WGS strains of C. difficile 2,007,855 and M120 using ResFinder tool.
| Strain | Resistance Gene | Identity% | Position in Contig | Alignment length | Phenotype | Accession No. |
|---|---|---|---|---|---|---|
|
|
| 100 | 3,136,169..3,136,906 | 738 | Macrolide resistance | U18931 |
|
|
| 96.65 | 3,810,802..3,811,338 | 537 | Aminoglycoside resistance | AF337947 |
|
|
| 98.67 | 3,811,382..3,812,281 | 900 | Aminoglycoside resistance | KF652098 |
|
|
| 100 | 480,747..481,604 | 858 | Aminoglycoside resistance | FN594949 |
|
|
| 100 | 468,126..468,989 | 864 | Aminoglycoside resistance | KF421157 |
|
|
| 98.85 | 2,175,639..2,177,558 | 1920 | Tetracycline resistance | EU182585 |
|
|
| 98.02 | 478,510..480,432 | 1923 | Tetracycline resistance | NZ_ABDU01000081 |
Subcellular localisation of C. difficile strains hypothetical proteins determined by PSORTb.
| Strain | Total HPs | Subcellular Location as Given Be PSORTb | |||
|---|---|---|---|---|---|
| Cytoplasmic | Cytoplasmic membrane | Extracellular | Unknown | ||
|
| 356 | 122 (34.27%) | 86 (24.16%) | 10 (2.81%) | 138 (38.76%) |
|
| 409 | 142 (34.72%) | 99 (24.21%) | 9 (2.20%) | 159 (38.88%) |
|
| 413 | 155 (37.53%) | 96 (23.24%) | 8 (1.94%) | 154 (37.29%) |
|
| 404 | 130 (32.18%) | 110 (27.23%) | 4 (0.99%) | 160 (39.60%) |
|
| 374 | 130 (34.76%) | 88 (25.53%) | 8 (2.14%) | 148 (39.57%) |
|
| 377 | 133 (35.28%) | 90 (23.87%) | 9 (2.39%) | 145 (38.46%) |
FIGURE 3Identification of active site pockets (highlighted in red cavity) for the shortlisted antigenic proteins (A) WP_104,732,835.1 (B) WP_009,906,007.1 (C) WP_003,429,932.1 (D) WP_021,396,478.1 (E) WP_021,389,778.1, and (F) WP_003,423,063.1.