| Literature DB >> 35980906 |
Zeshan Mahmud Chowdhury1, Arittra Bhattacharjee1, Ishtiaque Ahammad1, Mohammad Uzzal Hossain1, Abdullah All Jaber2, Anisur Rahman1, Preonath Chondrow Dev1, Md Salimullah3, Chaman Ara Keya2.
Abstract
Streptococcus pneumoniae (S. pneumoniae), the major etiological agent of community-acquired pneumonia (CAP) contributes significantly to the global burden of infectious diseases which is getting resistant day by day. Nearly 30% of the S. pneumoniae genomes encode hypothetical proteins (HPs), and better understandings of these HPs in virulence and pathogenicity plausibly decipher new treatments. Some of the HPs are present across many Streptococcus species, systematic assessment of these unexplored HPs will disclose prospective drug targets. In this study, through a stringent bioinformatics analysis of the core genome and proteome of S. pneumoniae PCS8235, we identified and analyzed 28 HPs that are common in many Streptococcus species and might have a potential role in the virulence or pathogenesis of the bacteria. Functional annotations of the proteins were conducted based on the physicochemical properties, subcellular localization, virulence prediction, protein-protein interactions, and identification of essential genes, to find potentially druggable proteins among 28 HPs. The majority of the HPs are involved in bacterial transcription and translation. Besides, some of them were homologs of enzymes, binding proteins, transporters, and regulators. Protein-protein interactions revealed HP PCS8235_RS05845 made the highest interactions with other HPs and also has TRP structural motif along with virulent and pathogenic properties indicating it has critical cellular functions and might go under unconventional protein secretions. The second highest interacting protein HP PCS8235_RS02595 interacts with the Regulator of chromosomal segregation (RocS) which participates in chromosome segregation and nucleoid protection in S. pneumoniae. In this interacting network, 54% of protein members have virulent properties and 40% contain pathogenic properties. Among them, most of these proteins circulate in the cytoplasmic area and have hydrophilic properties. Finally, molecular docking and dynamics simulation demonstrated that the antimalarial drug Artenimol can act as a drug repurposing candidate against HP PCS8235_RS 04650 of S. pneumoniae. Hence, the present study could aid in drugs against S. pneumoniae.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35980906 PMCID: PMC9387852 DOI: 10.1371/journal.pone.0272945
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1A schematic representation of the workflow involved to identify potential anti-streptococci drug targets.
Fig 2Circular representation of the S. pneumoniae genome and related species.
The brown bar of the circular plot represents the core genes.
Fig 3Categories of annotated S. pneumoniae hypothetical proteins classified according to their biological, molecular, and cellular functions.
Proteins involved in molecular functions were relatively higher compared to others.
Annotated products for HPs in S. Pneumoniae as determined by GO FEAT platform.
| Seq no. | Locus Tag | Product | Gene ontology | BlastP |
|---|---|---|---|---|
| PCS8235_RS00650 | Uncharacterized protein | MULTISPECIES: cell division protein ZapA [ | ||
| PCS8235_RS01335 | W1WQE4_9ZZZZ YlxR domain-containing protein OS = human gut metagenome OX = 408170 GN = Q604_UNBC18436G0004 PE = 4 SV = 1 | Cytosolic protein YlxR [ | ||
| PCS8235_RS01340 | Ribosomal protein HS6-type | GO:0003723; GO:0005840; | L7A family ribosomal protein [ | |
| PCS8235_RS01935 | Uncharacterized protein | DUF910 family protein [ | ||
| PCS8235_RS02405 | Q8DQG2_STRR6 PC4 domain-containing protein OS = | GO:0003677; GO:0006355 | YdbC family protein [ | |
| PCS8235_RS02560 | Uncharacterized protein | DUF1827 family protein [ | ||
| PCS8235_RS02595 | Uncharacterized protein | Protein of uncharacterised function (DUF1797) [ | ||
| PCS8235_RS02800 | Uncharacterized protein | MULTISPECIES: DUF2969 domain-containing protein [ | ||
| PCS8235_RS02820 | GAF domain-containing protein | Structure of fRMsr [ | ||
| PCS8235_RS03050 | Membrane protein puative | GO:0005886; GO:0016021 | permease [ | |
| PCS8235_RS03260 | M5KND5_STREE UPF0346 protein PCS8203_00654 OS = | YozE family protein [ | ||
| PCS8235_RS03345 | M5KNF4_STREE Rqc2 homolog RqcH OS = | GO:0000049; GO:0043023; GO:0072344 | Rqc2 family fibronectin-binding protein PavA [ | |
| PCS8235_RS03470 | A0A3R9LSP5_STROR DUF536 domain-containing protein OS = | MULTISPECIES: chromosome segregation protein RocS [ | ||
| PCS8235_RS03660 | Pneumococcal vaccine antigen A | pneumococcal vaccine antigen A [ | ||
| PCS8235_RS03915 | DNA repair protein RadC | GO:0046872; GO:0008237 | MULTISPECIES: DNA repair protein RadC [ | |
| PCS8235_RS04650 | Haloacid dehalogenase | GO:0016787 | Cof-type HAD-IIB family hydrolase [ | |
| PCS8235_RS04815 | Putative ACR | DUF177 domain-containing protein [ | ||
| PCS8235_RS05215 | M5KML3_STREE UPF0342 protein PCS8203_00480 OS = | YlbF/YmcA family competence regulator [ | ||
| PCS8235_RS05845 | Tetratricopeptide repeat protein | tetratricopeptide repeat protein [ | ||
| PCS8235_RS06025 | Methyltransferase | GO:0008168; GO:0003676; GO:0032259 | tRNA1(Val) (adenine(37)-N6)-methyltransferase [ | |
| PCS8235_RS06945 | M5MZN5_STREE tRNA(Met) cytidine acetate ligase OS = | GO:0000049; GO:0005524; GO:0005737; GO:0006400; GO:0016879 | nucleotidyltransferase [ | |
| PCS8235_RS07090 | Protein of uncharacterized function (DUF3013) | MULTISPECIES: DUF3013 family protein [ | ||
| PCS8235_RS07595 | A0A3R9LRS7_STROR UPF0154 protein D8800_03695 OS = | GO:0005886; GO:0016021 | YneF family protein | |
| PCS8235_RS08090 | A0A1E9G3W9_9STRE UPF0297 protein HMPREF2766_08835 OS = | MULTISPECIES: IreB family regulatory phosphoprotein [ | ||
| PCS8235_RS08125 | Uncharacterized protein | GO:0016021 | MULTISPECIES: DUF1129 domain-containing protein [ | |
| PCS8235_RS08375 | M5K7W4_STREE UPF0356 protein PCS8203_00243 OS = | MULTISPECIES: DNA-dependent RNA polymerase auxiliary subunit epsilon family protein [ | ||
| PCS8235_RS10555 | J1NSH0_STREE UPF0374 protein AMCSP13_002284 OS = | UPF0374 protein SSU05 [ | ||
| PCS8235_RS10805 | Uncharacterized protein | GO:0016021 | HlyC/CorC family transporter [ |
Physicochemical properties of HPs obtained using ProtParam tool.
| Locus tag | Protein length | Molecular weight | Isoelectric point | Negatively charged residues | Positively charged residues | extinction coefficient | instability index | Aliphatic index | GRAVY |
|---|---|---|---|---|---|---|---|---|---|
| RS00650 | 100 | 11568.13 | 4.82 | 20 | 13 | 1615 | 63.16 | 85.9 | -0.559 |
| RS01335 | 97 | 11240.04 | 9.62 | 16 | 22 | 4470 | 32.66 | 32.66 | -0.663 |
| RS01340 | 99 | 10867.78 | 9.87 | 9 | 16 | 2980 | 26.31 | 110.3 | -0.035 |
| RS01935 | 73 | 8838.32 | 5.8 | 14 | 13 | 8940 | 56.25 | 100.14 | -0.44 |
| RS02405 | 79 | 9136.44 | 6.73 | 13 | 13 | 11000 | 35.8 | 58.1 | -0.765 |
| RS02560 | 99 | 11257.94 | 6.83 | 12 | 12 | 4470 | 36.29 | 106.36 | -0.255 |
| RS02595 | 76 | 8837.87 | 4.37 | 17 | 7 | 5960 | 29.95 | 96.18 | -0.333 |
| RS02800 | 76 | 8463.82 | 7.88 | 12 | 13 | 4470 | 25.25 | 102.63 | -0.283 |
| RS02820 | 165 | 18421.04 | 4.41 | 25 | 13 | 14565 | 38.13 | 95.7 | 0.045 |
| RS03050 | 301 | 33319.99 | 8.58 | 15 | 18 | 24660 | 33.83 | 135.05 | 1.045 |
| RS03260 | 71 | 8462.34 | 4.68 | 14 | 7 | 13980 | 34.76 | 56.48 | -0.686 |
| RS03345 | 560 | 64392.58 | 8.43 | 74 | 77 | 29465 | 46.28 | 91.93 | -0.535 |
| RS03470 | 163 | 18970.57 | 4.88 | 36 | 28 | 2980 | 53.18 | 89.69 | -0.852 |
| RS03660 | 204 | 22915.15 | 9.16 | 16 | 21 | 27850 | 21.54 | 87.01 | -0.199 |
| RS03915 | 226 | 25535.53 | 6.64 | 26 | 24 | 7450 | 49.12 | 111.77 | -0.165 |
| RS04650 | 264 | 29481.57 | 4.64 | 42 | 25 | 21890 | 18.59 | 94.51 | -0.102 |
| RS04815 | 180 | 20398.72 | 3.93 | 39 | 12 | 14440 | 56.59 | 99.61 | -0.353 |
| RS05215 | 112 | 12471.19 | 4.83 | 16 | 12 | 5960 | 52.41 | 86.43 | -0.271 |
| RS05845 | 403 | 46780.9 | 4.2 | 87 | 28 | 57190 | 43.96 | 95.29 | -0.434 |
| RS06025 | 249 | 28367.53 | 5.75 | 35 | 30 | 12170 | 43.62 | 96.71 | -0.326 |
| RS06945 | 365 | 41269.4 | 7.12 | 43 | 43 | 39880 | 25.92 | 94.05 | -0.243 |
| RS07090 | 156 | 18064.09 | 3.92 | 38 | 10 | 22920 | 38.15 | 81.86 | -0.222 |
| RS07595 | 82 | 9085.76 | 10.07 | 6 | 13 | 2980 | 29.12 | 115.49 | -0.084 |
| RS08090 | 88 | 10227.41 | 4.9 | 15 | 12 | 11920 | 21.41 | 82.95 | -0.749 |
| RS08125 | 227 | 25799 | 6.42 | 21 | 20 | 32890 | 43.44 | 98.9 | 0.146 |
| RS08375 | 77 | 9258.34 | 4.58 | 18 | 11 | 8940 | 66.6 | 83.64 | -0.779 |
| RS10555 | 177 | 21338.22 | 6.31 | 29 | 27 | 49850 | 41.1 | 80.9 | -0.692 |
| RS10805 | 443 | 49621.74 | 4.36 | 70 | 35 | 34380 | 34.35 | 110.45 | 0.071 |
| Average | 186.14 | 21228.26 | 6.20464 | 27.78 | 20.5357 | 17152.3 | 39.2060 | 92.167 | -0.3113 |
Essential proteins as identified by the DEG database.
More than half of the proteins were found to be essential.
| Query ID | Subject Id | E-value | Bit score |
|---|---|---|---|
|
| DEG10100355 | 0.37 | 28.5 |
|
| DEG10070031 | 1.44E-66 | 194 |
|
| DEG10580251 | 1.75E-52 | 159 |
|
| DEG10350401 | 0.82 | 26.6 |
|
| DEG10580070 | 1.45E-41 | 129 |
|
| DEG10570129 | 3.2 | 25.8 |
|
| DEG10050604 | 1.1 | 26.6 |
|
| DEG10580106 | 8.13E-31 | 102 |
|
| DEG10070045 | 4.36E-121 | 337 |
|
| DEG10470184 | 5.09E-80 | 244 |
|
| DEG10580325 | 6.62E-35 | 112 |
|
| DEG10500070 | 1.11E-08 | 57 |
|
| DEG10540130 | 4.16E-78 | 229 |
|
| DEG10480049 | 0.5 | 30.4 |
|
| DEG10070165 | 6.91E-166 | 456 |
|
| DEG10580145 | 6.13E-62 | 195 |
|
| DEG10070175 | 9.05E-129 | 358 |
|
| DEG10080304 | 1.1 | 27.3 |
|
| DEG10290281 | 0.15 | 33.1 |
|
| DEG10500005 | 4.06E-19 | 82.8 |
|
| DEG10230304 | 0.37 | 32 |
|
| DEG10320245 | 2.9 | 26.6 |
|
| DEG10070224 | 7.65E-55 | 164 |
|
| DEG10070008 | 9.91E-61 | 179 |
|
| DEG10210140 | 0.29 | 31.6 |
|
| DEG10070005 | 4.49E-50 | 151 |
|
| DEG10620252 | 1.79E-77 | 228 |
|
| DEG10390037 | 7.16E-50 | 174 |
Fig 4Venn diagram representing the number of hypothetical proteins with virulence factor pathogenicity.
Fig 5String results of PPI revealed several HPs that interact with pneumococcal vaccine antigen A and Ribosomal silencing factor; PCS8235_RS05845 and PCS8235_RS02595 have the highest nodes.
Prediction of subcellular localization and unconventional protein secretion in HPs by CELLO, PSORTb, OutCyte 1.0 and SecretomeP 2.0.
| Seq ID | CELLO | PSORTb 3.0 | Outcyte 1.0 Class | SecretomeP 2.0 score |
|---|---|---|---|---|
| RS00650 | cytoplasmic | Cytoplasmic | UPS | 0.075661 |
| RS01335 | cytoplasmic | Cytoplasmic | Intracellular | 0.444115 |
| RS01340 | cytoplasmic | Cytoplasmic Membrane | UPS | 0.074983 |
| RS01935 | cytoplasmic | Cytoplasmic | UPS | 0.047038 |
| RS02405 | cytoplasmic | Cytoplasmic | UPS | 0.702673 |
| RS02560 | cytoplasmic + extracellular | Cytoplasmic | UPS | 0.353897 |
| RS02595 | cytoplasmic | Cytoplasmic | Intracellular | 0.049599 |
| RS02800 | cytoplasmic | Cytoplasmic | UPS | 0.762960 |
| RS02820 | cytoplasmic | unknown | UPS | 0.071492 |
| RS03050 | membrane | Cytoplasmic Membrane | Transmembrane | 0.970836 |
| RS03260 | cytoplasmic | unknown | UPS | 0.215002 |
| RS03345 | cytoplasmic | cytoplasmic | Intracellular | 0.200677 |
| RS03470 | cytoplasmic | cytoplasmic | UPS | 0.089014 |
| RS03660 | extracellular | Cytoplasmic Membrane | Signal peptide | 0.495995 |
| RS03915 | membrane | cytoplasmic | UPS | 0.122882 |
| RS04650 | cytoplasmic | cytoplasmic | UPS | 0.05868 |
| RS04815 | cytoplasmic | cytoplasmic | UPS | 0.132968 |
| RS05215 | cytoplasmic | cytoplasmic | UPS | 0.269975 |
| RS05845 | cytoplasmic | cytoplasmic | Intracellular | 0.153385 |
| RS06025 | cytoplasmic | cytoplasmic | UPS | 0.10396 |
| RS06945 | cytoplasmic | cytoplasmic | Intracellular | 0.098969 |
| RS07090 | cytoplasmic | cytoplasmic | UPS | 0.104500 |
| RS07595 | membrane | Cytoplasmic Membrane | Signal peptide | 0.933010 |
| RS08090 | cytoplasmic | unknown | UPS | 0.829525 |
| RS08125 | membrane | Cytoplasmic Membrane | UPS | 0.961769 |
| RS08375 | cytoplasmic | cytoplasmic | Intracellular | 0.090707 |
| RS10555 | cytoplasmic | cytoplasmic | UPS | 0.235292 |
| RS10805 | membrane | Cytoplasmic Membrane | Signal peptide | 0.720349 |
*UPS- unconventional protein secretion
Homologs of selected HPs found interacting with existing drugs in Drug Bank.
| Sequence ID | Homologous compound | E value | Bit score: | Interactions with drug |
|---|---|---|---|---|
| PCS8235_RS02820 | Free methionine-R-sulfoxide reductase | 4.90E-37 | 124.02 | 2-(N-morpholino) ethanesulfonic acid |
| PCS8235_RS04650 | Haloacid dehalogenase-like hydrolase | 1.21E-11 | 62.003 | Artenimol |
| PCS8235_RS04650 | Sugar phosphatase YbiV | 5.74E-31 | 114.775 | Aspartate beryllium trifluoride |
| PCS8235_RS10805 | Metal transporter CNNM2 | 4.16E-13 | 70.0922 | Magnesium carbonate |
Fig 6Interaction between the best drug for the HPs in S. pneumoniae and the active site amino acid residues of each of the HPs (a) PCS8235_RS04650- Artenimol (b) PCS8235_RS04650- Aspartate beryllium trifluoride (c) PCS8235_RS02820-2-(N-morpholino) ethanesulfonic acid.
Fig 7RMSD analysis of (A) PCS8235_RS02820 (green) and PCS8235_RS02820-2-(N-morpholino) ethanesulfonic acid complex (purple) (B) PCS8235_RS04650 (red) and PCS8235_RS04650 –Artenimol (purple) and PCS8235_RS04650-Aspartate beryllium trifluoride (gray) at 100ns.
Fig 8RMSF analysis of (A) PCS8235_RS02820 (brown) and PCS8235_RS02820-2-(N-morpholino) ethanesulfonic acid complex (blue) (B) PCS8235_RS04650 (blue) and PCS8235_RS04650 –Artenimol (golden) and PCS8235_RS04650-Aspartate beryllium trifluoride (green) at 100 ns.
Fig 9SASA calculation of (A) PCS8235_RS02820 (golden) and PCS8235_RS02820-2-(N-morpholino) ethanesulfonic acid complex (green) (B) PCS8235_RS04650 (blue) and PCS8235_RS04650 –Artenimol (green) and PCS8235_RS04650-Aspartate beryllium trifluoride (red) at 100ns.
Fig 10Rg measurement of (A) PCS8235_RS02820 (green) and PCS8235_RS02820-2-(N-morpholino) ethanesulfonic acid complex (red) (B) PCS8235_RS04650 (green) and PCS8235_RS04650 –Artenimol (red) and PCS8235_RS04650-Aspartate beryllium trifluoride (yellow) at 100 ns.