| Literature DB >> 29082265 |
Arafat Rahman Oany1, Tahmina Pervin2, Mamun Mia1, Motaher Hossain3, Mohammad Shahnaij4, Shahin Mahmud1, K M Kaderi Kibria1.
Abstract
Shigellosis, a bacillary dysentery, is closely associated with diarrhoea in human and causes infection of 165 million people worldwide per year. Casein-degrading serine protease autotransporter of enterobacteriaceae (SPATE) subfamily protein SigA, an outer membrane protein, exerts both cytopathic and enterotoxic effects especially cytopathic to human epithelial cell type-2 (HEp-2) and is shown to be highly immunogenic. In the present study, we have tried to impose the vaccinomics approach for designing a common peptide vaccine candidate against the immunogenic SigA of Shigella spp. At first, 44 SigA proteins from different variants of S. flexneri, S. dysenteriae, S. boydii, and S. sonnei were assessed to find the most antigenic protein. We retrieved 12 peptides based on the highest score for human leukocyte antigen (HLA) supertypes analysed by NetCTL. Initially, these peptides were assessed for the affinity with MHC class I and class II alleles, and four potential core epitopes VTARAGLGY, FHTVTVNTL, HTTWTLTGY, and IELAGTLTL were selected. From these, FHTVTVNTL and IELAGTLTL peptides were shown to have 100% conservancy. Finally, IELAGTLTL was shown to have the highest population coverage (83.86%) among the whole world population. In vivo study of the proposed epitope might contribute to the development of functional and unique widespread vaccine, which might be an operative alleyway to thwart dysentery from the world.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29082265 PMCID: PMC5610819 DOI: 10.1155/2017/6412353
Source DB: PubMed Journal: J Immunol Res ISSN: 2314-7156 Impact factor: 4.818
Figure 1Flow diagram of the methodology.
T-cell epitopes of SigA protein predicted by the NetCTL server on the basis of the combined score. Here, epitopes for all the 12 different HLA supertypes have been presented.
| Supertypes | Epitope | Start position | Combined score |
|---|---|---|---|
| A1 | VTARAGLGY | 645 | 2.8475 |
| A2 | SMYNTLWRV | 279 | 1.4558 |
| A3 | KSYMSGNYK | 432 | 1.5812 |
| A24 | NYAWVNGNI | 219 | 1.6813 |
| A26 | HTTWTLTGY | 411 | 2.0947 |
| B7 | APKGSNKEI | 378 | 1.1981 |
| B8 | YMSGNYKAF | 434 | 1.4879 |
| B27 | ARAGLGYQF | 647 | 1.7245 |
| B39 | FHTVTVNTL | 313 | 2.6855 |
| B44 | IELAGTLTL | 178 | 1.9586 |
| B58 | IASFSQPDW | 43 | 1.9421 |
| B62 | YMSGNYKAF | 434 | 1.4814 |
Epitopes for CD8+ T-cell along with their interacting MHC class I alleles with affinity < 250 nM.
| Epitope | Interacting MHC-I allele (IC50) on the nM scale |
|---|---|
| VTARAGLGY | HLA-A∗29:02 (40.62), HLA-A∗01:01 (224.16), HLA-A∗30:02 (168.45), HLA-B∗15:02 (81.70), HLA-C∗03:03 (103.66) |
| FHTVTVNTL | HLA-B∗39:01 (9.02), HLA-B∗38:01 (62.10), HLA-B∗15:02 (21.84), HLA-C∗03:03 (5.48), HLA-C∗08:02 (211.91), HLA-C∗14:02 (92.18), HLA-C∗12:03 (78.92) |
| HTTWTLTGY | HLA-A∗26:01 (162.41), HLA-A∗25:01 (219.32), HLA-B∗15:02 (153.54), HLA-C∗14:02 (122.92), HLA-C∗12:03 (33.90), HLA-C∗03:03 (144.75) |
| IELAGTLTL | HLA-E∗01:01 (152.83), HLA-B∗40:01 (35.47), HLA-B∗15:02 (55.75), HLA-C∗03:03 (29.28), HLA-C∗12:03 (109.70) |
| IASFSQPDW | HLA-B∗58:01 (9.34), HLA-B∗57:01 (42.06), HLA-B∗53:01 (61.22), HLA-C∗03:03 (2.35), HLA-C∗08:02 (187.57), HLA-C∗15:02 (164.60), HLA-C∗12:03 (11.95), HLA-C∗14:02 (248.10) |
| ARAGLGYQF | HLA-C∗12:03, HLA-C∗03:03 (53.53), HLA-B∗15:02 (107.95), HLA-B∗27:05 (97.18) |
| NYAWVNGNI | HLA-A∗68:02 (188.19), HLA-C∗14:02 (21.26), HLA-C∗03:03 (220.60), HLA-C∗12:03 (60.56) |
| KSYMSGNYK | HLA-C∗15:02 (6.94), HLA-A∗30:01 (9.18), HLA-A∗11:01 (11.28), HLA-A∗03:01 (20.22), HLA-A∗31:01 (54.63), HLA-C∗14:02 (40.80), HLA-C∗03:03 (61.75), HLA-C∗12:03 (31.71) |
| YMSGNYKAF | HLA-B∗15:01 (61.09), HLA-B∗35:01 (124.62), HLA-C∗14:02 (17.24), HLA-B∗15:02 (61.69), HLA-C∗03:03 (13.14), HLA-C∗12:03 (151.43) |
| SMYNTLWRV | HLA-A∗02:01 (6.70), HLA-A∗02:06 (13.79), HLA-C∗14:02 (105.59), HLA-C∗12:03 (30.21) |
| APKGSNKEI | HLA-B∗07:02 (199.48), HLA-C∗12:03 (7.82), HLA-C∗03:03 (37.81) |
Figure 2Cluster analysis of the HLA alleles for both MHC molecules through heat map representation. (a) Representing the cluster of the MHC-I. (b) Representing the cluster of MHC-II molecules. Epitopes are clustered on the basis of interaction with HLA and shown as red colour indicating strong interaction with appropriate annotation. Yellow zone indicates the weaker interaction. Here, all the available alleles are shown only.
The potential CD4+ T-cell epitopes along with their interacting MHC class II alleles with affinity (IC50) < 100 nM and respective predivac scores.
| Epitope | Interacting MHC-II allele (IC50) on the nM scale | Number of alleles | Predivac score (binding core) |
|---|---|---|---|
| VTARAGLGYQFDLFA | HLA-DRB1∗04:01, HLA-DRB1∗09:01, HLA-DRB1∗04:05, HLA-DRB3∗01:01, HLA-DQA1∗05:01 | 5 | 68.76 (LGYQFDLFA) |
| NSGFHTVTVNTLDAT | HLA-DRB1∗01:01 (53), HLA-DRB1∗01:21 (17.87), HLA-DRB1∗01:17 (77.94), HLA-DRB1∗01:16 (98.93), HLA-DRB1∗01:13 (61.53), HLA-DRB1∗01:29 (83.08), HLA-DRB1∗01:24 (62.89), HLA-DRB1∗01:10 (28.18), HLA-DRB1∗01:11 (45.39), HLA-DRB1∗01:19 (39.99), HLA-DRB1∗01:12 (39.99), HLA-DRB1∗01:31 (39.99), HLA-DRB1∗01:32 (39.99), HLA-DRB1∗01:08 (39.99), HLA-DRB1∗01:05 (39.99), HLA-DRB1∗01:07 (39.99), HLA-DRB1∗01:27 (39.99), HLA-DRB1∗01:25 (39.99), HLA-DRB1∗01:22 (39.99), HLA-DRB1∗01:28 (39.99), HLA-DRB1∗01:14 (47.44), HLA-DRB1∗01:09 (37.12), HLA-DRB1∗01:15 (71.58), HLA-DRB1∗01:18 (44.25), HLA-DRB1∗01:06 (35.78), HLA-DRB1∗01:26 (38.4), HLA-DRB1∗01:20 (41.15), HLA-DRB1∗01:23 (48.69), HLA-DRB1∗01:04 (48.54), HLA-DRB1∗07:06 (65.25), HLA-DRB1∗07:05 (65.06), HLA-DRB5∗02:05 (46.61) | 32 | 71.56 (FHTVTVNTL) |
| KAIELAGTLTLTGTP | HLA-DRB1∗01:01 (82), HLA-DRB1∗01:21 (23.85), HLA-DRB1∗01:17 (98.16), HLA-DRB1∗01:24 (58.9), HLA-DRB1∗01:10 (29.05), HLA-DRB1∗01:11 (49.8), HLA-DRB1∗01:19 (44.72), HLA-DRB1∗01:12 (44.72), HLA-DRB1∗01:31 (44.72), HLA-DRB1∗01:32 (44.72), HLA-DRB1∗01:08 (44.72), HLA-DRB1∗01:05 (44.72), HLA-DRB1∗01:07 (44.72), HLA-DRB1∗01:27 (44.72), HLA-DRB1∗01:25 (44.72), HLA-DRB1∗01:22 (44.72), HLA-DRB1∗01:28 (44.72), HLA-DRB1∗01:03 (91.98), HLA-DRB1∗01:14 (54.39), HLA-DRB1∗01:09 (31.87), HLA-DRB1∗01:15 (59.62), HLA-DRB1∗01:18 (40.48), HLA-DRB1∗01:06 (14.26), HLA-DRB1∗01:26 (18.43), HLA-DRB1∗01:20 (17.57), HLA-DRB1∗01:23 (24.27), HLA-DRB1∗01:04 (18.63), HLA-DRB1∗01:01 (82), HLA-DRB5∗02:05 (70.95) | 29 | 71.70 (IELAGTLTL) |
| NNQIASFSQPDWESR | 0 | 55.02 (FSQPDWESR) | |
| VTARAGLGYQFDLFA | 0 | 68.76 (LGYQFDLFA) | |
| AQNYAWVNGNIKSDK | HLA-DRB5∗01:01 (62), HLA-DRB5∗02:04 (93.07), HLA-DRB5∗02:05 (24.55) | 3 | 78.78 (YAWVNGNIK) |
| AAKSYMSGNYKAFLT | HLA-DRB1∗08:05 (65.65), HLA-DRB1∗12:03 (94.66), HLA-DRB1∗01:09 (57.66), HLA-DRB1∗01:10 (98.95), HLA-DRB1∗01:19 (99.23), HLA-DRB1∗01:12 (99.23), HLA-DRB1∗01:31 (99.23), HLA-DRB1∗01:32 (99.23), HLA-DRB1∗01:08 (99.23), HLA-DRB1∗01:05 (99.23), HLA-DRB1∗01:07 (99.23), HLA-DRB1∗01:27 (99.23), HLA-DRB1∗01:25 (99.23), HLA-DRB1∗01:22 (99.23), HLA-DRB1∗01:28 (99.23), HLA-DRB1∗01:21 (73.89), HLA-DRB1∗01:06 (52.24), HLA-DRB1∗01:23 (86.01), HLA-DRB1∗01:04 (87.14), HLA-DRB1∗01:26 (83.15), HLA-DRB1∗01:20 (85.84), HLA-DRB5∗02:05 (12.01), HLA-DRB5∗02:02 (67.2), HLA-DRB5∗02:04 (62.02) | 24 | 60.30 (KSYMSGNYK) |
| SYMSGNYKAFLTEVN | HLA-DRB1∗01:21 (56.34), HLA-DRB1∗01:09 (89.52), HLA-DRB1∗01:10 (81.52), HLA-DRB1∗01:26 (98.98), HLA-DRB1∗01:06 (88.25), HLA-DRB1∗04:05 (38), HLA-DRB5∗02:05 (41.74) | 7 | 69.98 (YKAFLTEVN) |
| ASMYNTLWRVNGQSA | HLA-DRB1∗08:05 (35.19), HLA-DRB1∗13:01 (49.45), HLA-DRB1∗12:05 (52.01), HLA-DRB1∗12:02 (60.35), HLA-DRB1∗12:03 (50.74), HLA-DRB1∗01:23 (62.08), HLA-DRB1∗01:26 (66.37), HLA-DRB1∗01:20 (74.19), HLA-DRB1∗01:06 (73.79), HLA-DRB5∗02:05 (44.37), HLA-DRB1∗11:01 (70) | 11 | 82.45 (LWRVNGQSA) |
Conservancy analysis of all the epitopes with appropriate length.
| Epitope | Conservancy | Length | Epitope | Conservancy | Length |
|---|---|---|---|---|---|
| VTARAGLGY | 84.09% | 9 |
| 84.09% | 15 |
| SMYNTLWRV | 100% | 9 | A | 100% | 15 |
| KSYMSGNYK | 97.73% | 9 | AA | 97.73% | 15 |
| NYAWVNGNI | 100% | 9 | AQN | 97.73% | 15 |
| HTTWTLTGY | 97.73% | 9 | No prediction | Undetected | 15 |
| APKGSNKEI | 100% | 9 | No prediction | Undetected | 15 |
| YMSGNYKAF | 97.73% | 9 | S | 97.73% | 15 |
| ARAGLGYQF | 84.09% | 9 | VT | 84.09% | 15 |
| FHTVTVNTL | 100% | 9 | NSG | 100% | 15 |
| IELAGTLTL | 100% | 9 | KA | 100% | 15 |
| IASFSQPDW | 97.73% | 9 | NNQ | 97.73% | 15 |
| YMSGNYKAF | 97.73% | 9 | S | 97.73% | 15 |
Figure 3MSA-based location identification of the different epitopes within the SPATE proteins of Shigella and their homologue in E. coli. In this figure, gi|647302223|, gi|446956855|, gi|844758686|, and gi|446956853| represent the S. flexneri, S. sonnei, S. boydii, and S. dysenteriae, respectively. E. coli represented by gi|693049347| and gi|699401135|.
Figure 4The three-dimensional model of SPATE subfamily protein SigA with the proposed epitopes VTARAGLGY (magenta), FHTVTVNTL (yellow), HTTWTLTGY (green), and IELAGTLTL (red). The superficial localities of the epitopes indicate their surface accessibility.
Figure 5The configurational frustration index of the predicted model of the SigA. (a) This analysis detects the stability and energy differences of the 3D structure of the protein. Colours are in accordance with their frustration index. The red colour regions are highly frustrated and the green colour regions are not frustrated. The frustrated residues are able to change their identity and also displace the location in any favourable conditions. (b) The locations of our proposed epitopes are described by different colours. The epitopes HTTWTLTGY (cyan) and IELAGTLTL (blue) are well outside of the frustrated regions and securing their stability. On the other hand, the epitopes VTARAGLGY (yellow) and FHTVTVNTL (orange) are in the frustrated regions and unable to secure their stability.
Figure 6Population coverage analysis for the top predicted epitope based on the HLA interaction. Here, the whole world populations are assessed for the proposed epitope. The combined prediction for both of the MHC has been shown. Here, the number 1 bar for all the analyses represents out-predicted epitope. Notes: in the graphs, the line (-o-) represents the cumulative percentage of population coverage of the epitopes; the bars represent the population coverage for each epitope.
Figure 7Docking analysis of the predicted epitope IELAGTLTL and HLA-E allele. (a) Representing the oriented view of the interaction and assuring the perfect binding. (b) Representing the cartoon view. (c) Embodying the interacted residues with the peptide.
Figure 8Docking analysis of the predicted epitope KAIELAGTLTLTGTP and HLA-DQA1 allele. (a) Representing the oriented view of the interaction and assuring the perfect binding. (b) Representing the cartoon view. (c) Embodying the interacted residues with the peptide.
Figure 9B-cell epitope prediction. (a) Kolaskar and Tongaonkar antigenicity prediction of the proposed epitope with a threshold value of 1.037. (b) Emini surface accessibility prediction of the proposed epitope, with a threshold value of 1.0. (c) Parker hydrophilicity prediction of the epitope, with a threshold of −0.352. Notes: the x-axis and y-axis represent the sequence position and antigenic propensity, respectively. The regions above the threshold are antigenic (desired), shown in yellow.