| Literature DB >> 26681959 |
Satarudra Prakash Singh1, Kriti Roopendra2, Bhartendu Nath Mishra2.
Abstract
Despite the wealth of information regarding genetics of the causative parasite and experimental immunology of the cutaneous leishmaniasis, there is currently no licensed vaccine against it. In the current study, a two-level data mining strategy was employed, to screen the Leishmania major genome for promising vaccine candidates. First, we screened a set of 25 potential antigens from 8312 protein coding sequences, based on presence of signal peptides, GPI anchors, and consensus antigenicity predictions. Second, we conducted a comprehensive immunogenic analysis of the 25 antigens based on epitopes predicted by NetCTL tool. Interestingly, results revealed that candidate antigen number 1 (LmjF.03.0550) had greater number of potential T cell epitopes, as compared to five well-characterized control antigens (CSP-Plasmodium falciparum, M1 and NP-Influenza A virus, core protein-Hepatitis B virus, and PSTA1-Mycobacterium tuberculosis). In order to determine an optimal set of epitopes among the highest scoring predicted epitopes, the OptiTope tool was employed for populations susceptible to cutaneous leishmaniasis. The epitope (127SLWSLLAGV) from antigen number 1, found to bind with the most prevalent allele HLA-A⁎0201 (25% frequency in Southwest Asia), was predicted as most immunogenic for all the target populations. Thus, our study reasserts the potential of genome-wide screening of pathogen antigens and epitopes, for identification of promising vaccine candidates.Entities:
Year: 2015 PMID: 26681959 PMCID: PMC4670862 DOI: 10.1155/2015/709216
Source DB: PubMed Journal: J Trop Med ISSN: 1687-9686
Figure 1Flowchart depicting the steps adopted for genome-wide screening of potential antigens and their epitopes optimization.
List of predicted 25 L. major candidate antigens and 5 control antigens along with their prediction probabilities using SOLpro and TMHMM program.
| Antigen number | GeneDB ID/UniProt accession number | Number of amino acids | Function | Protein solubility upon overexpression prediction probabilities | Number of TM regions |
|---|---|---|---|---|---|
| 1 | LmjF.03.0550 | 1316 | Hypothetical protein, conserved | 0.947197 | 5 |
| 2 | LmjF.04.0130 | 531 | Hypothetical protein, conserved in | 0.814123 | 0 |
| 3 | LmjF.04.0180 | 504 | Surface antigen-like protein | 0.819598 | 1 |
| 4 | LmjF.04.0190 | 709 | Surface antigen-like protein | 0.931571 | 1 |
| 5 | LmjF.04.0200 | 182 | Surface antigen-like protein | 0.769865 | 0 |
| 6 | LmjF.04.0210 | 277 | Surface antigen-like protein | 0.913911 | 2 |
| 7 | LmjF.04.0910 | 248 | Hypothetical protein, conserved | 0.586919 | 0 |
| 8 | LmjF.06.0380 | 401 | Hypothetical protein, conserved | 0.76955 | 0 |
| 9 | LmjF.09.0850 | 335 | Ras family protein-like protein | 0.612666 | 0 |
| 10 | LmjF.12.0710 | 108 | Hypothetical protein, conserved | 0.556695 | 1 |
| 11 | LmjF.12.1000 | 385 | Promastigote surface antigen protein 2, PSA2 | 0.802331 | 2 |
| 12 | LmjF.12.0870 | 547 | Surface antigen protein 2, putative | 0.696157 | 2 |
| 13 | LmjF.12.0740 | 760 | Surface antigen protein, putative | 0.534869 | 2 |
| 14 | LmjF.13.0480 | 522 | Hypothetical protein, conserved | 0.785913 | 0 |
| 15 | LmjF.14.0770 | 396 | Hypothetical protein, unknown function | 0.705749 | 0 |
| 16 | LmjF.16.0620 | 1136 | Hypothetical protein, unknown function | 0.682847 | 0 |
| 17 | LmjF.17.1350 | 179 | Hypothetical protein, conserved | 0.625526 | 1 |
| 18 | LmjF.22.0470 | 426 | Hypothetical protein, conserved | 0.743393 | 0 |
| 19 | LmjF.22.1260 | 1087 | Hypothetical protein, conserved | 0.689942 | 0 |
| 20 | LmjF.23.0225 | 221 | Hypothetical protein, conserved | 0.870557 | 1 |
| 21 | LmjF.24.1520 | 1152 | Hypothetical protein, conserved | 0.588925 | 0 |
| 22 | LmjF.26.0340 | 342 | Hypothetical protein, conserved | 0.671966 | 1 |
| 23 | LmjF.28.2565 | 264 | Hypothetical protein, conserved | 0.607781 | 0 |
| 24 | LmjF.32.0510 | 363 | Hypothetical protein, conserved | 0.825034 | 2 |
| 25 | LmjF.33.1890 | 556 | Hypothetical protein, conserved | 0.558399 | 1 |
| CSP | P02893 | 412 | Circumsporozoite protein | 0.857803 | 0 |
| M1 | P36347 | 252 | Matrix protein 1 | 0.655264 | 0 |
| NP | P03466 | 498 | Nucleoprotein | 0.528252 | 0 |
| Core | CAA59535.1 | 185 | HBV core | 0.816707 | 0 |
| PSTA1 | P9WG10 | 304 | Phosphate transport system permease protein 1 | 0.654434 | 6 |
Figure 2Graphical representation of CTL epitopes predicted by NetCTL for the 25 potential L. major antigens and 5 control antigens (CSP, M1, core, PSTA1, and NP), which bind with at least one allele in the HLA-A (A1, A2, A3, A24, and A26) and HLA-B (B7, B8, B27, B39, B44, B58, and B62) supertypes.
NetCTL predicted top scoring CTL epitopes along with start position and processing score for the HLA-A supertypes in 25 L. major candidate antigens.
| Antigens | A1 | A2 | A3 | A24 | A26 |
|---|---|---|---|---|---|
| 1 | NTDNFFFML | SLWSLLAGV | NLAAGQSLK | LYLLLPFLL | YTISLNPLL |
|
| |||||
| 2 | MSSTSFDDY | ALVSINVPL | SLFRVANCK | VWCTVPDCI | SVVDPMQNY |
|
| |||||
| 3 | CTQCAPNYY | LLTSFAMHL | SSYSCVSQK | GYAKDSNGF | YVVDSYDGL |
|
| |||||
| 4 | FIDANTAIY | MLPDMTCSL | SSYSCVSQK | GYIVVDKYF | YVVDSYDGL |
|
| |||||
| 5 | TTSTTTNTV | LMAAMLVAV | TMPTAPSSK | GYMPTASFK | ETASTTSTT |
|
| |||||
| 6 | VSAQTIDDY | YLCDRTTAA | VSYTCIPRK | GYPNINTYL | QTIDDYPPV |
|
| |||||
| 7 | TAAVKPLSY | TLASHPHYL | RVAEFLVVK | HYLHEANVF | EVPICSLEF |
|
| |||||
| 8 | SIMSLQIRY | FLFSPTDTL | RIKRNWQKK | IFMRLEDTI | SIIEKATRY |
|
| |||||
| 9 | YREILNEFY | FVAKYIPTI | LMMSCWSAR | LYTPALPPF | EVIEDLVVW |
|
| |||||
| 10 | SASNHKEFY | RMDVIGATV | EFYIYYLAK | QWTRRMHLI | SASNHKEFY |
|
| |||||
| 11 | LTDEKTCLK | FLTDEKTCL | QAFGRAIPK | TYAGTLPEM | YVSGISPTY |
|
| |||||
| 12 | LTDERTCLV | FLTDERTCL | RIQQLVLRK | LYIWNMPLL | TTITSTTKL |
|
| |||||
| 13 | LTDERTCLV | MLSAENLQL | KSLTNLYLK | EWSRVTSLL | EMKSLTNLY |
|
| |||||
| 14 | DLEEEVEEY | TLLEQYASL | LLEQYASLK | SFPPSPSLL | QVKELKVSY |
|
| |||||
| 15 | QTRVHPGLY | YLLDGDQLI | RSAPHHSRR | LFGAFLFAF | DVKESNAHV |
|
| |||||
| 16 | LVDTTAWRY | MLWETVAAL | RTATARLHK | LFQRVLAPI | EAQDHSCFY |
|
| |||||
| 17 | KADTYVEEF | VLAVVVLLV | HLRGAATGK | DMATVFAYF | STVRLLVSF |
|
| |||||
| 18 | ATSNAASRY | TMADVLLYA | ATMADVLLY | QFLINSSSI | ATMADVLLY |
|
| |||||
| 19 | FTSGEISFY | YMNLISQSI | LLYCRESRK | AYLRELFPV | FTSGEISFY |
|
| |||||
| 20 | NTTTAVRGY | ILMWSFAAL | ALFVVMAMY | NWWILMWSF | NTTTAVRGY |
|
| |||||
| 21 | LASLLSSKY | ALARYPLPV | ALASLLSSK | VYILLTEFL | HVARQLASY |
|
| |||||
| 22 | YMDPGAAGY | TLFPIDVTV | ALYTSIPVR | LFLLVIYAF | EAAHFLMAY |
|
| |||||
| 23 | CTGSSPSVY | VLIDYLLSM | TLASSAAVK | VYFTLPTAV | VLIDYLLSM |
|
| |||||
| 24 | LTAPVYMQY | LMFSLSQSL | RLTPFFQNY | LYRIDGTLI | ATGDQVSGY |
|
| |||||
| 25 | ISDTQVLLA | VLVGVVLGV | LVHAGIAGK | AYFVVPLEM | STVLRLFSF |
The distribution of fractional immunogenicity for the 60 optimal epitopes against the population of Southwest Asia.
| S. number | Epitope | Fraction of overall immunogenicity | Covered alleles |
|---|---|---|---|
| 1 |
| 0.07 | A |
| 2 | LPRLFLAFL | 0.06 | B |
| 3 | YLLDGDQLI | 0.05 | A |
| 4 | GYPNINTYL | 0.05 | A |
| 5 | LYLLLPFLL | 0.05 | A |
| 6 | YMDPGAAGY | 0.05 | A |
| 7 | TPRIPLDSL | 0.05 | B |
| 8 | SPTPLLAAL | 0.04 | B |
| 9 | SFDDYTMVL | 0.04 | A |
| 10 | GYIVVDKYF | 0.03 | A |
| 11 | VLVGVVLGV | 0.03 | A |
| 12 | TLFPIDVTV | 0.03 | A |
| 13 | FIDANTAIY | 0.03 | A |
| 14 | FLFSPTDTL | 0.02 | A |
| 15 | LMAAMLVAV | 0.02 | A |
| 16 | APNYYLTPL | 0.02 | B |
| 17 | SLFRVANCK | 0.02 | A |
| 18 | LFLLVIYAF | 0.02 | A |
| 19 | MLPDMTCSL | 0.02 | A |
| 20 | ELRRRGQEV | 0.02 | B |
| 21 | LFGAFLFAF | 0.02 | Cw |
| 22 | LVHAGIAGK | 0.02 | A |
| 23 | FPISKGAAL | 0.02 | B |
| 24 | RPRLFARAI | 0.01 | B |
| 25 | STQPPVSSW | 0.01 | B |
| 26 | GYMPTASFK | 0.01 | A |
| 27 | APAAPHSPL | 0.01 | B |
| 28 | LTSAINQFW | 0.01 | B |
| 29 | IHPERTVAL | 0.01 | B |
| 30 | VESGALFSF | 0.01 | B |
| 31 | YQLTGTPVL | 0.01 | A |
| 32 | GTFAAPLRW | 0.01 | B |
| 33 | YVVDSYDGL | 0.01 | A |
| 34 | APALYTISL | 0.01 | B |
| 35 | APAHGSVSL | 0.01 | B |
| 36 | AYFVVPLEM | 0.01 | A |
| 37 | KHLLMAAML | 0.01 | B |
| 38 | FVRVWDRSL | 0.01 | B |
| 39 | AHAPQNAAL | 0.01 | B |
| 40 | ALYTSIPVR | 0.01 | A |
| 41 | MSLFVSTLF | 0.01 | B |
| 42 | TPSSARLSM | 0.01 | B |
| 43 | STVLRLFSF | 0.01 | B |
| 44 | VSGGSTVSF | 0.01 | B |
| 45 | SAAAPYSLW | 0.01 | B |
| 46 | NLAAGQSLK | <0.01 | A |
| 47 | TMPTAPSSK | <0.01 | A |
| 48 | CESGYALTV | <0.01 | B |
| 49 | VSAQTIDDY | <0.01 | Cw |
| 50 | YLCDRTTAA | <0.01 | A |
| 51 | RELQSVYLL | <0.01 | B |
| 52 | RIKRNWQKK | <0.01 | A |
| 53 | HVARQLASY | <0.01 | Cw |
| 54 | SVVDPMQNY | <0.01 | Cw |
| 55 | AMKDPYTNY | <0.01 | Cw |
| 56 | LLPRLFLAF | <0.01 | A |
| 57 | REWYSADVL | <0.01 | B |
| 58 | YTISLNPLL | <0.01 | Cw |
| 59 | VRAPFTIQL | <0.01 | Cw |
| 60 | KSSALAHKL | <0.01 | Cw |
The distribution of fractional immunogenicity for the 45 optimal epitopes against the population of North Africa.
| S. number | Epitope | Fraction of overall immunogenicity | Covered alleles |
|---|---|---|---|
| 1 |
| 0.09 | A |
| 2 | YMDPGAAGY | 0.09 | A |
| 3 | YLLDGDQLI | 0.07 | A |
| 4 | TPRIPLDSL | 0.05 | B |
| 5 | LPRLFLAFL | 0.05 | B |
| 6 | FIDANTAIY | 0.05 | A |
| 7 | VESGALFSF | 0.04 | B |
| 8 | VLVGVVLGV | 0.04 | A |
| 9 | TLFPIDVTV | 0.04 | A |
| 10 | ELRRRGQEV | 0.04 | B |
| 11 | GYPNINTYL | 0.03 | A |
| 12 | LYLLLPFLL | 0.03 | A |
| 13 | LMAAMLVAV | 0.03 | A |
| 14 | FPISKGAAL | 0.03 | B |
| 15 | FLFSPTDTL | 0.03 | A |
| 16 | GYIVVDKYF | 0.03 | A |
| 17 | MLPDMTCSL | 0.02 | A |
| 18 | SLFRVANCK | 0.02 | A |
| 19 | APAAPHSPL | 0.02 | B |
| 20 | RPRLFARAI | 0.01 | B |
| 21 | ALYTSIPVR | 0.01 | A |
| 22 | APALYTISL | 0.01 | B |
| 23 | APNYYLTPL | 0.01 | B |
| 24 | APAHGSVSL | 0.01 | B |
| 25 | FVRVWDRSL | 0.01 | B |
| 26 | STQPPVSSW | 0.01 | B |
| 27 | LVHAGIAGK | 0.01 | A |
| 28 | LTSAINQFW | 0.01 | B |
| 29 | GTFAAPLRW | 0.01 | B |
| 30 | AYFVVPLEM | 0.01 | A |
| 31 | SPTPLLAAL | 0.01 | B |
| 32 | TPSSARLSM | 0.01 | B |
| 33 | MSLFVSTLF | 0.01 | B |
| 34 | GYMPTASFK | 0.01 | A |
| 35 | STVLRLFSF | 0.01 | B |
| 36 | YLCDRTTAA | 0.01 | A |
| 37 | NLAAGQSLK | <0.01 | A |
| 38 | TMPTAPSSK | <0.01 | A |
| 39 | VSGGSTVSF | <0.01 | B |
| 40 | SAAAPYSLW | <0.01 | B |
| 41 | RELQSVYLL | <0.01 | B |
| 42 | SFDDYTMVL | <0.01 | A |
| 43 | LFLLVIYAF | <0.01 | A |
| 44 | RIKRNWQKK | <0.01 | A |
| 45 | LLPRLFLAF | <0.01 | A |
The distribution of fractional immunogenicity for the 34 optimal epitopes against the population of South America.
| S. number | Epitope | Fraction of overall immunogenicity | Covered alleles |
|---|---|---|---|
| 1 |
| 0.1 | A |
| 2 | ALYTSIPVR | 0.1 | A |
| 3 | GYPNINTYL | 0.08 | A |
| 4 | YLLDGDQLI | 0.08 | A |
| 5 | LYLLLPFLL | 0.08 | A |
| 6 | GYIVVDKYF | 0.05 | A |
| 7 | VLVGVVLGV | 0.05 | A |
| 8 | TLFPIDVTV | 0.04 | A |
| 9 | SFDDYTMVL | 0.04 | A |
| 10 | VSAQTIDDY | 0.04 | Cw |
| 11 | SPTPLLAAL | 0.03 | Cw |
| 12 | LMAAMLVAV | 0.03 | A |
| 13 | FLFSPTDTL | 0.03 | A |
| 14 | HVARQLASY | 0.03 | Cw |
| 15 | MLPDMTCSL | 0.02 | A |
| 16 | LFLLVIYAF | 0.02 | A |
| 17 | LFGAFLFAF | 0.02 | Cw |
| 18 | AYFVVPLEM | 0.02 | A |
| 19 | KHLLMAAML | 0.02 | B |
| 20 | IHPERTVAL | 0.02 | B |
| 21 | LPRLFLAFL | 0.02 | Cw |
| 22 | TPRIPLDSL | 0.01 | Cw |
| 23 | LVHAGIAGK | 0.01 | A |
| 24 | AHAPQNAAL | 0.01 | B |
| 25 | APNYYLTPL | 0.01 | Cw |
| 26 | SVVDPMQNY | 0.01 | Cw |
| 27 | AMKDPYTNY | 0.01 | Cw |
| 28 | YLCDRTTAA | 0.01 | A |
| 29 | LLPRLFLAF | 0.01 | B |
| 30 | SQQETSSLY | 0.01 | B |
| 31 | YQLTGTPVL | <0.01 | B |
| 32 | VRAPFTIQL | <0.01 | B |
| 33 | KRGAAAVLL | <0.01 | B |
| 34 | SIMSLQIRY | <0.01 | B |
| Antigens | B7 | B8 | B27 | B39 |
|---|---|---|---|---|
| 1 | APALYTISL | PLRWRFRAV | RRAKRGIQK | YQLTGTPVL |
|
| ||||
| 2 | TPSSARLSM | FPISKGAAL | VRVDTQSSF | SFDDYTMVL |
|
| ||||
| 3 | APNYYLTPL | YSLWVAAAV | SRAILIAVL | SRAILIAVL |
|
| ||||
| 4 | APNYYLTPL | FVRVWDRSL | LRVSHSSVK | VRAPFTIQL |
|
| ||||
| 5 | APAHGSVSL | VKHLLMAAM | DRLGQCMVV | KHLLMAAML |
|
| ||||
| 6 | SPTPLLAAL | NPHKRGAAA | KRGAAAVLL | HKRGAAAVL |
|
| ||||
| 7 | APSPCVPPL | AAYRSYAAV | MQVLLGADF | SHDGKHVIL |
|
| ||||
| 8 | LPRLFLAFL | LLPRLFLAF | GRIKRNWQK | IHPERTVAL |
|
| ||||
| 9 | SARARTLSL | YPRIKLLVI | ARARTLSLY | YEAAQGVLL |
|
| ||||
| 10 | QPTTFKNPI | RMHLIGTAV | KQWTRRMHL | HKEFYIYYL |
|
| ||||
| 11 | TPRTTTEPL | MSKARSLQL | RRLVLAATL | QRTNTLAVL |
|
| ||||
| 12 | MPYLRGVSL | MPYLRGVSL | RRLVLAATL | YRHVMIREL |
|
| ||||
| 13 | MPRLRLVGL | MPRLRLVGL | RRLVLAATL | TAAQRTHTL |
|
| ||||
| 14 | MPAPPLNPF | KEKERHKAV | RRLMPAPPL | YESNTVSAL |
|
| ||||
| 15 | TPRIPLDSL | SSHRKHKAM | RRMRAGSSH | AHAPQNAAL |
|
| ||||
| 16 | MPRKRGRPL | MPRKRGRPL | RRTLQAQQL | DHAQGVAAL |
|
| ||||
| 17 | VPHHPGGDV | YVEEFYQAA | KRVMAPSDR | FHDPSTVRL |
|
| ||||
| 18 | HPSGAAVAI | RLYVEDMVL | RRAEKEKAK | NHSAHTEVL |
|
| ||||
| 19 | KPSAVMTAF | EPSRRTVQF | RRWAAQNTF | FRVDGADAL |
|
| ||||
| 20 | AARQRIMTM | ILMWSFAAL | RRAPTGLYE | QLDDNWWIL |
|
| ||||
| 21 | APAAPHSPL | ELRRRGQEV | RRLLAASPF | NQATTSLAL |
|
| ||||
| 22 | YPAHRSKIV | DAQVRQTAL | QRRDVVIGM | SHLVSVDKL |
|
| ||||
| 23 | TPRVGCSVA | FFRRYTRVF | RRYTRVFPA | YFTLPTAVL |
|
| ||||
| 24 | SPLSVSAVF | YMQYRLTPF | FRYDHINSY | YASQKFVQL |
|
| ||||
| 25 | RPRLFARAI | RLPRRLQAM | RRLLVHAGI | HSEAATSSL |
| Antigens | B44 | B58 | B62 |
|---|---|---|---|
| 1 | RELQSVYLL | GTFAAPLRW | SQQETSSLY |
|
| |||
| 2 | VESGALFSF | VSGGSTVSF | LVVDASSLF |
|
| |||
| 3 | AECDTGYSL | SAAAPYSLW | VINSAAAPY |
|
| |||
| 4 | AECDTGYSL | SAAAPYSLW | AMKDPYTNY |
|
| |||
| 5 | PEQSKNAAL | AMASDASSW | SGYMPTASF |
|
| |||
| 6 | CESGYALTV | VAATVACVM | SGYALTVSY |
|
| |||
| 7 | HEANVFGDL | ITLASHPHY | MQVLLGADF |
|
| |||
| 8 | EEHKFHEQL | STQPPVSSW | QFIQGRCPY |
|
| |||
| 9 | YEAAQGVLL | QSFAALQSW | REHGCAAYY |
|
| |||
| 10 | KEFYIYYLA | TAVGVAICW | RMHLIGTAV |
|
| |||
| 11 | PEWGSMTSL | ISGSVPPEW | YVSGISPTY |
|
| |||
| 12 | LEGLTSLTL | ITGPLPPQW | YVRVISTTY |
|
| |||
| 13 | SEMKSLTSL | GSLPSEWSW | TQVSGTLPL |
|
| |||
| 14 | GEFSDIRQL | AAVADAEVW | QVKELKVSY |
|
| |||
| 15 | AQTRVHPGL | MSLFVSTLF | FVSTLFGAF |
|
| |||
| 16 | GEARNPHRL | AASAPSFQW | RLAAEAQGF |
|
| |||
| 17 | EFYQAAGHL | KADTYVEEF | QDMATVFAY |
|
| |||
| 18 | GEDEEQVSL | SAAKAQVSY | IVSGLVESY |
|
| |||
| 19 | AEHRRGTQL | CASTATHVF | MMSQSLSTY |
|
| |||
| 20 | AEMQRNIDR | LMWSFAALF | RGYTRGIPY |
|
| |||
| 21 | TESVQFLKL | LTSAINQFW | SMMLPAGDF |
|
| |||
| 22 | FEAPLGEML | VAFACYFLF | VLFTDGTPY |
|
| |||
| 23 | QEAKARTTV | TGSSPSVYF | AMHDDQLRF |
|
| |||
| 24 | KEPGHKIPL | LTAPVYMQY | SQSLTAPVY |
|
| |||
| 25 | REWYSADVL | KSSALAHKL | RVVKQSLCF |