Literature DB >> 26681959

Genome-Wide Prediction of Vaccine Candidates for Leishmania major: An Integrated Approach.

Satarudra Prakash Singh1, Kriti Roopendra2, Bhartendu Nath Mishra2.   

Abstract

Despite the wealth of information regarding genetics of the causative parasite and experimental immunology of the cutaneous leishmaniasis, there is currently no licensed vaccine against it. In the current study, a two-level data mining strategy was employed, to screen the Leishmania major genome for promising vaccine candidates. First, we screened a set of 25 potential antigens from 8312 protein coding sequences, based on presence of signal peptides, GPI anchors, and consensus antigenicity predictions. Second, we conducted a comprehensive immunogenic analysis of the 25 antigens based on epitopes predicted by NetCTL tool. Interestingly, results revealed that candidate antigen number 1 (LmjF.03.0550) had greater number of potential T cell epitopes, as compared to five well-characterized control antigens (CSP-Plasmodium falciparum, M1 and NP-Influenza A virus, core protein-Hepatitis B virus, and PSTA1-Mycobacterium tuberculosis). In order to determine an optimal set of epitopes among the highest scoring predicted epitopes, the OptiTope tool was employed for populations susceptible to cutaneous leishmaniasis. The epitope (127SLWSLLAGV) from antigen number 1, found to bind with the most prevalent allele HLA-A⁎0201 (25% frequency in Southwest Asia), was predicted as most immunogenic for all the target populations. Thus, our study reasserts the potential of genome-wide screening of pathogen antigens and epitopes, for identification of promising vaccine candidates.

Entities:  

Year:  2015        PMID: 26681959      PMCID: PMC4670862          DOI: 10.1155/2015/709216

Source DB:  PubMed          Journal:  J Trop Med        ISSN: 1687-9686


1. Introduction

Leishmaniases are a group of complex diseases caused by protozoan parasites of the genus Leishmania and transmitted to humans by hematophagous sandflies [1]. There are at least 20 species of the parasite, which vary according to geographical location and cause a variety of clinical manifestations, ranging from self-limiting cutaneous lesions to potentially fatal infection of the viscera [2, 3]. It is a disease of tropical and subtropical areas, with more than 12 million cases in 88 countries and 2 million new cases annually including 1.5 million cases of cutaneous leishmaniasis (CL) and 0.5 million cases of visceral leishmaniasis (VL). The cutaneous disease is particularly prevalent in Afghanistan, Algeria, Brazil, Iran, Peru, Saudi Arabia, and Syria, accounting for 90% of the global CL burden [4]. Although high-cost chemotherapeutics are available, they show high toxicity and are prone to drug resistance development due to prolonged treatment periods [5]. Despite substantial effort spent in developing effective vaccines, there is currently no licensed vaccine against human leishmaniasis [6]. A large number of proof-of-principle studies have clearly demonstrated that different vaccine formulations, ranging from killed/live-attenuated parasites to recombinant DNA/protein vaccines, can provide significant protection against infection with Leishmania spp. in a variety of animal models [7, 8]. However, the efficacy of these prophylactic or therapeutic vaccines remains partial, and it is therefore necessary to develop novel and effective vaccines [1]. In this regard, antigen identification represents the most important roadblock in vaccine development against any pathogen, as it is usually achieved through rather empirical, time-consuming, and labour-intensive in vivo and in vitro experiments. Efforts have thus been devoted to the development of novel strategies for a more rational and faster identification of antigens among large number of pathogen proteins [9]. In recent development, reverse vaccinology approach can be used to predict those antigens that are most likely to be vaccine candidates using the pathogen genomic sequence [10]. Moreover, the genomic information, which contains the sequences of all known and unknown potential antigens of each pathogen, has enabled the prediction and analysis of the entire repertoire of potential cytotoxic T lymphocytes (CTL) epitopes using bioinformatics tools. This strategy allows the development of vaccines that were previously difficult or impossible to make and can lead to the discovery of unique antigens that may improve existing vaccines [11]. The recent genomic sequence completion of L. major, L. braziliensis, L. infantum, and L. donovani and the availability of immunoinformatics tools have opened new opportunities for the identification of novel vaccine targets against CL [9]. Additionally, the presence of genetically stable but highly conserved antigens among most of the species, including antigens with little or no homology to human proteins, offers hope for the development of a single vaccine for multiple disease indications [12]. As depicted in the literature, effective vaccines must invoke a strong response from both T and B cells; therefore, CTL epitope mapping is crucial in any vaccine designing strategy. Many immunoinformatics algorithms and resources have been available to predict T and B cell immune epitopes for peptide based vaccine design and development [13]. Thus, the approach of T cell epitopes prediction and their in vitro/in vivo validations appeared to be a very powerful strategy in rational antigen identification, particularly for a pathogen with a large genome such as Leishmania [9]. Hence, the current study deals with the analysis of L. major genome (33.6 Mb), considered to express about 8300 proteins, all of which are potential antigens containing effective CTL epitopes with respect to susceptible population [14].

2. Materials and Methods

2.1. Retrieval of Proteome Sequence Dataset

The complete proteome of L. major (strain Friedlin), consisting of 8312 protein coding sequences, was extracted from database GeneDB [15]. We also retrieved five well-characterized control antigens (CSP-401GLIMVLSFL from Plasmodium falciparum, M1-58GILGFVFTL and NP- 265ILRGSVAHK from Influenza A virus, core-141STLPETTVV from Hepatitis B virus, and PSTA1- 41FVVALIPLV from Mycobacterium tuberculosis) from database AntigenDB in order to compare and validate the prediction results. These known antigens have been previously tested and verified in various experimental studies and reported as capable of eliciting CTL responses [16].

2.2. Methodology Used for Prediction and Characterization of Candidate Antigens/Epitopes

Initially, the L. major proteome (8312 proteins) was screened for the presence of both signal peptide and GPI anchors using SignalP [17] and DGPI [18], respectively, and then consensus antigenicity predictions were done using VaxiJen [19] and ANTIGENpro [20] programs. Finally selected candidate antigens were further characterized using TMHMM [21], SCRATCH protein predictor [22], and BetaWrap program [23]. Thereafter, these candidate antigens were searched for potential sequence similarity with other closely related species and human and/or mouse proteins, using OrthoMCL database [24]. Furthermore, CTL epitopes prediction was carried out using NetCTL1.2 [25, 26] tool integrating predictions of proteasomal cleavage, TAP transport efficiency, and 12 MHC class I supertypes' binding. Finally, OptiTope (http://etk.informatik.uni-tuebingen.de/optitope) was used to determine good vaccine epitopes called the optimal set of epitopes from top scoring naturally processed T cell epitopes, for each population susceptible to cutaneous leishmaniasis (Figure 1) [27].
Figure 1

Flowchart depicting the steps adopted for genome-wide screening of potential antigens and their epitopes optimization.

The tool OptiTope requires the following input data from the user: (i) sequences of known/predicted antigens, (ii) a target human population, that is, MHC alleles and corresponding frequency, and (iii) the epitope set to be optimized. The input given by the user is transformed into an optimization problem. If it is feasible, OptiTope will return an optimal set of epitopes along with fraction of immunogenicity contributing to overall immunogenicity. Otherwise, program will propose changes to the user's input that might yield a feasible optimization problem. The information related to MHC alleles frequency in susceptible human populations and geographic areas is retrieved from dbMHC database (http://www.ncbi.nlm.nih.gov/gv/mhc). A good vaccine displays a high overall immunogenicity that means it is capable of inducing potent immunity in a large fraction of the target population including high mutation tolerance as well as a certain degree of allele and antigen coverage. Furthermore, the finally selected epitopes should display a high probability of passing through the natural antigen processing pathway. From all possible epitope combinations, the ones with a maximum overall immunogenicity will be called “optimal” (there may be more than one optimal epitope combination). Hence, the search for an optimal epitope set for an good vaccine can be considered as an optimization problem: out of a given set of epitopes, choose a subset which, out of all subsets meeting the other input requirements, displays maximum overall immunogenicity I, which can be derived mathematically (1) as a weighted sum over immunogenicities of epitopes E with respect to the given MHC alleles A: where p is the frequency of allele a in the target population and i measure the immunogenicity of epitope e when bound to allele a (either predicted or experimentally determined).

3. Results and Discussion

The present study was divided into two major steps: (i) we utilized the L. major genome consisting of 8312 protein coding sequences and predicted 25 antigens (Table 1), through successive screening and consensus antigenicity predictions; (ii) we conducted a comprehensive analysis of the epitopes predicted from these 25 candidate antigens (Figure 1). The present strategy is similar to the reverse vaccinology approach adopted by John et al. [28], for identifying common vaccine candidates from L. major and L. infantum genomes. Additionally, Singh et al. [29, 30] also utilized the similar approach of MHC supertype based epitope identification, as a strategy to mine proteomic data for identification of novel CTL epitopes, in Plasmodium falciparum.
Table 1

List of predicted 25 L. major candidate antigens and 5 control antigens along with their prediction probabilities using SOLpro and TMHMM program.

Antigen numberGeneDB ID/UniProt accession numberNumber of amino acidsFunctionProtein solubility upon overexpression prediction probabilitiesNumber of TM regions
1LmjF.03.05501316Hypothetical protein, conserved0.947197Insoluble5
2LmjF.04.0130531Hypothetical protein, conserved in Leishmania 0.814123Insoluble0
3LmjF.04.0180504Surface antigen-like protein0.819598Soluble1
4LmjF.04.0190709Surface antigen-like protein0.931571Insoluble1
5LmjF.04.0200182Surface antigen-like protein0.769865Soluble0
6LmjF.04.0210277Surface antigen-like protein0.913911Insoluble2
7LmjF.04.0910248Hypothetical protein, conserved0.586919Soluble0
8LmjF.06.0380401Hypothetical protein, conserved0.76955Insoluble0
9LmjF.09.0850335Ras family protein-like protein 0.612666Insoluble0
10LmjF.12.0710108Hypothetical protein, conserved0.556695Insoluble1
11LmjF.12.1000385Promastigote surface antigen protein 2, PSA20.802331Insoluble2
12LmjF.12.0870547Surface antigen protein 2, putative0.696157Soluble2
13LmjF.12.0740760Surface antigen protein, putative 0.534869Soluble2
14LmjF.13.0480522Hypothetical protein, conserved0.785913Soluble0
15LmjF.14.0770396Hypothetical protein, unknown function0.705749Insoluble0
16LmjF.16.06201136Hypothetical protein, unknown function0.682847Soluble0
17LmjF.17.1350179Hypothetical protein, conserved0.625526Insoluble1
18LmjF.22.0470426Hypothetical protein, conserved0.743393Soluble0
19LmjF.22.12601087Hypothetical protein, conserved0.689942Insoluble0
20LmjF.23.0225221Hypothetical protein, conserved0.870557Insoluble1
21LmjF.24.15201152Hypothetical protein, conserved0.588925Insoluble0
22LmjF.26.0340342Hypothetical protein, conserved0.671966Insoluble1
23LmjF.28.2565264Hypothetical protein, conserved0.607781Insoluble0
24LmjF.32.0510363Hypothetical protein, conserved0.825034Insoluble2
25LmjF.33.1890556Hypothetical protein, conserved0.558399Insoluble1
CSPP02893412Circumsporozoite protein0.857803Soluble0
M1P36347252Matrix protein 10.655264Insoluble0
NPP03466498Nucleoprotein 0.528252Soluble 0
CoreCAA59535.1185HBV core0.816707Insoluble0
PSTA1P9WG10304Phosphate transport system permease protein 10.654434Insoluble6

3.1. Screening of L. major Genome for Identification and Characterization of Antigens

The previous studies revealed that surface-exposed proteins such as secretory/outer membrane proteins are ideal targets for vaccine development, with respect to those pathogens against which a strong B cell response (for antibody production) is critical. However, for vaccine development against those pathogens where T cell response is critical, subcellular localization is not an issue since a T cell response could be directed to any protein target [31]. In addition, GPI anchored proteins are abundantly expressed in the infective and intracellular stages of Trypanosoma cruzi (another kinetoplastid protozoan) and have been recognized as antigenic targets by both the humoral and cellular immunity [32]. Herein, the entire protein repertoire of L. major, consisting of 8312 protein coding sequences, was screened for presence of signal peptides and GPI anchors. Out of these, 265 proteins were predicted as GPI anchored proteins, using DGPI tool [18], and 1798 proteins were found to contain signal peptides/signal anchors, using SignalP3.0 tool [17]. However, 151 proteins were predicted to contain both signal peptides/signal anchors and GPI anchors (data not shown). Further screening of these 151 proteins, based on consensus antigenicity predictions using VaxiJen [19] and ANTIGENpro [20] tools, above a predefined threshold of 0.6, provided a set of 27 antigenic proteins (data not shown). Interestingly, three candidate antigens (GeneDB id: LmjF.04.0130, LmjF.04.0140, and LmjF.04.0170) were found to share a high sequence similarity (more than 99.6%) and thus the latter two antigens were excluded from further analysis. Finally, 25 candidate antigens were screened for further characterization as vaccine candidates (Table 1). Protein insolubility has been known to be a major obstacle for many experimental studies. Thus, we used SOLpro tool (of SCRATCH protein predictor [22]) to predict the propensity of a protein to be soluble upon overexpression. Out of the 25 antigens, 8 (numbers 3, 5, 7, 12, 13, 14, 16, and 18) were predicted to be soluble upon overexpression while control antigens M1, core, and PSTA1 were predicted to be insoluble upon overexpression (Table 1). Similarly, proteins with more than one transmembrane (TM) region have been found to be difficult to clone, express, and purify. Thus, we predicted TM regions using TMHMM web server. Out of 25 predicted antigens, 19 antigens were found to contain less than two; 5 antigens (numbers 6, 11, 12, 13, and 24) were found to contain two each while antigen number 1 was found to contain five TM regions. On the other hand, PSTA1 was found to contain 6 TM regions (Table 1). Through literature analysis, it has also been observed that many bacterial and fungal proteins such as toxins, virulence factors, adhesins, and surface proteins have parallel beta-helices which play important role in human infectious disease [33]. Therefore, BetaWrap program [23] was used to predict the super secondary structural motif in primary amino acid sequences of 25 antigens. A total 9 candidate antigens (numbers 2–6, 9, 11, 13, and 24) were predicted to contain right-handed parallel beta-helix. Besides, heterologous immunity may exist to cross-reactive epitopes in other strains of the same organism. Thus, we identified the potential orthologs in the available Leishmania genomes annotations using OrthoMCL database [24] through BLASTP homology prediction program. All the selected 25 candidate antigens showed orthologs in other related species, namely, L. braziliensis, L. infantum, and L. mexicana except antigen number 11. One of the greatest barriers in vaccine development is the possibility that a particular vaccine may cross-react between host and parasite antigens [34]. Thus, vaccine candidates showing sequence similarity with the hosts (e.g., human or mouse) proteins are likely to cause autoimmunity in the host and should be discarded to avoid potential autoimmunity. Out of 25 antigens, 3 (numbers 3, 9, and 18) showed orthologs in human as well as mouse.

3.2. Epitope Based Analysis of the Selected Antigens Using NetCTL

For elicitation of T cell responses, the subcellular location and function of target protein are less important than the presence of appropriate MHC binding epitopes in the protein sequences [35]. In past 15 years, significant efforts have been made toward generation of procedures/algorithms for accurate prediction of MHC binding affinity and T cell epitopes. Utilizing the clustering method, majority of HLA molecules have been classified in relatively few HLA supertypes on the basis of their peptide binding specificities [36, 37]. One approach to identifying targets of CTL responses in an antigen is based on prediction of high affinity MHC class I restricted T cell epitopes using computerized algorithms [38]. It is also demonstrated that peptides that possess in vitro binding affinity (IC50) values of ≤ 500 nM are more immunogenic in vivo [39]. Thus, in the current study, the immunogenicity screening was limited to the predicted peptides that were able to bind HLA class I supertypes, with binding affinities (IC50) ≤500 nM [25]. Furthermore, it is important to consider whether each MHC binding peptide is being correctly processed from the native antigen and subsequently displayed on the surface of antigen presenting cells. At present, it is possible to predict the naturally processed peptides using NetCTL algorithm above a combined epitope processing score of 0.75, which includes predictions of proteasomal cleavage, TAP binding, and HLA binding [26]. Thus, in order to identify candidate CD8+ T cell epitopes, 25 candidate antigens selected from L. major were screened using NetCTL. The tool NetCTL1.2 provides a comprehensive prediction about epitopes binding to the 12 different HLA class I supertypes: HLA-A1, HLA-A2, HLA-A3, HLA-A24, and HLA-A26 and HLA-B7, HLA-B8, HLA-B27, HLA-B39, HLA-B44, HLA-B58, and HLA-B62. A total of 3756 putative CTL epitopes were predicted, including 1373 HLA-A (230-A1, 429-A2, 269-A3, 207-A24, and 238-A26) and 2383 HLA-B (542-B7, 283-B8, 318-B27, 288-B39, 157-B44, 304-B58, and 491-B62) supertype binding peptides (Figure 2). Predictions for the 5 control antigens showed that CSP had 28-HLA-A, 40-HLA-B, M1 had 33-HLA-A, 55-HLA-B, core had 21-HLA-A, 48-HLA-B, PSTA1 had 83-HLA-A, 104-HLA-B, and NP had 59-HLA-A, 126-HLA-B restricted CTL epitopes. Apart from this, their experimentally validated CTL epitopes were also predicted by NetCTL ranked in top five. From the analysis, antigen number 1 was found to have largest number of predicted CTL epitopes for HLA-A and HLA-B supertypes while antigens numbers 4, 16, 19, and 21 had higher number of CTL epitopes for HLA-A supertypes and antigens numbers 13, 16, 19, and 21 had higher number of CTL epitopes for HLA-B supertypes in comparison to the control antigens. Overall, test antigen number 1 showed highest number of supertype epitopes and was thus predicted as best antigen. From among the predicted CTL epitopes, the epitopes which displayed the top processing score for the respective MHC supertypes are presented in Tables 2 and 3. These 624 potential CTL epitopes were also checked for their potential similarity with human proteins, using Human Protein Reference Database (http://www.hprd.org/). However, none of the epitopes were found similar to any human proteins [40].
Figure 2

Graphical representation of CTL epitopes predicted by NetCTL for the 25 potential L. major antigens and 5 control antigens (CSP, M1, core, PSTA1, and NP), which bind with at least one allele in the HLA-A (A1, A2, A3, A24, and A26) and HLA-B (B7, B8, B27, B39, B44, B58, and B62) supertypes.

Table 2

NetCTL predicted top scoring CTL epitopes along with start position and processing score for the HLA-A supertypes in 25 L. major candidate antigens.

AntigensA1A2A3A24A26
1NTDNFFFML(228: 2.3073)SLWSLLAGV(127: 1.4042)NLAAGQSLK(489: 1.4921)LYLLLPFLL(19: 1.9727)YTISLNPLL(512: 1.6485)

2MSSTSFDDY(38: 3.3876)ALVSINVPL(224: 1.3414)SLFRVANCK(238: 1.4763)VWCTVPDCI(421: 1.4489)SVVDPMQNY(409: 2.3630)

3CTQCAPNYY(308: 2.8811)LLTSFAMHL(495: 1.0227)SSYSCVSQK(469: 1.6062)GYAKDSNGF(175: 1.5045)YVVDSYDGL(351: 1.4847)

4FIDANTAIY(122: 3.1585)MLPDMTCSL(390: 1.4053)SSYSCVSQK(674: 1.6062)GYIVVDKYF(93: 1.4189)YVVDSYDGL(556: 1.4847)

5TTSTTTNTV(68: 1.4293)LMAAMLVAV(7: 1.2898)TMPTAPSSK(43: 1.1211)GYMPTASFK(142: 0.8997)ETASTTSTT(64: 0.5879)

6VSAQTIDDY(32: 2.4784)YLCDRTTAA(123: 1.1971)VSYTCIPRK(241: 1.5973)GYPNINTYL(116: 1.1822)QTIDDYPPV(35: 1.4284)

7TAAVKPLSY(18: 1.8789)TLASHPHYL(208: 1.2381)RVAEFLVVK(198: 1.3425)HYLHEANVF(214: 1.6364)EVPICSLEF(185: 1.0192)

8SIMSLQIRY(127: 1.1734)FLFSPTDTL(12: 1.3828)RIKRNWQKK(38: 1.3737)IFMRLEDTI(185: 1.4289)SIIEKATRY(200: 2.2008)

9YREILNEFY(162: 1.6879)FVAKYIPTI(96: 1.3345)LMMSCWSAR(3: 1.1799)LYTPALPPF(17: 1.7705)EVIEDLVVW(321: 1.7072)

10SASNHKEFY(11: 1.8707)RMDVIGATV(52: 1.1546)EFYIYYLAK(17: 0.7897)QWTRRMHLI(29: 1.3589)SASNHKEFY(11: 0.9535)

11LTDEKTCLK(346: 1.7725)FLTDEKTCL(345: 1.2608)QAFGRAIPK(51: 1.2646)TYAGTLPEM(90: 1.0075)YVSGISPTY(83: 1.6128)

12LTDERTCLV(508: 2.3650)FLTDERTCL(507: 1.2308)RIQQLVLRK(230: 1.3906)LYIWNMPLL(112: 1.9046)TTITSTTKL(445: 1.1733)

13LTDERTCLV(721: 2.3650)MLSAENLQL(469: 1.2642)KSLTNLYLK(422: 1.3473)EWSRVTSLL(200: 1.6263)EMKSLTNLY(396: 1.8138)

14DLEEEVEEY(115: 1.8614)TLLEQYASL(292: 1.2313)LLEQYASLK(293: 1.3401)SFPPSPSLL(2: 1.4148)QVKELKVSY(182: 1.2390)

15QTRVHPGLY(116: 2.0525)YLLDGDQLI(71: 1.4042)RSAPHHSRR(226: 1.3101)LFGAFLFAF(388: 1.2731)DVKESNAHV(48: 1.1458)

16LVDTTAWRY(1027: 3.6396)MLWETVAAL(290: 1.3564RTATARLHK(248: 1.5070)LFQRVLAPI(982: 1.1324)EAQDHSCFY(628: 1.9956)

17KADTYVEEF(82: 1.3354)VLAVVVLLV(10: 1.1790)HLRGAATGK(74: 1.4033)DMATVFAYF(151: 1.2555)STVRLLVSF(163: 1.4330)

18ATSNAASRY(119: 3.2566)TMADVLLYA(135: 1.2997)ATMADVLLY(134: 1.3320)QFLINSSSI(2: 1.2187)ATMADVLLY(134: 1.7501)

19FTSGEISFY(21: 3.4722)YMNLISQSI(1056: 1.2709)LLYCRESRK(1039: 1.6445)AYLRELFPV(702: 1.3356)FTSGEISFY(21: 2.3098)

20NTTTAVRGY(27: 1.7205)ILMWSFAAL(204: 1.3025)ALFVVMAMY(211: 1.2263)NWWILMWSF(201: 1.5092)NTTTAVRGY(27: 1.5857)

21LASLLSSKY(1096: 1.7120)ALARYPLPV(58: 1.3281)ALASLLSSK(1095: 1.5764)VYILLTEFL(1138: 1.6493)HVARQLASY(980: 2.0702)

22YMDPGAAGY(201: 3.0231)TLFPIDVTV(220: 1.3367)ALYTSIPVR(288: 1.4227)LFLLVIYAF(35: 1.7091)EAAHFLMAY(155: 2.1321)

23CTGSSPSVY(8: 2.2878)VLIDYLLSM(252: 1.4781)TLASSAAVK(184: 1.5850)VYFTLPTAV(15: 1.3233)VLIDYLLSM(252: 1.3831)

24LTAPVYMQY(106: 3.2540)LMFSLSQSL(98: 1.2637)RLTPFFQNY(115: 1.2825)LYRIDGTLI(181: 1.3091)ATGDQVSGY(155: 1.9446)

25ISDTQVLLA(264: 1.5818)VLVGVVLGV(539: 1.2310)LVHAGIAGK(486: 1.3287)AYFVVPLEM(356: 1.1485)STVLRLFSF(24: 1.4346)

3.3. Selection of Optimal Epitopes Set Based on Population Coverage Analysis

MHC is highly polymorphic; hence each individual possesses a set of MHC molecules of differing specificities; that is, different patients typically bind different repertoires of peptides. Thus, a crucial step in the design of effective peptide based vaccine is the selection of the good epitopes set which yields the best immune response in a given population or individual. Furthermore, the frequency of an MHC allele to occur within the target human population directly affects the allele's contribution to the overall immunogenicity. Sette et al. have also demonstrated a correlation between immunogenicity and MHC class I binding affinity [39]. It is therefore reasonable to use MHC class I binding affinity prediction methods for calculation of the overall immunogenicity [40]. Hence, the present study employed OptiTope (using BIMAS method [41]) to determine the optimal set of epitopes from the selected epitopes, which calculate the best immune response in the susceptible target populations of cutaneous leishmaniasis, namely, Southwest Asia, North Africa, and South America. Initially, the top scoring epitope sets predicted using NetCTL from each of the 25 candidate antigens were tried to optimize by OptiTope and screened the BIMAS based HLA nonbinders (negative) for the target populations. Out of these, 10 epitope sets (from antigen numbers 1, 2, 4, 5, 6, 8, 15, 21, 22, and 25) that were predicted HLA binders (positive) are clubbed together to form a set of 120 candidate epitopes. However, when this combined epitope set was further optimized for the three different target populations, no optimization solution was obtained for any population. Therefore, epitope set of antigen number 21 was randomly excluded from the combined set and got the optimized results with the 108 combined epitopes set (derived from the 9 positive antigens) for the different target populations. For the target population of Southwest Asia, out of the 108 candidate epitopes set, OptiTope selected a subset of 60 epitopes restricted by the 19 MHC class I alleles covering 96.58 % human population. The most immunogenic epitope, 127SLWSLLAGV, from antigen number 1, was found to bind with allele HLA-A0201, contributing 7%, while the least immunogenic epitope, 246KSSALAHKL, from antigen number 25, contributed < 1% to the overall immunogenicity (Table 4).
Table 4

The distribution of fractional immunogenicity for the 60 optimal epitopes against the population of Southwest Asia.

S. numberEpitopeFraction of overall immunogenicityCovered alleles
1 SLWSLLAGV 0.07 A0201
2LPRLFLAFL0.06 B0702 B0801 B3501 Cw0401 Cw0602
3YLLDGDQLI0.05 A0201 A0205
4GYPNINTYL0.05 A2402 Cw0401
5LYLLLPFLL0.05 A2402 Cw0401
6YMDPGAAGY0.05 A0101
7TPRIPLDSL0.05 B0702 B0801 B3501 Cw0401
8SPTPLLAAL0.04 B0702 B3501 Cw0401 Cw0602 Cw0702
9SFDDYTMVL0.04 A2402 B3801 Cw0401
10GYIVVDKYF0.03 A2402
11VLVGVVLGV0.03 A0201
12TLFPIDVTV0.03 A0201
13FIDANTAIY0.03 A0101
14FLFSPTDTL0.02 A0201 A0205
15LMAAMLVAV0.02 A0201
16APNYYLTPL0.02 B0702 B3501 Cw0401
17SLFRVANCK0.02 A0301
18LFLLVIYAF0.02 A2402 Cw0401
19MLPDMTCSL0.02 A0201 A0205
20ELRRRGQEV0.02 B0801
21LFGAFLFAF0.02 Cw0401 Cw0702
22LVHAGIAGK0.02 A1101 A6801
23FPISKGAAL0.02 B0702 B0801 B3501
24RPRLFARAI0.01 B0702 B3501 B5101
25STQPPVSSW0.01 B5801
26GYMPTASFK0.01 A1101
27APAAPHSPL0.01 B0702 B3501
28LTSAINQFW0.01 B5801
29IHPERTVAL0.01 B3801
30VESGALFSF0.01 B4403
31YQLTGTPVL0.01 A0205 B5201
32GTFAAPLRW0.01 B5801
33YVVDSYDGL0.01 A0205
34APALYTISL0.01 B0702 B3501
35APAHGSVSL0.01 B0702 B3501
36AYFVVPLEM0.01 A2402
37KHLLMAAML0.01 B3801 Cw0602
38FVRVWDRSL0.01 B0702 B0801
39AHAPQNAAL0.01 B3801
40ALYTSIPVR0.01 A0301
41MSLFVSTLF0.01 B5801
42TPSSARLSM0.01 B0702 B3501
43STVLRLFSF0.01 B5801
44VSGGSTVSF0.01 B5801
45SAAAPYSLW0.01 B5801
46NLAAGQSLK<0.01 A0301
47TMPTAPSSK<0.01 A0301
48CESGYALTV<0.01 B4006
49VSAQTIDDY<0.01 Cw0702
50YLCDRTTAA<0.01 A0201
51RELQSVYLL<0.01 B4006 B4403
52RIKRNWQKK<0.01 A1101
53HVARQLASY<0.01 Cw0702
54SVVDPMQNY<0.01 Cw0702
55AMKDPYTNY<0.01 Cw0702
56LLPRLFLAF<0.01 A0301
57REWYSADVL<0.01 B4006
58YTISLNPLL<0.01 Cw0602
59VRAPFTIQL<0.01 Cw0602
60KSSALAHKL<0.01 Cw0602
Similarly, for the target population of North Africa, out of the 108 candidate epitopes' set, OptiTope selected a subset of 45 epitopes restricted by the 13 MHC class I alleles covering 88.48 % human population. Here again, the epitope 127SLWSLLAGV, from antigen number 1, was most immunogenic, covered the allele HLA-A0201, and contributes 9%, and the least immunogenic epitope, 4LLPRLFLAF, from antigen number 8, contributed <1% to the overall immunogenicity (Table 5).
Table 5

The distribution of fractional immunogenicity for the 45 optimal epitopes against the population of North Africa.

S. numberEpitopeFraction of overall immunogenicityCovered alleles
1 SLWSLLAGV 0.09 A0201
2YMDPGAAGY0.09 A0101
3YLLDGDQLI0.07 A0201
4TPRIPLDSL0.05 B0702 B0801 B3501
5LPRLFLAFL0.05 B0702 B0801 B3501
6FIDANTAIY0.05 A0101
7VESGALFSF0.04 B4403
8VLVGVVLGV0.04 A0201
9TLFPIDVTV0.04 A0201
10ELRRRGQEV0.04 B0801
11GYPNINTYL0.03 A2402
12LYLLLPFLL0.03 A2402
13LMAAMLVAV0.03 A0201
14FPISKGAAL0.03 B0702 B0801 B3501
15FLFSPTDTL0.03 A0201
16GYIVVDKYF0.03 A2402
17MLPDMTCSL0.02 A0201
18SLFRVANCK0.02 A0301
19APAAPHSPL0.02 B0702 B3501
20RPRLFARAI0.01 B0702 B3501 B5101
21ALYTSIPVR0.01 A0301 A3101
22APALYTISL0.01 B0702 B3501
23APNYYLTPL0.01 B0702 B3501
24APAHGSVSL0.01 B0702 B3501
25FVRVWDRSL0.01 B0702 B0801
26STQPPVSSW0.01 B5801
27LVHAGIAGK0.01 A1101 A6801
28LTSAINQFW0.01 B5801
29GTFAAPLRW0.01 B5801
30AYFVVPLEM0.01 A2402
31SPTPLLAAL0.01 B0702 B3501
32TPSSARLSM0.01 B0702 B3501
33MSLFVSTLF0.01 B5801
34GYMPTASFK0.01 A1101
35STVLRLFSF0.01 B5801
36YLCDRTTAA0.01 A0201
37NLAAGQSLK<0.01 A0301
38TMPTAPSSK<0.01 A0301
39VSGGSTVSF<0.01 B5801
40SAAAPYSLW<0.01 B5801
41RELQSVYLL<0.01 B4403
42SFDDYTMVL<0.01 A2402
43LFLLVIYAF<0.01 A2402
44RIKRNWQKK<0.01 A1101
45LLPRLFLAF<0.01 A0301
Also, for the target population of South America, out of the 108 candidate epitopes' set, OptiTope selected a subset of 34 epitopes restricted by the 9 MHC class I alleles covering 98.97 % human population. For a third time, the epitope 127SLWSLLAGV, from antigen number 1, which binds to the allele HLA-A0201, was predicted most immunogenic and contributed 10%, while the least immunogenic epitope, 127SIMSLQIRY, from antigen number 8, was found to contribute <1% to the overall immunogenicity (Table 6).
Table 6

The distribution of fractional immunogenicity for the 34 optimal epitopes against the population of South America.

S. numberEpitopeFraction of overall immunogenicityCovered alleles
1 SLWSLLAGV 0.1 A0201
2ALYTSIPVR0.1 A3101
3GYPNINTYL0.08 A2402 Cw0401
4YLLDGDQLI0.08 A0201
5LYLLLPFLL0.08 A2402 Cw0401
6GYIVVDKYF0.05 A2402
7VLVGVVLGV0.05 A0201
8TLFPIDVTV0.04 A0201
9SFDDYTMVL0.04 A2402 B3901 Cw0401
10VSAQTIDDY0.04 Cw0702
11SPTPLLAAL0.03 Cw0401 Cw0702
12LMAAMLVAV0.03 A0201
13FLFSPTDTL0.03 A0201
14HVARQLASY0.03 Cw0702
15MLPDMTCSL0.02 A0201
16LFLLVIYAF0.02 A2402 Cw0401
17LFGAFLFAF0.02 Cw0401 Cw0702
18AYFVVPLEM0.02 A2402
19KHLLMAAML0.02 B3901
20IHPERTVAL0.02 B3901
21LPRLFLAFL0.02 Cw0401
22TPRIPLDSL0.01 Cw0401
23LVHAGIAGK0.01 A6801
24AHAPQNAAL0.01 B3901
25APNYYLTPL0.01 Cw0401
26SVVDPMQNY0.01 Cw0702
27AMKDPYTNY0.01 Cw0702
28YLCDRTTAA0.01 A0201
29LLPRLFLAF0.01 B1501
30SQQETSSLY0.01 B1501
31YQLTGTPVL<0.01 B5201
32VRAPFTIQL<0.01 B3901
33KRGAAAVLL<0.01 B3901
34SIMSLQIRY<0.01 B1501
Thus, overall, it was found that 6 antigens (numbers 1, 4, 13, 16, 19, and 21) had larger number of predicted CTL epitopes as compared to control antigens which could be tested in vivo for validation. Similarly, the epitope 127SLWSLLAGV, from antigen number 1, binds to HLA-A0201 molecule and was predicted as most immunogenic for all the three populations' susceptibility to leishmaniasis [42, 43]. Hence, these antigens/peptides may be considered as suitable candidates for vaccine and diagnostics design [44]. Further, these protective epitopes conform to the anchor-based MHC binding motifs' concept used for T cell epitope identification by many researchers such as Sette et al. [45] and Rötzschke et al. [46].

4. Conclusions

The current study aimed to mine L. major genome for antigens selection and characterization as vaccine components based on criteria such as presence of transmembrane domains and orthologs analysis. Furthermore, the immunogenic epitopes predicted from these antigens can be analyzed for HLA-supertype binding and optimization of good vaccine epitopes against susceptible human populations to L. major infection. In light of the results obtained, it can be concluded that the combined use of reverse vaccinology and immunoinformatics along with in vitro/in vivo validation strategies has emerged as the most promising approach in designing successful vaccine against tropical diseases. In future, it would be helpful to use modeling and simulation system where critical experiments may be performed in a computer in order to predict the effects of experimental modifications on the immune system and thus offer a criterion for the selection of the most likely meaningful experimental tests to be conducted in vivo or in vitro.
(a)
AntigensB7B8B27B39
1APALYTISL(508: 1.7598)PLRWRFRAV(957: 2.1122)RRAKRGIQK(908: 1.8727)YQLTGTPVL(439: 2.2884)

2TPSSARLSM(83: 1.7256)FPISKGAAL(203: 2.2300)VRVDTQSSF(195: 1.4558)SFDDYTMVL(42: 2.2280)

3APNYYLTPL(312: 1.7308)YSLWVAAAV(486: 1.0021)SRAILIAVL(3: 1.3691)SRAILIAVL(3: 1.8971)

4APNYYLTPL(517: 1.7308)FVRVWDRSL(215: 1.8335)LRVSHSSVK(224: 1.3540)VRAPFTIQL(155: 1.5796)

5APAHGSVSL(33: 1.7709)VKHLLMAAM(3: 1.4045)DRLGQCMVV(109: 0.5955)KHLLMAAML(4: 1.0764)

6SPTPLLAAL(256: 1.6602)NPHKRGAAA(9: 1.8533)KRGAAAVLL(12: 1.3311)HKRGAAAVL(11: 1.2133)

7APSPCVPPL(175: 1.5944)AAYRSYAAV(144: 0.9391)MQVLLGADF(237: 0.8288)SHDGKHVIL(31: 2.6870)

8LPRLFLAFL(5: 1.6144)LLPRLFLAF(4: 1.5126)GRIKRNWQK(37: 1.4947)IHPERTVAL(294: 2.2073)

9SARARTLSL(9: 1.8057)YPRIKLLVI(69: 2.0337)ARARTLSLY(10: 1.2183)YEAAQGVLL(170: 1.5760)

10QPTTFKNPI(80: 1.1417)RMHLIGTAV(33: 1.4672)KQWTRRMHL(28: 1.5430)HKEFYIYYL(15: 1.4474)

11TPRTTTEPL(274: 1.7685)MSKARSLQL(181: 1.3726)RRLVLAATL(6: 1.9702)QRTNTLAVL(42: 1.4593)

12MPYLRGVSL(300: 1.8440)MPYLRGVSL(300: 2.1953)RRLVLAATL(6: 1.9702)YRHVMIREL(104: 1.7846)

13MPRLRLVGL(493: 1.7353)MPRLRLVGL(493: 2.2824)RRLVLAATL(6: 1.9702)TAAQRTHTL(39: 1.5626)

14MPAPPLNPF(414: 1.6955)KEKERHKAV(94: 1.8613)RRLMPAPPL(411: 1.7237)YESNTVSAL(317: 2.1937)

15TPRIPLDSL(187: 1.5747)SSHRKHKAM(162: 1.6542)RRMRAGSSH(250: 1.6225)AHAPQNAAL(138: 2.5007)

16MPRKRGRPL(237: 1.8778)MPRKRGRPL(237: 2.3735)RRTLQAQQL(546: 1.5424)DHAQGVAAL(877: 2.1234)

17VPHHPGGDV(134: 0.9985)YVEEFYQAA(86: 0.7269)KRVMAPSDR(37: 1.1717)FHDPSTVRL(159: 2.8191)

18HPSGAAVAI(409: 1.4883)RLYVEDMVL(284: 1.2511)RRAEKEKAK(215: 1.7438)NHSAHTEVL(147: 2.1507)

19KPSAVMTAF(919: 1.8285)EPSRRTVQF(667: 1.7157)RRWAAQNTF(77: 2.1177)FRVDGADAL(675: 2.1322)

20AARQRIMTM(156: 1.6084)ILMWSFAAL(204: 1.7177)RRAPTGLYE(71: 0.9324)QLDDNWWIL(197: 1.3196)

21APAAPHSPL(153: 1.7675)ELRRRGQEV(1113: 1.9415)RRLLAASPF(877: 2.0769)NQATTSLAL(504: 1.9553)

22YPAHRSKIV(163: 1.5878)DAQVRQTAL(281: 1.8406)QRRDVVIGM(75: 1.3697)SHLVSVDKL(331: 1.5360)

23TPRVGCSVA(63: 1.2802)FFRRYTRVF(46: 2.1834)RRYTRVFPA(48: 1.7379)YFTLPTAVL(16: 1.6593)

24SPLSVSAVF(27: 1.5060)YMQYRLTPF(111: 2.0206)FRYDHINSY(61: 1.7109)YASQKFVQL(311: 1.2960)

25RPRLFARAI(256: 1.7812)RLPRRLQAM(301: 2.1397)RRLLVHAGI(483: 1.8779)HSEAATSSL(72: 1.7315)
(b)
AntigensB44B58B62
1RELQSVYLL(1293: 1.9593)GTFAAPLRW(952: 2.0156)SQQETSSLY(802: 1.4851)

2VESGALFSF(276: 1.5005)VSGGSTVSF(299: 1.2478)LVVDASSLF(232: 1.2364)

3AECDTGYSL(383: 1.7888)SAAAPYSLW(481: 1.7104)VINSAAAPY(478: 1.2841)

4AECDTGYSL(588: 1.7888)SAAAPYSLW(686: 1.7104)AMKDPYTNY(355: 1.5075)

5PEQSKNAAL(153: 0.9583)AMASDASSW(20: 1.2946)SGYMPTASF(141: 1.1587)

6CESGYALTV(233: 1.5835)VAATVACVM(269: 1.3253)SGYALTVSY(235: 1.2956)

7HEANVFGDL(217: 1.4597)ITLASHPHY(207: 1.8535)MQVLLGADF(237: 1.4107)

8EEHKFHEQL(234: 1.7745)STQPPVSSW(375: 1.3179)QFIQGRCPY(279: 1.2654)

9YEAAQGVLL(170: 1.8058)QSFAALQSW(186: 1.9574)REHGCAAYY(301: 1.0743)

10KEFYIYYLA(16: 1.2817)TAVGVAICW(62: 1.8196)RMHLIGTAV(33: 1.0529)

11PEWGSMTSL(152: 1.5995)ISGSVPPEW(146: 1.9830)YVSGISPTY(83: 1.4703)

12LEGLTSLTL(131: 1.7189)ITGPLPPQW(241: 1.9075)YVRVISTTY(83: 1.4798)

13SEMKSLTSL(323: 1.9322)GSLPSEWSW(171: 1.9009)TQVSGTLPL(336: 1.2966)

14GEFSDIRQL(25: 1.9010)AAVADAEVW(458: 1.6950)QVKELKVSY(182: 1.3565)

15AQTRVHPGL(115: 0.9806)MSLFVSTLF(381: 1.5159)FVSTLFGAF(384: 1.2037)

16GEARNPHRL(724: 1.7322)AASAPSFQW(501: 1.9979)RLAAEAQGF(516: 1.3449)

17EFYQAAGHL(89: 0.5235)KADTYVEEF(82: 1.5709)QDMATVFAY(150: 1.1423)

18GEDEEQVSL(71: 1.8499)SAAKAQVSY(196: 1.3995)IVSGLVESY(299: 1.3837)

19AEHRRGTQL(978: 1.4654)CASTATHVF(601: 1.7234)MMSQSLSTY(1: 1.4755)

20AEMQRNIDR(60: 0.5443)LMWSFAALF(205: 1.0739)RGYTRGIPY(33: 1.3338)

21TESVQFLKL(968: 1.5363)LTSAINQFW(140: 1.8100)SMMLPAGDF(797: 1.4153)

22FEAPLGEML(132: 1.6939)VAFACYFLF(26: 1.6804)VLFTDGTPY(57: 1.4757)

23QEAKARTTV(193: 1.3216)TGSSPSVYF(9: 1.0291)AMHDDQLRF(38: 1.2996)

24KEPGHKIPL(256: 1.9127)LTAPVYMQY(106: 1.7822)SQSLTAPVY(103: 1.4574)

25REWYSADVL(525: 1.8122)KSSALAHKL(246: 1.5564)RVVKQSLCF(128: 1.1964)
  41 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

Review 2.  Cutaneous leishmaniasis.

Authors:  Richard Reithinger; Jean-Claude Dujardin; Hechmi Louzir; Claude Pirmez; Bruce Alexander; Simon Brooker
Journal:  Lancet Infect Dis       Date:  2007-09       Impact factor: 25.071

Review 3.  Vaccines for leishmaniasis in the fore coming 25 years.

Authors:  Clarisa B Palatnik-de-Sousa
Journal:  Vaccine       Date:  2008-01-30       Impact factor: 3.641

4.  Identification of Plasmodium falciparum antigens by antigenic analysis of genomic and proteomic data.

Authors:  Denise L Doolan; Scott Southwood; Daniel A Freilich; John Sidney; Norma L Graber; Lori Shatney; Lolita Bebris; Laurence Florens; Carlota Dobano; Adam A Witney; Ettore Appella; Stephen L Hoffman; John R Yates; Daniel J Carucci; Alessandro Sette
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-28       Impact factor: 11.205

Review 5.  Reverse vaccinology: developing vaccines in the era of genomics.

Authors:  Alessandro Sette; Rino Rappuoli
Journal:  Immunity       Date:  2010-10-29       Impact factor: 31.745

Review 6.  Emerging vaccine informatics.

Authors:  Yongqun He; Rino Rappuoli; Anne S De Groot; Robert T Chen
Journal:  J Biomed Biotechnol       Date:  2011-06-15

7.  HLA class I supertypes: a revised and updated classification.

Authors:  John Sidney; Bjoern Peters; Nicole Frahm; Christian Brander; Alessandro Sette
Journal:  BMC Immunol       Date:  2008-01-22       Impact factor: 3.615

8.  Identification and characterization of merozoite surface protein 1 epitope.

Authors:  Satarudra Prakash Singh; Bhartendu Nath Mishra
Journal:  Bioinformation       Date:  2009-08-17

9.  OptiTope--a web server for the selection of an optimal set of peptides for epitope-based vaccines.

Authors:  Nora C Toussaint; Oliver Kohlbacher
Journal:  Nucleic Acids Res       Date:  2009-05-06       Impact factor: 16.971

Review 10.  Complexities of assessing the disease burden attributable to leishmaniasis.

Authors:  Caryn Bern; James H Maguire; Jorge Alvar
Journal:  PLoS Negl Trop Dis       Date:  2008-10-29
View more
  3 in total

1.  Vaccinomics Approach for Designing Potential Peptide Vaccine by Targeting Shigella spp. Serine Protease Autotransporter Subfamily Protein SigA.

Authors:  Arafat Rahman Oany; Tahmina Pervin; Mamun Mia; Motaher Hossain; Mohammad Shahnaij; Shahin Mahmud; K M Kaderi Kibria
Journal:  J Immunol Res       Date:  2017-09-07       Impact factor: 4.818

Review 2.  Post-Genomics and Vaccine Improvement for Leishmania.

Authors:  Negar Seyed; Tahereh Taheri; Sima Rafati
Journal:  Front Microbiol       Date:  2016-04-06       Impact factor: 5.640

3.  VianniaTopes: a database of predicted immunogenic peptides for Leishmania (Viannia) species.

Authors:  Alejandro Llanes; Carlos Mario Restrepo; Ricardo Lleonart
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.