Literature DB >> 19944381

Computational analysis of cysteine proteases (Clan CA, Family Cl) of Leishmania major to find potential epitopic regions.

Babak Saffari1, Hassan Mohabatkar.   

Abstract

Leishmania is associated with a broad spectrum of diseases, ranging from simple cutaneous to invasive visceral leishmaniasis. Here, the sequences of ten cysteine proteases of types A, B and C of Leishmania major were obtained from GeneDB database. Prediction of MHC class I epitopes of these cysteine proteases was performed by NetCTL program version 1.2. In addition, by using BcePred server, different structural properties of the proteins were predicted to find out their potential B cell epitopes. According to this computational analysis, nine regions were predicted as B cell epitopes. The results provide useful information for designing peptide-based vaccines.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19944381      PMCID: PMC5054412          DOI: 10.1016/S1672-0229(08)60037-6

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

Leishmania (Order: Kinetoplastida, Family: Trypanasomatidae) is an obligate intracellular parasite responsible for a broad spectrum of diseases, ranging from simple cutaneous to invasive visceral leishmaniasis (. Protozoan parasites of the genus Leishmania present two forms in their life cycle: promastigote, which multiplies in the mid gut of the sand fly vector, and amastigote, the obligate intracellular form that lives within phagolysosomes of the vertebrate host 2., 3.. Three major types of leishmaniasis, namely cutaneous, mucocutaneous and visceral, occur in humans depending on the species of Leishmania. Infection by species such as L. major, L. tropica and L. mexicana may cause localized cutaneous lesions, resulting in lifelong immunity. Infection by L. braziliensis and L. panamensis initially presents as cutaneous lesions that may then spread or metastasize causing mucocutaneous lesions. Infection by L. donovani, L. infantum and L. chagasi may result in a chronic disseminating visceral disease in the liver and spleen (. It has been recognized for many years that proteases of pathogenic organisms may modulate the host’s defense mechanisms (. Proteases are grouped into clans and families on the basis of the architecture of their catalytic dyad or triad (. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan. L. major has cysteine proteases (CPs) of eight families within clan CA. Family C1 contains CPA and CPB, which are both cathepsin L-like in terms of primary amino acid sequence, and CPC, which is cathepsin B-like. CPB is unusual in that it has a 100-amino acid C-terminal extension in comparison with most CPs of the group, and exists as multiple isoenzymes, which are encoded by a tandem array of similar CPB genes located in a single locus (the arrays comprise eight genes in L. major). The CPBs of L. mexicana are stage-regulated and the isoforms present differences in their substrate specificity and catalytic properties (. Although the exact roles of CPs in Leishmania pathogenesis are unclear, it has been demonstrated that Leishmania cannot grow within macrophages in the presence of CP inhibitors (. These observations provide evidence of the importance of these molecules in the survival of both promastigote and amastigote forms of these parasites (. Despite trypanosomatid CPs may be instrumental in modulating the host’s immune response to favor parasite survival and proliferation, they are themselves immunogenic. L. mexicana CP is a T cell immunogen, resulting in the development of potentially protective Thl cell lines (. This finding suggests that the CP itself is a vaccine candidate and that homologous enzymes in other species may also be so. A CP of L. pifanoi, however, provided rather little protection for the host against infection with the parasite (, although more recently a similar L. amazonensis CP provided some protection against subsequent challenge, apparently through inducing a Thl-associated response (. These differing results presumably reflect the complexity of the immune response to the active parasite enzymes and how the response may be determined by the precise immunization conditions. It is therefore encouraging that a CP-rich fraction of L. major was shown to be a strong inducer of a primed human immune response and may have protective function (. These observations suggest that trypanosomatid CPs have potential as vaccines although attempts to exploit them are really just beginning 14., 15.. In addition, it has also been proven that infected dendritic cells are the critical antigen-presenting cells responsible for T cell priming in Leishmania infections. Amastigotes, but not the infectious promastigotes, are the main targets for phagocytosis activity of dendritic cells 16., 17.. A key step in the design of subunit vaccines is the identification of epitopes from overlapping synthetic peptides. This method decreases the possibility of missed epitopes, but lots of peptides need to be synthesized at a high cost. New developments in immunoinformatics and other computational methodologies, combined with the broad versatility in the design and synthesis of genetic (DNA) vaccines, underlay new strategies for the novel design of antigen-specific, epitope-based vaccines against many pathogens that currently have proven refractive to conventional vaccine therapy (. Epitopes are selected by prediction with software, which saves the expense of synthetic peptides and working time 19., 20.. Basically, the recognition of antigenic epitopes by the immune system, either small discrete T cell epitopes or large conformational epitopes recognized by B cells and soluble antibodies, is the key molecular event at the heart of the immune response to pathogens (. The objective of this bioinformatics-based study is to enhance the optimal selection of epitopic regions of clan CA, C1 family of cysteine proteases as potential targets of immune response. Consensus sequence methodology was used to identify sequences of 9 amino acids or longer with complete conservation in 80% or more of C1 families of cysteine proteases. These conserved sequences were further analyzed to identify targets for candidate epitope-based T cell vaccine formulations against L. major. Furthermore, concerning the activation of human humoral immunity by leishmaniasis, B cell epitopes were also predicted based on propensity scales for each of the 20 amino acids. Utilizing bioinformatics servers for vaccine candidates is a time-saving approach that could significantly help to increase our information about various aspects of pathogens in molecular biology. These theoretical predictions can then be tested by using experimental and complementary methods.

Results

Prediction of MHC class I binding peptides

For the prediction of major histocompatibility complex (MHC) class I binding peptides of C1 family of cysteine proteases, one sequence for each of the cysteine proteases A, B and C was selected. In the case of CPB, due to the high similarity between these cysteine proteases, only one out of eight sequences was chosen (LmjF08.1050). This sequence is a good candidate of CPB because of the complete identity of its sequence with the consensus sequence of CPB. All overlapping nonamer peptides were generated from this dataset and were screened for potential T cell antigens using the NetCTL algorithm, from which 716 peptides were short-listed. Most of the peptides were found to exhibit mono-supertype specificity, meaning that they bind to a single supertype. Some of them, however, appeared to bind to multiple supertypes; the highest number of supertypes a given nonamer could bind is 5 out of the 12 supertypes tested. In fact, out of the 716 human leukocyte antigen (HLA)-binding nonamers, one peptide binds to 5 supertypes, 4 bind to 4 supertypes, 9 bind to 3 supertypes and 44 bind to 2 supertypes. Sequences of binding peptides to the 4 and 5 supertypes and the name of proteins they belong to are summarized in Table 1.
Table 1

Number of supertypes that a given nonamer could bind

Peptide sequenceProteinNo. of binding to the supertypes
339-HVSQSPTPG-347CPB5
9-FAIVVTILF-17CPA4
270-FMSYHSGVL-278CPB4
42-AEVNSKAKG-50CPC4
258-LTMQVYSDF-266CPC4
Knowing the number of binding peptides of each of the analyzed proteins is important, considering the polymorphic nature of HLA and its diversity in populations of different geographical regions. Therefore, a good T cell antigen should have peptides recognized by many HLA alleles. The analysis revealed that CPB has the maximum number of binding peptides, followed by CPC and CPA, respectively (Table 2).
Table 2

Generated and binding peptides from selected cysteine proteases

Systematic name in GeneDBProteinFeaturesNo. of amino acidNo. of generated peptidesNo. of binding peptidesPercent of binding peptidesNo. of binding peptides with score >1.5
LmjF19.1420CPAcathepsin L-like3543469126.3%8
LmjF08.1050CPBcathepsin L-like34834010330.3%9
LmjF29.0820CPCcathepsin B-like3403329628.9%15
For short-listing potential vaccine candidates, it is important to analyze the binding profiles from a supertype perspective. Of the 12 supertypes studied, the largest number of nonamers was found to be recognized by the allele B62 (53), followed by B58 (38), A2 (36), A24 (24), A1, B8 and B39 (21), B44 (20), A3 and B7 (19), B27 (17) and A26 (15), as illustrated in Figure 1.
Figure 1

Observed variation in the binding of predicted peptides to the 12 different HLA class I supertypes studied. The figure depicts the proportion of binding peptides to individual supertypes.

The score with which a peptide binds to HLA ranges from 0.75 to 3.1949. In general, the binding score of CPC peptides to A1 locus supertypes is higher compared with CPB, CPA and other supertypes (Table 3).
Table 3

Features of binding peptides with high scores

No.ProteinSupertypePeptideScore
1CPCA152-WTASADNGY-603.1949
2CPCA1308-NTDWGDKGY-3162.6687
3CPCA119-LATTVSGLY-272.3211
4CPAA177-NTHNPHAHY-852.0979
5CPCA1131-AVEAISDRY-1391.9722
6CPCA176-VTDMSTEAV-841.9262
7CPCA24204-KYPPCPSTI-2121.9244
8CPCB44279-GEFLGGHAV-2871.9117
9CPCA2652-WTASADNGY-601.8531
10CPCA1224-RSEMDLVKY-2321.8397
Putative promiscuous T cell epitopes may be localized in clusters, as reported in studies of HIV-1 22., 23., 24., 25., the outer membrane of Chlamydia trachomatis (, and among others 27., 28.. The clusters are also ideal for developing epitope-based vaccines because they contain multiple promiscuous epitopes. The number of immunogenic hotspots for CPA, CPB and CPC is 4, 5 and 0, respectively, as shown in Table 4.
Table 4

Features of immunogenic hotspots of CPA and CPB

CPSupertypePositionScoreThresholdLength
CPA
A21–3957.385539
A2174–22456.205551
A361–9345.464533
A3230–27945.214550

CPBA21–3855.655538
A2188–22155.045534
A318–13846.5145121
A3151–19345.194543
A3228–27045.424543
The identification of conserved sequences is very important to design peptide vaccines, because vaccines that are developed on the basis of the conserved segments among candidate proteins can be used against a large majority of pathogen’s variants. In Figure 2, three segments (I, II and III) with identity ≥90% and have ≥9 amino acids in length are shown as conserved regions. Obviously the epitopes predicted in these regions are very significant. For immunological applications, a minimum conserved sequence length of 9 amino acids is required because this represents the typical length of peptides that bind to HLA molecules (. The features of potential epitopes located in conserved regions with maximum scores are summarized in Table 5.
Figure 2

Alignment of C1 family of clan CA of cysteine proteases. Gray rectangular under each sequence is the corresponding regions of that sequence, which is determined with Pfam database as conserved sequences. Sequences shown in boxes are predicted B cell epitopes of CPB (LmjF08.1050). Regions that have identity ≥90% are shown with Roman numerals. The most conserved residues that appear in the consensus sequence are indicated by “*”.

Table 5

Epitopes with highest scores located in conserved regions

Conserved regionEpitope sequenceSupertypeScoreCP
I133-REKGAVTPV-141B441.2763CPB
II148-GMCGSCWAF-156B621.2696CPA
III283-GEQLNHGVL-291B441.7068CPB

B cell epitope prediction

Before the prediction of B cell epitope of CPB (LmjF08.1050), signal peptide of this protein predicted by SignalP 3.0 hidden Markov model (HMM) (signal peptide probability 0.999, signal anchor probability 0.001, with cleavage site probability 0.760 between residues 27 and 28) was excluded. Hydrophilicity, flexibility, accessibility, turns, exposed surface, polarity and antigenic propensity scales were applied to predict B cell epitopes. These parameters were correlated with the location of continuous epitopes. As a result, 9 regions were predicted to be B cell epitopes (Figure 2).

Discussion

The aim of this investigation was to apply bioinformatics methods to study the B and T cell epitopic sites of C1 family cysteine proteases of L. major. To help the development of vaccines, understanding the structural basis for the cell-mediated immune response is necessary (. The perfect bioinformatics prediction of T cell epitopes can to a great extent reduce the experimental cost in candidate epitope identification (. In the present study, NetCTL program has been used to predict MHC class I of cysteine proteases A, B and C of L. major (. CP proteins are immunogenic and are potential vaccine candidates. Efficient processing and presentation of vaccine antigens by class I and/or class II MHC are essential for a good T cell response. Since humans carry only a limited number of co-dominant HLA alleles in their genome (2 each for A, B and C loci), out of hundreds of polymorphic alleles present in the population, it becomes important that a candidate vaccine must generate peptides that bind to a wide range of HLA molecules to provide good population coverage. In this work we found that generated peptides bind in larger numbers to B supertypes. However, almost all of the peptides with the highest binding score belong to CPC. In other words, CPC is the major source of peptides that bind to HLA loci with more affinity. These observations suggest that greater emphasis must be placed on cytotoxic T lymphocyte (CTL) response generated by the presentation of antigen by B alleles and should design epitope-based vaccine directed towards these HLA. T cell epitopes specific to multiple HLA supertypes are advantageous for vaccine design because they effectively increase the number of epitopes to which an individual can respond, and provide much more extensive coverage of the population (. The peptides binding to more than one HLA are termed promiscuous and such peptides are of prime interest for vaccine design because of their relevance in coverage of higher proportions of human populations. In silico approaches would help to predict some of the HLA-binding motifs, which could act as promiscuous epitopes (. Most of the generated nonameric peptides in this work are mono-allelic binders. To cover majority of the population, it is essential to have vaccine candidates that have multi-binding behavior. Consequently, peptides with the binding ability of ≥4 supertypes were taken as promiscuous epitopes. Note that each supertype consists of multiple HLA alleles, and peptides that can bind to ≥4 supertypes have a great potentiality to activate the most proportion of T cell population. It is generally recognized that conserved protein sequences represent important functional domains (, for which mutations would be detrimental to the survival of the pathogen. The functions of conserved sequences can be elucidated by databases that comprise data on protein families, domains and functional sites, such as the Pfam database (www.sanger.ac.uk/Software/Pfam) (. In Figure 2, in addition to the ClustalW consensus sequence, the results of the Pfam database and the highly conserved regions that have ≥9 amino acids in length have also been shown. It is clear that the predicted epitopes located in the conserved segments have more validity. Eventually, identification of proteins with peptides binding to larger number of alleles, assessment of alleles or supertypes of MHC that bind large number of peptides than others have great importance in determining epitopes as a candidate vaccine. In addition, allelic variation in binding affinity, immunological hotspots, HLA distribution analysis and similarity of epitopes to the self proteins play a key role in identification of these epitopes 34., 35., 36.. In proteins, turns are located on the surface; these parts are accessible and hydrophilic but the core regions are mostly devoid of water molecules (. Antigenic determinants lie in regions that are hydrophilic, exposed and polar, while accessibility and flexibility of these segments are high. This has led to the rules that would allow the position of B cell epitopes to be predicted from these features of the protein sequence 37., 38.. In conclusion, recognizing epitopes on proteins is essential for developing synthetic vaccines and can facilitate immunotherapy of leishmaniasis and many other infectious diseases. In the present work, employing a bioinformatics approach, a set of peptides has been identified, which can be used in either a natural or a synthetic vaccine cocktail. This approach could be extended to the entire proteome of L. major to identify newer sets of potentially antigenic proteins and yet reducing the number of T and B cell antigens for experimental verification. These kinds of researches can be applied for omitting non-functional sequences of proteins, which would help in designing new immunological methods.

Materials and Methods

Amino acid sequences

The sequences of ten cysteine proteases (clan CA, family C1) of L. major were obtained from GeneDB database (www.genedb.org) (. Sequences included one CPA (Systematic ID: LmjF19.1420), one CPC (LmjF29.0820) and eight CPBs (LmjF08.1010, LmjF08.1020, LmjF08.1030, LmjF08.1040, LmjF08. 1050, LmjF08.1060, LmjF08.1070 and LmjF08.1080). NetCTL program version 1.2 (http://www.cbs.dtu.dk/services/NetCTL/) ( predicts peptides restricted to 12 HLA class I supertypes (A1, A2, A3, A24, A26, B7, B8, B27, B39, B44, B58 and B62), integrated with predictions of HLA binding, proteasomal C-terminal cleavage and transport efficiency by the transporter associated with antigen processing (TAP) molecules. HLA binding and proteasomal cleavage predictions were performed by an artificial neural networks (ANN) method and TAP transport efficiency was predicted using a weight matrix method. The parameters used for NetCTL prediction were: 0.15 weight on C terminal cleavage, 0.05 weight on TAP transport efficiency, and 0.75 threshold for HLA supertype binding. The final scores are the predicted MHC class I affinities in form of –logIC50 and IC50 values. All prediction calculations were based on propensity scales for each of the 20 amino acids. Sequence of each protein was read as a moving window. In order to compare the profiles obtained by different methods, various scales were normalized where the original values of each scale were set between +3 and –3. Hydrophilicity (, flexibility (, accessibility (, turns (, exposed surface (, polarity ( and antigenic propensity ( scales were applied to predict B cell epitopes by BcePred server (http://www.imtech.res.in/raghava/bcepred) with default threshold.

Signal peptide prediction

Due to the elimination of signal peptides of CPBs before secretion to the outer space of the infected cells, this region must be excluded from the entire sequence of the protein for exerting the prediction analysis on it. Signal peptide prediction was achieved using SignalP 3.0 HMM (http://www.cbs.dtu.dk/services/SignalP-3.0/) (.

Immunogenic hotspot prediction

Putative promiscuous T cell epitopes may be localized in clusters that are also ideal for developing epitope-based vaccines because they contain multiple promiscuous epitopes. For determining the immunogenic hotspots, MULTIPRED server (http://antigen.i2r.a-star.edu.sg/multipred/) ( was utilized.

Alignment

Sequences were aligned using ClustalW program ( from the BioEdit v5.0.9 package (.

Authors’ contributions

BS conceived the study and carried out the computational analysis. HM supervised the study. BS and HM prepared the manuscript. Both authors read and approved the final manuscript.

Competing interests

The authors have declared that no competing interests exist.
  49 in total

1.  Localization of CD4+ T cell epitope hotspots to exposed strands of HIV envelope glycoprotein suggests structural influences on antigen processing.

Authors:  S Surman; T D Lockey; K S Slobod; B Jones; J M Riberdy; S W White; P C Doherty; J L Hurwitz
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-03       Impact factor: 11.205

Review 2.  Epitope clusters in the major outer membrane protein of Chlamydia trachomatis.

Authors:  S K Kim; R DeMars
Journal:  Curr Opin Immunol       Date:  2001-08       Impact factor: 7.486

Review 3.  Molecular determinants of Leishmania virulence.

Authors:  K P Chang; G Chaudhuri; D Fong
Journal:  Annu Rev Microbiol       Date:  1990       Impact factor: 15.500

Review 4.  Predicting location of continuous epitopes in proteins from their primary structures.

Authors:  J L Pellequer; E Westhof; M H Van Regenmortel
Journal:  Methods Enzymol       Date:  1991       Impact factor: 1.600

5.  New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites.

Authors:  J M Parker; D Guo; R S Hodges
Journal:  Biochemistry       Date:  1986-09-23       Impact factor: 3.162

6.  Correlation between the location of antigenic sites and the prediction of turns in proteins.

Authors:  J L Pellequer; E Westhof; M H Van Regenmortel
Journal:  Immunol Lett       Date:  1993-04       Impact factor: 3.685

7.  Conformation of amino acid side-chains in proteins.

Authors:  J Janin; S Wodak
Journal:  J Mol Biol       Date:  1978-11-05       Impact factor: 5.469

8.  Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide.

Authors:  E A Emini; J V Hughes; D S Perlow; J Boger
Journal:  J Virol       Date:  1985-09       Impact factor: 5.103

9.  SARS coronavirus nucleocapsid immunodominant T-cell epitope cluster is common to both exogenous recombinant and endogenous DNA-encoded immunogens.

Authors:  Vandana Gupta; Tani M Tabiin; Kai Sun; Ananth Chandrasekaran; Azlinda Anwar; Kun Yang; Priya Chikhlikar; Jerome Salmon; Vladimir Brusic; Ernesto T A Marques; Srinivasan N Kellathur; Thomas J August
Journal:  Virology       Date:  2006-01-04       Impact factor: 3.616

10.  Uptake of Leishmania major amastigotes results in activation and interleukin 12 release from murine skin-derived dendritic cells: implications for the initiation of anti-Leishmania immunity.

Authors:  E von Stebut; Y Belkaid; T Jakob; D L Sacks; M C Udey
Journal:  J Exp Med       Date:  1998-10-19       Impact factor: 14.307

View more
  5 in total

Review 1.  Understanding Leishmania parasites through proteomics and implications for the clinic.

Authors:  Shyam Sundar; Bhawana Singh
Journal:  Expert Rev Proteomics       Date:  2018-05-02       Impact factor: 3.940

2.  Experimental Validation of Multi-Epitope Peptides Including Promising MHC Class I- and II-Restricted Epitopes of Four Known Leishmania infantum Proteins.

Authors:  Maria Agallou; Evita Athanasiou; Olga Koutsoni; Eleni Dotsika; Evdokia Karagouni
Journal:  Front Immunol       Date:  2014-06-10       Impact factor: 7.561

3.  Identification of Potential MHC Class-II-Restricted Epitopes Derived from Leishmania donovani Antigens by Reverse Vaccinology and Evaluation of Their CD4+ T-Cell Responsiveness against Visceral Leishmaniasis.

Authors:  Manas Ranjan Dikhit; Akhilesh Kumar; Sushmita Das; Budheswar Dehury; Ajaya Kumar Rout; Fauzia Jamal; Ganesh Chandra Sahoo; Roshan Kamal Topno; Krishna Pandey; V N R Das; Sanjiva Bimal; Pradeep Das
Journal:  Front Immunol       Date:  2017-12-14       Impact factor: 7.561

Review 4.  Post-Genomics and Vaccine Improvement for Leishmania.

Authors:  Negar Seyed; Tahereh Taheri; Sima Rafati
Journal:  Front Microbiol       Date:  2016-04-06       Impact factor: 5.640

5.  Design of multi-epitope peptides containing HLA class-I and class-II-restricted epitopes derived from immunogenic Leishmania proteins, and evaluation of CD4+ and CD8+ T cell responses induced in cured cutaneous leishmaniasis subjects.

Authors:  Sarra Hamrouni; Rachel Bras-Gonçalves; Abdelhamid Kidar; Karim Aoun; Rym Chamakh-Ayari; Elodie Petitdidier; Yasmine Messaoudi; Julie Pagniez; Jean-Loup Lemesre; Amel Meddeb-Garnaoui
Journal:  PLoS Negl Trop Dis       Date:  2020-03-16
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.