| Literature DB >> 31824493 |
Lucas Michel-Todó1, Pedro Antonio Reche2, Pascal Bigey3,4, Maria-Jesus Pinazo1, Joaquim Gascón1, Julio Alonso-Padilla1.
Abstract
Trypanosoma cruzi infection causes Chagas disease, which affects 7 million people worldwide. Two drugs are available to treat it: benznidazole and nifurtimox. Although both are efficacious against the acute stage of the disease, this is usually asymptomatic and goes undiagnosed and untreated. Diagnosis is achieved at the chronic stage, when life-threatening heart and/or gut tissue disruptions occur in ~30% of those chronically infected. By then, the drugs' efficacy is reduced, but not their associated high toxicity. Given current deficiencies in diagnosis and treatment, a vaccine to prevent infection and/or the development of symptoms would be a breakthrough in the management of the disease. Current vaccine candidates are mostly based on the delivery of single antigens or a few different antigens. Nevertheless, due to the high biological complexity of the parasite, targeting as many antigens as possible would be desirable. In this regard, an epitope-based vaccine design could be a well-suited approach. With this aim, we have gone through publicly available databases to identify T. cruzi epitopes from several antigens. By means of a computer-aided strategy, we have prioritized a set of epitopes based on sequence conservation criteria, projected population coverage of Latin American population, and biological features of their antigens of origin. Fruit of this analysis, we provide a selection of CD8+ T cell, CD4+ T cell, and B cell epitopes that have <70% identity to human or human microbiome protein sequences and represent the basis toward the development of an epitope-based vaccine against T. cruzi.Entities:
Keywords: B cell epitopes; CD4 T cell; CD8 T cell; Chagas disease; Trypanosoma cruzi; epitope-based; vaccine
Year: 2019 PMID: 31824493 PMCID: PMC6882931 DOI: 10.3389/fimmu.2019.02698
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Conserved T. cruzi-specific CD8+ T cell epitopes from IEDB.
| GVSGVINAL | Paraflagellar rod component | AAC32021.1 | A*02:05, C*01:02, C*03:04 | 28.5 | SJM29146.1 (77.8) | KWZ91528.1 (77.8) |
| VPEVTDVTL | Major paraflagellar rod protein | AAA30221.1 | B*51:02, C*01:02 | 8.9 | NP_001254479.2 (77.8) | EHM52480.1 (77.8) |
| KLEKIEDEL | Major paraflagellar rod protein | AAA30221.1 | A*02:02, A*02:07 | 1.4 | XP_016860742.1 (77.8) | EGN34163.1 (77.8) |
Antigen name and ID number at NCBI database.
Predicted HLA I binding profile obtained at IEDB (.
Average PPC for populations from Latin American origin obtained at .
Blast hits in the NCBI human non-redundant protein collection and the Human Microbiome Project database are identified by their Accession reference number at NCBI; the percentage of identity (%Id, number of identical residues per queried length) of each epitope to its corresponding hit is shown in parenthesis.
Conserved T. cruzi-specific B cell epitopes at IEDB from antigens predicted to be exposed.
| GTD | Lipophosphoglycan biosynthetic protein/HSP90 superfamily | Q4DW89 | S (2) | XP_024308866 (66.7%) | EHN62501.1 (44.4%) | Y/Y |
| ATNGDTHLGGEDFDN | Heat shock 70 kDa protein | P05456 | S (1) | AAI12964.1 (93.0%) | EEX50322.1 (100%) | N/A |
The underlined sequence is shared by four epitopes whose fusion yields this 18-amino-acid-long peptide.
Antigen name and antigen ID were obtained from UniProtKB database.
Subcellular localization of antigens was predicted with TargetP1.1 (.
Blast hits in the NCBI human non-redundant protein collection and Human Microbiome Project database are identified by their Accession reference number at NCBI; the percentage of identity (% Id, number of identical residues per queried length) of each epitope to its corresponding hit is shown in parenthesis.
Evidence of expression in mammalian infective stages: existence (Y) or not (N) of evidence of expression by transcriptomics and/or proteomics as described in TriTrypDB search by Gene ID (.
Selection of predicted T. cruzi-specific CD8+ T cell epitopes.
| VYGRFYYRF | Phosphoglycerate mutase | XP_807293.1 | 52.5 | NP_001316055.1 (55.6) | ETN46456.1 | Y/Y |
| RFFPSVFWR | Hypothetical protein | XP_811816.1 | 46.2 | EAW98638.1 (66.7) | EFR45113.1 | Y/N |
| FPFCWLPTY | Hypothetical protein | XP_806973.1 | 39.1 | NP_005305.1 (66.7) | ETN41230.1 | Y/N |
| MPAFQGWAF | Hypothetical protein C3747_101g9 | PWV07436.1 | 36.0 | SJM35294.1 (66.7) | ERS97746.1 | Y/N |
| KICHVVFFR | Hypothetical protein | XP_807600.1 | 34.7 | NP_060595.3 (66.7) | EEG88082.1 | Y |
| RTYHMIWNR | Hypothetical protein C3747_25g140 | PWV16110.1 | 34.7 | AIE56486.1 (55.6) | EGG52539.1 | Y/N |
| RVFFWKVQR | Putative SNF2 DNA repair protein | PWV13412.1 | 34.7 | AAN08628.1 (55.6) | EPH21217.1 | Y/N |
| MTFVFEARR | Hypothetical protein | XP_817046.1 | 31.3 | NP_653173.1 (66.7) | EFK35250.1 | Y/N |
| KMWQRTFTR | Chaperonin HSP60, mitochondrial precursor | PWV11883.1 | 19.2 | CCQ43203.1 (66.7) | EGJ45866.1 | – |
| RLWRWRCMR | Hypothetical protein C3747_25g140 | PWV16110.1 | 19.2 | XP_011528872.1 (66.7) | EKY00389.1 | Y/N |
| RINFCFYVR | Phosphatase-like protein | XP_809619.1 | 19.1 | NP_001275914.1 (55.6) | EGF09027.1 | Y/Y |
| RQRAILMYR | Hypothetical protein C3747_84g29 | PWV08907.1 | 18.2 | XP_005256351.1 (66.7) | EFV17694.1 | Y/N |
| KMRVWRHQR | Hypothetical protein | XP_811814.1 | 15.3 | XP_006723890.1 (66.7) | EGC78513.1 | Y/N |
| RMNLITWHR | Hypothetical protein TCDM_03859 | ESS67427.1 | 15.3 | 5WC2 (66.7) | EFC97415.1 | Y/Y |
| FHDQTIFCL | Hypothetical protein | XP_805332.1 | 15.2 | BAG10460.1 (55.6) | EUB31300.1 | Y/N |
| MHDHYCFVL | Leucine carboxyl methyltransferase | XP_811822.1 | 15.2 | NP_620154.2 (66.7) | KGF29870.1 | Y |
| IPMRRRRSL | Hypothetical protein | XP_818330.1 | 15.0 | NP_579877.1 (66.7) | EKU83050.1 | Y/Y |
| FHFCITFCL | Hypothetical protein | XP_807567.1 | 11.3 | EAW88842.1 (66.7) | EEG29914.1 | Y/N |
Antigen name and ID accession number at NCBI database.
PPC for Latin America calculated at IEDB Population Coverage tool (.
Percentage of identity of all microbiome hits blasted was 66.7%.
Evidence of expression in mammalian infective stages: existence (Y) or not (N) of evidence of expression by transcriptomics and/or proteomics as described in TriTrypDB search by Gene ID (;
the transcript expression value was higher in epimastigotes than in mammalian infective stages as described by Minning et al. (.
Selection of predicted T. cruzi-specific CD4+ T cell epitopes.
| DDELFHYFLWTFFFIDLLYAVM | Hypothetical protein | XP_816368.1 | 95.9 | BAC87213.1 (52.3) | ERH31153.1 (43.91) | Y/N |
| YIFIECFQIMRAFRLRGASFF | Co-chaperone GrpE | XP_804093.1 | 38.4 | SJM30315.1 (28.6) | EJN84302.1 (47.6) | Y |
Antigen name and ID number at NCBI.
PPC for Latin America calculated at IEDB Population Coverage tool (.
Percentage of identity is given in parenthesis and stands for the number of identical residues per queried length of each epitope to its corresponding blasted hit.
Evidence of expression in mammalian infective stages: existence (Y) or not (N) of evidence of expression by transcriptomics and/or proteomics as described in TriTrypDB search by Gene ID (;
the transcript expression value was higher in epimastigotes than in mammalian infective stages as described by Minning et al. (.
Figure 1Structure-predicted B cell epitope in T. cruzi KMP11 antigen. (A) Alignment of T. cruzi KMP11 sequence with the BepiPred2.0 prediction of the likelihood of each residue of being part of an epitope (E). (B) PyMOL rendered picture of the T. cruzi KMP11 model with the predicted epitope QNKKFFADKPDESTL shown as sticks and the KMP11 background featured as pale yellow ribbon. The N-term and C-term extremes of the protein are indicated.
Selection of predicted T. cruzi-specific B cell epitopes.
| QNKKFFADKPDESTL | Kinetoplastid membrane protein 11 (KMP11) | XP_810488.1 | – (1) | AAH42613.1 (56.0%) | EOQ36117.1 (62.5%) | Y |
| EGESRHRTRSGSARHHRRHHRNEAGG | Hypothetical protein, conserved | XP_814578.1 | −4 | XP_006721247.1 (57.7) | EFX53962.1 (57.7) | Y/Y |
| RRRRHSRSKRGEEDGGET | Hypothetical protein, conserved | XP_814578.1 | −4 | XP_016883994.1 (55.6) | EEO54893.1 (61.1) | Y/Y |
| GTPSRTTGRSTSTTRGVSRPTNGVTPSTSLAHRASTPGRTGTRSTTPSRSSVLS | Associated kinase of Tb14-3-3, putative | XP_819464.1 | S (1) | – | XP_016882982.1 (24.1) | Y/Y |
| GVCTSAEPRDLLDPVALCMPYPGAERIIG | Associated kinase of Tb14-3-3, putative | XP_819464.1 | S (1) | AAI12922.1 (44.8) | EET82433.1 (37.9) | Y/Y |
| NSQETPDQQKTGITRV | Associated kinase of Tb14-3-3, putative | XP_819464.1 | S (1) | AAY14773.1 (37.5) | KZA05920.1 (68.8) | Y/Y |
| REARGTSTPRRAETPSGGSRVRGA | Associated kinase of Tb14-3-3, putative | XP_819464.1 | S (1) | NP_573568.1 (37.5) | OFQ81576.1 (54.2) | Y/Y |
| KRSRSHNDGPARKRRRKDNRP | Methyl-transferase, putative | XP_803963 | S (5) | CAD38887.1 (61.9) | EFD49634.1 (47.6) | Y/N |
| NPSASPEASWQLNQSWNPL | Hypothetical protein, conserved | XP_809003.1 | S (5) | EAW82987.1 (52.6) | EHS88047.1 (63.2) | Y/N |
Antigen name and Accession reference obtained at NCBI database upon BLASTP analysis of the corresponding ORF_AA sequence as detailed in Materials and Methods.
Antigen subcellular localization was predicted with TargetP1.1 (.
Percentage of identity is given in parenthesis and stands for the number of identical residues per queried length of each epitope to its corresponding blasted hit.
Evidence of expression in mammalian infective stages: existence (Y) or not (N) of evidence of expression by transcriptomics and/or proteomics as described in TriTrypDB search by Gene ID (;
the transcript expression value was higher in epimastigotes than in mammalian infective stages as described by Minning et al. (.
Proposed epitope vaccine ensemble for T. cruzi.
| VYGRFYYRF, RFFPSVFWR, FPFCWLPTY, MPAFQGWAF, KICHVVFFR, RTYHMIWNR, RVFFWKVQR, MTFVFEARR, KMWQRTFTR, RLWRWRCMR, RINFCFYVR, RQRAILMYR, KMRVWRHQR, RMNLITWHR, FHDQTIFCL, MHDHYCFVL, IPMRRRRSL, FHFCITFCL | See | See | P |
| DDELFHYFLWTFFFIDLLYAVM | Hypothetical protein | XP_816368.1 | P |
| YIFIECFQIMRAFRLRGASFF | Co-chaperone GrpE | XP_804093.1 | P |
| GTDEGLLLPVDNDGDESS | Lipophosphoglycan biosynthetic protein/HSP90 superfamily | XP_818651.1 | E |
| QNKKFFADKPDESTL | Kinetoplastid membrane protein 11 (KMP11) | XP_810488.1 | P |
| EGESRHRTRSGSARHHRRHHRNEAGG | Hypothetical protein, conserved | XP_814578.1 | P |
| RRRRHSRSKRGEEDGGET | Hypothetical protein, conserved | XP_814578.1 | P |
| GTPSRTTGRSTSTTRGVSRPTNGVTPSTSLAHRASTPGRTGTRSTTPSRSSVLS | Associated kinase of Tb14-3-3, putative | XP_819464.1 | P |
| GVCTSAEPRDLLDPVALCMPYPGAERIIG | Associated kinase of Tb14-3-3, putative | XP_819464.1 | P |
| NSQETPDQQKTGITRV | Associated kinase of Tb14-3-3, putative | XP_819464.1 | P |
| REARGTSTPRRAETPSGGSRVRGA | Associated kinase of Tb14-3-3, putative | XP_819464.1 | P |
| KRSRSHNDGPARKRRRKDNRP | Methyl-transferase, putative | XP_803963 | P |
| NPSASPEASWQLNQSWNPL | Hypothetical protein, conserved | XP_809003.1 | P |
Antigen name and ID reference obtained at NCBI database upon BLASTP analysis of the corresponding ORF_AA sequence as detailed in Materials and Methods.
Src, source of the epitopes: P, de novo predicted; E, experimentally validated retrieved from IEDB.
Figure 2Flowchart summary of the epitope identification and prioritization steps followed in our strategy. (A) depicts the steps followed to arrive at the selected CD8+ and CD4+ T cell epitopes included in the ensemble proposed, whereas (B) shows the strategies undertaken to select the B cell epitopes finally included. The larger boxes show the steps; triangles show IEDB-derived T. cruzi-specific unique epitopes, and the hexagons contain the finally selected epitopes. This figure was made at https://www.lucidchart.com.
Figure 3Schematic diagram of an mRNA vaccine construct. (A) Schematic diagram of a candidate mRNA construct. (B) Optimal organization and ordering of CD8+ T cell epitopes part of the vaccine, as well as optimal linker sequences as determined with OptiVac tool (68).
Figure 4NetChop predictions of proteasome processing in the full peptide sequence of the vaccine ensemble. S, spacers; P, NetChop predictions.
Figure 5RaptorX modeling and accessibility calculations of the epitope-containing ensemble. (A) 3D model of the ensemble, respectively, encompassing from N-term to C-term the listed CD8+ T cell epitopes (blue), B cell epitopes* (orange), and CD4+ T cell epitopes (red). Linkers are shown in gray. *, B cell epitopes QNKKFFADKPDESTL, EGESRHRTRSGSARHHRRHHRNEAGG, RRRRHSRSKRGEEDGGET, GTPSRTTGRSTSTTRGVSRPTNGVTPSTSLAHRASTPGRTGTRSTTPSRSSVLS, and KRSRSHNDGPARKRRRKDNRP were not modeled by RaptorX as they were considered disordered. (B) Full peptide sequence of the vaccine ensemble (Ens) aligned with RaptorX-provided solvent accessibility (Acc) per position. CD8+ T cell epitopes are shaded light blue; B cell epitopes are highlighted in yellow; and CD4+ T cell epitopes are in red. B, buried; M, medium (light gray); E, exposed (gray).