| Literature DB >> 20625421 |
Guglielmo Lucchese1, Angela Stufano, Darja Kanduc.
Abstract
Using the currently available proteome databases and based on the concept that a rare sequence is a potential epitope, epitopic sequences derived from Mycobacterium tuberculosis were examined for similarity score to the proteins of the host in which the epitopes were defined. We found that: (i) most of the bacterial linear determinants had peptide fragment(s) that were rarely found in the host proteins and (ii) the relationship between low similarity and epitope definition appears potentially applicable to T-cell determinants. The data confirmed the hypothesis that low-sequence similarity shapes or determines the epitope definition at the molecular level and provides a potential tool for designing new approaches to prevent, diagnose, and treat tuberculosis and other infectious diseases.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20625421 PMCID: PMC2896900 DOI: 10.1155/2010/832341
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Similarity analysis of M. tuberculosis B-cell epitopes to the host proteome.
| Epitope Source Name | IEDB ID1 | Epitope Sequence2 | Number of Matches3 | Host Proteome4 | Refs. |
|---|---|---|---|---|---|
| 10 kDa chaperonin | 3293 | wdedGEKRIpldvae | 4 | H | [ |
| 10 kDa chaperonin | 6508 | PDTAKekpqegtvva5 | 3 | H | [ |
| 10 kDa chaperonin | 3294 | ldvaegdtvIYSKY5 | 2 | H | [ |
| 10 kDa chaperonin | 1011756 | gtvvavgpGRWDEdge5 | 3 | H | [ |
| 10 kDa chaperonin | 1011758 | GRWDEdgekripldva | 3 | H | [ |
| 10 kDa chaperonin | 3290 | etttasglviPDTAK5 | 3 | H | [ |
| 10 kDa chaperonin | 3289 | dkilvQANEAettta | 1 | H | [ |
| 6 kDa ESAT5 | 8105 | yqgvQQKWD5 | 1 | M | [ |
| 6 kDa ESAT | 8086 | eQQWNFagieaaa5 | 0 | M | [ |
| 6 kDa ESAT | 8090 | DEGKQsltk5 | 2 | M | [ |
| 6 kDa ESAT | 8106 | aaAWGGSgsEAYQGvQQKWData5 | 3, 1, 1 | M | [ |
| DNAK | 1052559 | lvyQTEKF | 4 | H | [ |
| DNAK | 1052560 | vyQTEKFv | 4 | H | [ |
| DNAK | 1052558 | yQTEKFvk | 4 | H | [ |
| DNAK | 1052561 | qTEKFVke | 3 | H | [ |
| DNAK | 1052562 | TEKFVkeq | 3 | H | [ |
| DNAK | 1052583 | lNKVDAav | 4 | H | [ |
| DNAK | 1052584 | NKVDAava | 4 | H | [ |
| DNAK | 1052589 | aeaEGGTW | 3 | H | [ |
| DNAK | 1052591 | aegGTWRI | 2 | H | [ |
| DNAK | 1052593 | ggTWRIGY | 0 | H | [ |
| DNAK | 1052594 | WRIGYfgh | 0 | H | [ |
| DNAK | 1052595 | riGYFGHq | 2 | H | [ |
| DNAK | 1052596 | igYFGHQv | 1 | H | [ |
| DNAK | 1052597 | GYFGHqvg | 2 | H | [ |
| DNAK | 1052598 | YFGHQvgd | 1 | H | [ |
| DNAK | 24658 | HQVGDgea | 1 | H | [ |
| DNAK | 1052627 | lrsSSGCV | 5 | H | [ |
| DNAK | 1052628 | rssSGCVT | 4 | H | [ |
| DNAK | 1052629 | sssGCVTG | 4 | H | [ |
| DNAK | 1052630 | ssgCVTGH | 1 | H | [ |
| DNAK | 1052631 | cvTGHWRc | 0 | H | [ |
| DNAK | 1052632 | VTGHWrcp | 0 | H | [ |
| DNAK | 1052633 | TGHWRCpp | 0 | H | [ |
| DNAK | 1052549 | HWRCPprr | 1 | H | [ |
| DNAK | 1052550 | WRCPPrrr | 1 | H | [ |
| ESAT-6-like protein ESXB | 6090 | laqeagNFERIsgdl5 | 2 | H | [ |
| ESAT-6-like protein ESXB | 6090 | laqeagNFERIsgdl | 1 | M | [ |
| Immunogenic protein MPT64 | 1043665 | nitsATYQSaipprgtqavv | 1, 0 | H | [ |
| Immunogenic protein MPT64 | 1043668 | hptttyKAFDWdqayrkpit | 0 | H | [ |
| Lipoprotein LPQH | 1034449 | vtgsvvCTTAAgnvniaigg5 | 2 | M | [ |
| Lipoprotein LPQH | 1107625 | vtGSVVC5 | 5 | M | [ |
| Lipoprotein LPQH | 1107627 | VNKSFei | 1 | M | [ |
| Lipoprotein LPQH | 1052656 | VKRGLtvavagaailvagls | 2 | M | [ |
| Lipoprotein LPQH | 1034455 | taspgaasgpkvvidGKDQN | 4 | M | [ |
| Lipoprotein LPQH | 1018364 | dMANPMspvnksfeievtcs | 0 | M | [ |
| MTP40 protein | 1013654 | gprlyGEMTMqgtrkprpsgp | 2 | H | [ |
| MTP40 protein | 1011740 | ttlGMHCGsfgsapsng | 0 | H | [ |
| MTP40 protein | 1011751 | mlgtGTPNRarINFNC | 3, 3 | H | [ |
| MTP40 protein | 1011736 | MLGNApsvvpnTTLGM | 4, 3 | H | [ |
| MTP40 protein | 1011754 | INFNCevWSNVSetisgprly | 3, 2 | H | [ |
| Phosphate-binding protein 1 | 1127065 | lfnlWGPAFherypnvtita | 1 | M | [ |
| Phosphate-binding protein 1 | 1019382 | kqdpegWGKSPgfgttvdfp | 1 | M | [ |
| Phosphate-binding protein 1 | 1107628 | GPAFHer | 1 | M | [ |
| Phosphate-binding protein 1 | 1123943 | gnggMVTGCaetpgcvayig | 1 | M | [ |
| Phosphate-binding protein 1 | 1107630 | gfasKTPANqais | 1 | M | [ |
1IEDB ID Reference Dataset of Mycobacterial Immune Epitopes: http://iedb.zendesk.com/entries/18171-reference-datasets-of-mycobacterial-immune-epitopes.
2Epitope sequence with amino acid sequence in one-letter code. Low-similarity pentapeptides in capital letters.
3Number of times the low-similarity pentapeptide fragment(s) occur(s) in the set of proteins that comprehensively form the host proteome (see under Methods and [17–27]).
4Host proteome: H, human; M, murine.
5Sequences entirely or partially contained in M. tuberculosis T-cell epitopes [36, 49–52].
6ESAT, early secretory antigenic target.
Figure 1Similarity profile of the mycobacterial DNAK carboxy-terminus domain versus the human proteome at the pentapeptide level. The columns indicate the number of DNAK pentapeptide occurrences in the human proteome. DNAK corresponds to hsp70 protein, UniProt accession: P0A5B9; length: 609 aa. The numbered arrows indicate the location of mycobacterial peptide sequences immunoassayed with human sera [41]. Black arrows refer to the immuno-negative peptide sequences corresponding to IEDB ID: (6) 1212577 (aa FVKEQREA); (7) 1212579 (aa VKEQREAE); (8) 1212581 (aa KEQREAEG); (9) 1212582 (aa EQREAEGG); (10) 1212585 (aa QREAEGGS); (11) 1212586 (aa REAEGGSK); (12) 1212589 (aa EAEGGSKV); (13) 1212590 (aa AEGGSKVP); (14) 1212593 (aa EGGSKVPE); (15) 1212595 (aa GGSKVPED). Red arrows refer to the positive epitopic sequences corresponding to IEDB ID: (1) 1052559; (2) 1052560; (3) 1052558; (4) 1052561; (5) 1052562; (16) 1052583; (17) 1052584; (18) 1052589; (19) 1052591; (20) 1052593; (21) 1052594; (22) 1052595; (23) 1052596; (24) 1052597; (25) 1052598; (26) 24658; (27) 1052627; (28) 1052628; (29) 1052629; (30) 1052630; (31) 1052631; (32) 1052632; (33) 1052633; (34) 1052549; (35) 1052550. See Table 1 for amino acid sequence details. IEDB ID Reference Dataset of Mycobacterial Immune Epitopes: http://iedb.zendesk.com/entries/18171-reference- datasets-of-mycobacterial-immune-epitopes.
Figure 2The immunogenic impact of a single amino acid substitution in the five core residues (AGNVN, aa 71–75) of the immunodominant p61–80/PT19 mycobacterial epitope (VTGSVVCTTAAGNVNIAIGG). The substitution of N73A impairs T immunogenicity for the target epitope and changes the proteomic similarity level of the five core residues (aa 71–75 underlined) from one match (aa sequence AGNVN) to eight matches (aa sequence AGAVN). (a) The immunogenic MHC-peptide complex containing the central low-similarity pentapeptide AGNVN (in white); (b) The nonimmunogenic MHC-peptide complex containing the higher similarity pentapeptide AGAVN (in gray). Immunoreactivity and mutagenesis experiments are described in [58]. The figures are modified versions of the original drawings from the website http://www.iayork.com/ by kind authorization of Professor Ian York, University of Michigan.