| Literature DB >> 26493839 |
Marie-Cécile Mortier1, Erik Jongert2, Pascal Mettens3, Jean-Louis Ruelle4.
Abstract
BACKGROUND: Requisites for an efficacious tuberculosis (TB) vaccine are a minimal genomic diversity among infectious Mycobacterium tuberculosis strains for the selected antigen, and the capability to induce robust T-cell responses in the majority of human populations. A tool in the identification of putative T-cell epitopes is in silico prediction of major histocompatibility complex (MHC)-peptide binding. Candidate TB vaccine antigen Mtb72F and its successor M72 are recombinant fusion proteins derived from Mtb32A and Mtb39A (encoded by Rv0125 and Rv1196, respectively). Adjuvanted Mtb72F and M72 candidate vaccines were shown to induce CD4(+) T-cell responses in European, US, African and Asian populations.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26493839 PMCID: PMC4619029 DOI: 10.1186/s12865-015-0119-7
Source DB: PubMed Journal: BMC Immunol ISSN: 1471-2172 Impact factor: 3.615
Similarity of Mtb32A, Mtb39A, Mtb72F and M72 among selected Mycobacterium strains
| Lineagea | Strains | NCBI Accession number | Mtb39A | Mtb39A | Mtb32A | Mtb72F | M72 | |||
|---|---|---|---|---|---|---|---|---|---|---|
| No. of aa changes | No. of aa/nt, position (pos) | % identity | ||||||||
| Insertions | Deletions | Frameshifts | ||||||||
| 1 | T17 | PRJNA55273 | 3 | 99.23 | 94.37 | 96.79 | 96.65 | |||
| T46 | PRJNA55875 | 3 | 99.23b | - | - | - | ||||
| T92 | PRJNA55099 | 3 | 99.23 | 100 | 99.58 | 99.44 | ||||
| EAS054 | PRJNA55133 | 117 | 1 nt, pos 811 | 1, pos 270-391 | 68.92 | 100 | 84.37 | 84.23 | ||
| 2 | 94_M4241A | PRJNA55095 | 0 | 100 | 100 | 100 | 99.86 | |||
| 210 (Mtb Beijing | PRJNA42617 | 30 | 1 aa, pos 274 | 91.3 | 100 | 95.4 | 95.26 | |||
| 02_1987 | PRJNA55097 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| T85 | PRJNA55131 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| W-148 | PRJNA182020 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| 3 | - | - | - | - | - | |||||
| 4 | CDC1551 | PRJNA57775 | 0 | 100 | 100 | 100 | 99.86 | |||
| C | PRJNA54359 | 0 | 100b | - | - | - | ||||
| F11 | PRJNA58417 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| GM1503 | PRJNA55271 | - | - | - | - | - | 100b | - | - | |
| H37Rac | PRJNA58853 | 62 | 1 nt, pos 922 | 1, pos 309–391 | 82.28 | 100 | 90.56 | 90.42 | ||
| Haarlem | PRJNA54453 | 0 | 100 | 100 | 100 | 99.86 | ||||
| KZN1435 | PRJNA59069 | 0 | 100 | 100 | 100 | 99.86 | ||||
| 98R604INHRIFEM | PRJNA55399 | 0 | 100 | 100 | 100 | 99.86 | ||||
| 5 | CPHL_A ( | PRJNA55877 | 4 | 1 aa, pos 274 | 98.72 | 100 | 99.3 | 99.16 | ||
| 6 | K85 ( | PRJNA55879 | 3 | 2 aa, pos 162 | 1 aa, pos 274 | 98.48 | 100 | 99.17 | 99.03 | |
| n.d. | SUMu001 | PRJNA51927 | 8 | 1 aa, pos 274 | 97.70 | 100 | 98.75 | 98.61 | ||
| SUMu002 | PRJNA51925 | - | - | - | - | - | 100b | - | - | |
| SUMu003 | PRJNA51931 | 2 | 99.49 | 100 | 99.72 | 99.58 | ||||
| SUMu004 | PRJNA51933 | 2 | 1 aa, pos 274 | 99.23 | 100 | 99.58 | 99.44 | |||
| SUMu005 | PRJNA51935 | 2 | 1 aa, pos 274 | 99.23 | 100 | 99.58 | 99.44 | |||
| SUMu006 | PRJNA91937 | 2 | 99.49 | 100 | 99.72 | 99.58 | ||||
| SUMu008 | PRJNA51941 | - | - | - | - | - | 100b | - | - | |
| SUMu010 | PRJNA51945 | - | - | - | - | - | 100b | - | - | |
| KZN605 | PRJNA54947 | 0 | 100 | 100 | 100 | 99.86 | ||||
| KZN4207 | PRJNA83619 | 0 | 100 | 100 | 100 | 99.86 | ||||
| KZNR506 | PRJNA47489 | 0 | 100 | 100 | 100 | 99.86 | ||||
| KZNV2475 | PRJNA47491 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| UT205 | PRJNA162183 | 0 | 100 | 100 | 100 | 99.86 | ||||
| BTB05-552 | PRJNA51871 | 0 | 100 | 100 | 100 | 99.86 | ||||
| BTB05-559 | PRJNA51873 | 0 | 100 | 100 | 100 | 99.86 | ||||
| S96-129 | PRJNA51869 | 0 | 100 | 100 | 100 | 99.86 | ||||
| CCDC5079 | PRJNA161943 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| CCDC5180 | PRJNA161941 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| CTRI-2 | PRJNA161997 | 0 | 100 | 100 | 100 | 99.86 | ||||
| CTRI-4 | PRJNA43175 | 3 | 99.23 | 100 | 99.58 | 99.44 | ||||
| R1207 | PRJNA46669 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| X122 (pre | PRJNA46667 | 1 | 99.74 | 100 | 99.86 | 99.72 | ||||
| NA-A0008 | PRJNA168604 | 6 | 98.47 | 100 | 99.58 | 99.44 | ||||
| NA-A0009 | PRJNA168605 | 3 | 99.23 | 100 | 99.16 | 99.02 | ||||
| HN878 | PRJNA46665 | 3 | 99.23 | 100 | 99.58 | 99.44 | ||||
| RGTB423 | PRJNA162179 | 2 | 99.23 | 85.63 | 92.48 | 92.34 | ||||
| RGTB327 | PRJNA157907 | 24 | 1 nt, pos 480 | 1 nt, pos 464 | 1, pos 139–155 | 93.91 | 100 | 96.67 | 96.53 | |
| 2 nt, pos 951 | 1 nt, pos 468 | 1, pos 314–316 | ||||||||
| 1 nt, pos 959 | 1 nt, pos 469 | |||||||||
| 1 nt, pos 470 | ||||||||||
| 1 nt, pos 475 | ||||||||||
| Affected Strains; no (%) | 28 (67) | 3 (7) | 8 (19) | 3 (7) | ||||||
| Average % identity | 98.08 | 99.55 | 98.71 | 98.57 | ||||||
aLineages as defined in ref. [30]: no. 1: The Philippines / rim of the Indian Ocean; no. 2: East Asia; no. 3: India / East Africa; no. 4: Europe / Americas; no. 5 and no. 6: West African 1 and 2, respectively. n.d., not defined. MDR, multiple drug-resistant; (pre-)XDR, (pre-)extensively drug-resistant. aa, amino acid(s); nt, nucleotide(s). bEvaluations not performed for Mtb72F/M72 as a complete sequence was only available for one of the two genes. % identity = (number of identical residues / length of alignment) x100, as calculated from multiple alignment comparisons. Mtb39A genome sequences were derived from complete genomes (chromosomes; highlighted in bold) or assembled partial genomes (scaffolds/contigs; normal font). cLaboratory strain H37Ra was used as reference
Fig. 1Numbers of predicted HLA class II-binding peptides for Mtb32A, Mtb39A and Mtb72F. HLA-II binding predictions were generated for 15-mer peptides (overlapping by 14 amino acids) of Mtb32A, Mtb39A and Mtb72F using NetMHCIIpan-3.0, for 158 DRB1 alleles. Results were normalized with respect to the different protein lengths of Mtb32A, Mtb39A and Mtb72F (i.e., 355, 391 and 729 amino acids, respectively)
MHC class II-binding peptide predictions for common DRB1 alleles in high-TB burden regions
| Region/Alleles assessed (N) | Protein | Predicted epitopes (N) | DRB1 alleles without predicted epitope | ||
|---|---|---|---|---|---|
| Total | Average/allele | N | Allele | ||
|
| Mtb32A | 8792 | 56 | 5 | *03:02, *04:07, *14:10, *14:14, *14:44 |
| ( | Mtb39A | 14675 | 93 | 2 | *03:02, *16:04 |
| Mtb72F | 22065 | 140 | 2 | *03:02, *16:04a | |
|
| Mtb32A | 5873 | 40 | 5 | *03:02, *04:07, *14:10, *14:14, *14:44 |
| ( | Mtb39A | 12794 | 88 | 2 | *03:02, *16:04 |
| Mtb72F | 19109 | 131 | 2 | *03:02, *16:04a | |
|
| Mtb32A | 2595 | 55 | 2 | *03:02, *04:07 |
| ( | Mtb39A | 4516 | 96 | 1 | *03:02 |
| Mtb72F | 6733 | 143 | 1 | *03:02a | |
|
| Mtb32A | 1087 | 43 | 0 | |
| ( | Mtb39A | 1993 | 80 | 1 | *16:04 |
| Mtb72F | 2868 | 115 | 1 | *16:04a | |
|
| Mtb32A | 2155 | 51 | 2 | *03:02, *04:07 |
| ( | Mtb39A | 3517 | 84 | 1 | *03:02 |
| Mtb72F | 5298 | 126 | 1 | *03:02a | |
|
| Mtb32A | 2807 | 37 | 2 | *03:02, *04:07 |
| ( | Mtb39A | 4634 | 61 | 2 | *03:02, *16:04 |
| Mtb72F | 6876 | 90 | 2 | *03:02, *16:04a | |
|
| Mtb32A | 4447 | 67 | 1 | *03:02 |
| ( | Mtb39A | 6977 | 106 | 1 | *03:02 |
| Mtb72F | 10737 | 163 | 1 | *03:02a | |
Prediction results represent the sum of the weak and strong binders. aAllele frequencies in the populations assessed are included in Additional file 2. bS.S. Africa, Sub-Saharan Africa, native population only
Prediction of HLA II-binding peptides for DRB3/4/5, DP and DQ alleles
| HLA molecule | Mtb32A | Mtb39A | Mtb72F | M72 |
|---|---|---|---|---|
| DRB3*01:01 | 18 | 7 | 31 | 31 |
| DRB3*02:02 | 19 | 56 | 71 | 71 |
| DRB3*03:01 | 51 | 82 | 125 | 125 |
| DRB4*01:01 | 27 | 38 | 58 | 58 |
| DRB5*01:01 | 32 | 85 | 109 | 109 |
| DRB5*01:02 | 45 | 107 | 145 | 145 |
| DQA1*05:01-DQB1*02:01a | 14 | 35 | 51 | 51 |
| DQA1*02:01-DQB1*02:01b | 0 | 0 | 0 | 0 |
| DQA1*05:01-DQB1*03:01a | 291 | 325 | 614 | 610 |
| DQA1*03:01-DQB1*03:02a | 17 | 75 | 88 | 88 |
| DQA1*04:01-DQB1*04:02a | 15 | 75 | 87 | 88 |
| DQA1*01:01-DQB1*05:01a | 8 | 0 | 1 | 1 |
| DQA1*01:02-DQB1*05:02 | 26 | 37 | 61 | 61 |
| DQA1*01:02-DQB1*06:02a | 161 | 262 | 392 | 392 |
| DPA1*02:01-DPB1*01:01c | 27 | 33 | 56 | 56 |
| DPA1*01:03-DPB1*02:01c | 20 | 23 | 39 | 39 |
| DPA1*01:03-DPB1*04:01c | 14 | 11 | 16 | 16 |
| DPA1*01:03-DPB1*04:02c | 18 | 17 | 28 | 28 |
| DPA1*02:02-DPB1*05:01b, c | 8 | 0 | 0 | 0 |
| DPA1*02:01-DPB1*14:01 | 50 | 128 | 170 | 170 |
Prediction results represent the sum of the weak and strong binders. a Allele is among the 6 HLA-DQ molecules that reportedly are present in >85 % of populations worldwide [32]. b For these alleles, no frequencies are reported in the populations assessed (Additional file 2). c Allele is among the 5 HLA-DP molecules that reportedly are present in >90 % of populations worldwide [33]
Impact of the alterations introduced for the Mtb72F construction on the peptide binding predictions
| Alteration introduced in Mtb72F | Change in the numbers of covereda alleles for Mtb72F | Explanation |
|---|---|---|
| Deletion of the Mtb32A signal sequence. | Loss of 1 covered allele containing 28 predicted epitopes | Epitopes in the Mtb32A signal sequence were predicted for 149 of the 158 alleles assessed. For 148 of the 149 alleles, epitopes were also predicted for the other parts of the protein. Only for 1 allele (DRB1*16:04), all 28 predicted epitopes were located in the Mtb32A signal sequence, and were thus not predicted for Mtb72F. |
| Splitting the Mtb32A sequence upstream and downstream of ‘TAAS’ sequence. | No changes in the number of covered alleles. | For each allele with an epitope predicted in this part of the protein there was also an epitope predicted in other parts of the protein. There was an overall loss of 14 predicted epitopes. |
| Addition of a poly-His tag (MHHHHHH) at the Mtb32A C-terminal end. | No changes in the number of covered alleles. | One epitope (MHHHHHHTAASDNFQ, binding to DRB1*08:18) was predicted for the Meth-His tag in Mtb72F. There were also other epitopes predicted for this allele. |
| Addition of 2-amino acid hinge sequences at the junction sites between Mtb32C and Mtb39A (EF), and between Mtb39A and Mtb32N (DI). | No changes in the number of covered alleles. | Adding the EF and DI sequences resulted in 13 and 5 additional predicted epitopes, binding to 43 and 31 alleles, respectively. However, the number of alleles with at least one predicted epitope did not change. |
a Covered allele: an allele for which at least one epitope was predicted
Comparison of experimentally-defined and predicted 15-mer HLA class II-binding peptides for three DRB1 alleles
| Performance algorithm | DRB1*0101 (DR1) | DRB1*1501 (DR2) | DRB1*0401 (DR4) | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
| VNEAEYGEMWAQDAA | LSQDRFADFPALPLD |
|
|
| ||||
| GSGQTYGVDVVGYDR |
| DRFADFPALPLDPSA | KTVSPHRSPISNMVS | NFQLSQGGQGFAIPI | VVWGLTVGSWIGSSA | ||||
|
|
| APAQAAPPALSQDRF | MSSLGSSLGSSGLGG | LNGHHPGDVISVTWQ | GLTVGSWIGSSAGLM | ||||
|
|
|
| LTNNHVIAGATDINA | GVAANLGRAASVGSL | |||||
|
|
| QTYGVDVVGYDRTQD | ANLGRAASVGSLSVP | ||||||
| GGQGGTPRAVPGRVV |
| VPGRVVALGQTVQAS | AAAAYETAYGLTVPP | ||||||
| QTYGVDVVGYDRTQD |
| IPIGQAMAIAGQIRS | ASAFQSVVWGLTVGS | ||||||
| FSVGSGQTYGVDVVG |
| NGARVQRVVGSAPAA | VTPAARALPLTSLTS | ||||||
|
| AIAVNEAEYGEMWAQ | INAFSVGSGQTYGVD |
| ||||||
|
|
| VAAASPYVAWMSVTA | |||||||
|
| AGQIRSGGGSPTVHI |
| |||||||
|
| GSGQTYGVDVVGYDR |
| |||||||
|
|
| AENRAELMILIATNL | |||||||
|
| IAGATDINAFSVGSG | SASLVAAAQMWDSVA | |||||||
| RVVALGQTVQASDSL |
| ||||||||
| YDRTQDVAVLQLRGA | LPPEINSARMYAGPG | ||||||||
| FSVGSGQTYGVDVVG | GQAELTAAQVRVAAA | ||||||||
| RVVGSAPAASLGIST |
| ||||||||
|
|
| ||||||||
| GFAIPIGQAMAIAGQ |
| ||||||||
| GVDVVGYDRTQDVAV |
| ||||||||
| ATDINAFSVGSGQTY | PSSKLGGLWKTVSPH | ||||||||
| PLDPSAMVAQVGPQV | VTAGQAELTAAQVRV | ||||||||
| TQDVAVLQLRGAGGL | AVQTAAQNGVRAMSS | ||||||||
| GGTPRAVPGRVVALG |
| ||||||||
| NHVIAGATDINAFSV |
| ||||||||
| GGGSPTVHIGPTAFL | MYAGPGSASLVAAAQ | ||||||||
| GTGIVIDPNGVVLTN | VRVAAAAYETAYGLT | ||||||||
| RWSWLLSVLAAVGLG |
| ||||||||
| AIAVNEAEYGEMWAQ | |||||||||
|
| |||||||||
| ASVGSLSVPQAWAAA | |||||||||
| Total binders determined | 9 | 14 | 23 | 3 | 4 | 6 | 29 | 32 | 60 |
| Total non-bindersb | 106 | 113 | 202 | 112 | 123 | 219 | 86 | 95 | 165 |
| True predicted binders (TP) | 5 | 12 | 17 | 0 | 2 | 2 | 4 | 13 | 17 |
| True predicted non-binders (TN) | 45 | 28 | 68 | 102 | 108 | 197 | 77 | 70 | 134 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Listed epitopes were derived from ref. [18] and experimentally found to bind MCH Class II molecules. Peptides predicted by NetMHCpan-3.0 in the current study are underlined and highlighted in bold. TP True positives. In ref. [18] and the current study, data were generated using 15 mer peptides overlapping by 12 mer. TN True negatives. aSequence does not appear in the Mtb72F recombinant protein. bLength of the Mtb72F sequence is 729 amino acids, containing 225 overlapping15-mer peptides. Mtb32A and Mtb39A contain 115 and 127 overlapping 15-mer peptides