| Literature DB >> 29729680 |
Tatjana Sitt1, Roger Pelle2, Maurine Chepkwony1, W Ivan Morrison3, Philip Toye1.
Abstract
The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.Entities:
Keywords: African buffalo; CD8+ cytotoxic T cell antigens; Theileria parva; epitope; genetic diversity
Mesh:
Substances:
Year: 2018 PMID: 29729680 PMCID: PMC6126638 DOI: 10.1017/S0031182018000264
Source DB: PubMed Journal: Parasitology ISSN: 0031-1820 Impact factor: 3.234
Allele and protein variants of Tp9 observed in the buffalo- and cattle-derived theileria stabilates
| Sample ID | Stabilate name | Genomic DNA | Cloned DNA | ||||
|---|---|---|---|---|---|---|---|
| Allele TP02_0895 | Protein XP_765463 | Epitope | Allele | Protein | Epitope | ||
| Buffalo-derived | |||||||
| BD1 | Mara 3 | 4 | 22 | AKFPGMKKG | 4 | ||
| BD2 | Mara 4 | – | – | AKFPGMKKG | |||
| BD3 | Mara 18 | EKFQRMGIG | AKFPGMKKG | ||||
| BD4 | Mara 30 | EKFQRMGIG | EKFQRMGIG | ||||
| BD5 | Mara 32 | – | |||||
| BD6 | Mara 42 | 4 | 22 | AKFPGMKKG | 10 | 16 | EKFQRMGIG |
| BD7 | 6998 | 18 | EKFQRMGIG | AKFPGMKKG | |||
| BD8 | 6999 | – | – | 17 | EKFQRMGIG | ||
| BD9 | 7001 | – | EKFQRMGIG | ||||
| BD10 | 7546 | – | EKFKHMGIG | ||||
| BD14 | 7014 | – | |||||
| BD15 | 7698 | – | |||||
| BD16 | 7344 | ||||||
| BD17 | Mara 41 | – | – | – | – | – | – |
| BD18 | 4740 | – | – | – | |||
| BD19 | 5012 | 1 | 1 | AKFPGMKKS | – | – | – |
| BD20 | 5489 | – | – | – | – | – | – |
| BD21 | 6666 | – | – | – | – | – | – |
| BD22 | 6807 | – | – | – | – | – | – |
| BD23 | 6818 | – | – | – | – | – | – |
| BD24 | 6819 | – | – | – | – | – | – |
| BD25 | 6998 cl. 11 | ||||||
| BD26 | 7065 | – | – | – | – | – | – |
| BD27 | 7014, st.3506 | – | – | – | 4 | 22 | AKFPGMKKG |
| BD28 | 7014, st.3570 | – | – | – | 1 | 1 | AKFPGMKKS |
| BD29 | 7014, st.3081 | – | – | – | 26 | 7 | EKFKHMRIG |
| BD30 | 7014, st.3081 | – | – | – | 25 | 11 | EKFKRMGMK |
| BD31 | 7014, st. 3081 | – | – | – | 24 | 12 | EKFKRMGMK |
| BD32 | 7014, st.3081 | – | – | – | 21 | 26 | NKFPGMKKG |
| BD33 | 7014, st.3081 | – | – | – | 23 | 25 | AKFPGMKKG |
| BD34 | 7014, st. 3081 | – | – | – | 19 | 9 | NKFKHMRIG |
| BD35 | 7752, st. 4110 | – | – | – | 27 | 13 | EKFKHMGIG |
| BD36 | 7344, st. 3591 | – | – | – | 28 | 14 | EKFKHMGIG |
| BD37 | 7065, st. 3120 | – | – | – | 20 | 21 | EKFQRMGIG |
| BD38 | 6999, st. 3123 | – | – | – | 22 | 20 | AKFPGMKKG |
| BD39 | 6998, cl.19 | – | – | – | 18 | 6 | EKFQRMGIG |
| Cattle-derived | |||||||
| CD16 | Tc 867 Bt106 | 1 | 1 | AKFPGMKKS | – | – | – |
| CD17 | Tc 841 IL57 | – | – | – | EKFQRMGIG | ||
| CD28 | Tc 120 F344 | ||||||
Genomic DNA sequences were obtained directly from purified DNA; cloned DNA sequences were obtained from plasmids into which genomic DNA had been cloned.
Reference sequence accession number.
Newly identified alleles, protein variants and epitopes are in boldface.
Comparative sequence information on TpAg1–10 observed in the buffalo- and cattle-derived theileria stabilates
| Antigen | Primers 5’ – 3’ | Predicted length of amplicon | Lengths of readable sequence | Total no. sequences obtained | Total no. SNP locations | Total no. alleles | Total no. non-synonymous SNP locations | Total no. protein variants | Total no. epitope variants |
|---|---|---|---|---|---|---|---|---|---|
| Tp1 | ATGGCCACTTCAATTGCATTTGCC | 387 (22) | 363–399 | 11 | 19 | 8 | 16 | 8 | 2 |
| Tp2 | ATGAAATTGGCCGCCAGATTA | 483 (47) | 456–480 | 9 | 220 | 7 | 92 | 7 | See |
| Tp3 | ATGAAATTAAATACTATCGCAATAG | 735 (89) | 686–726 | 24 | 26 | 13 | 12 | 9 | n/a |
| Tp4 | CCAATTGGTGATTTAGCAACA | 692 (27) | 386–422 | 24 | 36 | 13 | 5 | 6 | 1 |
| Tp5 | ATGAGGAAGCGAGTTTGGG | 238 (31) | 200–238 | 16 | 26 | 6 | 3 | 3 | 1 |
| Tp6 | ATGGCTCAGATTCCTGTTG | 1005 (92) | 569–833 | 23 | 23 | 13 | 1 | 2 | n/a |
| Tp7 | ATGGAGGCACTGCAAGCAGGC | 417 (14) | 299–332 | 25 | 33 | 9 | 4 | 3 | 1 |
| Tp8 | ATGCTTGGAAATCATGTCATGGGA CTCCTTCGCAGTGTACGCAGTTTAA | 1275 (81) | 931–1109 | 22 | 19 | 12 | 6 | 7 | 1 |
| Tp9 | ATGAATGTTCTAACTACTGG | 958 (95) | 326–1057 | 37 | >100 | 28 | >100 | 27 | 12 |
| Tp10 | GACTACGCTCTTCTTACCAC | 916 (89) | 505–824 | 22 | 24 | 13 | 7 | 6 | n/a |
The sequence length of the predicted amplicon from the reference sequence, excluding primers, is shown. The figures in brackets represent the length of the amplicon as a percentage of the length of the coding sequence from the reference gene, including introns.
SNPs within introns are not included.
Alleles or protein variants which have ambiguous residues (labelled X.1 in Table 1) are not included, unless they are unique, validated variants.
Unique, validated alleles or protein variants obtained from partial sequences are included if the constitute a variant.
Allele and protein variants of Tp1–4 observed in the buffalo- and cattle-derived T. parva stabilates
| Sample ID | Stabilate name | Tp1 | Tp2 | Tp3 | Tp4 | ||||
|---|---|---|---|---|---|---|---|---|---|
| Allele TP03_0849 | Protein XP_762973 | Allele TP01_0056 | Protein XP_765583 | Allele TP01_0868 | Protein XP_763892 | Allele TP03_0210 | Protein XP_763228 | ||
| Buffalo-derived | |||||||||
| BD1 | Mara 3 | 1 | |||||||
| BD2 | Mara 4 | – | – | ||||||
| BD3 | Mara 18 | ||||||||
| BD4 | Mara 30 | ||||||||
| BD5 | Mara 32 | ||||||||
| BD6 | Mara 42 | ||||||||
| BD7 | Mara 6998 | ||||||||
| BD8 | Mara 6999 | 1 | |||||||
| BD9 | Mara 7001 | ||||||||
| BD10 | Mara 7546 | 1 | |||||||
| BD14 | Buffalo 7014 | – | – | ||||||
| BD15 | Buffalo 7698 | ||||||||
| BD16 | Buffalo 7344 | ||||||||
| BD17 | Mara 41 | – | – | – | – | 1 | |||
| BD18 | Buffalo 4740 | 1 | 1 | 1 | 1 | ||||
| BD19 | Buffalo 5012 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| BD20 | Buffalo 5489 | 13 | |||||||
| BD21 | Buffalo 6666 | 1 | |||||||
| BD22 | Buffalo 6807 | 13 | 11 | – | – | – | – | 1.1 | |
| BD23 | Buffalo 6818 | 13 | 11 | – | – | 1 | |||
| BD24 | Buffalo 6819 | 1 | |||||||
| BD25 | 6998 cl. 11 | ||||||||
| BD26 | Buffalo 7065 | – | – | – | – | – | |||
| Cattle-derived | |||||||||
| CD16 | Tc867 BT 106 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| CD17 | Tc841 IL 57 | 2 | 2 | ||||||
| CD28 | Tc120F344 | 17 | |||||||
We have previously published (Pelle et al. 2011) the Tp1 and Tp2 results for samples BD1-BD16. They are included here in italics for completeness.
The letters-numbers in this row indicate the reference sequence in the NCBI Genbank.
Newly identified alleles and protein variants are in boldface.
Although we were unable to determine if the sequence is novel due to four unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions.
Unique, validated alleles or protein variants obtained from partial sequences which are their own variant.
A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 4 if labelled 4.1, as in BD22, Tp4).
partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis (ie: no complete consensus sequence).
Allele and protein variants of Tp5–8 and Tp10 observed in the buffalo- and cattle- derived T. parva stabilates
| Sample ID | Stabilate name | Tp5 | Tp6 | Tp7 | Tp8 | Tp10 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Allele TP02_0767 | Protein XP_765334 | Allele TP01_0188 | Protein XP_765715 | Allele TP02_0244 | Protein XP_764810 | Allele TP02_0140 | Protein XP_764709 | Allele TP04_0772 | Protein XP_764408 | ||
| Buffalo-derived | |||||||||||
| BD1 | Mara 3 | 1 | 1 | 1 | |||||||
| BD2 | Mara 4 | 1 | 1 | – | – | – | – | ||||
| BD3 | Mara 18 | 1 | 1 | 1 | – | – | 1 | ||||
| BD4 | Mara 30 | 1 | 1 | 1 | |||||||
| BD5 | Mara 32 | 1 | 1 | 1 | |||||||
| BD6 | Mara 42 | 1 | 1 | 1.1 | 1 | 1 | |||||
| BD7 | Mara 6998 | 1 | |||||||||
| BD8 | Mara 6999 | 1 | 1 | 1 | |||||||
| BD9 | Mara 7001 | 1 | 1 | 1 | – | – | 1 | ||||
| BD10 | Mara 7546 | 1 | 1 | 1 | – | – | 1 | ||||
| BD14 | Buffalo 7014 | 1 | 1 | 1 | 1.1, 4.1, 5.1, 6.1 | ||||||
| BD15 | Buffalo 7698 | 1 | 1 | ||||||||
| BD16 | Buffalo 7344 | – | – | 1 | – | – | |||||
| BD17 | Mara 41 | – | – | – | – | 1 | |||||
| BD18 | Buffalo 4740 | – | – | 1 | 1 | ||||||
| BD19 | Buffalo 5012 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| BD20 | Buffalo 5489 | – | – | 1 | 1.1, 4.1, 5.1, 6.1 | ||||||
| BD21 | Buffalo 6666 | – | – | 1 | 1.1, 4.1, 5.1, 6.1 | ||||||
| BD22 | Buffalo 6807 | 1 | 1 | – | – | 1.1, 4.1, 5.1, 6.1 | |||||
| BD23 | Buffalo 6818 | 1 | 1 | – | – | ||||||
| BD24 | Buffalo 6819 | 1 | |||||||||
| BD25 | 6998 cl. 11 | – | – | 1 | 1 | ||||||
| BD26 | Buffalo 7065 | – | – | 1 | – | – | |||||
| Cattle-derived | |||||||||||
| CD16 | Tc867 BT 106 | – | – | 1 | 1 | – | – | 1 | 1 | 1 | 1 |
| CD17 | Tc841 IL 57 | – | – | 1 | 1 | 1 | 1 | 1 | 1 | ||
| CD28 | Tc120 F344 | – | – | – | – | 1 | 1.1, 4.1, 5.1, 6.1 | ||||
The letters-numbers in this row indicate the reference sequence in the NCBI Genbank.
Newly identified alleles and protein variants are in boldface.
A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 8 if labelled 8.1, as in BD2, Tp7).
Although the nucleotide sequence had unknown (mixed) nucleotides (i.e. S = C or G, R = A or G, Y = C or T), upon translation, these nucleotides resulted in synonymous protein sequence.
Unique, validated alleles or protein variants obtained from partial sequences which are their own variant.
These sequences were shorter than the rest and could have been any of the variants noted in the table above as these sequence areas were an identical overlap with variants 1, 4, 5 and 6.
Although we were unable to determine if the sequence is novel due to unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions.
Partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis, if also labelled with *.
Diversity within the individual antigens
| TpAg | No. nucleotide variants/No. sequences obtained | No. protein variants/No. of sequences obtained. | No. protein variants/No. nucleotide variants | No. non-synonymous SNP locations/Total No. SNP locations |
|---|---|---|---|---|
| Tp1 | ||||
| Tp2 | 92/220 (0.42) | |||
| Tp3 | 13/24 (0.54) | 9/24 (0.38) | 9/13 (0.69) | 12/26 (0.46) |
| Tp4 | 13/24 (0.54) | 6/24 (0.25) | 6/13 (0.46) | 5/36 (0.14) |
| Tp5 | 6/16 (0.38) | 3/16 (0.19) | 3/6 (0.5) | 3/26 (0.12) |
| Tp6 | 13/23 (0.57) | 2/23 (0.09) | 2/13 (0.15) | 1/23 (0.04) |
| Tp7 | 9/25 (0.36) | 5/25 (0.20) | 3/9 (0.33) | 4/33 (0.12) |
| Tp8 | 12/22 (0.55) | 7/22 (0.32) | 7/12 (0.58) | 6/19 (0.32) |
| Tp9 | NA | |||
| Tp10 | 13/22 (0.59) | 6/22 (0.27) | 6/13 (0.46) | 7/24 (0.29) |
Due to extensive gene diversity, it was not feasible to conduct this analysis.
Numbers in bold indicate the top three most diverse TpAg per group.
Tp 2 epitopes observed in the buffalo- and cattle-derived theileria stabilates
| Sample ID | Epitope 1 | Epitope 2 | Epitope 3 | Epitope 4 | Epitope 5 | Epitope 6 |
|---|---|---|---|---|---|---|
| BD18 | TNEELKKLGMV (V16) | EGLDMEALF (V12) | KTSKGMTKVGR (V15) | FGQSIKCVA (V8) | QSIKCVAQH (V7) | KGDVPNPCDW (V10) |
| BD19 | SHEELKKLGML (V1) | DGFDRDALF (V1) | KSSHGMGKVGK (V1) | FAQSLVCVL (V1) | QSLVCVLMK (V1) | KTSIPNPCKW (V1) |
| BD20, 21 | SDNELDTLGLL (V4) | PDLDKNRLF (V2) | LTSHGMGKIGR (V3) | LAASIKCVS (V3) | ||
| BD24 | PDPDKQRLF (V8) | LTSKAMSTVGK (V11) | QSIYCVANN (V10) | KGDVPNPCQW (V8) | ||
| BD25 | SDDELRKLGML (V15) | DGFDRDALF (V1) | KSSHGMGKVGR (V14) | FVESIMCVI (V14) | ESIMCVIKK (V14) | KEDVPNPCDW (V3) |
| CD16 | SHEELKKLGML (V1) | DGFDRDALF (V1) | KSSHGMGKVGK (V1) | FAQSLVCVL (V1) | QSLVCVLMK (V1) | KTSIPNPCKW (V1) |
| CD17 | SDNELDTLGLL (V4) | PDLDKNRLF (V2) | LAASIKCVS (V3) | ASIKCVSHH (V3) | KPSVPNPCDW (V2) | |
| CD28 |
Newly identified epitope sequences are in boldface.
Epitope variant number is found in (). Original epitope variant numbers are defined in Pelle et al. 2011. Epitope variant numbers are separate from protein variant numbers.