Literature DB >> 29729680

Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

Tatjana Sitt1, Roger Pelle2, Maurine Chepkwony1, W Ivan Morrison3, Philip Toye1.   

Abstract

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.

Entities:  

Keywords:  African buffalo; CD8+ cytotoxic T cell antigens; Theileria parva; epitope; genetic diversity

Mesh:

Substances:

Year:  2018        PMID: 29729680      PMCID: PMC6126638          DOI: 10.1017/S0031182018000264

Source DB:  PubMed          Journal:  Parasitology        ISSN: 0031-1820            Impact factor:   3.234


Introduction

Cattle constitute an important source of livelihood for African pastoralists, whether for societal status, income generation through meat or milk production and sale, or protection against additional economic hardships (Anderson, 2003; Mwacharo and Drucker, 2005; Ilatsia et al. 2011). The implementation of preventative measures against disease is imperative to maintain herd health and productivity, including those situations where livestock share grazing habitat with wild animal populations (Grootenhuis and Olubayo, 1993; Sitt et al. 2015). Vaccination is one of the most effective methods of disease prevention. The development of new or improved vaccines is aided by a clear understanding of the molecular interactions between the host and pathogen, including potential genetic diversity in the parasite antigens that induce protective immune responses. Antigenic diversity and antigen variation are exploited by pathogens for immune evasion in diseases such as malaria, trypanosomiasis and Lyme disease (Frank, 2002). The protozoan parasite Theileria parva (T. parva) is of particular concern for pastoralists in sub-Saharan Africa. It is an apicomplexan parasite which induces lymphoproliferative disease in cattle and is transmitted by tick vectors, predominantly Rhipicephalus appendiculatus. Infections in cattle may be initiated by ticks that have acquired infection from other cattle or from African buffalo (Syncerus caffer) and these give rise to distinct clinical syndromes, referred to as East Coast fever (ECF) and Corridor disease (CD), respectively (Neitz et al. 1955; Barnett and Brocklesby, 1966). In both cases, the disease causes severe economic losses through morbidity and mortality throughout a large part of eastern, southern and central Africa (Conelly, 1998). A vaccination procedure used to protect cattle from ECF, called the Infection and Treatment Method (ITM), is successfully employed in parts of Zambia, Tanzania, Uganda, Malawi and Kenya, but generally only where pasture is grazed predominantly by cattle (Uilenberg et al. 1977; Morzaria et al. 2000). The most commonly used version of the ITM vaccine, which comprises a mixture of three T. parva isolates and is known as the Muguga cocktail, has been shown not to protect cattle introduced into an area previously grazed only by buffalo (Sitt et al. 2015). Parasite-specific CD8+ T cells, which recognize epitopes presented on the surface of T. parva-infected lymphoblasts, have been shown to play a key role in mediating protection in animals immunized by ITM (McKeever et al. 1994). Immunity induced by ITM with one parasite isolate frequently does not provide complete protection against challenge with other parasite isolates and lack of protection has been shown to correlate with parasite strain specificity of the CD8+ T cell responses (Taracha et al. 1995). Ten antigens expressed by the intra-lymphocytic schizont stage of the parasite, designated Tp1–Tp10, have been described as targets of CD8+ T cells from immune cattle, and thus far, CD8+ T cell epitope regions have been defined for eight of these antigens (Gardner et al. 2005; Graham et al. 2006, 2007; Pelle et al. 2011; Hemmink et al. 2016). Sequence analyses of the genes encoding two of these antigens, Tp1 and Tp2, have revealed extensive sequence diversity, which was particularly pronounced in buffalo-derived parasites (Pelle et al. 2011). Specific CD8+ T cell responses to both antigens in ITM-immunized cattle have been shown to be parasite strain-specific, although the role of these antigens in immune protection has not been formally demonstrated (MacHugh et al. 2009; Connelley et al. 2011). A more limited analysis of the Tp9 gene has also revealed marked sequence diversity (Hemmink et al. 2016). Diversity in the genes encoding the remaining T. parva antigens (TpAg) has not been examined in detail. In an effort to increase our understanding of the overall antigenic diversity in T. parva, we analysed the sequences of genes encoding 10 TpAg in infected cell lines derived predominantly from buffalo, which previous studies have indicated harbour greatest parasite genetic diversity. The results confirmed the high level of diversity among the Tp1, Tp2 and Tp9 antigens, suggesting that these would be most useful for parasite strain typing. On the other hand, the much more limited diversity displayed by the other antigens, especially in their known CTL epitopes, indicates that these would be the more preferable for inclusion in a subunit vaccine.

Materials and methods

Parasite isolates

The samples used in these studies are listed in Tables 1–3. Nineteen of the buffalo-derived stabilates collected from several regions in Kenya have been described previously (Conrad et al. 1987; Grootenhuis et al. 1987; Baldwin et al. 1988; Pelle et al. 2011) (Supplementary Table S2). Sample BD19 from buffalo 5012 was recovered from the ILRI biorepository, although the history of the buffalo was not found. Samples from three cattle-derived infected cell lines were also included in the analysis for comparison – samples CD16 and CD17 have been described previously (Pelle et al. 2011) and originate from Nyairo, Kenya, while the third, CD28, is from Ngong, Kenya. Tp9 analysis was conducted on DNA from an additional 12 cell lines from seven buffalo-derived field stabilates. Polymerase chain reaction (PCR) amplicons from these DNA samples and from fifteen of the stabilates described above (Table 3) were cloned and sequenced.
Table 3.

Allele and protein variants of Tp9 observed in the buffalo- and cattle-derived theileria stabilates

Sample IDStabilate nameGenomic DNAaCloned DNAa
Allele TP02_0895bProtein XP_765463bEpitopeAlleleProteinEpitope
Buffalo-derived T. parva
BD1Mara 3422AKFPGMKKG24NKFKSMGIG
BD2Mara 435AKFPGMKKG
BD3Mara 1813c15EKFQRMGIG719AKFPGMKKG
BD4Mara 301524EKFQRMGIG1315EKFQRMGIG
BD5Mara 3224NKFKSMGIG
BD6Mara 42422AKFPGMKKG1016EKFQRMGIG
BD76998186EKFQRMGIG35AKFPGMKKG
BD869991718EKFQRMGIG
BD970011423EKFQRMGIG
BD1075461627EKFKHMGIG
BD147014122NKFKSMGIG
BD15769824NKFKSMGIG
BD167344617AKFPNLKAR98EKFKRMGMK
BD17Mara 41
BD184740113NKFKSMGIG
BD19501211AKFPGMKKS
BD205489
BD216666
BD226807
BD236818
BD246819
BD256998 cl. 11617AKFPNLKAR
BD267065
BD277014, st.3506a422AKFPGMKKG
BD287014, st.3570a11AKFPGMKKS
BD297014, st.3081a267EKFKHMRIG
BD307014, st.3081a2511EKFKRMGMK
BD317014, st. 3081a2412EKFKRMGMK
BD327014, st.3081a2126NKFPGMKKG
BD337014, st.3081a2325AKFPGMKKG
BD347014, st. 3081a199NKFKHMRIG
BD357752, st. 4110a2713EKFKHMGIG
BD367344, st. 3591a2814EKFKHMGIG
BD377065, st. 3120a2021EKFQRMGIG
BD386999, st. 3123a2220AKFPGMKKG
BD396998, cl.19a186EKFQRMGIG
Cattle-derived T. parva
CD16Tc 867 Bt10611AKFPGMKKS
CD17Tc 841 IL578EKFQRMGIG
CD28Tc 120 F344510RKFQRMGIG

Genomic DNA sequences were obtained directly from purified DNA; cloned DNA sequences were obtained from plasmids into which genomic DNA had been cloned.

Reference sequence accession number.

Newly identified alleles, protein variants and epitopes are in boldface.

Allele and protein variants of Tp1–4 observed in the buffalo- and cattle-derived T. parva stabilates We have previously published (Pelle et al. 2011) the Tp1 and Tp2 results for samples BD1-BD16. They are included here in italics for completeness. The letters-numbers in this row indicate the reference sequence in the NCBI Genbank. Newly identified alleles and protein variants are in boldface. Although we were unable to determine if the sequence is novel due to four unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions. Unique, validated alleles or protein variants obtained from partial sequences which are their own variant. A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 4 if labelled 4.1, as in BD22, Tp4). partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis (ie: no complete consensus sequence). Allele and protein variants of Tp5–8 and Tp10 observed in the buffalo- and cattle- derived T. parva stabilates The letters-numbers in this row indicate the reference sequence in the NCBI Genbank. Newly identified alleles and protein variants are in boldface. A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 8 if labelled 8.1, as in BD2, Tp7). Although the nucleotide sequence had unknown (mixed) nucleotides (i.e. S = C or G, R = A or G, Y = C or T), upon translation, these nucleotides resulted in synonymous protein sequence. Unique, validated alleles or protein variants obtained from partial sequences which are their own variant. These sequences were shorter than the rest and could have been any of the variants noted in the table above as these sequence areas were an identical overlap with variants 1, 4, 5 and 6. Although we were unable to determine if the sequence is novel due to unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions. Partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis, if also labelled with *. Allele and protein variants of Tp9 observed in the buffalo- and cattle-derived theileria stabilates Genomic DNA sequences were obtained directly from purified DNA; cloned DNA sequences were obtained from plasmids into which genomic DNA had been cloned. Reference sequence accession number. Newly identified alleles, protein variants and epitopes are in boldface.

DNA extraction

Sixteen buffalo-derived DNA samples have previously been described (Pelle et al. 2011). DNA from the additional samples (see Parasite isolates) was extracted using DNeasy Blood and Tissue Kit (Qiagen, USA). An aliquot of DNA was diluted to 100 ng µL−1 with either AE buffer or RNase/DNase free water and frozen. The stock DNA was stored at −20 °C.

PCR and sequencing of stabilates

PCR amplicons were generated from genes encoding the 10 previously described T. parva antigens (TpAg). PCR master mixes comprised 10 µL 10 ×  buffer (Applied Biosystems), 10 µL 25 mm MgCl2, 1 µL 25 mm dNTP, 1 µL 100 pmole µL−1 forward and reverse primers, 0.5 µL Ampli-Taq Gold (Applied Biosystems) and H2O added to a final volume of 30ul, containing between 2.5 and 5 µL of 100 ng µL−1 gDNA. Cycling conditions can be found in Supplemental Table 1. Tp1 and Tp2 primers have previously been described (Pelle et al. 2011). TpAg information including Genebank accession numbers of references and epitope sequences can be found in Tables 1–4. Primers for Tp3, Tp4, Tp5, Tp6, Tp7, Tp8, Tp9 and Tp10 can be found in Table 5.
Table 5.

Comparative sequence information on TpAg1–10 observed in the buffalo- and cattle-derived theileria stabilates

AntigenPrimers 5’ – 3’ Forward ReversePredicted length of amplicona (%)Lengths of readable sequenceTotal no. sequences obtainedTotal no. SNP locationsbTotal no. allelesc,dTotal no. non-synonymous SNP locationsTotal no. protein variantsc,dTotal no. epitope variants
Tp1ATGGCCACTTCAATTGCATTTGCC GAAGCTCATAAATATTTCATT387 (22)363–399111981682
Tp2ATGAAATTGGCCGCCAGATTA GAAGCCTCCGGCACTTCATAG483 (47)456–48092207927See Table 2
Tp3ATGAAATTAAATACTATCGCAATAG ACCTTCACTAAGAGTCCAGACGATAAT735 (89)686–726242613129n/a
Tp4CCAATTGGTGATTTAGCAACA CTCCGTAAATCATGTTAATGGAGT692 (27)386–422243613561
Tp5ATGAGGAAGCGAGTTTGGG CGATATCGATGATTTATAA238 (31)200–23816266331
Tp6ATGGCTCAGATTCCTGTTG CTCTTACTCTCAACTGATAAATAA1005 (92)569–83323231312n/a
Tp7ATGGAGGCACTGCAAGCAGGC GTCACTGACGAGGAGGAGAAAAAG417 (14)299–33225339431
Tp8ATGCTTGGAAATCATGTCATGGGA CTCCTTCGCAGTGTACGCAGTTTAA1275 (81)931–1109221912671
Tp9ATGAATGTTCTAACTACTGG CGTAATAAACCATGGACAAAACAATAA958 (95)326–105737>10028>1002712
Tp10GACTACGCTCTTCTTACCAC CCATCCAGGAGTGTAATGCC916 (89)505–82422241376n/a

The sequence length of the predicted amplicon from the reference sequence, excluding primers, is shown. The figures in brackets represent the length of the amplicon as a percentage of the length of the coding sequence from the reference gene, including introns.

SNPs within introns are not included.

Alleles or protein variants which have ambiguous residues (labelled X.1 in Table 1) are not included, unless they are unique, validated variants.

Unique, validated alleles or protein variants obtained from partial sequences are included if the constitute a variant.

Tp 2 epitopes observed in the buffalo- and cattle-derived theileria stabilates Newly identified epitope sequences are in boldface. Epitope variant number is found in (). Original epitope variant numbers are defined in Pelle et al. 2011. Epitope variant numbers are separate from protein variant numbers. Comparative sequence information on TpAg1–10 observed in the buffalo- and cattle-derived theileria stabilates The sequence length of the predicted amplicon from the reference sequence, excluding primers, is shown. The figures in brackets represent the length of the amplicon as a percentage of the length of the coding sequence from the reference gene, including introns. SNPs within introns are not included. Alleles or protein variants which have ambiguous residues (labelled X.1 in Table 1) are not included, unless they are unique, validated variants.
Table 1.

Allele and protein variants of Tp1–4 observed in the buffalo- and cattle-derived T. parva stabilates

Sample IDStabilate nameTp1aTp2aTp3Tp4
Allele TP03_0849bProtein XP_762973Allele TP01_0056Protein XP_765583Allele TP01_0868Protein XP_763892Allele TP03_0210Protein XP_763228
Buffalo-derived T. parva
BD1Mara 31615886c421
BD2Mara 41199109
BD3Mara 18171610107634
BD4Mara 301111116434
BD5Mara 321112126495
BD6Mara 42181713138986
BD7Mara 699811141438123
BD8Mara 69991918772551
BD9Mara 7001201415154234
BD10Mara 75462119161611951
BD14Buffalo 70141120153
BD15Buffalo 769814122120109133
BD16Buffalo 73442421775334
BD17Mara 415*3*11d1
BD18Buffalo 47401144431*1*133
BD19Buffalo 5012111113*e9*11
BD20Buffalo 54893613484710*9*133
BD21Buffalo 6666403648473*8*21
BD22Buffalo 680713114.1f1.1
BD23Buffalo 6818131110*9*41
BD24Buffalo 68193735454210*9*101
BD256998 cl. 113837494538123
BD26Buffalo 70659.15
Cattle-derived T. parva
CD16Tc867 BT 10611111111
CD17Tc841 IL 572247469773
CD28Tc120F3443917464412962

We have previously published (Pelle et al. 2011) the Tp1 and Tp2 results for samples BD1-BD16. They are included here in italics for completeness.

The letters-numbers in this row indicate the reference sequence in the NCBI Genbank.

Newly identified alleles and protein variants are in boldface.

Although we were unable to determine if the sequence is novel due to four unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions.

Unique, validated alleles or protein variants obtained from partial sequences which are their own variant.

A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 4 if labelled 4.1, as in BD22, Tp4).

partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis (ie: no complete consensus sequence).

Unique, validated alleles or protein variants obtained from partial sequences are included if the constitute a variant. PCR amplicons were purified either by centrifugal precipitation in PEG8000/MgCl2 or Qiagen PCR purification kit (Qiagen, USA) and samples eluted in Elution Buffer (EB) or molecular grade water. Purification via Qiagen kit was conducted according to manufacturer's protocol. Purification via PEG8000/MgCl2 centrifugation was conducted as briefly outlined below: 175 µL of EB was added to 25μL of PCR product and mixed thoroughly with 100 µL 30% PEG8000/30 mm MgCl2. Mixtures were immediately centrifuged at 10 000  for 20 min at room temperature after which the supernatant was removed and the pellet was dissolved in 10 µL water. 1 µL was used to confirm recovery on a gel. Aliquots of PCR purified products were sequenced at the BecA-ILRI Hub, Nairobi, Kenya. Sanger sequencing was conducted on the purified PCR products using the GeneAmp® 9700 system (Applied Biosystems). Additional samples (BD33-BD44,and BD26, Table 3) were processed and sequenced after cloning into pGEM-T Easy plasmid vector (Promega) and sequenced as previously described (Pelle et al. 2011). These additional Tp9 sequences have previously been submitted to Genbank with the accessions numbers JN828553, JN828535, JN828556, JN828557, JN828558, JN828559, JN828560, JN828561, JN828562, JN828563, JN828564 and JN828565, and JQ735950.

Sequence analysis

Sequence analysis, alignments and prediction of encoded proteins were conducted using Geneious version 7.1.4 created by Biomatters (http://www.geneious.com) for all T. parva antigens, with variant alignments conducted using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) or SCSC Biology WorkBench (http://seqtool.sdsc.edu/). Additional Tp9 sequences were translated into protein sequences using EMBOSS-Transeq software (Rice et al. 2000) and alignments performed using CLUSTALW version 2.0 (Larkin et al. 2007). TpAg epitope information has been described in detail elsewhere (Graham et al. 2006, 2008; Akoolo et al. 2008; Nene et al. 2012; Svitek et al. 2014). All sequence lengths described in this paper are post-editing for removal of both primer sequence and weak or mixed trace peaks at the sequence ends-this is expected with Sanger sequencing. Sanger sequencing results were obtained in both the forward (sense) and reverse (anti-sense) directions for each sample. Consensus sequences for each sample were identified through detailed analysis of both forward and reverse sequence chromatograms and individual nucleotide trace signals. Occasionally sequence in both directions resulted in multiple traces (double peaks) at a specific nucleotide location. In these cases, the ‘mixed’ nucleotide received the corresponding IUPAC ambiguity code. These sequences are identified in Tables 1 and 2. No sequence had more than four ambiguous residues with the exception of Buffalo 5012, Tp6. Some ambiguous nucleotides were observed to be non-synonymous.
Table 2.

Allele and protein variants of Tp5–8 and Tp10 observed in the buffalo- and cattle- derived T. parva stabilates

Sample IDStabilate nameTp5Tp6Tp7Tp8Tp10
Allele TP02_0767aProtein XP_765334Allele TP01_0188Protein XP_765715Allele TP02_0244Protein XP_764810Allele TP02_0140Protein XP_764709Allele TP04_0772Protein XP_764408
Buffalo-derived T. parva
BD1Mara 33b113172127111
BD2Mara 411328.1, 2.1c2d
BD3Mara 18116132101
BD4Mara 302*e1*71222410.11
BD5Mara 321*1*817212765
BD6Mara 421113.11.12211711
BD7Mara 699842118266125
BD8Mara 69991121725672
BD9Mara 7001114172101
BD10Mara 7546115172111
BD14Buffalo 701411918211713*e1.1, 4.1, 5.1, 6.1*f
BD15Buffalo 7698311218*2*4534
BD16Buffalo 73441118266
BD17Mara 415.12d24101
BD18Buffalo 474012.1g12233101
BD19Buffalo 50121*1*9.11111111
BD20Buffalo 5489131727261.1, 4.1, 5.1, 6.1f
BD21Buffalo 666613172975*e1.1, 4.1, 5.1, 6.1*f
BD22Buffalo 68071*1*5210.1, 11.16.1, 7.19*e1.1, 4.1, 5.1, 6.1*f
BD23Buffalo 68186*e1*1318.12107
BD24Buffalo 68195*e3*e1019.1g3d11783
BD256998 cl. 111118266111
BD26Buffalo 706513*1*62117
Cattle-derived T. parva
CD16Tc867 BT 106111111
CD17Tc841 IL 57111*1*1126
CD28Tc120 F3444*e1*8741.1, 4.1, 5.1, 6.1f

The letters-numbers in this row indicate the reference sequence in the NCBI Genbank.

Newly identified alleles and protein variants are in boldface.

A variant with 0.1 after the variant number indicates a sequence which is identical to the parent variant but has 1–4 unresolved nucleotides which prevents its being regarded as completely identical to the parent allele (e.g. 8 if labelled 8.1, as in BD2, Tp7).

Although the nucleotide sequence had unknown (mixed) nucleotides (i.e. S = C or G, R = A or G, Y = C or T), upon translation, these nucleotides resulted in synonymous protein sequence.

Unique, validated alleles or protein variants obtained from partial sequences which are their own variant.

These sequences were shorter than the rest and could have been any of the variants noted in the table above as these sequence areas were an identical overlap with variants 1, 4, 5 and 6.

Although we were unable to determine if the sequence is novel due to unknown (mixed) nucleotides, the sequence is considered novel due to unique nucleotides in other sequence regions.

Partial sequence as defined by only having either the forward or reverse sequence of the allele available for analysis, if also labelled with *.

Where only one sequence from one direction was obtained, all trace peaks were required to have strong signal intensity before including the sequence in data analysis. We define these sequences as partial sequences in the paper and these partial sequences are noted with an *. Occasionally these partial sequences yielded novel single nucleotide polymorphism (SNP) variants due to SNPs. These sequences are indicated in the Tables. To confirm sequence identity, a GenBank database search (nucleotide and protein BLAST) of all the consensus nucleotide and amino acid sequences was conducted (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Additionally, the sequences were compared with available GenBank sequences via BLAST to determine if the sequences were novel or were a 100% match to previously published data. Sequences that did not have 100% identity with published sequences were considered novel. 100% identity with a sequence from BLAST was only considered if the query coverage was over 70%. A new gene allele, protein variant or epitope variant was also confirmed by alignments with previously published sequences, as described further below. Alignments also allowed for detection of sequence introns or insertions and deletions (indels) in the obtained sequences. Where applicable, introns were removed from the DNA sequences before translation into protein. All amplicons were categorized into allele and protein variants based on the following criteria; (1) the presence of at least one SNP difference from any published sequence and (2) the presence of at least one single amino acid difference from any published sequence. It is recognized that variant categorization could change if longer sequences are obtained in the future. Stabilate gene and protein sequence GenBank accession numbers can be found in Supplemental Table S2. Some Tp9 sequences had previously been deposited in GenBank, but had not undergone formal analysis in a publication and were included in this paper.

Results

A total of 23 DNA samples from infected cell lines derived from buffalo, three samples from cattle-derived (non-buffalo-associated) cultured infected cell lines and one cloned cell line obtained from a buffalo-derived stabilate were analysed. An additional 12 Tp9 sequences (from cell lines derived from buffalo) were analysed. Detailed information including primer sequence, length of PCR amplicons, edited sequence length, number of SNPs, nucleotide and protein variants have been summarized for all Tp genes in Table 5; limited information of these specific parameters is given in the text. Amplicons were not generated for each gene from every sample and not all amplicons generated readable sequences (Tables 1–3). The Tp1 and Tp2 sequences from 16 of the cell lines mentioned in this study have been published previously (Pelle et al. 2011) and are included for completeness. The nucleotide and amino acid sequences are available as sequence alignments in Supplementary Tables S3–S12. GenBank accession numbers for these sequences are given in Supplementary Table 2. A summary of the findings for the data below can be found in Tables 1–5, and 6. Table 6 also shows the percentage of the full-length gene represented by the amplicon for each gene.
Table 6.

Diversity within the individual antigens

TpAgNo. nucleotide variants/No. sequences obtainedNo. protein variants/No. of sequences obtained.No. protein variants/No. nucleotide variantsNo. non-synonymous SNP locations/Total No. SNP locations
Tp18/11 (0.73)8/11 (0.73)8/8 (1)16/19 (0.84)
Tp27/9 (0.78)7/9 (0.78)7/7 (1)92/220 (0.42)
Tp313/24 (0.54)9/24 (0.38)9/13 (0.69)12/26 (0.46)
Tp413/24 (0.54)6/24 (0.25)6/13 (0.46)5/36 (0.14)
Tp56/16 (0.38)3/16 (0.19)3/6 (0.5)3/26 (0.12)
Tp613/23 (0.57)2/23 (0.09)2/13 (0.15)1/23 (0.04)
Tp79/25 (0.36)5/25 (0.20)3/9 (0.33)4/33 (0.12)
Tp812/22 (0.55)7/22 (0.32)7/12 (0.58)6/19 (0.32)
Tp928/37 (0.76)27/37 (0.73)27/28 (0.96)NAa
Tp1013/22 (0.59)6/22 (0.27)6/13 (0.46)7/24 (0.29)

Due to extensive gene diversity, it was not feasible to conduct this analysis.

Numbers in bold indicate the top three most diverse TpAg per group.

Diversity within the individual antigens Due to extensive gene diversity, it was not feasible to conduct this analysis. Numbers in bold indicate the top three most diverse TpAg per group.

Nucleotide variant diversity

As shown in Tables 1–3, between 5 and 12 novel variants were identified in all the antigens analysed. To compare the nucleotide diversity among the antigen genes, a nucleotide diversity ratio was determined for each gene by calculating the ratio of the number of nucleotide variants to the total number of sequences obtained for that gene (Table 6). Tp7 and Tp5 had the least allele diversity, with diversity ratios of 0.36 and 0.38, respectively, whereas Tp1, Tp2 and Tp9 had the highest allele diversity ratios, ranging from 0.73 to 0.78. Tp3, Tp4, Tp6, Tp8 and Tp10 were moderately diverse with ratios between 0.54 and 0.57. We also examined the allele sequences for the presence of novel indels or introns. Only one novel Tp3 allele, v9, had the in-frame trinucleotide insertion AAA (amino acid K) found in the reference sequence and there are numerous indels throughout the Tp9 gene. Interestingly, sequencing of the cloned Tp9 PCR products revealed six different sequences from one buffalo (BD27-BD34), suggesting that the buffalo was either infected concurrently or in succession with several strains of T. parva.

Protein variant diversity

Between one and eight novel protein variants were identified in antigens analysed. As for the nucleotide diversity, we assessed the protein variant diversity by calculating the ratio of the number of protein variants to the number of sequences obtained for each antigen (Table 6). Tp1, Tp2 and Tp9 were again the most diverse with ratios between 0.73 and 0.78. On the other hand, Tp6 was by far the most conserved protein (ratio of 0.09) reflecting the fact that only one sample (BD2) expressed a Tp6 antigen which varied from the reference sequence. The remaining antigens showed moderate diversity ratios, which were all lower than the corresponding nucleotide diversity ratios. A more direct measure of this is the ratio of the number of protein variants to the number of nucleotide variants per antigen. For Tp1, Tp2 and Tp9, the ratios were 1, 1 and 0.96, as, with one exception for Tp9, every allelic variant encodes a unique protein. On the other hand, Tp6 showed a low ratio of protein to nucleotide variants of 0.15, with the remaining antigens revealing moderate ratios of 0.46–0.69. In addition to these observations, it was noted that the most common protein variant of Tp3 (v9) was found in nine of the 24 samples sequenced, while each of the five new Tp4 protein variants differed by only a single amino acid from the reference sequence.

Synonymous vs non-synonymous substitutions

As another direct comparison of diversity among the genes, we determined the total number of SNP locations and the ratio of the number of non-synonymous to synonymous SNP locations for all antigens except Tp9. Tp9 had extensive diversity for which there is a large number of SNP locations and indels distributed throughout the gene. The Tp2 gene had by far the greatest number of SNP locations (220 or 46% of the predicted amplicon length) compared with the other genes which ranged from 19 to 36 SNP locations. Tp1 showed the lowest conservation of protein sequences with a ratio of 0.84, compared with Tp4, Tp5, Tp7 and especially Tp6, all of which showed ratios of less than 0.20. Remarkably, Tp6 differed at only one amino acid residue in the single protein sequence which differed from the reference sequence. We also noted that, among the Tp5 sequences, those from BD7 and BD24 expressed the majority of SNPs and shared a high degree of similarity, while the other novel Tp5 alleles differed from the reference sequence at 1, 2 or 3 residues only.

Diversity in epitope sequences

CTL epitopes have not yet been defined for two (Tp3 and Tp6) of the 10 antigens included in this study while the recently reported epitope for Tp10 (Hemmink et al. 2016) lies outside the region sequenced in this paper. Remarkably, no epitope diversity was observed in Tp4, Tp5, Tp7, or Tp8. On the other hand, new epitope sequences were found at all six Tp2 epitope locations, with one sample (CD28) expressing new variants at all six locations (Table 4). Between 5 and 7, epitope variants were observed at each of the six epitope regions for the nine sequenced samples. For Tp9, of the 37 samples which were successfully sequenced, there were 11 variants of the single defined epitope and these were represented between 1 and 10 times within the sequences.
Table 4.

Tp 2 epitopes observed in the buffalo- and cattle-derived theileria stabilates

Sample IDEpitope 1Epitope 2Epitope 3Epitope 4Epitope 5Epitope 6
BD18TNEELKKLGMV (V16)EGLDMEALF (V12)KTSKGMTKVGR (V15)FGQSIKCVA (V8)QSIKCVAQH (V7)KGDVPNPCDW (V10)
BD19SHEELKKLGML (V1)DGFDRDALF (V1)KSSHGMGKVGK (V1)FAQSLVCVL (V1)QSLVCVLMK (V1)KTSIPNPCKW (V1)
BD20, 21SDNELDTLGLL (V4)PDLDKNRLF (V2)LTSHGMGKIGR (V3)LAASIKCVS (V3)ASIKCVSQH (V20)aKPSIPNPCDW (V19)
BD24SDSELETLGML (V26)aPDPDKQRLF (V8)LTSKAMSTVGK (V11)FAQSIYCVA (V21)QSIYCVANN (V10)KGDVPNPCQW (V8)
BD25SDDELRKLGML (V15)DGFDRDALF (V1)KSSHGMGKVGR (V14)FVESIMCVI (V14)ESIMCVIKK (V14)KEDVPNPCDW (V3)
CD16SHEELKKLGML (V1)DGFDRDALF (V1)KSSHGMGKVGK (V1)FAQSLVCVL (V1)QSLVCVLMK (V1)KTSIPNPCKW (V1)
CD17SDNELDTLGLL (V4)PDLDKNRLF (V2)LTSHGMGKIAR (V25)LAASIKCVS (V3)ASIKCVSHH (V3)KPSVPNPCDW (V2)
CD28TEEELKKMGMV (V27)EGFDKEKLF (V22)KSSKSMGIVGR (V24)FVQSIMCVI (V20)QSIMCVINK (V21)VNDIPNPCKW (20)

Newly identified epitope sequences are in boldface.

Epitope variant number is found in (). Original epitope variant numbers are defined in Pelle et al. 2011. Epitope variant numbers are separate from protein variant numbers.

In all, the results indicate that the substantial epitope variation previously observed in Tp2 and Tp9 is not a universal feature of T. parva CTL antigens.

Discussion

The studies reported in this paper were undertaken to evaluate the extent of diversity among genes encoding T. parva CD8+ T cell antigens from buffalo-derived parasites. Sequence diversity is a prominent feature of the two previously studied T. parva antigens, Tp1 and Tp2, particularly in buffalo-derived T. parva (Pelle et al. 2011). Antigenic diversity, or lack thereof, can be important in parasite immune evasion, and, for T. parva, previous studies have implicated epitope sequence variation in evasion from the CTL response (MacHugh et al. 2009; Connelley et al. 2011). Of practical significance, widespread sequence diversity among functional CD8+ T cell antigens would greatly increase the challenge of designing a broadly protective subunit vaccine against the parasite. The overall conclusion derived from the data reported here is that, although there was substantial diversity at the nucleotide level, extensive diversity in the amino acid sequences is not a feature of all antigens. In particular, no polymorphism was observed in the CD8+ T cell epitope sequences in four antigens (Tp4, 5, 7 and 8) for which the epitopes have been defined, compared with the marked variation in epitope sequences observed in Tp2 and Tp9 and to a lesser extent in Tp1. There was a considerable difference in the degree of diversity observed in the antigen sequences, as summarized in Tables 5 and 6. The findings on diversity were relatively consistent based on ratios obtained for the following parameters: (1) the number of nucleotide variants/number of sequences obtained, (2) the number of protein variants/number of sequences obtained, (3) the number of protein variants/number of nucleotide variants, and (4) the number of non-synonymous SNP locations/total number of SNP locations. Tp1, Tp2 and Tp9 were consistently the most diverse (the highest ratios) whereas Tp4, Tp5, Tp6 and Tp7 were consistently the least diverse (the lowest ratios). No diversity was observed in the epitope sequences of the Tp4, Tp5, Tp7 and Tp8 antigens, suggesting that extensive diversity is not a universal feature of T. parva CTL antigens. However, the data need to be interpreted with some caution, given that the sequences which were analysed represent different proportions of the coding region of each gene. We also detected very few novel indels and introns, suggesting that the major diversity among the antigen genes is due to SNPs and not variations in the indel or intron composition. In comparing the different antigen genes, Tp1, Tp2 and Tp9 were the most challenging to sequence, sometimes resulting in failure to obtain a PCR product or in sequences that were obviously mixed. In contrast, sequences obtained from Tp6, Tp7 and Tp8 usually resulted in good quality sequences. A likely explanation for the former inconsistency is that these genes possess sequence diversity in the region corresponding to one or both primers. Although the primers used were initially shown to generate products from a panel of laboratory isolates of T. parva, it is possible that some allelic variants are not amplified. As further genome sequence data becomes available for more T. parva isolates, it may be possible to develop improved PCR primers. However, at least for Tp2 and Tp9, this may still be challenging in view of the extensive nucleotide differences throughout these genes. The ambiguous mixed sequences are almost certainly due to infection of the cell lines with two or more strains of the parasite, resulting in several sequences in the PCR amplicon. It has been known for some time that cells infected with the closely related theilerial parasite, T. sp. buffalo, can be established in vitro from infected buffalo (Zweygarth et al. 2009). Recent research published after the current study has also shown that this parasite can infect cattle and that cell lines established from buffalo or cattle co-grazed with buffalo may be co-infected with T. parva and T. sp. buffalo (Bishop et al. 2015). To avoid the possibility of obtaining sequences from both Theileria species, we used species-specific primers to screen for the presence or absence of T. parva and T. sp. buffalo (Bishop et al. 2015) and only used cell lines that did not have a detectable infection with T.sp. buffalo It should be noted that the sequences obtained may not be strictly representative of the parasite populations in vivo, as the approach of direct sequencing of PCR amplicons may generate a bias, in the case of infections with mixed genotypes, to strains that are more abundant or whose sequences are more efficiently amplified by the PCR primers employed. Additionally, maintenance of Theileria-infected cells in culture may select for parasite-infected cells which are more suited to in vitro cell propagation, thus altering the composition of the parasites originally present in the host. Hemmink et al. (2016) have previously shown that 454 deep sequencing (Roche) can be used to reveal alleles present at a low frequency in the parasites in the Muguga cocktail ITM vaccine. In a parallel study, we have applied the same high-throughput sequencing approach to compare the sequences of six of TpAg genes (Tp1, Tp2, Tp4, Tp5, Tp6 and Tp10) examined in the current study, in samples obtained from naturally infected buffalo in two geographically distant locations; PCR amplicons of segments of these genes generated from DNA extracted directly from buffalo blood were used for sequencing. These studies generated larger sets of sequences and consequently revealed greater diversity but overall the observed patterns of diversity were very similar to those observed here, ranging from some highly polymorphic genes showing high ratios of non-synonymous:synonymous nucleotide substitution to others showing lower diversity and low ratios of non-synonymous:synonymous nucleotide substitution, with relatively conserved amino acid sequences (Hemmink et al. 2017). Our studies have identified major variation in the level of sequence diversity exhibited by the genes encoding different T. parva antigens recognized by parasite-specific CD8+ T cells. The extensive polymorphism seen in some antigens could be of great value for typing and differentiating populations of T. parva. Conversely, limited diversity is a most desirable feature of antigens that could potentially be used in a subunit vaccine against disease caused by T. parva. The epitopes recognized by CD8+ T cells from T. parva-immune cattle depend heavily on the class I MHC background of the individual animal. This adds to the complexity of the development of subunit vaccines and may well require the use of multiple antigens to consistently generate the desired immune responses. We and others have observed that the currently used Muguga cocktail live vaccine offers limited protection to cattle introduced into pasture co-grazed predominantly by buffalo (Bishop et al. 2015; Sitt et al. 2015). Early studies (Radley et al. 1979; Young, 1981) providing an indication that this is associated with greater antigenic diversity among buffalo-derived parasites have been supported by a subsequent study that showed much more extensive diversity in the amino acid sequences of the Tp1 and Tp2 CD8+ T cell antigens in buffalo isolates compared from those derived from cattle (Pelle et al. 2011). However, the studies reported here suggest that antigenic diversity is not a universal feature of the T. parva antigens identified to date, but rather that immune cattle can also respond to antigens that appear to be highly conserved among T. parva isolates, including those from cattle.
  27 in total

Review 1.  Designing bovine T cell vaccines via reverse immunology.

Authors:  Vishvanath Nene; Nicholas Svitek; Philip Toye; William T Golde; John Barlow; Mikkel Harndahl; Soren Buus; Morten Nielsen
Journal:  Ticks Tick Borne Dis       Date:  2012-01-09       Impact factor: 3.744

Review 2.  Disease research in the wildlife-livestock interface in Kenya.

Authors:  J G Grootenhuis; R O Olubayo
Journal:  Vet Q       Date:  1993-06       Impact factor: 3.320

3.  Isolation of Theileria parasites from African buffalo (Syncerus caffer) and characterization with anti-schizont monoclonal antibodies.

Authors:  P A Conrad; D A Stagg; J G Grootenhuis; A D Irvin; J Newson; R E Njamunggeh; P B Rossiter; A S Young
Journal:  Parasitology       Date:  1987-06       Impact factor: 3.234

4.  Infection of African buffalo (Syncerus caffer) and cattle with Theileria parva lawrencei after serial passage in cattle.

Authors:  J G Grootenhuis; A S Young; D A Stagg; B L Leitch; T T Dolan; P A Conrad
Journal:  Res Vet Sci       Date:  1987-05       Impact factor: 2.534

5.  Vaccines against Theileria parva.

Authors:  S Morzaria; V Nene; R Bishop; A Musoke
Journal:  Ann N Y Acad Sci       Date:  2000       Impact factor: 5.691

6.  Production objectives and management strategies of livestock keepers in south-east Kenya: implications for a breeding programme.

Authors:  J M Mwacharo; A G Drucker
Journal:  Trop Anim Health Prod       Date:  2005-11       Impact factor: 1.559

7.  Parasite strain specificity of precursor cytotoxic T cells in individual animals correlates with cross-protection in cattle challenged with Theileria parva.

Authors:  E L Taracha; B M Goddeeris; S P Morzaria; W I Morrison
Journal:  Infect Immun       Date:  1995-04       Impact factor: 3.441

8.  Characterization of the fine specificity of bovine CD8 T-cell responses to defined antigens from the protozoan parasite Theileria parva.

Authors:  Simon P Graham; Roger Pellé; Mat Yamage; Duncan M Mwangi; Yoshikazu Honda; Ramadhan S Mwakubambanya; Etienne P de Villiers; Evelyne Abuya; Elias Awino; James Gachanja; Ferdinand Mbwika; Anthony M Muthiani; Cecelia Muriuki; John K Nyanjui; Fredrick O Onono; Julius Osaso; Victor Riitho; Rosemary M Saya; Shirley A Ellis; Declan J McKeever; Niall D MacHugh; Sarah C Gilbert; Jean-Christophe Audonnet; W Ivan Morrison; Pierre van der Bruggen; Evans L N Taracha
Journal:  Infect Immun       Date:  2007-12-10       Impact factor: 3.441

9.  Limited genetic and antigenic diversity within parasite isolates used in a live vaccine against Theileria parva.

Authors:  Johanneke D Hemmink; William Weir; Niall D MacHugh; Simon P Graham; Ekta Patel; Edith Paxton; Brian Shiels; Philip G Toye; W Ivan Morrison; Roger Pelle
Journal:  Int J Parasitol       Date:  2016-04-11       Impact factor: 3.981

10.  CD8+ T-cell responses to Theileria parva are preferentially directed to a single dominant antigen: Implications for parasite strain-specific immunity.

Authors:  Niall D MacHugh; Timothy Connelley; Simon P Graham; Roger Pelle; Principia Formisano; Evans L Taracha; Shirley A Ellis; Declan J McKeever; Alison Burrells; W Ivan Morrison
Journal:  Eur J Immunol       Date:  2009-09       Impact factor: 5.532

View more
  6 in total

1.  Genetic and antigenic variation of the bovine tick-borne pathogen Theileria parva in the Great Lakes region of Central Africa.

Authors:  Gaston S Amzati; Appolinaire Djikeng; David O Odongo; Herman Nimpaye; Kgomotso P Sibeko; Jean-Berckmans B Muhigwa; Maxime Madder; Nathalie Kirschvink; Tanguy Marcotty
Journal:  Parasit Vectors       Date:  2019-12-16       Impact factor: 3.876

2.  Analysis of p67 allelic sequences reveals a subtype of allele type 1 unique to buffalo-derived Theileria parva parasites from southern Africa.

Authors:  Lubembe D Mukolwe; David O Odongo; Charles Byaruhanga; Louwtjie P Snyman; Kgomotso P Sibeko-Matjila
Journal:  PLoS One       Date:  2020-06-29       Impact factor: 3.240

3.  Systematic Determination of TCR-Antigen and Peptide-MHC Binding Kinetics among Field Variants of a Theileria parva Polymorphic CTL Epitope.

Authors:  Nicholas Svitek; Rosemary Saya; Houshuang Zhang; Vishvanath Nene; Lucilla Steinaa
Journal:  J Immunol       Date:  2022-01-14       Impact factor: 5.422

4.  Development of a Potential Yeast-Based Vaccine Platform for Theileria parva Infection in Cattle.

Authors:  Shan Goh; Jeannine Kolakowski; Angela Holder; Mark Pfuhl; Daniel Ngugi; Keith Ballingall; Kata Tombacz; Dirk Werling
Journal:  Front Immunol       Date:  2021-07-08       Impact factor: 7.561

5.  Transcriptomics reveal potential vaccine antigens and a drastic increase of upregulated genes during Theileria parva development from arthropod to bovine infective stages.

Authors:  Triza Tonui; Pilar Corredor-Moreno; Esther Kanduma; Joyce Njuguna; Moses N Njahira; Steven G Nyanjom; Joana C Silva; Appolinaire Djikeng; Roger Pelle
Journal:  PLoS One       Date:  2018-10-10       Impact factor: 3.240

6.  Capture-based enrichment of Theileria parva DNA enables full genome assembly of first buffalo-derived strain and reveals exceptional intra-specific genetic diversity.

Authors:  Nicholas C Palmateer; Kyle Tretina; Joshua Orvis; Olukemi O Ifeonu; Jonathan Crabtree; Elliott Drabék; Roger Pelle; Elias Awino; Hanzel T Gotia; James B Munro; Luke Tallon; W Ivan Morrison; Claudia A Daubenberger; Vish Nene; Donald P Knowles; Richard P Bishop; Joana C Silva
Journal:  PLoS Negl Trop Dis       Date:  2020-10-29
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.