| Literature DB >> 21559495 |
Roger Pelle1, Simon P Graham, Moses N Njahira, Julius Osaso, Rosemary M Saya, David O Odongo, Philip G Toye, Paul R Spooner, Anthony J Musoke, Duncan M Mwangi, Evans L N Taracha, W Ivan Morrison, William Weir, Joana C Silva, Richard P Bishop.
Abstract
BACKGROUND: Theileria parva causes an acute fatal disease in cattle, but infections are asymptomatic in the African buffalo (Syncerus caffer). Cattle can be immunized against the parasite by infection and treatment, but immunity is partially strain specific. Available data indicate that CD8(+) T lymphocyte responses mediate protection and, recently, several parasite antigens recognised by CD8(+) T cells have been identified. This study set out to determine the nature and extent of polymorphism in two of these antigens, Tp1 and Tp2, which contain defined CD8(+) T-cell epitopes, and to analyse the sequences for evidence of selection. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 21559495 PMCID: PMC3084734 DOI: 10.1371/journal.pone.0019015
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Neighbour-joining tree showing relationships among 79 cattle and buffalo-derived T. parva isolates.
These data are based on the sequences of 35 Tp1 gene alleles obtained in this study. T. annulata (TA17450 Strain_Ank) was used as an outgroup to root the tree. Isolates with their corresponding alleles in parentheses are indicated and colour-coded based on their geographical location in Kenya. Isolates from outside Kenya (Tanzania, Uganda, Zambia and Zimbabwe) are depicted in black. The TP03_0849 gene from the T. parva (Muguga) genome sequence was also included in the analysis. Bootstrap values >50% are shown above branches. A Major clade and minor clade with two sub-clusters is indicated.
Figure 2Multiple amino acid sequence alignment of the 30 Tp1 antigen variants obtained in this study.
(A) The single letter amino acid code is used throughout. Variants named Var1-30. Residue coordinates are listed above the alignment. The single currently identified CD8 T-cell target epitope (coordinates 40–50 bolded and boxed) and the positions of the two indels are indicated. The two polymorphic residues in the T-cell epitope are indicated by arrows. A predicted recombination breakpoint between amino acid residues 80 and 81 is indicated with a broken vertical lane. The frequency of each variant is indicated in square brackets, when larger than 1. Residues conserved in all sequences are identified below the alignment (*). The flanked PCR primers regions not included in estimations of % residues conserved are shadowed. The distribution of selective constraints across the Tp1-encoded peptide, according to model M8, is shown for cattle (B) and buffalo (C) parasites. The stacked bar chart above each amino acid residue shows the probability that the degree of selective constraint on that residue falls into each of the several ω classes indicated by the color legend.
Figure 3Neighbour-joining tree showing relationships among 80 cattle and buffalo-derived T. parva isolates.
These data are based on the sequences of 43 Tp2 alleles obtained in this study. The Tp2 homologous from T. annulata (TA19865_Strain_Ank) was used as an outgroup to root the tree. Isolates with their corresponding alleles in parentheses are indicated and colour-coded based on their geographical location in Kenya. Isolates from outside Kenya (Tanzania, Uganda, Zambia and Zimbabwe) are shown in black Bootstrap values >50% are shown above branches. Two major clades and sub-clusters within these are indicated.
Figure 4Multiple amino acid sequence alignment of the 41 full-length Tp2 antigen variants obtained in this study.
(A) The single letter amino acid code is used throughout. Variants named Var1-41. Residue coordinates are listed above the alignment. The defined epitopes that are recognized by CD8 T cells (bolded and Boxed) are overlined and numbered from 1 to 6. The conserved amino acid residues in the epitopes are indicated by arrows. The frequency of each variant is indicated in square brackets, when larger than 1. Conserved residues are identified below the alignment (*). (†) denotes the two predicted recombination breakpoints, the first at the junction of epitope 2 and 3 (between residues 48 and 49) and the second at the C-terminal margin of epitope 5 (between residues 106 and 107). The flanked PCR primers regions not included in estimations of % residues conserved are shadowed. The distribution of selective constraints across the Tp2-encoded peptide, according to model M8, is shown for parasite isolates from cattle (B) and buffalo (C); legend as in Fig. 2.
Tp2 CTL epitopes variants obtained in this study.
| Epitope Variants | Epitope 1 (Tp227–37) (25 variants) | Epitope 2 (Tp240–48) (21 variants) | Epitope 3 (Tp249–59) (23 variants) | Epitope 4 (Tp296–104) (19 variants) | Epitope 5 (Tp298–106) (19 variants) | Epitope 6 (Tp2138–147) (18 variants) |
| V1 | SHEELKKLGML 1, 5,11,14 | DGFDRDALF 1, 2, 5, 11, 14, 17, 22, 24, 28, 32, 35 | KSSHGMGKVGK 1, 2, 5, 14, 17, 24, 28, 32, 35 | FAQSLVCVL 1, 2, 5, 11, 14, 17, 24, 32, 35 | QSLVCVLMK 1, 2, 5, 11, 14, 24, 32, 35 | K |
| V2 | SDEELNKLGML 2, 34 | PDLDKNRLF 3, 4 | LTSHGMGRIGR 3 | FAASIKCVA 3 | ASIKCVAQY 3 | K |
| V3 | SDDELDTLGML 3 | DGFEKERLF 6, 8, 10, 15, 27, 39, 40 | LTSHGMGKIGR 4 | LAASIKCVS 4 | ASIKCVSHH 4 | K |
| V4 | SDNELDTLGLL 4 | PDPVKERLF 7 | KTSHSMGMIGK 6, 10, 27, 39, 40 | FGQSVVCVL 6, 10, 27, 39, 40 | QSVVCVLMR 6, 10, 27, 39, 40 | K |
| V5 | SDEELKKLGML 6, 10, 17, 24, 27, 32, 35, 39 | EGLDKDALF 9 | LTSKAMTTVGK 7 | FAQSIKCVS 7, 37 | QSIKCVSQH 7, 37 | KTNIPNPCDW 9 |
| V6 | SDEELESLGML 7 | EGFDKEALF 12 | RSSHGMGKVGK 8, 15 | FVESILCVI 8, 15 | ESILCVIKK 8, 15 | K |
| V7 | SDEELKKMGML 8, 15 | PGFDKEVLF 13, 23 | KSSKGMGKVGK 9 | FAQSIKCVA 9 | QSIKCVAQH 9, 25, 41 | N |
| V8 | SNEELKKLGMV 9, 12 | PDPDKQRLF 16 | KSSHGMGEVGK 11 | FGQSIKCVA 12, 25, 41 | QSIKCVAQK 12 | K |
| V9 | SDDELKKMGLI 13, 23 | SDPDRETLF 18 | KSSKGMTKVGK 12 | FGQSIKCVV 13, 23, 30, 36 | QSIKCVVQK 13, 23, 30, 36 | TSDVPNPCEW 18, 20 |
| V10 | SHSELETLGML 16 | PDPDKETLF 19, 20 | KSSKAMTTTGK 13, 23 | YAQSIYCVA 16 | QSIYCVANN 16, 19 | KGDVPNPCDW 21, 25, 41 |
| V11 | TPPELEALGRL 18 | EGFDREALF 21 | LTSKAMSTVGK 16 | FGASIQCVV 18, 20 | QSLVCVLMN 17 | K |
| V12 | SHEELKKMGML 19 | EGLDMEALF 25, 41 | KTSKAMTMTGR 18, 19, 20 | CFAQSIYCVA 19 | ASIQCVVKN 18, 20 | KGDAPNPCTW 29 |
| V13 | TEEELKRMGML 20 | DGFDKELLF 26 | KTSNGMTKVGK 21 | FGQSLKCVL 21 | QSLKCVLQH 21 | KEDVPNPCEW 30, 36 |
| V14 | TSEELKKLGMV 21 | DGFDRQRLF 29 | KSSHGMGKVGR 22 | FVESIMCVI 22, 28 | ESIMCVIKK 22, 28 | KPKIPNPCDW 31 |
| V15 | SDDELRKLGML 22, 28 | DNLDKDKLF 30 | KTSKGMTKVGR 25, 41 | FAQSIYCVV 26, 29 | QSIYCVVKN 26, 29 | TSDIPNPCKW 33 |
| V16 | TNEELKKLGMV 25, 41 | PHPDKERLF 31 | KSSKGMTAVGK 26, 38 | FAASIHCVS 31 | ASIHCVSNK 31 | KDKIKNPCDY 34 |
| V17 | NDDELKRMGMV 26 | DGFDRELLF 33 | LTSKSMSEVGR 29 | LAQSIVCVV 33 | QSIVCVVSK 33 | KQSILNPCNW37 |
| V18 | SDEELKELGML 29 | PDFEKEKLF 34 | MTSKAMTATGR 30 | FAQSLLCFL 34 | QSLLCFLQN 34 | KEDVPNPCKW 39 |
| V19 | TEDELKKLGMV 30 | QDPDKETLF 36 | KTSKGMTEVGK 31 | FAKSIKCVS 38 | KSIKCVSQH 38 | |
| V20 | NDEELENLGML 31 | PDPNKERLF 37 | KSSHGMGKIGR 33 | |||
| V21 | TVEELREMGMV 33 | PGFDKELLF 38 | AASHGLGKVGK 34 | |||
| V22 | SDDELNKLGML 36 | KTSKAMTATGR 36 | ||||
| V23 | TDEELENLGML 37 | LTSKSMMTVGK 37 | ||||
| V24 | TEDELKNMGLI 38 | |||||
| V25 | SDEDLKKLGML 40 |
Note: Numbers following the epitope sequences are the corresponding antigen alleles (see Table S1) carrying the epitopes. Up to 8 different amino acid residues (in bold and underlined) are used in the second amino acid position in epitope 6.
Evolution of the Tp1 locus: likelihood value (lnL) and parameter estimates for six evolution models implemented.
| Model | lnL |
| Estimated parameter values |
|
| |||
| M0 (one-ratio) | −620.2986 | 0.91 |
|
| M1a (nearly neutral) | −619.0294 | 0.55 |
|
| M2 (positive sel.) | −617.4723 | 1.07 |
|
|
| |||
| Positively selected sites (BEB): 124, 133, 138 | |||
| M3 |
| 1.07 |
|
|
| |||
| Positively selected codons (NEB): 49 | |||
| M7 (beta) | −619.0489 | 0.50 |
|
| M8 (beta & | −617.4725 | 1.07 |
|
|
| |||
| Positively selected sites (BEB): 49, 50, 124, 127, 130, 131, 132, 133, 134, 138 | |||
|
| |||
| M0 (one-ratio) | −892.7489 | 0.49 |
|
| M1a (nearly neutral) | −876.3387 | 0.29 |
|
| M2 (positive sel.) | −870.9605 | 0.59 |
|
|
| |||
| Positively selected sites (BEB): 29, 63, 110, 118 | |||
|
|
| 0.58 |
|
|
| |||
| Positively selected codons (NEB): 29, 63, 110, 118 | |||
| M7 (beta) | −876.3474 | 0.30 |
|
| M8 (beta & | −870.9584 | 0.58 |
|
|
| |||
| Positively selected sites (BEB): 29 |
Sites models of evolution. Nomenclature as in Yang et al. (2000). Model with highest likelihood for each locus is shown in italics.
Weighted average of d N/d S across all sites.
Parameters in parenthesis are inferred (probabilities add up to 1); all others are estimated from the data. Significance levels indicate probability that ω>1 for the site:
*p≥0.95;
**p≥0.99 (no asterisks signifies p<95%). Significance levels for M3 is based on a Naïve Empirical Bayes (NEB) analysis, while that of models M2 and M8 is based in a Bayes Empirical Bayes (BEB) analysis (Yang, Wong, and Nielsen 2005).
Evolution of the Tp2 locus: likelihood values (lnL) and parameter estimates for the evolutionary models implemented.
| Modela | lnL |
| Estimated parameter valuesc |
|
| |||
| M0 (one-ratio) | −1213.7784 | 0.45 |
|
| M1a (nearly neutral) | −1205.9230 | 0.45 |
|
| M2 (positive sel.) | −1201.4097 | 0.86 |
|
|
| |||
| Positively selected sites (BEB): 83, 105, 126, 130, 165* | |||
| M3 |
| 1.29 |
|
|
| |||
| Positively selected sites (NEB): 29, 34, 39, 40, 46, 49, 66, 69, 74, 77, 81, 83, 86, 91, 98, 101, 104, 105, 106, 122, 126, 130, 165 | |||
| M7 (beta) | −1206.8250 | 0.47 |
|
| M8 (beta & | −1201.5758 | 0.78 |
|
|
| |||
| Positively selected sites (BEB): 83, 105, 126, 130, 164** | |||
|
| |||
| M0 (one-ratio) | −6194.3698 | 0.50 |
|
| M1a (nearly neutral) | −5989.1350 | 0.54 |
|
| M2 (positive sel.) | −5974.5287 | 0.68 |
|
|
| |||
| Positively selected sites (BEB): 46, 73, 77, 83, 91*, 118, 130*, 139**, 146, 149, 165 | |||
|
|
| 0.57 |
|
|
| |||
| Positively selected sites (NEB): 20, 34, 38*, 39, 40, 42, 46**, 56, 65, 68, 72, 73**, 74, 77**, 78, 82, 83*, 86, 91**, 94, 101*, 104, 105, 118*, 130**, 139**, 146**, 148, 149**, 165** | |||
| M7 (beta) | −5976.5388 | 0.46 |
|
| M8 (beta & | −5962.3507 | 0.57 |
|
|
| |||
| Positively selected sites (BEB): 20, 34, 38, 39, 42, 46*, 56, 65, 69, 73, 74, 77*, 78, 82, 83, 86, 91*, 94, 101, 104, 118, 130*, 139**, 146, 149, 165 |
Note: Legend as in Table 2.
Figure 5Principal component analysis of Tp2 allelic diversity.
This diagram illustrates the relationship between buffalo-derived (Mara) and buffalo-associated, cattle-derived isolates (Marula), suggesting the isolates may belong to a single population. The proportion of variation in the dataset explained by the 1st and 2nd principal components is indicated in parenthesis.