| Literature DB >> 31842995 |
Gaston S Amzati1,2,3, Appolinaire Djikeng4,5, David O Odongo4,6, Herman Nimpaye7, Kgomotso P Sibeko8, Jean-Berckmans B Muhigwa9, Maxime Madder8, Nathalie Kirschvink10, Tanguy Marcotty10.
Abstract
BACKGROUND: Theileria parva causes East Coast fever (ECF), one of the most economically important tick-borne diseases of cattle in sub-Saharan Africa. A live immunisation approach using the infection and treatment method (ITM) provides a strong long-term strain-restricted immunity. However, it typically induces a tick-transmissible carrier state in cattle and may lead to spread of antigenically distinct parasites. Thus, understanding the genetic composition of T. parva is needed prior to the use of the ITM vaccine in new areas. This study examined the sequence diversity and the evolutionary and biogeographical dynamics of T. parva within the African Great Lakes region to better understand the epidemiology of ECF and to assure vaccine safety. Genetic analyses were performed using sequences of two antigen-coding genes, Tp1 and Tp2, generated among 119 T. parva samples collected from cattle in four agro-ecological zones of DRC and Burundi.Entities:
Keywords: Agro-ecological zones; East Coast fever; Evolutionary dynamics; Muguga cocktail vaccine; Population genetics; Rhipicephalus appendiculatus; Tick-borne diseases; Tp1; Tp2
Mesh:
Substances:
Year: 2019 PMID: 31842995 PMCID: PMC6915983 DOI: 10.1186/s13071-019-3848-2
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Fig. 1Map of the Great Lakes region showing sampling sites and their altitudes in the four agro-ecological zones of DRC and Burundi. Sampling sites located in the lowlands are indicated by squares (AEZ1: altitude < 1200 m), while circles indicate the sampling locations in midlands (AEZ2: altitude 1200–1800 m); and triangles denote sites located in the highlands (AEZ3: altitude 1600–2800 m)
PCR oligonucleotide primers used for amplification of p104, Tp1 and Tp2 genes with their corresponding annealing temperatures and amplicon sizes
| Gene locus | Primer name | Primer sequence (5′–3′) | Annealing temperature (°C) | Amplicon size (bp) | Reference |
|---|---|---|---|---|---|
| p104 | IL3231 (Fw1) | ATTTAAGGAACCTGACGTGACTGC | 60 | 496 | [ |
| IL755 (Rev1) | TAAGATGCCGACTATTAATGACACC | ||||
| IL4243 (Fw2) | GGCCAAGGTCTCCTTCAGAATACG | 55 | 277 | [ | |
| IL3232 (Rev2) | TGGGTGTGTTTCCTCGTCATCTGC | ||||
| ATGGCCACTTCAATTGCATTTGCC | 50 | 432 | [ | ||
| TTAAATGAAATATTTATGAGCTTC | |||||
| TGCATTTGCCGCTGATCCTGGATTCTG | 55 | 405 | [ | ||
| TGAGCTTCGTATACACCCTCGTATTCG | |||||
| ATGAAATTGGCCGCCAGATTA | 50 | 525 | [ | ||
| CTATGAAGTGCCGGAGGCTTC | |||||
| ATTAGCCTTTACTTTATTATTTWCATTYTAC | 54 | 504 | [ | ||
| CTATGAAGTGCCGGAGGCTTCTCCT |
Abbreviations: Fw1, forward outer; Fw2, forward inner; Rev1, reverse outer; Rev2, reverse inner
Tp1 and Tp2 sequence diversity in cattle-derived T. parva from RDC and Burundi
| Gene locus | AEZ | Sample size | Nucleotide sequences | Amino acid sequences | |||
|---|---|---|---|---|---|---|---|
| Polymorphic sitesa | No. of allelesb | Nucleotide diversity ± SD | Polymorphic sitesa | No. of antigen variantsb | |||
| DRC AEZ1 | 31 | 12 + ind | 7 | 0.008 ± 0.002 | 7 + ind | 5 | |
| DRC AEZ2 | 27 | 12 + ind | 8 | 0.007 ± 0.002 | 7 + ind | 6 | |
| DRC AEZ3 | 25 | 0 | 1 | 0 | 0 | 1 | |
| Burundi AEZ1 | 33 | 4 | 3 | 0.005 ± 0.0004 | 3 | 3 | |
| Overall | 116 | 14 + ind | 11 | 0.005 ± 0.0007 | 7 + ind | 6 | |
| DRC AEZ1 | 25 | 166 | 7 | 0.13 ± 0.025 | 82 | 7 | |
| DRC AEZ2 | 20 | 165 | 5 | 0.17 ± 0.016 | 81 | 5 | |
| DRC AEZ3 | 23 | 2 | 2 | 0.002 ± 0.0004 | 2 | 2 | |
| Burundi AEZ1 | 28 | 175 | 6 | 0.16 ± 0.017 | 85 | 5 | |
| Overall | 96 | 181 | 10 | 0.14 ± 0.01 | 88 | 9 | |
aThe insertion region (ind) was excluded for the determination of the number of polymorphic sites and the nucleotide diversity
bAlleles represent distinct nucleotide sequences diverged at least by one substitution (Additional file 4: Figure S1, Additional file 8: Figure S2), while antigen variants represent predicted distinct protein sequences (Figs. 2, 4)
Abbreviations: AEZ, agro-ecological zone; ind, indels; SD, standard deviation
Fig. 2Multiple amino acid sequence alignment of six Tp1 antigen variants in 116 T. parva samples obtained from DRC and Burundi. Antigen variants are named var-1 to var-34. The single letter amino acid code is used. The antigen variants nomenclature used in this study was first proposed by Pelle et al. [36]. Variants var-1 and var-3 were first described by Pelle et al. [36] and var-31 by Salih et al. [37]. The numbers in square brackets behind variants names indicate the number of T. parva isolates represented by each variant. The single previously identified T. parva CD8+ T cell target epitope is bolded and boxed. The polymorphic residues in the T cell epitope are coloured in red. Conserved amino acid residues are denoted by (*) below the alignment, and dashes (–) denote insertion region. Nested PCR primers used for sequencing are shaded and boxed in flanked regions. The Muguga sequence (GenBank: JF451936) was used as the reference sequence; it represents the other component of the Muguga cocktail vaccine (Serengeti-transformed and Kiambu 5) that are identical to Muguga strain sequence for the Tp1 locus. Tp1 antigen variant var-1 is found in the three Muguga vaccine strains. Corresponding gene alleles and sample characteristics are presented in Additional file 3: Table S3
Fig. 4Multiple amino acid sequence alignment of nine Tp2 antigen variants detected in 96 T. parva samples from DRC and Burundi. Amino acids are denoted by the single-letter codes. Var-1 to var-59 are variant names. The antigen variants nomenclature used in this study was first initiated by Pelle et al. [36]. Antigen variants var-1 and var-2 were described in Pelle et al. [36] and Salih et al. [37] and are, respectively, Muguga (identical to Serengeti-transformed) and Kiambu-5 strains. Reference sequences component of the Muguga cocktail live vaccine are represented by Muguga (GenBank: JF451856), Serengeti (Serengeti-transformed, GenBank: JF451862) and Kiambu-5 (GenBank: JF451880). The numbers in square brackets behind variants names indicate the number of T. parva samples represented by each variant. The six previously described epitopes (epitope1-6), that are the target of the bovine CD8+ T cells immune responses are bolded and boxed. The conserved amino acid residues in the epitopes are coloured in red. The star (*) below the alignment indicates positions of conserved amino acid residues. The shaded and boxed flanked regions denote the inner primers used for sequencing. Tp2 Antigen variants var-1 and var-2 are found in Muguga/Serengeti and Kiambu-5 strains, respectively. Corresponding gene alleles and sample characteristics are presented in Additional file 3: Table S3
Fig. 3Tp2-based sliding-window plot of Tajima’s D statistics (a) and nucleotide diversity (b) of T. parva sequences from the Great Lakes region. A window length of 100 nucleotides and a step size of 25 bp were used. The maximum nucleotide diversity and Tajima’s D values are observed between the nucleotide positions 200 and 300, containing the Tp2 epitopes 4 and 5. Abbreviation: Ep1-6, epitope1-6
Tp1 and Tp2 CD8+ T cell epitope variants identified in cattle-derived T. parva from DRC and Burundi
| Epitope 1 (Tp220–30) | Epitope 2 (Tp233–41) | Epitope 3 (Tp242–52) | Epitope 4 (Tp289–97) | Epitope 5 (Tp291–99) | Epitope 6 (Tp2131–140) | |
|---|---|---|---|---|---|---|
| VGYPKVKEEII (var-3, 31, 34) | ||||||
| VGYPKVKEEMI (var-33) | KSSHGMGKVGR (var-56, 57, 59) | |||||
| SDNELDTLGLL (var-54) | PDLDKNRLF (var-53, 54) | LTSHGMGKIGR (var-54) | LAASIKCVS (var-54) | ASIKCVSHH (var-54) | KPSVPNPCDW (var-53, 54) | |
| SDDELDTLGML (var-53) | LTSHGMGRIGR (var-53) | FAASIKCVA (var-53) | ASIKCVAQY (var-53) | |||
Notes: Epitope variants were identified using the reference amino acid positions presented in Additional file 2: Table S2. Numbers in brackets following the epitope sequences correspond to antigen variants carrying the epitopes (Figs. 2, 4). Epitope variants described for the first time are in italic and those found in the Muguga cocktail vaccine are in bold. Tp2 antigen variants var-1 and var-2 are found in Muguga (identical with Serengeti-transformed) and Kiambu-5 strains, respectively. Tp1 antigen variant var-1 is found in Muguga (identical with Serengeti-transformed and Kiambu-5)
Abbreviations: Var-1 to var-59, antigen variant names
Fig. 5Neighbor-Joining tree showing phylogenetic relationships among the 61 Tp2 gene alleles described in Africa (A01–A63). Tp2 gene alleles obtained from cattle in the present study are indicated by black diamonds. Theileria parva alleles found in cattle with no association with buffalo and in laboratory stocks are coloured in blue, while buffalo-derived and buffalo-associated alleles are depicted in Red. Bootstrap values (> 50%) are shown above branches. The Tp2 homologous sequence of T. annulata (GenBank: TA19865) was used as the outgroup. The numbers in brackets behind allele names denote the number of T. parva isolates carrying the allele. The detailed Tp2 alleles distribution and their corresponding populations/AEZs are presented in Additional file 10: Table S8. Tp2 allele A01 corresponds to isolates identical to Muguga and Serengeti-transformed strains, while Tp2 allele A02 represents isolates identical to Kiambu-5 strain
Fig. 6Median-joining network representing the phylogeographical distribution of Tp2 alleles of T. parva from cattle in sub-Saharan Africa. Each circle represents a unique allele, with colours depicting the proportion of individuals from different populations sharing the allele. Black nodes represent hypothetical unsampled alleles (or median vectors). Numbers in brackets on connecting lines indicate mutational steps between alleles. The detailed Tp2 alleles distribution and their corresponding populations/AEZs are presented in Additional file 10: Table S8. Tp2 allele A01 corresponds to samples identical to Muguga and Serengeti-transformed strains and Tp2 allele A02 represents samples identical to Kiambu-5 strain. CD, cattle-derived samples (from Kenya); LS, laboratory samples (ILRI) [36]
Pairwise estimates of genetic distance among nine T. parva populations using F statistic for nucleotide sequences of Tp2
| Country | Population | DRC AEZ1 | DRC AEZ2 | DRC AEZ3 | Burundi AEZ1 | BA | BD | CD | South Sudan | LS |
|---|---|---|---|---|---|---|---|---|---|---|
| DRC | DRC AEZ1 | – | 0.008 | < 0.001 | 0.2 | < 0.001 | 0.001 | 0.4 | < 0.001 | 0.4 |
| DRC AEZ2 | 0.17 | – | < 0.001 | 0.06 | < 0.001 | 0.004 | < 0.001 | < 0.001 | 0.07 | |
| DRC AEZ3 | 0.20 | 0.58 | – | < 0.001 | < 0.001 | < 0.001 | 0.04 | 0.3 | < 0.001 | |
| Burundi | Burundi AEZ1 | 0.02 | 0.08 | 0.3 | – | < 0.001 | 0.008 | 0.1 | < 0.001 | 0.3 |
| Kenya | BA | 0.17 | 0.19 | 0.39 | 0.15 | – | 0.3 | < 0.001 | < 0.001 | < 0.001 |
| BD | 0.17 | 0.17 | 0.45 | 0.14 | 0.002 | – | < 0.001 | < 0.001 | 0.009 | |
| CD | − 0.01 | 0.28 | 0.13 | 0.06 | 0.22 | 0.23 | – | 0.002 | 0.1 | |
| South Sudan | South Sudan | 0.29 | 0.69 | − 0.003 | 0.41 | 0.52 | 0.59 | 0.19 | – | < 0.001 |
| Laboratory samples | LS | − 0.02 | 0.08 | 0.35 | 0.009 | 0.15 | 0.14 | 0.04 | 0.48 | – |
Notes: F values below the diagonal and P-values above the diagonal; The genetic differentiation was considered as low (F between 0–0.05), intermediate (F between 0.05–0.15), great (F between 0.15–0.25) and very great (F < 0.25). The sample sizes (number of sequences) used in each population are shown in (Additional file 9: Table S7, Additional file 10: Table S8)
Abbreviations: AEZ, agro-ecological zones; BD, buffalo-derived; LS, laboratory samples; CD, cattle-derived
Tp2-based demographic structure and natural selection analyses of T. parva populations
| Country | Population | Sample size | Tajima’s | Fu and Liʼs | Fu and Liʼs |
|---|---|---|---|---|---|
| DRC | DRC AEZ1 | 25 | 1.2 | 0.97 | 0.94 |
| DRC AEZ2 | 20 | 2.5 | 1.8** | 2.1** | |
| DRC AEZ3 | 23 | 1.1 | 0.84 | 1.1 | |
| Burundi | Burundi AEZ1 | 28 | 2.2 | 1.9** | 2.1** |
| 68 | − 1.2 | 1.9** | 0.6 | ||
| 28 | 3.6** | 1.6** | 2.6** | ||
| Overall (DRC and Burundi) | 96 | 2.5 | 2.3** | 2.3** | |
| Kenya | CD | 22 | 0.17 | 1.8** | 1.4 |
| South Sudan | South Sudan | 65 | − 2.5** | − 2.7* | − 3.1* |
| Laboratory samples | LS | 17 | 2.7** | 1.7** | 2.3** |
*P < 0.05, **P < 0.01
Abbreviations: AEZ, agro-ecological zone; LS, laboratory samples; CD, cattle-derived