| Literature DB >> 26403690 |
Martin Norling1, Richard P Bishop2, Roger Pelle3, Weihong Qi4, Sonal Henson5, Elliott F Drábek6, Kyle Tretina7, David Odongo8,9, Stephen Mwaura10, Thomas Njoroge11, Erik Bongcam-Rudloff12, Claudia A Daubenberger13,14, Joana C Silva15,16.
Abstract
BACKGROUND: There are no commercially available vaccines against human protozoan parasitic diseases, despite the success of vaccination-induced long-term protection against infectious diseases. East Coast fever, caused by the protist Theileria parva, kills one million cattle each year in sub-Saharan Africa, and contributes significantly to hunger and poverty in the region. A highly effective, live, multi-isolate vaccine against T. parva exists, but its component isolates have not been characterized. Here we sequence and compare the three component T. parva stocks within this vaccine, the Muguga Cocktail, namely Muguga, Kiambu5 and Serengeti-transformed, aiming to identify genomic features that contribute to vaccine efficacy.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26403690 PMCID: PMC4583173 DOI: 10.1186/s12864-015-1910-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of SNPs identified between each T. parva stock in the Muguga Cocktail and the reference Muguga strain
| Isolate | Total number of SNPs | Synonymous | Non-synonymous | Nonsense | Indels | Intron | Number of genes with non-syn SNPs |
|---|---|---|---|---|---|---|---|
| Serengeti-transformed | 957 | 291 | 420 | 1 | 0 | 121 | 53 |
| Kiambu5 | 39,296 | 20,086 | 8,587 | 16 | 748 | 7,615 | 1,708 |
| Muguga2 | 586 | 206 | 264 | 1 | 0 | 48 | 52 |
Fig. 1Single Nucleotide Polymorphisms (SNP) in each of the Muguga Cocktail component T. parva stocks relative to the reference T. parva Muguga isolate. a) Number of base pairs in the reference T. parva Muguga genome assembly with no read coverage from each of the three isolates. SNPs cannot be identified in this fraction of the genome, respectively 0.55 % (45,837 bp), 0.29 % (24,439 bp) and 5.61 % (466,092 bp) of the genome, in the comparisons with Muguga clone 2, Serengeti-transformed and Kiambu5. b) Identified SNPs unique to, and shared among, Muguga clone 2, Kiambu5, and Serengeti-transformed isolates
Fig. 2Phylogenetic relationships of twelve T. parva isolates. Genomic distances are measured in SNPs detected within the coding region of 200 randomly selected genes. Support was obtained from 500 bootstrap replicates. Represented isolates include T. parva stocks analyzed in this study, and the additional stocks described by Hayashida et al. in [10], namely the cattle-derived T. parva parasite stocks "Entebbe", "Katumba", "Nyakizu", "ChitongoZ2", "KateteB2", "KiambuZ464", and "MandaliZ22", as well as the African Cape buffalo-derived stocks "Lawrencei", and "Z5E5". Stocks are labeled according to country of origin: northern end of the distribution -Kenya, Uganda and Rwanda (dark green); north-central region –Tanzania (teal); central region –Zambia (blue). The three vaccine isolates are shown (bold-italics font)
Fig. 3Distribution of variants in the Serengeti-transformed and Kiambu5 isolates relative to the genome of the T. parva Muguga reference isolate. For each of the four T. parva Muguga chromosomes the location of the protein coding genes in shown (purple), with genes encoded in the forward and reverse strands in the top and bottom central tracks, respectively. Genes with non-synonymous differences between Serengeti and Muguga are shown in red. Known antigens are shown in green. Variants were estimated in 2Kb-long, non-overlapping windows; for each of the isolates, the proportion of the window with read coverage (orange; min-max = 0-100 % coverage) is displayed, as is the number of SNPs per window (blue; min-max = 0-32 SNPs/Kb)
Loci encoding well-characterized peptide products, which contain SNPs between the Serengeti-transformed stock and reference Muguga strain. Variants were also found in 47 hypothetical proteins (Additional file 2: Table S2)
| Locus Tag /2005 | Locus Tag IDa | Product Name | TM Domainsb | Protein Localization / Confidencec | Serengeti-transformed SNPsd | Kiambu5 SNPse | ||
|---|---|---|---|---|---|---|---|---|
| Syn | Nsyn | Syn | Nsyn | |||||
| TP01_0011 | TpMuguga_01g00011f | ABC transporter | 11 | _ / 2 | 9 | 11 | ≥66f | ≥14f |
| TP02_0016 | TpMuguga_02g00016f | ABC transporter | 13 | _ / 2 | 2 | 2 | ≥21f | ≥7f |
| TP02_0017 | TpMuguga_02g00017f | ABC transporter | 12 | _ / 3 | 19 | 17 | ≥11f | ≥8f |
| TP02_0940 | TpMuguga_02g00940f | ABC transporter | 12 | _ / 1 | 1 | ≥0f | ≥0f | |
| TP02_0951 | TpMuguga_02g00951f | ABC transporter | 11 | _ / 1 | 44 | 37 | ≥9f | ≥2f |
| TP03_0007 | TpMuguga_03g00007f | ABC transporter | 11 | _ / 1 | 7 | 4 | ≥11f | ≥9f |
| TP03_0864 | TpMuguga_03g00864 | ABC transporter | 13 | _ / 2 | 5 | 3 | 10 | 6 |
| TP04_0021 | TpMuguga_04g00021f | ABC transporter | 13 | _ / 1 | 10 | 24 | ≥43f | ≥23f |
| TP02_0449 | TpMuguga_02g00449 | DNA replication factor CDT1 like | 0 | _ / 2 | 4 | 1 | ||
| TP02_0448 | TpMuguga_02g00448 | F-box-like/WD repeat-containing protein TBL1XR1 | 0 | _ / 1 | 1 | 1 | ||
| TP03_0280 | TpMuguga_03g00280 | Papain family cysteine protease | 1 | _ / 2 | 12 | 7 | 1 | 1 |
| TP04_0278 | TpMuguga_04g00278 | Translation initiation factor IF-2 | 1 | S / 2 | 4 | 8 | 5 | |
| TP04_0279 | TpMuguga_04g00279 | Translation initiation factor IF-2 | 1 | S / 2 | 1 | 3 | ||
| TP04_0280 | TpMuguga_04g00280f | Translation initiation factor IF-2 | 1 | S / 1 | 1 | 3 | ≥2f | ≥2f |
aLocus Tag Identifiers according to the updated whole genome gene structural re-annotation
bNumber of transmembrane domains
cProtein localization predictions based on TargetP: C = chloroplast; M = mitochondrion; S = Secretory pathway; _ = Any other location. Prediction confidence: 1 = max; 5 = min
dSNPs between the Serengeti-transformed isolate and the reference Muguga isolate. Both synonymous (Syn) and non-synonymous (Nsyn) SNP counts are shown
eSNPs between Kiambu5 and the reference Muguga isolate
fThis genes coding sequence(s) had at least one position with <10x mapping coverage in the Kiambu5 stock, and thus there is likely more variation in the actual sequence
List of known T. parva antigens described in literature. No variants were found for Serengeti-transformed in any of these loci, so only Kiambu5 nucleotide sequence variants are reported
| Antigen | Locus Tag /2005 | Locus Tag /2014 | Description | Protein Localization/ Confidence | Domains | Kiambu5 variants | Ref. | ||
|---|---|---|---|---|---|---|---|---|---|
| Syn | Non-syn | Intron | |||||||
| Tp2 | TP01_0056 | TpMuguga_01g00056 | CD8+ T cell target antigen | 0 / S | - | 2 | 2 | [ | |
| gp34 | TP01_0939 | TpMuguga_01g00939 | Hypothetical protein TP01_0939 | 1 / S | 1 TM, 1 GPI | 16 | 6 | [ | |
| p32 | TP01_1056 | TpMuguga_01g01056 | Antibody target antigen 32 kDa surface protein | 0 / S | 1 GPI | 3 | 8 | [ | |
| Tp8 | TP02_0140 | TpMuguga_02g00140 | CD8+ T cell target cysteine proteinase | 1 / _ | 1 TM | 0 | 1 | [ | |
| X88 | TP02_0148 | TpMuguga_02g00148 | T cell target heat shock protein 70 | 0 / _ | - | 16 | 0 | [ | |
| Tp7 | TP02_0244 | TpMuguga_02g00244 | CD8+ T cell target antigen heat shock protein 90 | 0 / _ | - | 0 | 0 | [ | |
| Tp5 | TP02_0767 | TpMuguga_02g00767 | CD8 + T cell target antigen translation initiation factor eIF-1A | 0 / _ | - | 1 | 0 | 6 | [ |
| Tp9 | TP02_0895 | TpMuguga_02g00895 | CD8+ T-cell target antigen Tp9 | 0 / S | - | ? | ? | [ | |
| Tp4 | TP03_0210 | TpMuguga_03g00210 | CD8 + T cell target antigen T-complex protein 1 subunit eta | 0 / _ | - | 22 | 1 | 10 | [ |
| p67 | TP03_0287 | TpMuguga_03g00287 | Antibody target antigen p67 sporozoite surface protein | 1 / S | 1 TM, 1 GPI | 0 | 0 | [ | |
| Tp1 | TP03_0849 | TpMuguga_03g00849 | CD8 + T cell target antigen apical membrane antigen 1 | 1 / S | 1 TM | 1 | [ | ||
| p150 | TP03_0861 | TpMuguga_03g00861 | Antibody target antigen p150 microsphere protein | 0 / S | - | 0 | 0 | [ | |
| PIM | TP04_0051 | TpMuguga_04g00051 | Antibody target antigen polymorphic immunodominant molecule (PIM protein) | 3 / S | 3 TM | 0 | 0 | [ | |
| p104 | TP04_0437 | TpMuguga_04g00437 | Antibody target antigen 104 kDa rhoptry protein | 1 / S | 1 TM, 1 GPI | 3 | 13 | 2 | [ |
Fig. 4Distribution of πN and πN/πS for all genes in a comparison between two strains of T. parva. Histograms of Histograms of (a) the distribution of non-synonymous polymorphism (πN) and (b) the ratio of non-synonymous to synonymous polymorphism in a comparison between the sequence of Kiambu5 and the reference Muguga isolate. Show are the mean, as well as the mean + 3*SD which is the cut-off used for Additional file 5: Table S5
Loci encoding well-characterized peptide products, with the highest 5 % πN value between the Kiambu5 stock and the Muguga reference sequence. Variants in hypothetical proteins, and those not in the highest 5 % πN can be found in Additional file 3: Table S3
| Locus Tag / 2005 | Locus Tag IDa | Product Name | Domains | Protein Localization/ Confidencec | πNd | πN/πSd | Synd | Non-synd |
|---|---|---|---|---|---|---|---|---|
| TP01_0066 | TpMuguga_01g00066 | 50S ribosomal protein L33 | - | S/ 1 | 0.013 | 0.4313 | 3 | 3 |
| TP01_0248 | TpMuguga_01g00248 | tRNAHis guanylyltransferase | 1 TM | _/ 1 | 0.0148 | 0.4679 | 8 | 10 |
| TP01_0272, TP01_0273 | TpMuguga_01g00272 | Kinase binding protein CGI-121 | - | _/ 1 | 0.0161 | 0.3261 | 7 | 8 |
| TP01_0374 | TpMuguga_01g00374 | DEAD/DEAH box helicase | - | _/ 2 | 0.0131 | 0.3503 | 13 | 15 |
| TP01_0405 | TpMuguga_01g00405 | Asparagine synthetase domain-containing protein C4F6.11c | - | _/ 1 | 0.0121 | 0.4375 | 14 | 15 |
|
|
|
| 1 GPI | S/ 2 | 0.0125 | 1.1773 | 3 | 8 |
| TP01_1165 | TpMuguga_01g01165 | ATP-dependent RNA helicase SUV3 homolog mitochondrial | - | M/ 5 | 0.015 | 0.3049 | 25 | 23 |
| TP01_1173, TP01_1174 | TpMuguga_01g01174 | Ribonuclease P protein subunit p29 | - | _/ 3 | 0.0123 | 0.255 | 7 | 7 |
| TP01_0275, TP01_0273, TP01_0274, TP01_0272 | TpMuguga_01g02345 | Phosphatidylinositol 4-phosphate 5-kinase 7 | - | _/ 4 | 0.0142 | 3 | ||
| TP01_1063 | TpMuguga_01g02455 | Uncharacterized protein C18H10.09 | - | _/ 1 | 0.0132 | 0.66 | 2 | 4 |
| TP02_0369 | TpMuguga_02g00369 | Dual specificity protein phosphatase 12 | - | _/ 2 | 0.0142 | 0.3435 | 11 | 11 |
| TP03_0010 | TpMuguga_03g00010 | Archease protein family (MTH1598/TM1083) | - | _/ 1 | 0.0143 | 0.4618 | 6 | 8 |
| TP03_0595 | TpMuguga_03g00595 | Ydr279p protein family (RNase H2 complex component) | - | _/ 2 | 0.0187 | 0.5673 | 5 | 13 |
| TP03_0601 | TpMuguga_03g00601 | CobW/HypB/UreG nucleotide-binding domain | - | _/ 3 | 0.0127 | 0.4562 | 9 | 10 |
| TP03_0270 | TpMuguga_03g02310 | Protein CyaY | - | _/ 5 | 0.0117 | 0.2878 | 36 | 35 |
| TP04_0705 | TpMuguga_04g00705 | Maf-like protein | - | _/ 3 | 0.0115 | 0.5954 | 5 | 6 |
The previously identified antigen p32 is marked in bold
Locus Tag Identifiers according to the updated whole genome gene structural re-annotation
Number of transmembrane domains
Protein localization predictions based on TargetP: C = chloroplast; M = mitochondrion; S = Secretory pathway; _ = Any other location. Prediction confidence: 1 = max; 5 = min
SNPs between Kiambu5 and the reference Muguga isolate