| Literature DB >> 29703152 |
Daniel R Bogema1, Melinda L Micallef1, Michael Liu2, Matthew P Padula2, Steven P Djordjevic2, Aaron E Darling2, Cheryl Jenkins3.
Abstract
BACKGROUND: Theileria orientalis (Apicomplexa: Piroplasmida) has caused clinical disease in cattle of Eastern Asia for many years and its recent rapid spread throughout Australian and New Zealand herds has caused substantial economic losses to production through cattle deaths, late term abortion and morbidity. Disease outbreaks have been linked to the detection of a pathogenic genotype of T. orientalis, genotype Ikeda, which is also responsible for disease outbreaks in Asia. Here, we sequenced and compared the draft genomes of one pathogenic (Ikeda) and two apathogenic (Chitose, Buffeli) isolates of T. orientalis sourced from Australian herds.Entities:
Keywords: Genome; Phylogenomics; Theileria orientalis; Vaccine
Mesh:
Substances:
Year: 2018 PMID: 29703152 PMCID: PMC5921998 DOI: 10.1186/s12864-018-4701-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genome assembly and resequencing statistics
| Robertson | Fish Creek | Goon Nure | |
|---|---|---|---|
| Resequencing | |||
| % reads mapped | 96.9% | 75.2% | 78.0% |
| % host DNA | 0.072% | 1.34% | 0.069% |
| Genome % ref. coverage (≥ 14×) | 93.1% | 74.4% | 65.0% |
| CDS % ref. coverage (≥ 14×) | 96.5% | 85.3% | 78.3% |
| Mean coverage depth | 60.5 × | 43.5 × | 47.3 × |
| Assembly | |||
| # scaffolds | 639 | 1557 | 6043 |
| N50 | 54,695 | 12,243 | 2646 |
| % GC | 41.7% | 39.3% | 39.8% |
| Predicted genes | 3677 | 3604 | 4905 |
SNV variance of Robertson, Fish Creek and Goon Nure strains compared with Shintoku reference
| Robertson | Fish Creek | Goon Nure | |
|---|---|---|---|
| Total SNVs (Density SNV/kb) | 24,132 (2.68) | 788,412 (87.8) | 676,284 (75.3) |
| Within CDS SNVs (Density SNV/kb) | 16,974 (2.75) | 637,070 (103) | 565,947 (91.6) |
| % of genome (coverage adjusted) | 0.288% | 11.8% | 11.6% |
| % of CDS (coverage adjusted) | 0.28% | 12.1% | 11.7% |
Fig. 1Venn diagram showing number of unique and shared SNVs in the Robertson, Fish Creek and Goon Nure genotypes. Low numbers of Robertson SNVs reflect the use of the Shintoku whole genome sequence as a reference
Fig. 2Recombination analysis of T. orientalis Shintoku, Ikeda, Chitose and Buffeli genotypes. In total, 83 recombination events were detected using six recombination prediction methods (Geneconv, MaxChi, RDP, SisScan, 3Seq and BootScan). Shintoku, Robertson, Fish Creek and Goon Nure chromosomes (grey) are shown aligned with probable major parent sequences shown as coloured lines. Most likely major parent sources are shown by colour (see legend)
Fig. 3Mitochondrial genomes of T. orientalis Ikeda, Chitose and Buffeli. Grey areas represent areas of sequence identity; black lines represent areas of sequence divergence. The cox1, cob and cox3 genes are structurally conserved in all three genomes
Predicted surface proteins with High dN/dS
| Protein ID | Functional classification | Homolog in other piroplasmida | Pfam | Mean dN/dS |
|---|---|---|---|---|
| XP_009690939.1 | hypothetical protein | N | 0.420013 | |
| XP_009691911.1 | uncharacterized protein | Y | Pf04385 | 0.348927 |
| XP_009690040.1 | hypothetical protein | Y | 0.329911 | |
| XP_009690607.1 | hypothetical protein | Y | 0.326084 | |
| XP_009691340.1 | hypothetical protein | Y | 0.325941 | |
| XP_009689372.1 | hypothetical protein | Y | 0.322271 | |
| XP_009689430.1 | hypothetical protein | N | 0.310138 | |
| XP_009690004.1 | hypothetical protein | Y | 0.307715 | |
| XP_009690344.1 | ToLocg1 paralog | Y | 0.272704 | |
| XP_009690269.1 | hypothetical protein | Y | 0.236663 | |
| XP_009691913.1 | hypothetical protein | N | 0.230252 | |
| XP_009689555.1 | hypothetical protein | Y | 0.228835 | |
| XP_009689383.1 | hypothetical protein | N | 0.225034 | |
| XP_009688884.1 | CD8+ T cell target antigen Tp2 | Y | 0.222906 | |
| XP_009689185.1 | hypothetical protein | Y | 0.222693 | |
| XP_009692799.1 | hypothetical protein | N | Pf04385 | 0.220597 |
| XP_009692694.1 | hypothetical protein | Y | 0.217668 | |
| XP_009690569.1 | brain protein 44-like | Y | Pf03650 | 0.214623 |
| XP_009689733.1 | hypothetical protein | Y | 0.211825 | |
| XP_009690910.1 | hypothetical protein | N | 0.210374 | |
| XP_009689120.1 | hypothetical protein | Y | 0.198483 | |
| XP_009692522.1 | thrombospondin-related anonymous protein | Y | Pf00092 | 0.196277 |
| XP_009691139.1 | putative protease | Y | Pf02517 | 0.1867 |
| XP_009690803.1 | putative apicoplast import protein | Y | Pf16166 | 0.176391 |
| XP_009692438.1 | hypothetical protein | Y | 0.17456 | |
| XP_009692777.1 | uncharacterized protein | Y | Pf04385 | 0.173405 |
| XP_009692412.1 | hypothetical protein | Y | 0.16334 | |
| XP_009692816.1 | surface protein | Y | Pf04145 | 0.160208 |
| XP_009689868.1 | hypothetical protein | Y | 0.156548 | |
| XP_009692665.1 | hypothetical protein | N | 0.154705 | |
| XP_009689845.1 | major piroplasm surface protein | Y | Pf02488 | 0.146066 |
| XP_009690016.1 | hypothetical protein | Y | Pf04385 | 0.136429 |
| XP_009692660.1 | putative lysophospholipase | Y | Pf12146 | 0.129336 |
| XP_009692759.1 | hypothetical protein | Y | 0.127301 | |
| XP_009691480.1 | putative ER oxidoreductin | Y | Pf04137 | 0.120148 |
| XP_009689697.1 | uncharacterized protein | Y | 0.116339 | |
| XP_009691437.1 | hypothetical protein | Y | 0.113952 | |
| XP_009688850.1 | hypothetical protein | Y | 0.113259 | |
| XP_009690130.1 | hypothetical protein | Y | Pf08320 | 0.1113 |
| XP_009692225.1 | uncharacterized protein | Y | Pf07691 | 0.107945 |
| XP_009690580.1 | p23 surface protein | Y | 0.104459 | |
| XP_009691349.1 | zinc transport protein | Y | Pf02535 | 0.10384 |
| XP_009689509.1 | hypothetical protein | Y | 0.10243 | |
| XP_009689195.1 | hypothetical protein | Y | 0.102029 | |
| XP_009691696.1 | hypothetical protein | Y | 0.10029 | |
| XP_009689770.1 | uncharacterized protein | Y | Pf04385 | 0.099851 |
| XP_009690399.1 | 50S ribosomal protein L9 | Y | Pf01281 | 0.096548 |
| XP_009689290.1 | hypothetical protein | Y | Pf05450 | 0.095706 |
| XP_009689754.1 | uncharacterized protein | Y | 0.086991 | |
| XP_009688894.1 | 50S ribosomal protein L33 | Y | 0.0863757 |
Fig. 4a. Multilocus phylogeny of Piroplasmida whole genome sequences. The tree was inferred by maximum-likelihood using an alignment of 654 concatenated protein sequences from single copy genes. Plasmodium falciparum str. 3D7 was included as an outgroup. Labels indicated bootstrap support (%). b. Average nucleotide identities between genomes of the Theileria genus as calculated by the method described in [92]