Literature DB >> 35647418

In silico identification of Theileria parva surface proteins.

Nitisha Gurav1, Olivia J S Macleod2, Paula MacGregor3, R Ellen R Nisbet1.   

Abstract

East Coast Fever is a devastating African cattle disease caused by the apicomplexan parasite, Theileria parva. Little is known about the cell surface, and few proteins have been identified. Here, we take an in silico approach to identify novel cell surface proteins, and predict the structure of four key proteins.
© 2022 Published by Elsevier B.V.

Entities:  

Keywords:  Apicomplexa; East Coast Fever; Parasitology; Theileria

Year:  2022        PMID: 35647418      PMCID: PMC9133732          DOI: 10.1016/j.tcsw.2022.100078

Source DB:  PubMed          Journal:  Cell Surf        ISSN: 2468-2330


Introduction

Theileria parva is the causative agent of East Coast Fever (ECF) a lethal, tick-borne disease of cattle in sub-Saharan Africa. T. parva is an apicomplexan parasite, closely related to Plasmodium, the causative agent of malaria. Mortality levels vary from 3 to 80% depending on parasite strain and cattle breed, killing over one million cattle each year (Nene et al., 2016). The only drug licenced to treat T. parva is buparvaquone, which is over 30 years old. Although there is no reported resistance to buparvaquone in T. parva, there is rising levels of resistance in Theileria annulata, a related cattle pathogen (Mhadhbi et al., 2015). The main mechanism for control is cattle dipping and vaccination. However, the current vaccination model is an ‘infection and treat’ model. Large numbers of infected ticks are produced, ground up, and frozen. The tick residue is transported in liquid nitrogen and injected to the cattle to be vaccinated. As this causes disease, the cattle are then treated with high dose of tetracycline to prevent the infection taking hold (MacGregor et al., 2021, Nene et al., 2016). This model of vaccination has significant drawbacks: the transport of the vaccine in liquid nitrogen is impractical in the context of rural Africa, it requires the use of high levels of antibiotics, thus driving the rise of antibiotic resistance, it requires the presence of a veterinarian, which significantly increases costs, regional vaccines are needed vaccinated cattle remain lifelong asymptomatic carriers of disease and cannot be exported. A modern vaccine is urgently required (Nene and Morrison, 2016). As with many eukaryotic parasites, T. parva differentiate between a series of different life-stages, each characterised by distinct morphologies and patterns of gene expression. These include distinct protein composition of the parasite cell surface between life-stages. As putative vaccine targets there are two key life-stages of interest: (i) the free-living sporozoite stage, where the parasite is released from the tick’s salivary glands and passes into the cattle bloodstream, and (ii) the schizont stage, where T. parva causes cancer-like cell proliferation within the cattle lymphocytes. An ideal vaccine would target the sporozoite and/or schizont stages, giving rise to a CD4 and CD8 T cell immune response (Morrison et al., 2021). Previous works have used in silico analysis of gene sequences from the T. parva Mugaga strain genome and proteome to identify putative surface proteins, containing both a signal peptide and putative GPI-anchor addition sequence (Nyagwange et al., 2018a). These can be considered as highly likely to reside on the cell surface, and not internal membranes, as utilization of a GPI-anchor for membrane attachment is a surface-specific feature. Here, we sought to identify a longer candidate list of putative surface proteins, using a similar theoretical approach, but to instead identify proteins that are attached to the plasma membrane with one or more transmembrane domains.

Methodology

The T. parva Mugaga strain genome (causative agent of ECF) (Hayashida et al., 2013, Tretina et al., 2020) was computationally screened for all proteins with a putative signal peptide using SignalP 5.0 (Almagro Armenteros et al., 2019) plus one or more transmembrane domain using HMMer (Finn et al., 2011). 91 protein-encoding genes with a signal peptide and a transmembrane domain were identified, Fig. 1.
Fig. 1

Flow chart showing workflow to identify putative T. parva surface membrane proteins.

Flow chart showing workflow to identify putative T. parva surface membrane proteins. Inclusion of proteins that have putative transmembrane domains will inevitably include proteins that are internal (e.g. in the endosomal pathway) and so not exposed to the external environment. We therefore removed all proteins that were predicted to localise to the mitochondria or apicoplast (with bipartide targeting) using TargetP, Fig. 1 (Armenteros et al., 2019). Proteins with a putative GPI-anchor addition sequence were also excluded as these have been considered previously (Nyagwange et al., 2018a, Pierleoni et al., 2008). This led to a final long-list of 68 proteins, shown in Supplementary Table 1. These represent a maximal list of proteins of interest.
Table 1

Putative membrane proteins. Shortlist of putative membrane proteins with signal peptide, with experimental evidence for expression in Theileria parva sporozoites. * Sporozoite proteome: figure given is emPAI, a measure of the relative protein abundance in sample.

Gene nameGene IDProduct nameNumber of TM domainsTMs wo/signal peptideSize/kDaSporozoite proteome *
TpMuguga_01g00016XP_765543.1Vacuolar protein sorting/targeting protein 1011960.71
TpMuguga_01g00326XP_765853.1emp24/gp25L/p24 family/GOLD family protein11301.2
TpMuguga_01g00509XP_766029.1putative integral membrane protein11661.69
TpMuguga_01g00620XP_766141.1putative integral membrane protein101240.16
TpMuguga_01g00921HAD ATPase P-type IC family protein11121660.91
TpMuguga_01g00939XP_766460.1putative integral membrane protein10363.65
TpMuguga_01g01013XP_766534.1Sugar efflux transporter for intercellular exchange family protein77420.4
TpMuguga_01g01069XP_766590.1Sugar (and other) transporter family protein1211520.43
TpMuguga_01g01091XP_766612.1Tp12106412.02
TpMuguga_01g01195XP_766716.1emp24/gp25L/p24 family/GOLD family protein21241.15
TpMuguga_02g00330XP_764896.1emp24/gp25L/p24 family/GOLD family protein11241.15
TpMuguga_02g00538XP_765104.1Sel1 repeat family protein111771.34
TpMuguga_02g00543XP_765109.1putative integral membrane protein33540.3
TpMuguga_02g00602XP_765168.1Thioredoxin family protein112519.43
TpMuguga_02g02055putative integral membrane protein101174.39
TpMuguga_03g00168XP_763186.1putative integral membrane protein112694.18
TpMuguga_03g00175XP_763193.1putative integral membrane protein21362.68
TpMuguga_03g00264XP_763282.1S1/P1 Nuclease family protein22451.05
TpMuguga_03g00419XP_763440.1Thioredoxin family protein106233.82
TpMuguga_04g00068putative integral membrane protein101411.32
XP_763703.1putative integral membrane protein11271.39
TpMuguga_04g00399XP_764034.1putative integral membrane protein21417.78
TpMuguga_04g00649XP_764284.1GPI transamidase subunit Gpi1611680.62
TpMuguga_04g00668XP_764304.1unspecified product11460.83
TpMuguga_04g00917XP_764554.1SVSP family protein10600.48
Putative membrane proteins. Shortlist of putative membrane proteins with signal peptide, with experimental evidence for expression in Theileria parva sporozoites. * Sporozoite proteome: figure given is emPAI, a measure of the relative protein abundance in sample. Next, we looked for proteins with evidence of expression in the sporozoite. A whole-cell proteomic dataset is available for the T. parva sporozoite stage, providing evidence for 2007 proteins, representing about 50% of the total predicted genes (Nyagwange et al., 2018b). Of the 68 proteins of interest (Suppl. Table 1), a total of 30 were present in the proteome dataset, Table 1. To further refine our search, we removed all proteins predicted to contain a single transmembrane domain where it is located within the first 30 aa; these most likely represent false positive results, where the signal peptide domain is incorrectly identified as a containing a transmembrane domain. However, we retained TpMuguga_01g01091 (Tp12), a known antigen (Morrison et al., 2015). Following analysis of localization and function of homologous proteins in other species, and selection for proteins with the largest predicted external size, we selected the five most promising candidates, plus Tp12 (TpMuguga_01g01091). For each protein, we removed the signal peptide and predicted the structure using Phyre2, Swiss-model and AlphaFold (Jumper et al., 2021, Kelley et al., 2015, Waterhouse et al., 2018). The results are shown in Table 2. TpMuguga_03g00168 was too large for AlphaFold, giving rise to no hits in Phyre2 and no hits in Swiss-model. Modelling of TpMuguga_01g01091 (Tp12) gave rise to a disordered protein with little confidence in AlphaFold. The four protein structures that were able to be modelled are shown in Fig. 2.
Table 2

Short list of the six most promising membrane proteins. The results of structure prediction by AlphaFold, Phyre2 and Swiss-model are shown, together with the final model choice for modelling in Fig. 2.

Long list of IDsPredicted mature protein (aa#)ColabFold:AlphaFold2 usingMMseqs2Region (aa#)Average pLDDTPDB Phyre top Function top hithit (confidence %)Region (aa#)PDB Swissmodel top hit(GMQE)Function top hitRegion (aa#)Finalmodel
TpMuguga_01g0092118–1450not performed (>1000aa)7OP1 (100%)cation transportingATPase233–14497OP1 (0.41)cation transportingATPase172–1428Phyre2
TpMuguga_01g01013TpMuguga_01g01069TpMuguga_01g0109116–37920–474model model16–37920–47467.990.75XPD (100%)6RW3 (100%)sugar transporter sugar transporter147–36721–4655XPD (0.38) 6 M20no modelsugar transporter144–356AlphaFold2 AlphaFold2not used
sugar transporter23–472
23–568model23–56856.3no model
TpMuguga_02g00543 19–472model19–47261.5no modelno modelAlphaFold2
TpMuguga_03g00168 19–2361not performed (>1000aa)no modelno model
Fig. 2

Predicted protein structures. 01g00921 encodes a cation transporter ATPase (red; Phyre2 model shown), 01g01013 encodes a sugar transporter (green; AlphaFold2 model shown), the function of 02g00543 is unknown (orange) and 01g01069 encodes a sugar transporter (purple, modelled with AlphaFold2).

Short list of the six most promising membrane proteins. The results of structure prediction by AlphaFold, Phyre2 and Swiss-model are shown, together with the final model choice for modelling in Fig. 2. Predicted protein structures. 01g00921 encodes a cation transporter ATPase (red; Phyre2 model shown), 01g01013 encodes a sugar transporter (green; AlphaFold2 model shown), the function of 02g00543 is unknown (orange) and 01g01069 encodes a sugar transporter (purple, modelled with AlphaFold2). Two proteins (TpMuguga_01g01013 and TpMuguga_01g01069) are predicted sugar transporters, and TpMuguga_01g00921 is predicted to be a cation transporting ATPase. The remaining membrane protein TpMuguga_03g00168 has unknown function. All four have external facing regions that may offer target epitopes for novel vaccines. To test this, we utilized the epitope prediction tool NetMHC 4.0 which predicts peptide-MHC Class 1 binding for known proteins (Jurtz et al., 2017, Nielsen et al., 2018). Each of the four proteins were analyzed for the bovine BoLA-T2a allele, with peptide length between 8 and 14 aa. The results are shown in Table 3, giving the numbers of peptides identified as strong or weak binders. These indicate that these proteins are highly likely to contain epitopes that will be candidates for vaccination.
Table 3

Epitope and post-transcriptional modification. Epitope prediction was carried out using NetMHC 4.0, with the bovine BoLA-T2a antigen. The number of strong binding (SB) epitopes (with a rating < 0.5) and weak binding (WB) epitopes (with a rating < 2) is given for each protein. Identification of putative glycosylation was carried out using NetOGlyc 4.0, and phosphorylation sites by MusiteDeep.

ProteinSBWBglycosylationsitesphosphorylationSites
Tp01g0092143206527
Tp01g01013737210
Tp02g00543205377
Tp01g01069135110
Epitope and post-transcriptional modification. Epitope prediction was carried out using NetMHC 4.0, with the bovine BoLA-T2a antigen. The number of strong binding (SB) epitopes (with a rating < 0.5) and weak binding (WB) epitopes (with a rating < 2) is given for each protein. Identification of putative glycosylation was carried out using NetOGlyc 4.0, and phosphorylation sites by MusiteDeep. As many cell surface proteins contain post-translational modifications, we used bioinformatic tools to search for glycosylation (Steentoft et al., 2013) and phosphorylation sites (Wang et al., 2020), as shown in Table 3. These indicated that all four proteins are glycosylated, while two of the four also contain phosphorylation sites. It should be noted that the training datasets for post-transcriptional modification prediction software were not obtained from T. parva, so these results will need to be validated experimentally.

Conclusions

Recent advances in Plasmodium cell biology have identified numerous surface proteins which could be part of a future multi-valent subunit vaccine against malaria. In contrast, very few surface proteins have been characterized in T. parva. However, many surface proteins that have been identified give rise to neutralizing antibodies, suggesting that they would be potential vaccine targets (Musoke et al., 1992, Nyagwange et al., 2018a). In the absence of a proteome-based experimental approach to identify further surface proteins, a recent in silico study identified 21 putative GPI-anchored surface proteins (Nyagwange et al., 2018a). Of the six expressed GPI anchored surface proteins, four gave rise to sporozoite neutralizing antibodies. Here, we took an in silico approach to identify transmembrane domain proteins on the T. parva cell surface, thus further increasing the numbers of putative surface proteins for future vaccinology attempts. Of the six most promising candidate proteins, three (TpMuguga_01g01013, TpMuguga_01g00921 and TpMuguga_01g01069) are likely to be involved in nutrient or cation uptake. This is unsurprising, as the parasite must interact with the environment. Two of these proteins (TpMuguga_01g01069 and TpMuguga_01g00921) also have high levels of expression in the schizont stage, suggesting that in addition, they play a role in nutrient acquisition during this intracellular stage (Tonui et al., 2018). The schizont stage must have very high requirement for glucose, due to the rapid rate of cell proliferation. In Plasmodium, chemical inhibition of a key hexose transporter suppresses the growth of the parasite (Jiang et al., 2020). As TpMuguga_01g01069 is the only sugar transporter predicted to have a signal peptide and thus a surface localization in T. parva, it may also be a good drug target. A fourth protein (TpMuguga_03g00168) is a clear membrane protein with no predicted function. An experimental based approach will be required to obtain structures of TpMuguga_03g00168 and TpMuguga_01g01091 (Tp12). Epitope prediction software suggests that these four proteins may be immunogenic, with predicted strong binding peptides by NetMHC (Jurtz et al., 2017). Although laboratory-based methodologies will be required to test these predictions, the results suggest that these proteins have potential as vaccine targets. One of the challenges of T. parva research is the absence of a system for stable genetic manipulation, so it is not possible to confirm if these four proteins are essential, nor confirm the localization of proteins with tagging. An alternative approach would be to take a whole-cell spatial proteomic localization technique which would give localization information for all cellular proteins (Lundberg and Borner, 2019). An analysis of the related apicomplexan parasite Toxoplasma using hyperLOPIT (hyperplexed localisation of organelle proteins by isotope tagging) identified 110 integral surface proteins and 71 peripheral surface proteins (Barylyuk et al., 2020). A similar proteomic-based approach of T. parva would transform our understanding of this parasite, especially at the cell surface, and provide a massive leap forward in the quest to develop a modern T. parva vaccine. NG was funded by a University of Nottingham Developing Solutions Masters Scholarship and by the J N Tata endowment, India.

CRediT authorship contribution statement

Nitisha Gurav: Methodology, Investigation. Olivia J.S. Macleod: Methodology, Investigation. Paula MacGregor: Conceptualization, Methodology, Writing – review & editing. R. Ellen R. Nisbet: Conceptualization, Methodology, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  26 in total

1.  MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization.

Authors:  Duolin Wang; Dongpeng Liu; Jiakang Yuchi; Fei He; Yuexu Jiang; Siteng Cai; Jingyi Li; Dong Xu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

Review 2.  Spatial proteomics: a powerful discovery tool for cell biology.

Authors:  Emma Lundberg; Georg H H Borner
Journal:  Nat Rev Mol Cell Biol       Date:  2019-05       Impact factor: 94.444

3.  SignalP 5.0 improves signal peptide predictions using deep neural networks.

Authors:  José Juan Almagro Armenteros; Konstantinos D Tsirigos; Casper Kaae Sønderby; Thomas Nordahl Petersen; Ole Winther; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Biotechnol       Date:  2019-02-18       Impact factor: 54.908

4.  HMMER web server: interactive sequence similarity searching.

Authors:  Robert D Finn; Jody Clements; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2011-05-18       Impact factor: 16.971

5.  The Phyre2 web portal for protein modeling, prediction and analysis.

Authors:  Lawrence A Kelley; Stefans Mezulis; Christopher M Yates; Mark N Wass; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2015-05-07       Impact factor: 13.491

6.  Improved Prediction of Bovine Leucocyte Antigens (BoLA) Presented Ligands by Use of Mass-Spectrometry-Determined Ligand and in Vitro Binding Data.

Authors:  Morten Nielsen; Tim Connelley; Nicola Ternette
Journal:  J Proteome Res       Date:  2017-11-14       Impact factor: 4.466

7.  Transcriptomics reveal potential vaccine antigens and a drastic increase of upregulated genes during Theileria parva development from arthropod to bovine infective stages.

Authors:  Triza Tonui; Pilar Corredor-Moreno; Esther Kanduma; Joyce Njuguna; Moses N Njahira; Steven G Nyanjom; Joana C Silva; Appolinaire Djikeng; Roger Pelle
Journal:  PLoS One       Date:  2018-10-10       Impact factor: 3.240

8.  Re-annotation of the Theileria parva genome refines 53% of the proteome and uncovers essential components of N-glycosylation, a conserved pathway in many organisms.

Authors:  Kyle Tretina; Roger Pelle; Joshua Orvis; Hanzel T Gotia; Olukemi O Ifeonu; Priti Kumari; Nicholas C Palmateer; Shaikh B A Iqbal; Lindsay M Fry; Vishvanath M Nene; Claudia A Daubenberger; Richard P Bishop; Joana C Silva
Journal:  BMC Genomics       Date:  2020-04-03       Impact factor: 3.969

9.  A Comprehensive Subcellular Atlas of the Toxoplasma Proteome via hyperLOPIT Provides Spatial Context for Protein Functions.

Authors:  Konstantin Barylyuk; Ludek Koreny; Huiling Ke; Simon Butterworth; Oliver M Crook; Imen Lassadi; Vipul Gupta; Eelco Tromer; Tobias Mourier; Tim J Stevens; Lisa M Breckels; Arnab Pain; Kathryn S Lilley; Ross F Waller
Journal:  Cell Host Microbe       Date:  2020-10-13       Impact factor: 21.023

10.  Highly accurate protein structure prediction with AlphaFold.

Authors:  John Jumper; Richard Evans; Alexander Pritzel; Tim Green; Michael Figurnov; Olaf Ronneberger; Kathryn Tunyasuvunakool; Russ Bates; Augustin Žídek; Anna Potapenko; Alex Bridgland; Clemens Meyer; Simon A A Kohl; Andrew J Ballard; Andrew Cowie; Bernardino Romera-Paredes; Stanislav Nikolov; Rishub Jain; Demis Hassabis; Jonas Adler; Trevor Back; Stig Petersen; David Reiman; Ellen Clancy; Michal Zielinski; Martin Steinegger; Michalina Pacholska; Tamas Berghammer; Sebastian Bodenstein; David Silver; Oriol Vinyals; Andrew W Senior; Koray Kavukcuoglu; Pushmeet Kohli
Journal:  Nature       Date:  2021-07-15       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.