Literature DB >> 34336254

Immunoinformatic Analysis of Calcium-Dependent Protein Kinase 7 (CDPK7) Showed Potential Targets for Toxoplasma gondii Vaccine.

Ali Taghipour1, Sanaz Tavakoli2, Mohamad Sabaghan3, Masoud Foroutan4, Hamidreza Majidiani5, Shahrzad Soltani4, Milad Badri6, Ali Dalir Ghaffari1, Sheyda Soltani4.   

Abstract

Apicomplexan parasites, including Toxoplasma gondii (T. gondii), express different types of calcium-dependent protein kinases (CDPKs), which perform a variety of functions, including attacking and exiting the host cells. In the current bioinformatics study, we have used several web servers to predict the basic features and specifications of the CDPK7 protein. The findings showed that CDPK7 protein has 2133 amino acid residues with an average molecular weight (MW) of 219085.79 D. The aliphatic index with 68.78 and grand average of hydropathicity (GRAVY) with -0.331 score were estimated. The outcomes of current research showed that the CDPK7 protein included 502 alpha-helix, 1311 random coils, and 320 extended strands with GOR4 method. Considering the Ramachandran plot, the favored region contains more than 92% of the amino acid residues. In addition, evaluation of antigenicity and allergenicity showed that CDPK7 protein has immunogenic and nonallergenic nature. The present research provides key data for more animal-model study on the CDPK7 protein to design an efficient vaccine against toxoplasmosis in the future.
Copyright © 2021 Ali Taghipour et al.

Entities:  

Year:  2021        PMID: 34336254      PMCID: PMC8295510          DOI: 10.1155/2021/9974509

Source DB:  PubMed          Journal:  J Parasitol Res        ISSN: 2090-0023


1. Introduction

Toxoplasma gondii is a prevalent intracellular protozoan, which can infect a broad spectrum of mammals (i.e., human) and birds [1, 2]. Oocysts are the potential infective form in the life cycle of the parasite. Feline species as the only definitive hosts can contaminate the environment by shedding unsporulated oocysts through feces [3]. T. gondii is transferred by water/vegetables contaminated via mature oocysts and consumption of raw or semicooked meat from infected animals, vertical transmission from infected pregnant mothers to neonates, and blood transfusion [4-7]. Approximately one-third of human society has been exposed to T. gondii, worldwide [5, 8, 9]. Often T. gondii infection among immunocompetent people is asymptomatic or demonstrates mild symptoms, whereas in immunocompromised patients, it can cause a various range of clinical symptoms [6, 9, 10]. Toxoplasmosis in immunocompromised subjects can cause repeated attacks in the brain and manifests as encephalitis [11]. Moreover, toxoplasmosis in pregnant women can cause blindness, microcephaly, and mental retardation in the infant [6, 12]. Different factors, such as host's immune system status, genetic background, age, gender, contact with infected cats, environmental conditions, and diet and cultural habits, as well as the protozoan genotype, can affect the morbidity and mortality rate of Toxoplasma infection [13, 14]. Today, treatment of toxoplasmosis with conventional drugs can just limit the proliferation of tachyzoites at the beginning of infection, while these drugs cannot eradicate cystic forms of parasites in host tissue [15, 16]. In addition, taking these medications in pregnant women can have serious side effects, such as the possibility of teratogenic effects on the fetus [17]. Hence, the discovery and design of an effective vaccine to control and prevent toxoplasmosis is very important, especially in humans and domestic animals. In this regard, various in silico-based studies suggest various antigens as suitable candidates for vaccine design [18-32]. Calcium-dependent protein kinases (CDPKs) are a class of serine/threonine kinases that express in apicomplexans, ciliates, and plants [33]. In T. gondii as a member of the apicomplexan parasites, several CDPKs have been identified involving in critical functions in the different stages of the life cycle of parasite, including gliding motility (surface translocation), entry into (invasion), and exit from (egress) of host cells [34]. The CDPK7 is a crucial enzyme for division, growth, and maintenance of structural integrity of the Toxoplasma centrosome. As a result, TgCDPK7 knockdown is suggested as an important goal in achieving the right vaccine [35]. Computer-aided evaluation of different T. gondii proteins involved in various stages of life cycle can open new doors towards recognizing potent vaccine candidates through identification of highly immunogenic, nonallergenic, and nontoxic B- and T-cell epitopes [36]. Thereby, the present in silico study was performed to evaluate the crucial biochemical features and immunogenic epitopes of the CDPK7 protein by means of different bioinformatics servers.

2. Methods

2.1. CDPK7 Sequence

For this purpose, ToxoDB online website was used to obtain the whole amino acid sequence of T. gondii CDPK7 protein.

2.2. Physicochemical Characterization

We used the Expasy ProtParam online server to predict the physicochemical parameters of CDPK7 [37].

2.3. Prediction of Posttranslational Modification (PTM) Sites

The NetPhos 3.1 online tool was applied to predict phosphorylation location, and the CSS-Palm online server was applied to predict acylation location of the CDPK7 [38, 39].

2.4. Transmembrane Domains and Subcellular Location

The transmembrane regions and subcellular localization of T. gondii CDPK7 protein were assessed utilizing the TMHMM 2.0 and PSORT II web servers, respectively [38].

2.5. Secondary and Tertiary Structures

In this study, we employed the Garnier-Osguthorpe-Robson 4 (GOR4) online tool to forecast the secondary structure of CDPK7 protein [40]. Consequently, the three-dimensional (3D) model structures was used by SWISS-MODEL [38, 41].

2.6. The 3D Modeled Structure Refinement and Validation

GalaxyRefine was selected to develop and refine the quality of the template-based protein prediction [42]. To the Ramachandran plot validated the 3D structure of the protein, the SWISS-MODEL software was applied [43]. ProSA-web was used for evaluation of the whole quality of the model [44].

2.7. Linear and Conformational B-Cell Epitopes

We used a web-based Bcepred server to predict continuous B-cell epitopes exploiting physicochemical characteristics [45]. An online server of ABCpred was applied to predict B-cell epitopes [46]. Using the immune epitope database (IEDB), hydrophilicity [47], Bepipred linear epitope prediction [48], antigenicity [49], surface accessibility [50], beta-turn [51], and flexibility [52] were predicted. Afterwards, discontinuous B-cell epitopes were appraised by ElliPro [53] from the 3D structure of protein epitopes.

2.8. MHC-I and MHC-II Epitopes

To this aim, we used the IEDB website to evaluate the half-maximal inhibitory concentration (IC50) values of peptides that bind to the main histocompatibility complex (MHC) class I and class II molecules for CDPK7 [54, 55]. All predicted epitopes were then evaluated in terms of antigenicity using the VaxiJen v2.0 server.

2.9. Cytotoxic T-Lymphocyte (CTL) Epitopes

We applied CTLpred online website according to 75.8% accuracy [56]. Next, all predicted epitopes were evaluated regarding antigenicity using the VaxiJen v2.0 server.

2.10. Antigenic and Allergenic Profiles

The antigenicity of the full CDPK7 sequence was estimated by VaxiJen v2.0 [57]. The allergenic profile of CDPK7 was predicted by the AllergenFP v1.0 and AllerTOP v2.0 servers [58, 59].

3. Results

3.1. General Information of CDPK7

The amino acid structure of CDPK7 was obtained from the ToxoDB server with accession no. TGME49_228750. Based on the ProtParam database, the CDPK7 protein entails of 2133 amino acid residues with molecular weight of 219085.79 D, whereas theoretical pI was 5.79. The overall number of negatively (Asp+Glu) charged residues was 209, and positively (Arg+Lys) charged residues was 178. There are a total number of 30441 atoms. The half-life of the CDPK7 was predictable at 30 hours, >20 hours, and >10 hours for mammalian (in vitro), yeast (in vivo), and Escherichia coli (in vivo), respectively. In addition, the instability index of the CDPK7 protein presented an unstable nature with a value of 53.28. In addition, the aliphatic index was calculated 68.78, and GRAVY of the protein was estimated -0.331.

3.2. PTM Sites of CDPK7 Protein

In the present research, the results exhibited that 269 phosphorylation sites (Thr: 64, Tyr: 13, and Ser: 192) (Figures 1(a) and 1(b)) and 30 acylation sites (Table 1) were recognized in the CDPK7, representing that the CDPK7 sequence is composed of 299 possible PTM sites.
Figure 1

NetPhos server output for CDPK7 phosphorylation sites. (a) The number of predicted sites, based on S (serine), T (threonine), and Y (tyrosine); (b) prediction diagram of CDPK7 phosphorylation sites.

Table 1

The acylation sites of CDPK7 sequence.

IDPositionPeptideScore
TGME49_228750 CDPK7 (T. gondii)34STQLSKECLKQYLKK1.129
TGME49_228750 CDPK7 (T. gondii)109FLIGIAVCCRGTKSD1.996
TGME49_228750 CDPK7 (T. gondii)110LIGIAVCCRGTKSDR5.494
TGME49_228750 CDPK7 (T. gondii)187QNLFSPQCQRTPQNG0.526
TGME49_228750 CDPK7 (T. gondii)222DEEDTGSCGSNSNFP5.293
TGME49_228750 CDPK7 (T. gondii)244YPEAALVCVSDFVPS3.693
TGME49_228750 CDPK7 (T. gondii)321SLSDVFQCFSPFDHA0.984
TGME49_228750 CDPK7 (T. gondii)524SSEASVICPQGGISP2.536
TGME49_228750 CDPK7 (T. gondii)706VDKIIEECEFFEHGK0.403
TGME49_228750 CDPK7 (T. gondii)736ILSMFTECLHEEVWG1.821
TGME49_228750 CDPK7 (T. gondii)1298AHDDPLACSGHSPRD5.591
TGME49_228750 CDPK7 (T. gondii)1309SPRDLYSCPNCCNPL1.744
TGME49_228750 CDPK7 (T. gondii)1312DLYSCPNCCNPLLLC8.015
TGME49_228750 CDPK7 (T. gondii)1313LYSCPNCCNPLLLCP7.05
TGME49_228750 CDPK7 (T. gondii)1319CCNPLLLCPFCHSRY2.719
TGME49_228750 CDPK7 (T. gondii)1322PLLLCPFCHSRYPQL2.865
TGME49_228750 CDPK7 (T. gondii)1340EGRVVMECRQCGRLG2.295
TGME49_228750 CDPK7 (T. gondii)1343VVMECRQCGRLGGSS2.929
TGME49_228750 CDPK7 (T. gondii)1395DRVEAGICVGGSSRV5.406
TGME49_228750 CDPK7 (T. gondii)1406SSRVFTRCWHCGWEL0.108
TGME49_228750 CDPK7 (T. gondii)1409VFTRCWHCGWELSKC1.424
TGME49_228750 CDPK7 (T. gondii)1416CGWELSKCAEMLKGN4.272
TGME49_228750 CDPK7 (T. gondii)1474GFMFLEGCYVELLSE1.626
TGME49_228750 CDPK7 (T. gondii)1649TVYYLHKCGIVHRDL1.164
TGME49_228750 CDPK7 (T. gondii)1683DFGLSTLCAPNEVLH1.6
TGME49_228750 CDPK7 (T. gondii)1693NEVLHQPCGTLAYVA1.927
TGME49_228750 CDPK7 (T. gondii)1828GEERTMACCPEVPTF4.139
TGME49_228750 CDPK7 (T. gondii)1829EERTMACCPEVPTFT7.362
TGME49_228750 CDPK7 (T. gondii)2080PSLAAPGCSDLSSAS3.862
TGME49_228750 CDPK7 (T. gondii)2110ARQDERACGTPAEVP6.173

3.3. Transmembrane Domains and Subcellular Location

Based on the TMHMM output, no transmembrane domain was found for CDPK7 (Figures 2(a) and 2(b)). Moreover, by PSORT II, the CDPK7 subcellular site was predicted as follows: 78.3% nuclear, 8.7% cytoplasmic, 8.7% plasma membrane, and 4.3% cytoskeletal.
Figure 2

Transmembrane domains expected in CDPK7 protein. (a) Some statistics and a list of the location of the predicted transmembrane helices and the predicted location of the intervening loop regions. Length: the length of the protein sequence; number of predicted TMHs: the number of predicted transmembrane helices; Exp number of AAs in TMHs: the expected number of amino acids in transmembrane helices. If this number is larger than 18, it is very likely to be a transmembrane protein (or have a signal peptide); Exp number, first 60 AAs: the expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein. If this number is more than a few, you should be warned that a predicted transmembrane helix in the N-term could be a signal peptide; total prob of N-in: the total probability that the N-term is on the cytoplasmic side of the membrane; (b) transmembrane domains expected in CDPK7 protein. (b) Analysis of the transmembrane domains of CDPK7.

3.4. Secondary and Tertiary Structures

The secondary structure of CDPK7 was predicted via the GOR4 online server, suggesting 502 alpha-helix, 320 extended strands, and 1311 random coils (Figures 3(a) and 3(b)). Moreover, the SWISS-MODEL analysis is shown in Figures 4(a)–4(d).
Figure 3

(a) GOR4 server results suggested that CDPK7 encompasses 502 alpha-helix, 320 extended strands, and 1311 random coils in secondary structure; (b) graphical result of the secondary structure prediction of CDPK7 using the GOR4 online server.

Figure 4

SWISS-MODEL server output. (a) Computed 3D model; (b) global quality estimate; (c) comparison with nonredundant set of PDB structures; (d) local quality estimate.

3.5. Refinement and Validation of Tertiary Structure

Protein validation by means of the SWISS-MODEL server displayed that 92.86% of residues were situated in favored regions and 1.65% in the outlier regions. According to the Ramachandran plot, there were 97.80% residues in the favored region with 0.27% residues in the outlier regions of the refined model (Figure 5).
Figure 5

Validation of 3D model of CDPK7 protein. (a) The Z-score plot for 3D structure of predicted protein before and after refinement with ProSA-web server, respectively; (b) Ramachandran plot analysis of predicted structure.

3.6. Predicted Linear and Discontinuous B-Cell Epitopes of the CDPK7 Protein

The predicted linear B-cell epitopes by the Bcepred are listed in Table 2. The outputs of the ABCpred server are tabulated in Table 3 (only the epitopes over scores of 0.75 are embedded in Table 3). The higher peptide score proposes the greater chance of being an epitope. The present server estimated 124 epitopes over 0.75 scores on the sequence, in which the linear epitope SSPPGTPASVVSPAAGAGPI (score: 0.95) had the greatest score. Four epitopes with over 0.95 scores were as follows: “SSPPGTPASVVSPAAGAGPI,” “EVPQAAQPSKGPTKSAMLLQ,” “GGVSPPPQVPPVVVRAASPR,” and “GETVSKRLLFANSAKEQREW.” The average score of antigenicity, beta-turn, flexibility, hydrophilicity, Bepipred linear epitope prediction, and surface accessibility for the CDPK7 protein using the IEDB online server was 1.026, 1.042, 1.017, 2.396, 0.350, and 1.00, respectively (Figure 6). Five discontinuous B-cell epitopes were predicted using the ElliPro server (Table 4).
Table 2

Epitopes predicted in CDPK7 protein by different parameters based on the Bcepred online server.

Prediction parameterEpitope sequence
HydrophilicityGAGGGAGGAG; KKFDSDEVEV; KGSGSVDYEE; CRGTKSDRM; AQQAHSEGSNSVGRGSHGGKEEEQNL; SPQCQRTPQNGGSSGTAGA; SPGNLDDEDDEEDTGSCGSNSN; SLDSTSSNERPRER; EQEASSSEGYGRSFDEESSGASSYSS; DHASRNP; GPEAPGQEAPGT; SSPTGEQTGAP; SASSPAGGS; DRPAGAGTGAE; RPAAGDDGDSSAGPAGGASGESAAKGAEKSPKTGT; SQQPRGG; STQSSSTQGAPGS; SGGGGSRP; PSRQSSEASV; SPRAETQENETGLEE; GEGATPGGDAGREASQKA; AGTGRGSGPLEEDEAQGNG; QPSKGPTKSA; QAEKDKTRQEQAKKNPS; IKEEKEENEQKDV; GSGREGGSGKV; SSRTSSAS; GKAGSPSSSRVGG; TNPAHSSPRRPTRD; QATGSSGAASA; ARGSGAQ; GGAGPENAGA; ETTSQASQHQTGPSGPSSPP; GVEPKQE; AGGGAGSETQPA; ASGSSPAA; SEGPATT; DPTTAGA; EETRAAGG; QGPPDGRGSAGDKV; IGEEEGERMSGSGDARDDDVYER; DSRPAPS; SGASGPGSGA; ASPSEGASAR; ARAHDDP; SGHSPRD; SSSSLSDAGAQP; AGTGANSGAGGASGSADPSGGPGAEEDRVEAG; KGNSEAA; RRKGDAKPRG; SEQVGGRQ; KGETVSKR; ANSAKEQRE; DKGKING; TDRTPNAT; EVSSSAKD; VNNGSKNID; DEVRHSTRYGEERT; AASSPSS; DPGAPTS; AARTEGDTGPVEG; DEVPESG; GGESVSDAA; AGRGEVD; TRGQGQGSTASG; TLQDGSEGRR; AEAGSPG; SSASSGTQRRGTEEPEAEPARQDERACGT; GSPGGPS
FlexibilityLAFSTQL; QYLKKFD; KALSARSPG; QKFDFKGSGS; AVCCRGTKSD; QQAHSEGSNSVGRGSHGGKEEE; QCQRTPQNGGSSGTAGAVSSSPGNLDDEDDEEDTGSCGSNS; SGLSLDSTSSNERPR; ARLEQEASSSE; RSFDEESSGA; FDHASRNPSPP; GTVSSPTGEQ; PAALSSRP; VSASSPAG; PAAGDDGDSS; GPAGGASG; SAAKGAEKSPKTGTLSQQPRGGITKTA; SAIKRTFSTQSSSTQ; PPVRGFSGGGGSRP; SVLPSRQSSE; NAPPPGSG; PIVPTSSG; PRAETQENE; GDAGREAS; FAAGTGRGS; QAAQPSKGPT; LQAEKDKTRQEQAKKNP; SLIKEEKEENEQ; ALYRSTSVQSRPS; KDPLGSGREGGSG; SKLLSSRTSSASFSSRGMGKAGSPSSSRV; NPAHSSPRRPT; APQPSRLSSSPQMQATGSS; PAARGSG; SQHQTGPSGPSSPP; TVAGGGAGSET; ASVASGSS; AVQGPPDGRGSAG; PVIGEEEGERMSGSGDA; RHWEQSR; DFIRSSH; AELPRDSR; GALSGASGPG; LACSGHSP; RQCGRLGGSSSSL; TGANSGAGGASGSADPSGG; GICVGGSS; AEMLKGNS; YYYRRKGD; VHPKGETVS; LFANSAKEQ; GKINGHE; ILLTDRT; AVWREVSSSA; RMLQPNPR; TAVVNNGS; APAASSPS; TPIRPFS; PGVSLSG; PAARTEGD; SGASLGGESV; VDLTRGQGQGSTA; LTLQDGSEGRRM; PAAGSKSV; SDLSSASSGTQRRGTE; VPAGSPGGPS
AccessibilitySTQLSKECLKQYLKKFDSDEV; VLKKVYKAL; PGIDKETFLQY; GERLFQKFDFKG; CRGTKSDRMYV; SDGYIQKSEL; NLPNLDRYMSIRKAQQAHSEG; SHGGKEEEQNLFSPQCQRTPQNGG; PGNLDDEDDEEDTGS; DSTSSNERPRERLKPYEPHPL; ARLEQEASSSEGYGRSFDEESS; DHASRNPSPPRRVSAQQPTH; PEAPGQE; SSPTGEQTGAPP; PPPVDRPA; QASPHAR; AKGAEKSPKTGTLSQQPRGGITKTASRFTSAIKRTFSTQSSSTQG; LPSRQSSEAS; SPPPQVPP; ASPRAETQENETGLEE; GREASQKA; GPLEEDEAQGN; PQAAQPSKGPTKSA; LQAEKDKTRQEQAKKNPSPVAQSLIKEEKEENEQKDVLD; FKTWLERNEG; YRSTSVQSRPSRLTA; REGGSGKVFRRSKLLS; GSPSSSR; NMQHFQKVKH; PAHSSPRRPTRDLDPATPAPQPSRLSSSPQMQ; ETTSQASQHQTGPS; GVEPKQEVTV; EETRAAG; QGPPDGRGSAGDKVVSEE; GEEEGERMS; SGDARDDDVYERIAGYRHWEQSRMSPQ; RSSHQSL; SAELPRDSRPAPSRG; RAQLPYREGELRQAD; ARAHDDPL; SGHSPRDLYS; CHSRYPQLTL; PGAEEDRVEA; VLYKKGKHLHQWQARYY; NMLYYYRRKGDAKPRGF; EQVGGRQ; KGETVSKRL; ANSAKEQREWVDT; RVATKQQALEQ; IHRATNELY; KVIDKGKINGHERELLRSE; RLLNHPN; KELLDTKETLY; LIQQNHRLPEL; VHRDLKPENI; LTDRTPNAT; MEGYNHQ; HPSFYENTPVS; RMLQPNPRRRITV; VNNGSKNID; SQLDEVRHSTRYGEERTMA; IPKNGAKPLQNHG; RPFSEST; PPAARTEGDTG; QDGSEGRRMTSAT; SKSVPSPS; SSGTQRRGTEEPEAEPARQDERAC; PSPSIEE
TurnsSCGSNSNFPG; DSTSSNE; YSCPNCCNPL; LLNHPNVI
Exposed surfaceKECLKQYLKKFDSDE; VLKKVYKAL; RGTKSDR; GKEEEQNL; DDEDDEEDT; SSNERPRERLKPYEPHP; RNPSPPRR; AEKSPKTG; LQAEKDKTRQEQAKKNPSPV; SLIKEEKEENEQKDV; KVFRRSK; QHFQKVK; SPRRPTRDL; EPKQEVT; VLYKKGKHLH; LYYYRRKGDAKPR; NSAKEQREWV; KQQALEQ; KVIDKGK; KELLDTK; HRDLKPEN; LQPNPRRRIT; QRRGTEE; EPARQDER; EEVHK
PolarityLSKECLKQYLKKFDSDEVEVLKK; GERLFQKF; RGTKSDR; RYMSIRKAQQAH; RGSHGGKEEEQNLF; GNLDDEDDEEDTGS; SSNERPRERLKPYEPHP; LARLEQEAS; GRSFDEES; AEKSPKTG; PRAETQENETGLEE; GREASQKA; GPLEEDEAQG; LQAEKDKTRQEQAKKNP; QSLIKEEKEENEQKDVLDVE; VDKIIEECEFFEHGKLSF; EFKTWLERNEGIL; FTECLHEEVWGL; REGGSGKVFRRSKLLS; QHFQKVKHLFT; AHSSPRRPTRDLD; GVEPKQEVT; AFVEETRAAG; DKVVSEE; IGEEEGERMSGSGDARDDDVYERIAGYRHWEQSRM; HSAELPRDSR; LPYREGELRQA; IARAHDDPL; EGRVVMECRQCGR; PGAEEDRVEAG; ELSKCAE; LYKKGKHLHQWQ; LYYYRRKGDAKPR; IVHPKGETVSKRL; NSAKEQREWVD; EQLGHGK; VYKGIHRATNEL; KVIDKGKINGHERELLRSEM; KELLDTKE; ELVRGGE; QNHRLPELHVNRI; HKCGIVHRDLKPEN; QPNPRRRITVA; QLDEVRHSTRYGEERTMAC; DGSEGRRMTS; TQRRGTEEPEAEPARQDERAC; PSIEEVHK
Antigenic propensityQLSKECLKQYLK; VEVLKKVYK; FLQYFPLPGL; VCCRGTK; MYVLFQVFDL; NLFSPQCQ; LVCVSDFVPSQQYV; YEPHPLL; YSSLSDVFQCFSPFDH; PSIDSLVS; GGSPVVLPPPVD; SRPVSVLPSRQS; SVICPQGG; PPPIVPTS; VSPPPQVPPVVVR; QKDVLDVEGIV; ECLHEEVW; FQKVKHLF; GPISVPVSPSVT; QEVTVSVSVVTV; PSITLQVTTTL; IVSKELVDFIRS; PRDLYSCPNCCNPLLLCPFCHSRYPQLTLLEGRVVMECRQCGRL; ICVGGSS; VFTRCWHC; IDGVLYK; RYYVLVDNML; FLEGCYVELLSEQVG; TVSKRLLF; LEQLYQV; GKFSIVYKGIH; ILRLLNHPNV; KETLYIVMELVR; LFDLIQQ; RLPELHVNRIISQLLSTVYYLHKCGIVHRD; FGLSTLC; EVLHQPCGTL; YNHQVDVWSIGVIMYLLLRGRL; LIVRMLQ; IDVYISQLD; CCPEVPTF; LRPPVSQLP; VSPSSLP; SLLNLTLQ; SVPSPSV
Table 3

The predicted B-cell epitopes via the ABCpred tool.

RankSequenceStart positionScore
1SSPPGTPASVVSPAAGAGPI9650.95
1EVPQAAQPSKGPTKSAMLLQ6380.95
1GGVSPPPQVPPVVVRAASPR5640.95
1GETVSKRLLFANSAKEQREW14970.95
2DVLDVEGIVDKIIEECEFFE6910.94
2EEDEAQGNGMLEVPQAAQPS6270.94
2GAPTSVATPVAVSISSAPPA19220.94
3KNGAKPLQNHGAPVATAGPP18390.93
4ALEQLYQVGEQLGHGKFSIV15280.92
5GAPSLAVGGATPLAGTTPPP9270.91
5FGYSASGGMIVNMQHFQKVK8210.91
5KDKTRQEQAKKNPSPVAQSL6600.91
5EDTGSCGSNSNFPGAQAQGA2170.91
5LPAAPAVASRAPAASSPSSL18680.91
5DVWSIGVIMYLLLRGRLPFP17140.91
5VYKGIHRATNELYAIKVIDK15470.91
5SAKEQREWVDTLRVATKQQA15090.91
6TPVYAVPAASAPGVSLSGGG18980.90
6LIQQNHRLPELHVNRIISQL16200.90
6SGDARDDDVYERIAGYRHWE11840.90
6VAGAPTSSAGVEPKQEVTVS10030.90
7QATGSSGAASAAAGASSVSA8770.89
7RSKLLSSRTSSASFSSRGMG7890.89
7VGSAHANAPPPGSGTPAPPP5320.89
7LYYYRRKGDAKPRGFMFLEG14540.89
8AGALAVASPVSGAPSLAVGG9160.88
8PAAGDDGDSSAGPAGGASGE4240.88
8SEAAIDGVLYKKGKHLHQWQ14240.88
9KNPSPVAQSLIKEEKEENEQ6700.87
9GGDAGREASQKAFAAGTGRG6030.87
9SAGPAGGASGESAAKGAEKS4330.87
9ASRNPSPPRRVSAQQPTHVG3280.87
9PHPLLARLEQEASSSEGYGR2810.87
9SSNERPRERLKPYEPHPLLA2670.87
9TPAEVPAGSPGGPSPSIEEV21120.87
9SSLSDAGAQPAAGTGANSGA13510.87
9KQEVTVSVSVVTVAGGGAGS10160.87
10STSVQSRPSRLTAAGLQGIF7520.86
10LIKEEKEENEQKDVLDVEGI6790.86
10LQASPHARPAAGDDGDSSAG4160.86
10AQAQGAYPEAALVCVSDFVP2310.86
10GEERTMACCPEVPTFTIPKN18210.86
10RCWHCGWELSKCAEMLKGNS14050.86
10MSPQLAVDIVSKELVDFIRS12070.86
10VASGSSPAAPGVTGVTEAVA10440.86
11TQGAPGSPPVRGFSGGGGSR4880.85
11KTASRFTSAIKRTFSTQSSS4680.85
11EESSGASSYSSLSDVFQCFS3040.85
11GSPGVSGALLSPAAGSKSVP20420.85
11GDKVVSEEAFPVIGEEEGER11610.85
12ISVPVSPSVTAVATAAVTQV9840.84
12NMQHFQKVKHLFTNPAHSSP8320.84
12ASQKAFAAGTGRGSGPLEED6100.84
12VEETRAAGGATAPGTSVTHT11250.84
12TVAGGGAGSETQPAMASVAS10270.84
13DKIIEECEFFEHGKLSFPEF7000.83
13PGIDKETFLQYFPLPGLWGE640.83
13GYRHWEQSRMSPQLAVDIVS11980.83
13TTGATTAVGGPVSEGPATTP10720.83
14QTGPSGPSSPPGTPASVVSP9580.82
14GTLSQQPRGGITKTASRFTS4560.82
14AGAVSSSPGNLDDEDDEEDT2000.82
14PCGTLAYVAPEVLTMEGYNH16920.82
14NSVGRGSHGGKEEEQNLFSP1660.82
14LLSTVYYLHKCGIVHRDLKP16390.82
14GFAIVHPKGETVSKRLLFAN14890.82
14GFMFLEGCYVELLSEQVGGR14670.82
14CRQCGRLGGSSSSLSDAGAQ13400.82
14LSGASGPGSGALASPSEGAS12490.82
14KELVDFIRSSHQSLHSAELP12180.82
14ATAAAAAFVEETRAAGGATA11170.82
14AGAAAAAATAAAAAFVEETR11100.82
15DPATPAPQPSRLSSSPQMQA8590.81
15FFEHGKLSFPEFKTWLERNE7080.81
15TGLEELGEGATPGGDAGREA5910.81
15APPAALSSRPSIDSLVSASS3690.81
15SPTGEQTGAPPAALSSRPSI3610.81
15EEPEAEPARQDERACGTPAE20960.81
15PAARTEGDTGPVEGAAVSPS19400.81
15KAQQAHSEGSNSVGRGSHGG1560.81
15VGGPVSEGPATTPSITLQVT10790.81
16SGSVDYEEFLIGIAVCCRGT940.80
16ICPQGGISPVGSAHANAPPP5230.80
16VQSTRVGAGGGAGGAGPANS40.80
16ASSSEGYGRSFDEESSGASS2920.80
16GESVSDAAPVAGRGEVDLTR19780.80
17PAPPPIVPTSSGGVPAPGGV5470.79
17GGGSRPVSVLPSRQSSEASV5030.79
17SATPPVAAEAGSPGVSGALL20320.79
17NHGAPVATAGPPAALRPPVS18470.79
17LFSPQCQRTPQNGGSSGTAG1820.79
17LLLRGRLPFPINQAFGHPSF17240.79
17ELVRGGELFDLIQQNHRLPE16100.79
17NELYAIKVIDKGKINGHERE15560.79
17AAIARAHDDPLACSGHSPRD12860.79
17EAVAVASVPGTPTTGATTAV10600.79
18PSKGPTKSAMLLQAEKDKTR6450.78
18QEAPGTVSSPTGEQTGAPPA3530.78
18APGCSDLSSASSGTQRRGTE20770.78
18ASPASLLNLTLQDGSEGRRM20110.78
18DEVRHSTRYGEERTMACCPE18120.78
18VHRDLKPENILLTDRTPNAT16520.78
18GGASGSADPSGGPGAEEDRV13710.78
19PSRQSSEASVICPQGGISPV5130.77
19AAVSPSSLPAGSLDEVPESG19540.77
19YENTPVSFDGAVWREVSSSA17440.77
19ETLYIVMELVRGGELFDLIQ16030.77
19LLSEQVGGRQYGFAIVHPKG14780.77
19GASARAQLPYREGELRQADL12660.77
20PLPGLWGERLFQKFDFKGSG760.76
20PRAETQENETGLEELGEGAT5820.76
20FGLSTLCAPNEVLHQPCGTL16770.76
20ATIKLTDFGLSTLCAPNEVL16700.76
20KHLHQWQARYYVLVDNMLYY14370.76
20CSGHSPRDLYSCPNCCNPLL12980.76
20AGGGAGGAGPANSLAFSTQL110.76
21GREGGSGKVFRRSKLLSSRT7780.75
21VCVSDFVPSQQYVATGSGLS2430.75
21AVSISSAPPAARTEGDTGPV19320.75
21KNIDVYISQLDEVRHSTRYG18020.75
21ELVAMLSNLPNLDRYMSIRK1370.75
21REGELRQADLAAIARAHDDP12760.75
21VFDLNSDGYIQKSELVAMLS1240.75
21TAPGTSVTHTATATAVQGPP11350.75
Figure 6

Propensity scale plots of CDPK7 protein. (a) Bepipred linear; (b) beta-turn; (c) surface accessibility; (d) flexibility; (e) antigenicity; (f) hydrophilicity. x-axis and y-axis represent position and score, respectively. The horizontal line indicates the threshold or the average score. Yellow colors (above the threshold) indicate favorable regions related to the properties of interest. Green color (below the threshold) indicates the unfavorable regions related to the properties of interest.

Table 4

Conformational B-cell epitopes of CDPK7 protein predicted by the ElliPro server.

ResiduesNumber of residuesScore3D structure
A:V1431, A:L1432, A:Y1433, A:K1434, A:K1435, A:G1436, A:K1437, A:H1438, A:L1439, A:H1440, A:Q1441, A:W1442, A:Q1443, A:A1444, A:R1445, A:Y1456, A:Y1457, A:R1458, A:R1459, A:K1460, A:G1461, A:D1462, A:A1463, A:K1464, A:P1465, A:R1466, A:G1467, A:F1468, A:E1477, A:L1478, A:L1479, A:S1480, A:E1481, A:Q1482, A:V1483, A:G1484, A:G1485, A:R1486, A:Q1487, A:Y1488, A:G1489, A:L1504, A:L1505, A:F1506, A:A1507, A:N1508, A:S1509, A:A1510, A:K1511, A:Q1513, A:R1514510.82
A:V1493, A:H1494, A:P1495, A:K1496, A:G1497, A:E1498, A:T1499, A:V1500, A:S1501, A:K1502, A:R1503110.755
A:A1528, A:L1529, A:E1530, A:Q1531, A:L1532, A:Y1533, A:Q1534, A:V1535, A:G1536, A:E1537, A:Q1538, A:H1541, A:I1546, A:Y1548, A:K1549, A:G1550, A:I1551, A:H1552, A:R1553, A:A1554, A:T1555, A:N1556, A:E1557, A:L1558, A:L1611, A:V1612, A:R1613, A:G1614, A:Q1623, A:N1624, A:H1625, A:L1627, A:P1628, A:E1629, A:L1630, A:H1631, A:N1633, A:R1634, A:T1664, A:D1665, A:R1666, A:T1667, A:P1668, A:N1669, A:A1670, A:V1699, A:A1700, A:P1701, A:L1704, A:T1705, A:M1706, A:L1726, A:R1727, A:G1728, A:R1729, A:L1730, A:P1731, A:F1732, A:P1733, A:I1734, A:N1735, A:Q1736, A:A1737, A:F1738, A:G1739, A:P1741, A:S1742, A:F1743, A:Y1744, A:E1745, A:N1746, A:T1747, A:P1748, A:V1749, A:S1750, A:F1751, A:D1752, A:G1753, A:A1754, A:V1755, A:W1756, A:E1758, A:V1759, A:S1760, A:S1761, A:S1762, A:A1763, A:K1764, A:D1765, A:V1768, A:R1769, A:L1771, A:Q1772, A:P1773, A:N1774, A:P1775, A:R1776, A:R1777, A:R1778990.677
A:F1544, A:D1565, A:K1566, A:G1567, A:K1568, A:I1569, A:N1570, A:G1571, A:H1572, A:E1603, A:T1604, A:Y1606120.645
A:C1683, A:A1684, A:P1685, A:N1686, A:E1687, A:V1688, A:L1689, A:Q1691, A:P1692, A:C1693100.535

3.7. MHC-Binding Epitopes

The results are listed in Tables 5 and 6. Epitopes were assessed regarding antigenicity, and those highly antigenic epitopes were finally selected.
Table 5

IC50 values for CDPK7 binding to MHC class I molecules obtained using the IEDBa.

MHC-II allelebStart-stopcPeptide sequencePercentile rankdAntigenicity
CDPK7CDPK7
H2-Db1882-1891SSPSSLPTPI0.150.4079
143-152SNLPNLDRYM0.21-0.4284
1637-1646SQLLSTVYYL0.240.9146

H2-Dd647-656KGPTKSAMLL0.180.9201
1612-1621VRGGELFDLI0.280.1770
1483-1492VGGRQYGFAI0.640.1561

H2-Kb1721-1730IMYLLLRGRL0.551.4751
798-807SSASFSSRGM1.01.3623
1643-1652VYYLHKCGIV1.20.1373

H2-Kd1474-1483CYVELLSEQV0.790.5079
1445-1454RYYVLVDNML1.150.9079
822-831GYSASGGMIV1.30.9579

H2-Kk1555-1564TNELYAIKVI0.120.2695
1084-1093SEGPATTPSI0.750.2366
694-703DVEGIVDKII1.50.1511

H2-Ld1589-1598HPNVIYMKEL3.80.2773
1729-1738RLPFPINQAF4.20.3317
279-288YEPHPLLARL4.60.1027

aThe immune epitope database (http://tools.iedb.org/mhci/). bH2-Db, H2-Dd, H2-Kb, H2-Kd, H2-Kk, and H2-Ld alleles are mouse MHC class I molecules. cTen amino acids for analysis were used each time. dLow percentile rank = high level binding; high percentile rank = low level binding; IC50 values = percentile rank. ∗ indicates potential antigenic epitopes (threshold = 0.5).

Table 6

IC50 values for CDPK7 binding to MHC class II molecules obtained using the IEDBa.

MHC-II allelebStart-stopcPeptide sequencePercentile rankdAntigenicity
CDPK7CDPK7
H2-IAb1109-1123AAGAAAAAATAAAAA0.070.8045
1108-1122AAAGAAAAAATAAAA0.080.8354
1110-1124AGAAAAAATAAAAAF0.080.7176

H2-IAd1035-1049SETQPAMASVASGSS0.130.6766
1034-1048GSETQPAMASVASGS0.150.7059
1036-1050ETQPAMASVASGSSP0.250.6536

H2-IEd1451-1465DNMLYYYRRKGDAKP0.140.6972
1452-1466NMLYYYRRKGDAKPR0.140.8298
1450-1464VDNMLYYYRRKGDAK0.190.6159

aThe immune epitope database (http://tools.immuneepitope.org/mhcii). bH2-IAb, H2-IAd, and H2-IEd alleles are mouse MHC class II molecules. cFifteen amino acids for analysis were used each time. dLow percentile rank = high level binding; high percentile rank = low level binding; IC50 values = percentile rank. ∗ indicates potential antigenic epitopes (threshold = 0.5).

3.8. CTL Epitope Prediction

The high-ranked CTL epitopes predicted by the CTLpred tool for CDPK7 protein are summarized in Table 7. Epitopes were assessed regarding antigenicity, and those highly antigenic epitopes were finally selected.
Table 7

Predicted CDPK7 epitopes by CTLpreda.

Peptide rankStart positionbSequenceScore (ANN/SVM)cAntigenicity
1280EPHPLLARL0.83/1.35910880.0131
21716WSIGVIMYL0.96/1.11208480.1711
31398GSSRVFTRC0.94/1.0685326-0.7197
41187ARDDDVYER0.65/1.34415880.3493
5715SFPEFKTWL0.98/0.953454971.0485
61763AKDLIVRML0.98/0.890308330.8096
7724ERNEGILSM0.65/1.07570750.5393
8470ASRFTSAIK0.80/0.859636891.0303
91573ERELLRSEM0.51/1.07207920.9337
101188RDDDVYERI0.85/0.730178910.0942
111666RTPNATIKL0.99/0.584816130.2323
1232KECLKQYLK0.99/0.583768561.2628
131411WELSKCAEM0.19/1.37503920.3168
141749VSFDGAVWR0.96/0.593704261.2284
15743GLQGNALYR0.99/0.544844831.4369

aCTLpred, available online at http://www.imtech.res.in/raghava/ctlpred/index.html. bNine amino acids for analysis were used. cThe default artificial neural network (ANN) and support vector machine (SVM) cut-off scores were set 0.51 and 0.36, respectively. ∗ indicates potential antigenic epitopes (threshold = 0.5).

3.9. Antigenic and Allergenic Profiles

The antigenic profile of CDPK7 was conducted by the VaxiJen web server with score of 0.7074 (threshold: 0.5). Based on AllergenFP and AllerTOP v2.0 analyses, the CDPK7 protein was appraised as possible nonallergen.

4. Discussion

Toxoplasmosis is a significant menace to human society as well as livestock industry [2, 8, 60]. Thus, the design and improvement of an efficient vaccine against T. gondii infection is still a great challenge for researchers against toxoplasmosis in domestic animals and humans [61]. Recently, bioinformatics tools are more focused for rational vaccine design, with some advantage, including the following: (i) time- and cost-effectiveness; (ii) accurately targeting, long-lasting immunity with favorable polarity in cellular components; and (iii) elimination of undesired responses through specific, epitope-based construct design. Nevertheless, the obtained in silico results are only theoretical data which must be confirmed using wet lab experiments inevitably [62]. It has been known that CDPK7 contributes to several functions in T. gondii such as gliding movement, host-cell invasion, and egress as well as other vital growth processes [34]. Here, we conducted a comprehensive analysis of TgCDPK7, a member of the CDPK family in T. gondii. The amino acid sequence of CDPK7 comprises 2133 residues with an average MW of 219085.79 D, which characterizes a suitable antigenic nature (the peptides with MW more than 10 kDa are considered as good immunogens) [63]. According to the Expasy ProtParam server, GRAVY and the aliphatic index of the CDPK7 were achieved at -0.331 and 68.78, respectively. In summary, the great value of aliphatic index means that the peptide has more stability in a broad range of various temperatures. Moreover, the low/negative value of the GRAVY factor signifies the better interaction of peptide with the molecules of water. It is efficient to identify that PTMs have a fundamental role in cell stability [64]. The acquired outcomes show that CDPK7 comprises 299 potential PTM sites (269 phosphorylation and 30 acylation positions), representing that these positions may organize protein activity. To predict the secondary structure of CDPK7, the GOR4 tool was recruited. The results of secondary structure of CDPK7 verified and included 502 (out of 2133) alpha-helix, 320 extended strands, and 1311 random coils. It is known that the key role of the proteins is related to their three-dimensional structure. As such, to comprehend the influences between both structures and functions, assessment of 3D structure is the key aim of expecting a protein's nature [65]. Humoral and cellular immunity are strongly stimulated in T. gondii infection [66, 67], in such a way that the establishment of IgG antibodies avoids the protozoan from attachment to the receptors of host cell [67]. Interferon-γ (IFN-γ), CD4+, and CD8+ T cells as the main members of T cells play a dynamic role in constraining acute and chronic infection. These major cytokines prevent the reactivation of bradyzoites in the host tissue cyst [66]. Epitope prediction has critical value to evaluate the specificity of antigen. Furthermore, epitope evaluation may reveal the pathogenesis and immune process of the pathogen in design vaccine researches [65, 68]. The strength of using in silico is the detection of the component epitopes that are critical for the interaction of antibodies and antigens. Several linear B-cell epitopes were predicted by the ABCpred server, among which those epitopes above 0.9 score were of great significance to be included in multiepitope vaccine constructs. Moreover, we applied the IEDB online server to evaluate the IC50 values of peptides that link to the MHC class I/II molecules for CDPK7. According to the obtained results from IEDB, the T-cell epitopes on CDPK7 have the capability to bind intensely to MHC class I and class II molecules. It is important to note that the lower IC50 values show the higher-level of affinity, which show an appropriate T-cell epitope. Other the main stage, CTLpred is a special approach used to predict CTL epitopes, which is important in vaccine-related studies. This tool relies on elegant machine learning methods, such as ANN and SVM. We recognized the CTL epitopes using the CTLpred online database to select the top CDPK7 epitopes. The CTLpred server utilizes consensus and combined estimates, in line with these two methods [56]. Evaluation of antigenicity and allergenicity showed that CDPK7 protein has immunogenic and nonallergenic nature.

5. Conclusion

Well antigenicity, hydrophilicity, surface accessibility, and flexibility indexes were detected for CDPK7. Hence, we recommend that a suitable vaccine should be designed and verified both in silico and in vivo by the potential B- and T-cell epitopes predicted in this study.
  63 in total

1.  Prediction of CTL epitopes using QM, SVM and ANN techniques.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Vaccine       Date:  2004-08-13       Impact factor: 3.641

2.  Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization.

Authors:  Dong Xu; Yang Zhang
Journal:  Biophys J       Date:  2011-11-15       Impact factor: 4.033

3.  Prediction of continuous B-cell epitopes in an antigen using recurrent neural network.

Authors:  Sudipto Saha; G P S Raghava
Journal:  Proteins       Date:  2006-10-01

4.  A semi-empirical method for prediction of antigenic determinants on protein antigens.

Authors:  A S Kolaskar; P C Tongaonkar
Journal:  FEBS Lett       Date:  1990-12-10       Impact factor: 4.124

Review 5.  History of the discovery of the life cycle of Toxoplasma gondii.

Authors:  J P Dubey
Journal:  Int J Parasitol       Date:  2009-07-01       Impact factor: 3.981

6.  Development of a chimeric vaccine candidate based on Toxoplasma gondii major surface antigen 1 and apicoplast proteins using comprehensive immunoinformatics approaches.

Authors:  Ali Asghari; Sadegh Shamsinia; Hassan Nourmohammadi; Hamidreza Majidiani; Mohammad Fatollahzadeh; Taher Nemati; Hamid Irannejad; Hamid Reza Nouri; Ezatollah Ghasemi; Morteza Shams
Journal:  Eur J Pharm Sci       Date:  2021-04-06       Impact factor: 4.384

7.  Toxoplasma gondii ROP38 protein: Bioinformatics analysis for vaccine design improvement against toxoplasmosis.

Authors:  MohammadReza Chaechi Nosrati; Ezatollah Ghasemi; Morteza Shams; Sadegh Shamsinia; Ali Yousefi; Hassan Nourmohammadi; Erfan Javanmardi; Bahareh Kordi; Hamidreza Majidiani; Ali Dalir Ghaffari; Farzaneh Shakarami
Journal:  Microb Pathog       Date:  2020-09-09       Impact factor: 3.738

Review 8.  Toxoplasmosis in Blood Donors: A Systematic Review and Meta-Analysis.

Authors:  Masoud Foroutan-Rad; Hamidreza Majidiani; Sahar Dalvand; Ahmad Daryani; Wesam Kooti; Jasem Saki; Faezeh Hedayati-Rad; Ehsan Ahmadpour
Journal:  Transfus Med Rev       Date:  2016-03-30

Review 9.  Rhomboid antigens are promising targets in the vaccine development against Toxoplasma gondii.

Authors:  Masoud Foroutan; Leila Zaki; Sanaz Tavakoli; Shahrzad Soltani; Ali Taghipour; Fatemeh Ghaffarifar
Journal:  EXCLI J       Date:  2019-05-21       Impact factor: 4.068

Review 10.  Environmental transmission of Toxoplasma gondii: Oocysts in water, soil and food.

Authors:  Karen Shapiro; Lillian Bahia-Oliveira; Brent Dixon; Aurélien Dumètre; Luz A de Wit; Elizabeth VanWormer; Isabelle Villena
Journal:  Food Waterborne Parasitol       Date:  2019-04-01
View more
  2 in total

Review 1.  Mining the Proteome of Toxoplasma Parasites Seeking Vaccine and Diagnostic Candidates.

Authors:  Sajad Rashidi; Javier Sánchez-Montejo; Reza Mansouri; Mohammad Ali-Hassanzadeh; Amir Savardashtaki; Mohammad Saleh Bahreini; Mohammadreza Karimazar; Raúl Manzano-Román; Paul Nguewa
Journal:  Animals (Basel)       Date:  2022-04-23       Impact factor: 3.231

Review 2.  More Than Seventy Years of Research (1948-November 2021) on Toxoplasma gondii in Iran: A Narrative Review.

Authors:  Mitra Sadeghi; Seyed Abdollah Hosseini; Shahabeddin Sarvi; Tooran Nayeri; Mehdi Sharif; Abdol Sattar Pagheh; Afsaneh Amouei; Mahbobeh Montazeri; Ahmad Daryani
Journal:  Iran J Parasitol       Date:  2022 Apr-Jun       Impact factor: 1.217

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.