| Literature DB >> 34336254 |
Ali Taghipour1, Sanaz Tavakoli2, Mohamad Sabaghan3, Masoud Foroutan4, Hamidreza Majidiani5, Shahrzad Soltani4, Milad Badri6, Ali Dalir Ghaffari1, Sheyda Soltani4.
Abstract
Apicomplexan parasites, including Toxoplasma gondii (T. gondii), express different types of calcium-dependent protein kinases (CDPKs), which perform a variety of functions, including attacking and exiting the host cells. In the current bioinformatics study, we have used several web servers to predict the basic features and specifications of the CDPK7 protein. The findings showed that CDPK7 protein has 2133 amino acid residues with an average molecular weight (MW) of 219085.79 D. The aliphatic index with 68.78 and grand average of hydropathicity (GRAVY) with -0.331 score were estimated. The outcomes of current research showed that the CDPK7 protein included 502 alpha-helix, 1311 random coils, and 320 extended strands with GOR4 method. Considering the Ramachandran plot, the favored region contains more than 92% of the amino acid residues. In addition, evaluation of antigenicity and allergenicity showed that CDPK7 protein has immunogenic and nonallergenic nature. The present research provides key data for more animal-model study on the CDPK7 protein to design an efficient vaccine against toxoplasmosis in the future.Entities:
Year: 2021 PMID: 34336254 PMCID: PMC8295510 DOI: 10.1155/2021/9974509
Source DB: PubMed Journal: J Parasitol Res ISSN: 2090-0023
Figure 1NetPhos server output for CDPK7 phosphorylation sites. (a) The number of predicted sites, based on S (serine), T (threonine), and Y (tyrosine); (b) prediction diagram of CDPK7 phosphorylation sites.
The acylation sites of CDPK7 sequence.
| ID | Position | Peptide | Score |
|---|---|---|---|
| TGME49_228750 CDPK7 ( | 34 | STQLSKECLKQYLKK | 1.129 |
| TGME49_228750 CDPK7 ( | 109 | FLIGIAVCCRGTKSD | 1.996 |
| TGME49_228750 CDPK7 ( | 110 | LIGIAVCCRGTKSDR | 5.494 |
| TGME49_228750 CDPK7 ( | 187 | QNLFSPQCQRTPQNG | 0.526 |
| TGME49_228750 CDPK7 ( | 222 | DEEDTGSCGSNSNFP | 5.293 |
| TGME49_228750 CDPK7 ( | 244 | YPEAALVCVSDFVPS | 3.693 |
| TGME49_228750 CDPK7 ( | 321 | SLSDVFQCFSPFDHA | 0.984 |
| TGME49_228750 CDPK7 ( | 524 | SSEASVICPQGGISP | 2.536 |
| TGME49_228750 CDPK7 ( | 706 | VDKIIEECEFFEHGK | 0.403 |
| TGME49_228750 CDPK7 ( | 736 | ILSMFTECLHEEVWG | 1.821 |
| TGME49_228750 CDPK7 ( | 1298 | AHDDPLACSGHSPRD | 5.591 |
| TGME49_228750 CDPK7 ( | 1309 | SPRDLYSCPNCCNPL | 1.744 |
| TGME49_228750 CDPK7 ( | 1312 | DLYSCPNCCNPLLLC | 8.015 |
| TGME49_228750 CDPK7 ( | 1313 | LYSCPNCCNPLLLCP | 7.05 |
| TGME49_228750 CDPK7 ( | 1319 | CCNPLLLCPFCHSRY | 2.719 |
| TGME49_228750 CDPK7 ( | 1322 | PLLLCPFCHSRYPQL | 2.865 |
| TGME49_228750 CDPK7 ( | 1340 | EGRVVMECRQCGRLG | 2.295 |
| TGME49_228750 CDPK7 ( | 1343 | VVMECRQCGRLGGSS | 2.929 |
| TGME49_228750 CDPK7 ( | 1395 | DRVEAGICVGGSSRV | 5.406 |
| TGME49_228750 CDPK7 ( | 1406 | SSRVFTRCWHCGWEL | 0.108 |
| TGME49_228750 CDPK7 ( | 1409 | VFTRCWHCGWELSKC | 1.424 |
| TGME49_228750 CDPK7 ( | 1416 | CGWELSKCAEMLKGN | 4.272 |
| TGME49_228750 CDPK7 ( | 1474 | GFMFLEGCYVELLSE | 1.626 |
| TGME49_228750 CDPK7 ( | 1649 | TVYYLHKCGIVHRDL | 1.164 |
| TGME49_228750 CDPK7 ( | 1683 | DFGLSTLCAPNEVLH | 1.6 |
| TGME49_228750 CDPK7 ( | 1693 | NEVLHQPCGTLAYVA | 1.927 |
| TGME49_228750 CDPK7 ( | 1828 | GEERTMACCPEVPTF | 4.139 |
| TGME49_228750 CDPK7 ( | 1829 | EERTMACCPEVPTFT | 7.362 |
| TGME49_228750 CDPK7 ( | 2080 | PSLAAPGCSDLSSAS | 3.862 |
| TGME49_228750 CDPK7 ( | 2110 | ARQDERACGTPAEVP | 6.173 |
Figure 2Transmembrane domains expected in CDPK7 protein. (a) Some statistics and a list of the location of the predicted transmembrane helices and the predicted location of the intervening loop regions. Length: the length of the protein sequence; number of predicted TMHs: the number of predicted transmembrane helices; Exp number of AAs in TMHs: the expected number of amino acids in transmembrane helices. If this number is larger than 18, it is very likely to be a transmembrane protein (or have a signal peptide); Exp number, first 60 AAs: the expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein. If this number is more than a few, you should be warned that a predicted transmembrane helix in the N-term could be a signal peptide; total prob of N-in: the total probability that the N-term is on the cytoplasmic side of the membrane; (b) transmembrane domains expected in CDPK7 protein. (b) Analysis of the transmembrane domains of CDPK7.
Figure 3(a) GOR4 server results suggested that CDPK7 encompasses 502 alpha-helix, 320 extended strands, and 1311 random coils in secondary structure; (b) graphical result of the secondary structure prediction of CDPK7 using the GOR4 online server.
Figure 4SWISS-MODEL server output. (a) Computed 3D model; (b) global quality estimate; (c) comparison with nonredundant set of PDB structures; (d) local quality estimate.
Figure 5Validation of 3D model of CDPK7 protein. (a) The Z-score plot for 3D structure of predicted protein before and after refinement with ProSA-web server, respectively; (b) Ramachandran plot analysis of predicted structure.
Epitopes predicted in CDPK7 protein by different parameters based on the Bcepred online server.
| Prediction parameter | Epitope sequence |
|---|---|
| Hydrophilicity | GAGGGAGGAG; KKFDSDEVEV; KGSGSVDYEE; CRGTKSDRM; AQQAHSEGSNSVGRGSHGGKEEEQNL; SPQCQRTPQNGGSSGTAGA; SPGNLDDEDDEEDTGSCGSNSN; SLDSTSSNERPRER; EQEASSSEGYGRSFDEESSGASSYSS; DHASRNP; GPEAPGQEAPGT; SSPTGEQTGAP; SASSPAGGS; DRPAGAGTGAE; RPAAGDDGDSSAGPAGGASGESAAKGAEKSPKTGT; SQQPRGG; STQSSSTQGAPGS; SGGGGSRP; PSRQSSEASV; SPRAETQENETGLEE; GEGATPGGDAGREASQKA; AGTGRGSGPLEEDEAQGNG; QPSKGPTKSA; QAEKDKTRQEQAKKNPS; IKEEKEENEQKDV; GSGREGGSGKV; SSRTSSAS; GKAGSPSSSRVGG; TNPAHSSPRRPTRD; QATGSSGAASA; ARGSGAQ; GGAGPENAGA; ETTSQASQHQTGPSGPSSPP; GVEPKQE; AGGGAGSETQPA; ASGSSPAA; SEGPATT; DPTTAGA; EETRAAGG; QGPPDGRGSAGDKV; IGEEEGERMSGSGDARDDDVYER; DSRPAPS; SGASGPGSGA; ASPSEGASAR; ARAHDDP; SGHSPRD; SSSSLSDAGAQP; AGTGANSGAGGASGSADPSGGPGAEEDRVEAG; KGNSEAA; RRKGDAKPRG; SEQVGGRQ; KGETVSKR; ANSAKEQRE; DKGKING; TDRTPNAT; EVSSSAKD; VNNGSKNID; DEVRHSTRYGEERT; AASSPSS; DPGAPTS; AARTEGDTGPVEG; DEVPESG; GGESVSDAA; AGRGEVD; TRGQGQGSTASG; TLQDGSEGRR; AEAGSPG; SSASSGTQRRGTEEPEAEPARQDERACGT; GSPGGPS |
| Flexibility | LAFSTQL; QYLKKFD; KALSARSPG; QKFDFKGSGS; AVCCRGTKSD; QQAHSEGSNSVGRGSHGGKEEE; QCQRTPQNGGSSGTAGAVSSSPGNLDDEDDEEDTGSCGSNS; SGLSLDSTSSNERPR; ARLEQEASSSE; RSFDEESSGA; FDHASRNPSPP; GTVSSPTGEQ; PAALSSRP; VSASSPAG; PAAGDDGDSS; GPAGGASG; SAAKGAEKSPKTGTLSQQPRGGITKTA; SAIKRTFSTQSSSTQ; PPVRGFSGGGGSRP; SVLPSRQSSE; NAPPPGSG; PIVPTSSG; PRAETQENE; GDAGREAS; FAAGTGRGS; QAAQPSKGPT; LQAEKDKTRQEQAKKNP; SLIKEEKEENEQ; ALYRSTSVQSRPS; KDPLGSGREGGSG; SKLLSSRTSSASFSSRGMGKAGSPSSSRV; NPAHSSPRRPT; APQPSRLSSSPQMQATGSS; PAARGSG; SQHQTGPSGPSSPP; TVAGGGAGSET; ASVASGSS; AVQGPPDGRGSAG; PVIGEEEGERMSGSGDA; RHWEQSR; DFIRSSH; AELPRDSR; GALSGASGPG; LACSGHSP; RQCGRLGGSSSSL; TGANSGAGGASGSADPSGG; GICVGGSS; AEMLKGNS; YYYRRKGD; VHPKGETVS; LFANSAKEQ; GKINGHE; ILLTDRT; AVWREVSSSA; RMLQPNPR; TAVVNNGS; APAASSPS; TPIRPFS; PGVSLSG; PAARTEGD; SGASLGGESV; VDLTRGQGQGSTA; LTLQDGSEGRRM; PAAGSKSV; SDLSSASSGTQRRGTE; VPAGSPGGPS |
| Accessibility | STQLSKECLKQYLKKFDSDEV; VLKKVYKAL; PGIDKETFLQY; GERLFQKFDFKG; CRGTKSDRMYV; SDGYIQKSEL; NLPNLDRYMSIRKAQQAHSEG; SHGGKEEEQNLFSPQCQRTPQNGG; PGNLDDEDDEEDTGS; DSTSSNERPRERLKPYEPHPL; ARLEQEASSSEGYGRSFDEESS; DHASRNPSPPRRVSAQQPTH; PEAPGQE; SSPTGEQTGAPP; PPPVDRPA; QASPHAR; AKGAEKSPKTGTLSQQPRGGITKTASRFTSAIKRTFSTQSSSTQG; LPSRQSSEAS; SPPPQVPP; ASPRAETQENETGLEE; GREASQKA; GPLEEDEAQGN; PQAAQPSKGPTKSA; LQAEKDKTRQEQAKKNPSPVAQSLIKEEKEENEQKDVLD; FKTWLERNEG; YRSTSVQSRPSRLTA; REGGSGKVFRRSKLLS; GSPSSSR; NMQHFQKVKH; PAHSSPRRPTRDLDPATPAPQPSRLSSSPQMQ; ETTSQASQHQTGPS; GVEPKQEVTV; EETRAAG; QGPPDGRGSAGDKVVSEE; GEEEGERMS; SGDARDDDVYERIAGYRHWEQSRMSPQ; RSSHQSL; SAELPRDSRPAPSRG; RAQLPYREGELRQAD; ARAHDDPL; SGHSPRDLYS; CHSRYPQLTL; PGAEEDRVEA; VLYKKGKHLHQWQARYY; NMLYYYRRKGDAKPRGF; EQVGGRQ; KGETVSKRL; ANSAKEQREWVDT; RVATKQQALEQ; IHRATNELY; KVIDKGKINGHERELLRSE; RLLNHPN; KELLDTKETLY; LIQQNHRLPEL; VHRDLKPENI; LTDRTPNAT; MEGYNHQ; HPSFYENTPVS; RMLQPNPRRRITV; VNNGSKNID; SQLDEVRHSTRYGEERTMA; IPKNGAKPLQNHG; RPFSEST; PPAARTEGDTG; QDGSEGRRMTSAT; SKSVPSPS; SSGTQRRGTEEPEAEPARQDERAC; PSPSIEE |
| Turns | SCGSNSNFPG; DSTSSNE; YSCPNCCNPL; LLNHPNVI |
| Exposed surface | KECLKQYLKKFDSDE; VLKKVYKAL; RGTKSDR; GKEEEQNL; DDEDDEEDT; SSNERPRERLKPYEPHP; RNPSPPRR; AEKSPKTG; LQAEKDKTRQEQAKKNPSPV; SLIKEEKEENEQKDV; KVFRRSK; QHFQKVK; SPRRPTRDL; EPKQEVT; VLYKKGKHLH; LYYYRRKGDAKPR; NSAKEQREWV; KQQALEQ; KVIDKGK; KELLDTK; HRDLKPEN; LQPNPRRRIT; QRRGTEE; EPARQDER; EEVHK |
| Polarity | LSKECLKQYLKKFDSDEVEVLKK; GERLFQKF; RGTKSDR; RYMSIRKAQQAH; RGSHGGKEEEQNLF; GNLDDEDDEEDTGS; SSNERPRERLKPYEPHP; LARLEQEAS; GRSFDEES; AEKSPKTG; PRAETQENETGLEE; GREASQKA; GPLEEDEAQG; LQAEKDKTRQEQAKKNP; QSLIKEEKEENEQKDVLDVE; VDKIIEECEFFEHGKLSF; EFKTWLERNEGIL; FTECLHEEVWGL; REGGSGKVFRRSKLLS; QHFQKVKHLFT; AHSSPRRPTRDLD; GVEPKQEVT; AFVEETRAAG; DKVVSEE; IGEEEGERMSGSGDARDDDVYERIAGYRHWEQSRM; HSAELPRDSR; LPYREGELRQA; IARAHDDPL; EGRVVMECRQCGR; PGAEEDRVEAG; ELSKCAE; LYKKGKHLHQWQ; LYYYRRKGDAKPR; IVHPKGETVSKRL; NSAKEQREWVD; EQLGHGK; VYKGIHRATNEL; KVIDKGKINGHERELLRSEM; KELLDTKE; ELVRGGE; QNHRLPELHVNRI; HKCGIVHRDLKPEN; QPNPRRRITVA; QLDEVRHSTRYGEERTMAC; DGSEGRRMTS; TQRRGTEEPEAEPARQDERAC; PSIEEVHK |
| Antigenic propensity | QLSKECLKQYLK; VEVLKKVYK; FLQYFPLPGL; VCCRGTK; MYVLFQVFDL; NLFSPQCQ; LVCVSDFVPSQQYV; YEPHPLL; YSSLSDVFQCFSPFDH; PSIDSLVS; GGSPVVLPPPVD; SRPVSVLPSRQS; SVICPQGG; PPPIVPTS; VSPPPQVPPVVVR; QKDVLDVEGIV; ECLHEEVW; FQKVKHLF; GPISVPVSPSVT; QEVTVSVSVVTV; PSITLQVTTTL; IVSKELVDFIRS; PRDLYSCPNCCNPLLLCPFCHSRYPQLTLLEGRVVMECRQCGRL; ICVGGSS; VFTRCWHC; IDGVLYK; RYYVLVDNML; FLEGCYVELLSEQVG; TVSKRLLF; LEQLYQV; GKFSIVYKGIH; ILRLLNHPNV; KETLYIVMELVR; LFDLIQQ; RLPELHVNRIISQLLSTVYYLHKCGIVHRD; FGLSTLC; EVLHQPCGTL; YNHQVDVWSIGVIMYLLLRGRL; LIVRMLQ; IDVYISQLD; CCPEVPTF; LRPPVSQLP; VSPSSLP; SLLNLTLQ; SVPSPSV |
The predicted B-cell epitopes via the ABCpred tool.
| Rank | Sequence | Start position | Score |
|---|---|---|---|
| 1 | SSPPGTPASVVSPAAGAGPI | 965 | 0.95 |
| 1 | EVPQAAQPSKGPTKSAMLLQ | 638 | 0.95 |
| 1 | GGVSPPPQVPPVVVRAASPR | 564 | 0.95 |
| 1 | GETVSKRLLFANSAKEQREW | 1497 | 0.95 |
| 2 | DVLDVEGIVDKIIEECEFFE | 691 | 0.94 |
| 2 | EEDEAQGNGMLEVPQAAQPS | 627 | 0.94 |
| 2 | GAPTSVATPVAVSISSAPPA | 1922 | 0.94 |
| 3 | KNGAKPLQNHGAPVATAGPP | 1839 | 0.93 |
| 4 | ALEQLYQVGEQLGHGKFSIV | 1528 | 0.92 |
| 5 | GAPSLAVGGATPLAGTTPPP | 927 | 0.91 |
| 5 | FGYSASGGMIVNMQHFQKVK | 821 | 0.91 |
| 5 | KDKTRQEQAKKNPSPVAQSL | 660 | 0.91 |
| 5 | EDTGSCGSNSNFPGAQAQGA | 217 | 0.91 |
| 5 | LPAAPAVASRAPAASSPSSL | 1868 | 0.91 |
| 5 | DVWSIGVIMYLLLRGRLPFP | 1714 | 0.91 |
| 5 | VYKGIHRATNELYAIKVIDK | 1547 | 0.91 |
| 5 | SAKEQREWVDTLRVATKQQA | 1509 | 0.91 |
| 6 | TPVYAVPAASAPGVSLSGGG | 1898 | 0.90 |
| 6 | LIQQNHRLPELHVNRIISQL | 1620 | 0.90 |
| 6 | SGDARDDDVYERIAGYRHWE | 1184 | 0.90 |
| 6 | VAGAPTSSAGVEPKQEVTVS | 1003 | 0.90 |
| 7 | QATGSSGAASAAAGASSVSA | 877 | 0.89 |
| 7 | RSKLLSSRTSSASFSSRGMG | 789 | 0.89 |
| 7 | VGSAHANAPPPGSGTPAPPP | 532 | 0.89 |
| 7 | LYYYRRKGDAKPRGFMFLEG | 1454 | 0.89 |
| 8 | AGALAVASPVSGAPSLAVGG | 916 | 0.88 |
| 8 | PAAGDDGDSSAGPAGGASGE | 424 | 0.88 |
| 8 | SEAAIDGVLYKKGKHLHQWQ | 1424 | 0.88 |
| 9 | KNPSPVAQSLIKEEKEENEQ | 670 | 0.87 |
| 9 | GGDAGREASQKAFAAGTGRG | 603 | 0.87 |
| 9 | SAGPAGGASGESAAKGAEKS | 433 | 0.87 |
| 9 | ASRNPSPPRRVSAQQPTHVG | 328 | 0.87 |
| 9 | PHPLLARLEQEASSSEGYGR | 281 | 0.87 |
| 9 | SSNERPRERLKPYEPHPLLA | 267 | 0.87 |
| 9 | TPAEVPAGSPGGPSPSIEEV | 2112 | 0.87 |
| 9 | SSLSDAGAQPAAGTGANSGA | 1351 | 0.87 |
| 9 | KQEVTVSVSVVTVAGGGAGS | 1016 | 0.87 |
| 10 | STSVQSRPSRLTAAGLQGIF | 752 | 0.86 |
| 10 | LIKEEKEENEQKDVLDVEGI | 679 | 0.86 |
| 10 | LQASPHARPAAGDDGDSSAG | 416 | 0.86 |
| 10 | AQAQGAYPEAALVCVSDFVP | 231 | 0.86 |
| 10 | GEERTMACCPEVPTFTIPKN | 1821 | 0.86 |
| 10 | RCWHCGWELSKCAEMLKGNS | 1405 | 0.86 |
| 10 | MSPQLAVDIVSKELVDFIRS | 1207 | 0.86 |
| 10 | VASGSSPAAPGVTGVTEAVA | 1044 | 0.86 |
| 11 | TQGAPGSPPVRGFSGGGGSR | 488 | 0.85 |
| 11 | KTASRFTSAIKRTFSTQSSS | 468 | 0.85 |
| 11 | EESSGASSYSSLSDVFQCFS | 304 | 0.85 |
| 11 | GSPGVSGALLSPAAGSKSVP | 2042 | 0.85 |
| 11 | GDKVVSEEAFPVIGEEEGER | 1161 | 0.85 |
| 12 | ISVPVSPSVTAVATAAVTQV | 984 | 0.84 |
| 12 | NMQHFQKVKHLFTNPAHSSP | 832 | 0.84 |
| 12 | ASQKAFAAGTGRGSGPLEED | 610 | 0.84 |
| 12 | VEETRAAGGATAPGTSVTHT | 1125 | 0.84 |
| 12 | TVAGGGAGSETQPAMASVAS | 1027 | 0.84 |
| 13 | DKIIEECEFFEHGKLSFPEF | 700 | 0.83 |
| 13 | PGIDKETFLQYFPLPGLWGE | 64 | 0.83 |
| 13 | GYRHWEQSRMSPQLAVDIVS | 1198 | 0.83 |
| 13 | TTGATTAVGGPVSEGPATTP | 1072 | 0.83 |
| 14 | QTGPSGPSSPPGTPASVVSP | 958 | 0.82 |
| 14 | GTLSQQPRGGITKTASRFTS | 456 | 0.82 |
| 14 | AGAVSSSPGNLDDEDDEEDT | 200 | 0.82 |
| 14 | PCGTLAYVAPEVLTMEGYNH | 1692 | 0.82 |
| 14 | NSVGRGSHGGKEEEQNLFSP | 166 | 0.82 |
| 14 | LLSTVYYLHKCGIVHRDLKP | 1639 | 0.82 |
| 14 | GFAIVHPKGETVSKRLLFAN | 1489 | 0.82 |
| 14 | GFMFLEGCYVELLSEQVGGR | 1467 | 0.82 |
| 14 | CRQCGRLGGSSSSLSDAGAQ | 1340 | 0.82 |
| 14 | LSGASGPGSGALASPSEGAS | 1249 | 0.82 |
| 14 | KELVDFIRSSHQSLHSAELP | 1218 | 0.82 |
| 14 | ATAAAAAFVEETRAAGGATA | 1117 | 0.82 |
| 14 | AGAAAAAATAAAAAFVEETR | 1110 | 0.82 |
| 15 | DPATPAPQPSRLSSSPQMQA | 859 | 0.81 |
| 15 | FFEHGKLSFPEFKTWLERNE | 708 | 0.81 |
| 15 | TGLEELGEGATPGGDAGREA | 591 | 0.81 |
| 15 | APPAALSSRPSIDSLVSASS | 369 | 0.81 |
| 15 | SPTGEQTGAPPAALSSRPSI | 361 | 0.81 |
| 15 | EEPEAEPARQDERACGTPAE | 2096 | 0.81 |
| 15 | PAARTEGDTGPVEGAAVSPS | 1940 | 0.81 |
| 15 | KAQQAHSEGSNSVGRGSHGG | 156 | 0.81 |
| 15 | VGGPVSEGPATTPSITLQVT | 1079 | 0.81 |
| 16 | SGSVDYEEFLIGIAVCCRGT | 94 | 0.80 |
| 16 | ICPQGGISPVGSAHANAPPP | 523 | 0.80 |
| 16 | VQSTRVGAGGGAGGAGPANS | 4 | 0.80 |
| 16 | ASSSEGYGRSFDEESSGASS | 292 | 0.80 |
| 16 | GESVSDAAPVAGRGEVDLTR | 1978 | 0.80 |
| 17 | PAPPPIVPTSSGGVPAPGGV | 547 | 0.79 |
| 17 | GGGSRPVSVLPSRQSSEASV | 503 | 0.79 |
| 17 | SATPPVAAEAGSPGVSGALL | 2032 | 0.79 |
| 17 | NHGAPVATAGPPAALRPPVS | 1847 | 0.79 |
| 17 | LFSPQCQRTPQNGGSSGTAG | 182 | 0.79 |
| 17 | LLLRGRLPFPINQAFGHPSF | 1724 | 0.79 |
| 17 | ELVRGGELFDLIQQNHRLPE | 1610 | 0.79 |
| 17 | NELYAIKVIDKGKINGHERE | 1556 | 0.79 |
| 17 | AAIARAHDDPLACSGHSPRD | 1286 | 0.79 |
| 17 | EAVAVASVPGTPTTGATTAV | 1060 | 0.79 |
| 18 | PSKGPTKSAMLLQAEKDKTR | 645 | 0.78 |
| 18 | QEAPGTVSSPTGEQTGAPPA | 353 | 0.78 |
| 18 | APGCSDLSSASSGTQRRGTE | 2077 | 0.78 |
| 18 | ASPASLLNLTLQDGSEGRRM | 2011 | 0.78 |
| 18 | DEVRHSTRYGEERTMACCPE | 1812 | 0.78 |
| 18 | VHRDLKPENILLTDRTPNAT | 1652 | 0.78 |
| 18 | GGASGSADPSGGPGAEEDRV | 1371 | 0.78 |
| 19 | PSRQSSEASVICPQGGISPV | 513 | 0.77 |
| 19 | AAVSPSSLPAGSLDEVPESG | 1954 | 0.77 |
| 19 | YENTPVSFDGAVWREVSSSA | 1744 | 0.77 |
| 19 | ETLYIVMELVRGGELFDLIQ | 1603 | 0.77 |
| 19 | LLSEQVGGRQYGFAIVHPKG | 1478 | 0.77 |
| 19 | GASARAQLPYREGELRQADL | 1266 | 0.77 |
| 20 | PLPGLWGERLFQKFDFKGSG | 76 | 0.76 |
| 20 | PRAETQENETGLEELGEGAT | 582 | 0.76 |
| 20 | FGLSTLCAPNEVLHQPCGTL | 1677 | 0.76 |
| 20 | ATIKLTDFGLSTLCAPNEVL | 1670 | 0.76 |
| 20 | KHLHQWQARYYVLVDNMLYY | 1437 | 0.76 |
| 20 | CSGHSPRDLYSCPNCCNPLL | 1298 | 0.76 |
| 20 | AGGGAGGAGPANSLAFSTQL | 11 | 0.76 |
| 21 | GREGGSGKVFRRSKLLSSRT | 778 | 0.75 |
| 21 | VCVSDFVPSQQYVATGSGLS | 243 | 0.75 |
| 21 | AVSISSAPPAARTEGDTGPV | 1932 | 0.75 |
| 21 | KNIDVYISQLDEVRHSTRYG | 1802 | 0.75 |
| 21 | ELVAMLSNLPNLDRYMSIRK | 137 | 0.75 |
| 21 | REGELRQADLAAIARAHDDP | 1276 | 0.75 |
| 21 | VFDLNSDGYIQKSELVAMLS | 124 | 0.75 |
| 21 | TAPGTSVTHTATATAVQGPP | 1135 | 0.75 |
Figure 6Propensity scale plots of CDPK7 protein. (a) Bepipred linear; (b) beta-turn; (c) surface accessibility; (d) flexibility; (e) antigenicity; (f) hydrophilicity. x-axis and y-axis represent position and score, respectively. The horizontal line indicates the threshold or the average score. Yellow colors (above the threshold) indicate favorable regions related to the properties of interest. Green color (below the threshold) indicates the unfavorable regions related to the properties of interest.
Conformational B-cell epitopes of CDPK7 protein predicted by the ElliPro server.
| Residues | Number of residues | Score | 3D structure |
|---|---|---|---|
| A:V1431, A:L1432, A:Y1433, A:K1434, A:K1435, A:G1436, A:K1437, A:H1438, A:L1439, A:H1440, A:Q1441, A:W1442, A:Q1443, A:A1444, A:R1445, A:Y1456, A:Y1457, A:R1458, A:R1459, A:K1460, A:G1461, A:D1462, A:A1463, A:K1464, A:P1465, A:R1466, A:G1467, A:F1468, A:E1477, A:L1478, A:L1479, A:S1480, A:E1481, A:Q1482, A:V1483, A:G1484, A:G1485, A:R1486, A:Q1487, A:Y1488, A:G1489, A:L1504, A:L1505, A:F1506, A:A1507, A:N1508, A:S1509, A:A1510, A:K1511, A:Q1513, A:R1514 | 51 | 0.82 |
|
| A:V1493, A:H1494, A:P1495, A:K1496, A:G1497, A:E1498, A:T1499, A:V1500, A:S1501, A:K1502, A:R1503 | 11 | 0.755 |
|
| A:A1528, A:L1529, A:E1530, A:Q1531, A:L1532, A:Y1533, A:Q1534, A:V1535, A:G1536, A:E1537, A:Q1538, A:H1541, A:I1546, A:Y1548, A:K1549, A:G1550, A:I1551, A:H1552, A:R1553, A:A1554, A:T1555, A:N1556, A:E1557, A:L1558, A:L1611, A:V1612, A:R1613, A:G1614, A:Q1623, A:N1624, A:H1625, A:L1627, A:P1628, A:E1629, A:L1630, A:H1631, A:N1633, A:R1634, A:T1664, A:D1665, A:R1666, A:T1667, A:P1668, A:N1669, A:A1670, A:V1699, A:A1700, A:P1701, A:L1704, A:T1705, A:M1706, A:L1726, A:R1727, A:G1728, A:R1729, A:L1730, A:P1731, A:F1732, A:P1733, A:I1734, A:N1735, A:Q1736, A:A1737, A:F1738, A:G1739, A:P1741, A:S1742, A:F1743, A:Y1744, A:E1745, A:N1746, A:T1747, A:P1748, A:V1749, A:S1750, A:F1751, A:D1752, A:G1753, A:A1754, A:V1755, A:W1756, A:E1758, A:V1759, A:S1760, A:S1761, A:S1762, A:A1763, A:K1764, A:D1765, A:V1768, A:R1769, A:L1771, A:Q1772, A:P1773, A:N1774, A:P1775, A:R1776, A:R1777, A:R1778 | 99 | 0.677 |
|
| A:F1544, A:D1565, A:K1566, A:G1567, A:K1568, A:I1569, A:N1570, A:G1571, A:H1572, A:E1603, A:T1604, A:Y1606 | 12 | 0.645 |
|
| A:C1683, A:A1684, A:P1685, A:N1686, A:E1687, A:V1688, A:L1689, A:Q1691, A:P1692, A:C1693 | 10 | 0.535 |
|
IC50 values for CDPK7 binding to MHC class I molecules obtained using the IEDBa.
| MHC-II alleleb | Start-stopc | Peptide sequence | Percentile rankd | Antigenicity |
|---|---|---|---|---|
| CDPK7 | CDPK7 | |||
| H2-Db | 1882-1891 | SSPSSLPTPI | 0.15 | 0.4079 |
| 143-152 | SNLPNLDRYM | 0.21 | -0.4284 | |
| 1637-1646 | SQLLSTVYYL∗ | 0.24 | 0.9146 | |
|
| ||||
| H2-Dd | 647-656 | KGPTKSAMLL∗ | 0.18 | 0.9201 |
| 1612-1621 | VRGGELFDLI | 0.28 | 0.1770 | |
| 1483-1492 | VGGRQYGFAI | 0.64 | 0.1561 | |
|
| ||||
| H2-Kb | 1721-1730 | IMYLLLRGRL∗ | 0.55 | 1.4751 |
| 798-807 | SSASFSSRGM∗ | 1.0 | 1.3623 | |
| 1643-1652 | VYYLHKCGIV | 1.2 | 0.1373 | |
|
| ||||
| H2-Kd | 1474-1483 | CYVELLSEQV∗ | 0.79 | 0.5079 |
| 1445-1454 | RYYVLVDNML∗ | 1.15 | 0.9079 | |
| 822-831 | GYSASGGMIV∗ | 1.3 | 0.9579 | |
|
| ||||
| H2-Kk | 1555-1564 | TNELYAIKVI | 0.12 | 0.2695 |
| 1084-1093 | SEGPATTPSI | 0.75 | 0.2366 | |
| 694-703 | DVEGIVDKII | 1.5 | 0.1511 | |
|
| ||||
| H2-Ld | 1589-1598 | HPNVIYMKEL | 3.8 | 0.2773 |
| 1729-1738 | RLPFPINQAF | 4.2 | 0.3317 | |
| 279-288 | YEPHPLLARL | 4.6 | 0.1027 | |
aThe immune epitope database (http://tools.iedb.org/mhci/). bH2-Db, H2-Dd, H2-Kb, H2-Kd, H2-Kk, and H2-Ld alleles are mouse MHC class I molecules. cTen amino acids for analysis were used each time. dLow percentile rank = high level binding; high percentile rank = low level binding; IC50 values = percentile rank. ∗ indicates potential antigenic epitopes (threshold = 0.5).
IC50 values for CDPK7 binding to MHC class II molecules obtained using the IEDBa.
| MHC-II alleleb | Start-stopc | Peptide sequence | Percentile rankd | Antigenicity |
|---|---|---|---|---|
| CDPK7 | CDPK7 | |||
| H2-IAb | 1109-1123 | AAGAAAAAATAAAAA∗ | 0.07 | 0.8045 |
| 1108-1122 | AAAGAAAAAATAAAA∗ | 0.08 | 0.8354 | |
| 1110-1124 | AGAAAAAATAAAAAF∗ | 0.08 | 0.7176 | |
|
| ||||
| H2-IAd | 1035-1049 | SETQPAMASVASGSS∗ | 0.13 | 0.6766 |
| 1034-1048 | GSETQPAMASVASGS∗ | 0.15 | 0.7059 | |
| 1036-1050 | ETQPAMASVASGSSP∗ | 0.25 | 0.6536 | |
|
| ||||
| H2-IEd | 1451-1465 | DNMLYYYRRKGDAKP∗ | 0.14 | 0.6972 |
| 1452-1466 | NMLYYYRRKGDAKPR∗ | 0.14 | 0.8298 | |
| 1450-1464 | VDNMLYYYRRKGDAK∗ | 0.19 | 0.6159 | |
aThe immune epitope database (http://tools.immuneepitope.org/mhcii). bH2-IAb, H2-IAd, and H2-IEd alleles are mouse MHC class II molecules. cFifteen amino acids for analysis were used each time. dLow percentile rank = high level binding; high percentile rank = low level binding; IC50 values = percentile rank. ∗ indicates potential antigenic epitopes (threshold = 0.5).
Predicted CDPK7 epitopes by CTLpreda.
| Peptide rank | Start positionb | Sequence | Score (ANN/SVM)c | Antigenicity |
|---|---|---|---|---|
| 1 | 280 | EPHPLLARL | 0.83/1.3591088 | 0.0131 |
| 2 | 1716 | WSIGVIMYL | 0.96/1.1120848 | 0.1711 |
| 3 | 1398 | GSSRVFTRC | 0.94/1.0685326 | -0.7197 |
| 4 | 1187 | ARDDDVYER | 0.65/1.3441588 | 0.3493 |
| 5 | 715 | SFPEFKTWL∗ | 0.98/0.95345497 | 1.0485 |
| 6 | 1763 | AKDLIVRML∗ | 0.98/0.89030833 | 0.8096 |
| 7 | 724 | ERNEGILSM∗ | 0.65/1.0757075 | 0.5393 |
| 8 | 470 | ASRFTSAIK∗ | 0.80/0.85963689 | 1.0303 |
| 9 | 1573 | ERELLRSEM∗ | 0.51/1.0720792 | 0.9337 |
| 10 | 1188 | RDDDVYERI | 0.85/0.73017891 | 0.0942 |
| 11 | 1666 | RTPNATIKL | 0.99/0.58481613 | 0.2323 |
| 12 | 32 | KECLKQYLK∗ | 0.99/0.58376856 | 1.2628 |
| 13 | 1411 | WELSKCAEM | 0.19/1.3750392 | 0.3168 |
| 14 | 1749 | VSFDGAVWR∗ | 0.96/0.59370426 | 1.2284 |
| 15 | 743 | GLQGNALYR∗ | 0.99/0.54484483 | 1.4369 |
aCTLpred, available online at http://www.imtech.res.in/raghava/ctlpred/index.html. bNine amino acids for analysis were used. cThe default artificial neural network (ANN) and support vector machine (SVM) cut-off scores were set 0.51 and 0.36, respectively. ∗ indicates potential antigenic epitopes (threshold = 0.5).