| Literature DB >> 23586017 |
Hua Cong1, Min Zhang, Qingli Zhang, Jing Gong, Haizi Cong, Qing Xin, Shenyi He.
Abstract
Toxoplasma gondii is a protozoan parasite capable of infecting humans and animals. Surface antigen glycoproteins, SAG2C, -2D, -2X, and -2Y, are expressed on the surface of bradyzoites. These antigens have been shown to protect bradyzoites against immune responses during chronic infections. We studied structures of SAG2C, -2D, -2X, and -2Y proteins using bioinformatics methods. The protein sequence alignment was performed by T-Coffee method. Secondary structural and functional domains were predicted using software PSIPRED v3.0 and SMART software, and 3D models of proteins were constructed and compared using the I-TASSER server, VMD, and SWISS-spdbv. Our results showed that SAG2C, -2D, -2X, and -2Y are highly homologous proteins. They share the same conserved peptides and HLA-I restricted epitopes. The similarity in structure and domains indicated putative common functions that might stimulate similar immune response in hosts. The conserved peptides and HLA-restricted epitopes could provide important insights on vaccine study and the diagnosis of this disease.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23586017 PMCID: PMC3618923 DOI: 10.1155/2013/165342
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The original resources for SAG2C, D, X, and Y.
| Protein | Sizea | Coding gene location in types I, II, and III parasitesb | Derivationc |
|---|---|---|---|
| SAG2C | 365 aa | TGGT1_chrX: 7,358,316–7,360,225(−); | |
| TGME49_chrX: 7353954–7,365860 (−); | SRS49D | ||
| TGVEG_chrX: 7,440,957–7,442,866(−) | |||
| SAG2D | 196 aa | TGGT1_chrX: 7,352,615–7,353,874(−); | |
| TGME49_chrX: 7,353,328–7,354,587(−); | SRS49C | ||
| TGVEG_chrX: 7,435,249–7,436,508 (−) | |||
| SAG2X | 367 aa | TGGT1_chrX: 7,352,615–7,353,874(−); | |
| TGME49_chrX: 7,353,328–7,354,587(−); | SRS49B | ||
| TGVEG_chrX: 7,435,249–7,436,508 (−) | |||
| SAG2Y | 316 aa | TGGT1_chrX: 7,358,316–7,360,225(−); | |
| TGME49_chrX: 7,355,817–7,356,572 (−); | SRS49A | ||
| TGVEG_chrX: 7,440,957–7,442,866(−) |
aSize is the amino acid number that the protein has.
bCoding gene is the location of the gene that coded the protein.
cSRS domain-containing protein number.
Figure 1Alignment analysis for SAG2C, -2D, -2X, -2Y proteins. T-Coffee: multiple sequence alignment tools were used to obtain the alignment analysis result for SAG2C, -2D, -2X, and -2Y. Color bar indicated the identity, from bad identity to good identity.
Figure 2The 2D Structures of SAG2C, -2D, -2X, and -2Y proteins. PSIPREDv3.0 was used to predict the secondary structure for SAG2C, -2D, -2X, and -2Y proteins (C stands for coil, H stands for α-helix, and E stands for β-strand).
Figure 3Prediction for protein domain. A web-based tool—SMARTM was used to figure out the domains of these proteins: transmembrane segments predicted by the TMHMM2 program (segments in blue color), segments of low compositional complexity determined by the SEG program (segments in purple color), signal peptides determined by the SignalP program (segments in red color), and domain (segments in gray color).
Parameters for predicted best 3-D model.
| Name | C-scorea | TM-scoreb | RMSDb | No. of decoysc | Cluster densityd |
|---|---|---|---|---|---|
| SAG2C | −2.18 | 0.46 ± 0.15 | 11.8 ± 4.5 | 460 | 0.0285 |
| SAG2D | −1.45 | 0.54 ± 0.15 | 8.5 ± 4.5 | 4662 | 0.0842 |
| SAG2X | −2.22 | 0.45 ± 0.15 | 11.9 ± 4.4 | 455 | 0.0280 |
| SAG2Y | −1.59 | 0.52 ± 0.15 | 9.9 ± 4.6 | 2877 | 0.0525 |
aC-score is a confidence score for estimating the quality of predicted models by I-TASSER. C-score is typically in the range of [−5, 2], where a C-score of higher value signifies a model with a high confidence and vice versa.
bTM-score and RMSD are known standards for measuring structural similarity between two structures which are usually used to measure the accuracy of structure modeling when the native structure is known.
cNumber of decoys represents the number of structural decoys that are used in generating each model.
dCluster density represents the density of cluster.
Figure 4The 3D models of SAG2C, -2D, -2X, and -2Y. The sequences of proteins were sent to Dr. Zhang's lab from the website http://zhanglab.ccmb.med.umich.edu/I-TASSER/. The 3D models with the highest score for each protein were selected. The models were viewed by VMD software, color method was secondary structure (yellow: β-strands, purple: α-helix, gray: coil), and draw method was new cartoon. The domain of each model was shown out in sheet form.
Prediction binding site residues in the model.
| Protein model | C-scoreLB a | TM-scoreb | RMSDc | IDENd | Cov.e | BS-scoref | Lig. name | Predicted binding site residues in the model |
|---|---|---|---|---|---|---|---|---|
| SAG2C | 0.09 | 0.350 | 7.07 | 0.031 | 0.604 | 0.84 | MAL | 222, 223, 224, 225, 226, 227, 273, 279 |
| SAG2D | 0.11 | 0.435 | 4.90 | 0.043 | 0.668 | 0.75 | ANP | 96, 153, 155 |
| SAG2X | 0.06 | 0.388 | 5.94 | 0.053 | 0.597 | 0.79 | NA | 261, 292, 326 |
| SAG2Y | 0.07 | 0.412 | 6.36 | 0.057 | 0.687 | 0.77 | FES | 144, 145, 146, 147, 149, 150, 151, 153, 243 |
aC-scoreLB is the confidence score of predicted binding site. C-scoreLB values range between [0-1], where a higher score indicates a more reliable ligand-binding site prediction.
bTM-score is a measure of global structural similarity between query and template protein.
cRMSD is the RMSD between residues that are structurally aligned by TM-align.
dIDEN is the percentage sequence identity in the structurally aligned region.
eCov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.
fBS-score is a measure of local similarity (sequence and structure) between template binding site and predicted binding site in the query structure.
Similarity between SAG2C, -2D, -2X, -2Y and top identified structural analogs BSR4 protein.
| Protein model | TM-scorea | RMSDb | IDENc | Cov.d |
|---|---|---|---|---|
| SAG2A | 0.583 | 1.20 | 0.244 | 0.596 |
| SAG2B | 0.661 | 2.42 | 0.167 | 0.755 |
| SAG2C | 0.673 | 1.68 | 0.179 | 0.698 |
| SAG2D | 0.670 | 2.85 | 0.194 | 0.763 |
aTM-score of the structural alignment between the query structure and known structures in the PDB library.
bRMSD is the RMSD between residues that are structurally aligned by TM-align.
cIDEN is the percentage sequence identity in the structurally aligned region.
dCov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein.
Figure 5Fit analyses of SAG2C, -2D, -2X, and -2Y. Swiss-Pdb Viewer was used to show the fitness. SAG2C and SAG2X are in yellow color; SAG2D and SAG2Y are in blue color.
Predicted HLA restricted CD8+ T cell epitopes for SAG2C, -2D of T. gondii.
| Allele | Sequence | Pecentile rank | Method used | Location | |
|---|---|---|---|---|---|
| SAG2C | SAG2D | ||||
| HLA-A*0201 | VVLGSAFMI | 3.4 | Consensus ( | 342–350 | 174–182 |
| HLA-A*0201 | FMIAFISCF | 1.4 | Consensus ( | 348–356 | 180–188 |
| HLA-A*0201 | AFISCFALV | 2.8 | Consensus ( | 351–359 | 183–191 |
| HLA-A*1101 | QVTVAVTSK | 1.45 | Consensus ( | 206–214 | 38–46 |
| HLA-A*1101 | SSPQNIFYK | 0.2 | Consensus ( | 290–298 | 122–130 |
| HLA-A*1101 | QVGTQTECK | 2.35 | Consensus ( | 308–316 | 140–148 |
| HLA-A*1101 | KVLINIEEK | 1 | Consensus ( | 316–324 | 148–156 |
| HLA-B*0702 | LPSSPQNIF | 1.1 | Consensus ( | 288–296 | 120–128 |
| HLA-B*0702 | KPEAETPAT | 2.6 | Consensus ( | 324–332 | 156–164 |
Predicted HLA-restricted CD8+ T cell epitopes for SAG2X, -2Y of T. gondii.
| Allele | Sequence | Pecentile rank | Method used | Location | |
|---|---|---|---|---|---|
| SAG2C | SAG2D | ||||
| HLA-A*0201 | ALVPNSSLV | 1.6 | Consensus ( | 260–268 | 204–212 |
| HLA-A*0201 | VLSSSFMIV | 1.2 | Consensus ( | 346–354 | 290–298 |
| HLA-A*1101 | ALAITSTTK | 1.6 | Consensus (ANN, SMM) | 208–216 | 152–160 |
| HLA-A*1101 | SSAQTFFYK | 0.1 | Consensus ( | 289–297 | 234–242 |
| HLA-A*1101 | KVLISVEKR | 2.75 | Consensus ( | 311–319 | 263–271 |
| HLA-B*0702 | LPSSAQTFF | 3 | Consensus ( | 285–293 | 232–240 |
| HLA-B*0702 | RPDSDATAT | 2.2 | Consensus ( | 314–322 | 271–279 |
Figure 6Identifying HLA-restricted epitopes on the surface of 3D models. The predicted HLA-restricted epitopes sequences shown in Tables 2 and 3 were marked out on the surface of 3D models of SAG2C, -2D, -2X, and -2Y. The 3D structures of proteins were shown using Surf method. Red color balls stand for epitopes restricted by HLA-A*1101, green color balls stand for epitopes restricted by HLA-A*0201, and blue color balls stand for epitopes restricted by HLA-B*0702.