| Literature DB >> 25705652 |
Pingping Sun1, Haixu Ju2, Baowen Zhang2, Yu Gu2, Bo Liu3, Yanxin Huang4, Huijie Zhang2, Yuxin Li4.
Abstract
Identification of epitopes which invokes strong humoral responses is an essential issue in the field of immunology. Various computational methods that have been developed based on the antigen structures and the mimotopes these years narrow the search for experimental validation. These methods can be divided into two categories: antigen structure-based methods and mimotope-based methods. Though new methods of the two kinds have been proposed in these years, they cannot maintain a high degree of satisfaction in various circumstances. In this paper, we proposed a new conformational B-cell epitope prediction method based on antigen preprocessing and mimotopes analysis. The method classifies the antigen surface residues into "epitopes" and "nonepitopes" by six epitope propensity scales, removing the "nonepitopes" and using the preprocessed antigen for epitope prediction based on mimotope sequences. The proposed method gives out the mean F score of 0.42 on the testing dataset. When compared with other publicly available servers by using the testing dataset, the new method yields better performance. The results demonstrate the proposed method is competent for the conformational B-cell epitope prediction.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25705652 PMCID: PMC4326220 DOI: 10.1155/2015/257030
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The detailed information of the testing dataset.
| PDB_ID | Template chain | Target | Mimotope size | Reference |
|---|---|---|---|---|
| Antigen-antibody complex | ||||
| 3IU3 | I | Basiliximab | 6 ∗ 9 | 17440057 |
| 1YY9 | A | Cetuximab | 4 ∗ 12 | 16288119 |
| 1N8Z | C | Herceptin | 5 ∗ 12 | 15210798 |
| 2ADF | A | 82D6A3, IgG | 3 ∗ 8 | 12855711 |
| 1IQD | C | Anti-coagulation factor VIII monoclonal antibody BO2C11 | 27 ∗ 12 | 12676786 |
| 2GHW | A | 80R | 18 ∗ 15 | 16630634 |
| 2NY7 | G | B12 | 17 ∗ 14, 1 ∗ 10, 1 ∗ 13 | 16940148 |
| 1G9M | G | Anti-gp120 monoclonal antibody 17b | 1 ∗ 10, 10 ∗ 12 | 14596802 |
| 1E6J | P | 13b5 | 14 ∗ 14, 2 ∗ 7 | 14596802 |
| 1ZTX | E | E16 | 3 ∗ 13, 19 ∗ 14 | 18760481 |
| 2AJF | A | SARS-coronavirus spike protein | 18 ∗ 15 | 1116480 |
| 1BJ1 | W | rhuMAb | 36 ∗ 6, 3 ∗ 5, 2 ∗ 4 | 10543973 |
| 1JRH | I | A6, IgG1 | 59 ∗ 5 | 11123892 |
|
| ||||
| Protein-protein | ||||
| 1AVZ | B | Fyn | 8 ∗ 10,10 ∗ 12 | 7988556 |
| 1HX1 | B | Heat shock cognate 71 kDa protein | 8 ∗ 15 | 7649995 |
| 2GSK | A | Protein TONB | 6 ∗ 9 | 16414071 |
| 3EZE | B | Protein (phosphotransferase system, HPR) | 11 ∗ 6 | 10048929 |
| 1II4 | A | Fibroblast growth factor receptor 2 | 30 ∗ 7 | 12032665 |
Figure 1The algorithm flowchart of the method.
The prediction results on testing datasets.
| PDB_ID | MimoPro/this method | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| True epitopes | Predicted epitope | Sen | Spe | PPV | MCC | (Sen + Spe)/2 | ACC |
| |
| Antigen-antibody interactions | |||||||||
| 3IU3_I | 23 | 34/32 | 0.61/0.61 | 0.75/0.78 | 0.41/0.44 | 0.18/0.19 | 0.68/0.69 | 0.72/0.74 | 0.49/0.51 |
| 1YY9_A | 15 | 43/18 | 0.00/0.00 | 0.91/0.96 | 0.00/0.00 | −0.01/−0.01 | 0.45/0.48 | 0.88/0.93 | 0.00/0.00 |
| 1N8Z_C | 20 | 38/36 | 0.90/0.85 | 0.95/0.95 | 0.47/0.47 | 0.14/0.13 | 0.93/0.90 | 0.95/0.95 | 0.62/0.61 |
| 2ADF_A | 15 | 24/20 | 0.87/0.87 | 0.90/0.94 | 0.54/0.65 | 0.23/0.25 | 0.89/0.90 | 0.90/0.93 | 0.67/0.74 |
| 1IQD_C | 16 | 40/37 | 0.56/0.50 | 0.68/0.70 | 0.23/0.22 | 0.08/0.07 | 0.62/0/60 | 0.66/0.67 | 0.32/0.30 |
| 2GHW_A | 29 | 38/37 | 0.48/0.48 | 0.80/0.81 | 0.37/0.38 | 0.13/0.14 | 0.64/0.64 | 0.73/0.74 | 0.42/0.42 |
| 2NY7_G | 25 | 30/29 | 0.12/0.12 | 0.87/0.87 | 0.10/0.10 | 0.00/0.00 | 0.49/0.50 | 0.78/0.79 | 0.11/0.11 |
| 1G9M_G | 15 | 47/46 | 0.73/0.73 | 0.83/0.84 | 0.23/0.24 | 0.10/0.10 | 0.78/0.79 | 0.83/0.83 | 0.35/0.36 |
| 1E6J_P | 11 | 45/43 | 0.73/0.73 | 0.77/0.79 | 0.18/0.19 | 0.08/0.08 | 0.75/0.76 | 0.77/0.78 | 0.29/0.30 |
| 1ZTX_E | 14 | 39/37 | 0.10/0.10 | 0.63/0.66 | 0.36/0.38 | 0.24/0.24 | 0.81/0.83 | 0.69/0.72 | 0.53/0.55 |
| 2AJF_A | 20 | 43/39 | 0.10/0.10 | 0.90/0.91 | 0.05/0.05 | 0.00/0.00 | 0.50/0.50 | 0.86/0.87 | 0.06/0.07 |
| 1BJ1_W | 19 | 32/32 | 0.68/0.68 | 0.72/0.72 | 0.41/0.41 | 0.19/0.19 | 0.70/0.70 | 0.72/0.72 | 0.51/0.51 |
| 1JRH_I | 21 | 31/30 | 0.95/0.90 | 0.82/0.82 | 0.65/0.63 | 0.38/0.36 | 0.88/0.86 | 0.85/0.84 | 0.77/0.75 |
|
| |||||||||
| Protein-protein interactions | |||||||||
| 1AVZ_B | 16 | 32/31 | 0.69/0.69 | 0.71/0.72 | 0.34/0.35 | 0.16/0.17 | 0.70/0.70 | 0.70/0.72 | 0.46/0.47 |
| 1HX1_B | 20 | 38/38 | 0.75/0.75 | 0.69/0.69 | 0.39/0.39 | 0.20/0.20 | 0.72/0.72 | 0.70/0.70 | 0.52/0.52 |
| 2GSK_A | 33 | 40/37 | 0.21/0.21 | 0.93/0.93 | 0.18/0.19 | 0.04/0.04 | 0.57/0.57 | 0.88/0.88 | 0.19/0.20 |
| 3EZE_B | 20 | 27/26 | 0.75/0.70 | 0.74/0.74 | 0.56/0.54 | 0.29/0.27 | 0.75/0.72 | 0.75/0.73 | 0.64/0.61 |
| 1II4_A | 30 | 41/39 | 0.60/0.60 | 0.69/0.72 | 0.44/0.46 | 0.18/0.19 | 0.64/0.66 | 0.66/0.68 | 0.51/0.52 |
|
| |||||||||
| Average | 0.60/0.58 | 0.79/0.81 | 0.33/0.34 | 0.14/0.15 | 0.70/0.70 | 0.78/0.79 | 0.41/0.43 | ||
Figure 2Sensitivity versus 1− specificity scores of the method on testing dataset.
The prediction performance of SVM and RF with different parameters.
| Different parameters | Sen | Spe | PPV | MCC | (Sen + Spe)/2 | ACC |
|
|---|---|---|---|---|---|---|---|
| RF | |||||||
|
| 0.58 | 0.81 | 0.34 | 0.15 | 0.70 | 0.79 | 0.43 |
|
| 0.56 | 0.83 | 0.34 | 0.14 | 0.69 | 0.80 | 0.42 |
|
| 0.53 | 0.83 | 0.34 | 0.14 | 0.68 | 0.80 | 0.40 |
| LibSVM | |||||||
| Blocked | 0.58 | 0.80 | 0.32 | 0.14 | 0.69 | 0.79 | 0.42 |
| Weight | 0.57 | 0.81 | 0.33 | 0.14 | 0.69 | 0.79 | 0.41 |
| Blocked and weight | 0.58 | 0.81 | 0.33 | 0.14 | 0.69 | 0.79 | 0.42 |
Figure 3The sensitivity of each method on the testing dataset.
Figure 4The PPV of each method on the testing dataset.
Figure 5The ACC of each method on the testing dataset.
Figure 6The F of each method on the testing dataset.
The overall performance of the compared methods on testing dataset.
| Methods | Sen | Spe | PPV | MCC | (Sen + Spe)/2 | ACC |
|
|---|---|---|---|---|---|---|---|
| Pep-3D-Search | 0.48 | 0.78 | 0.29 | 0.08 | 0.63 | 0.75 | 0.35 |
| EpiSearch | 0.31 | 0.89 | 0.28 | 0.09 | 0.60 | 0.70 | 0.19 |
| PepSurf | 0.36 | 0.86 | 0.26 | 0.07 | 0.61 | 0.79 | 0.31 |
| This method | 0.58 | 0.81 | 0.33 | 0.14 | 0.69 | 0.79 | 0.42 |