| Literature DB >> 24753488 |
Konrad Krawczyk1, Xiaofeng Liu1, Terry Baker1, Jiye Shi2, Charlotte M Deane1.
Abstract
MOTIVATION: Antibodies are currently the most important class of biopharmaceuticals. Development of such antibody-based drugs depends on costly and time-consuming screening campaigns. Computational techniques such as antibody-antigen docking hold the potential to facilitate the screening process by rapidly providing a list of initial poses that approximate the native complex.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24753488 PMCID: PMC4207425 DOI: 10.1093/bioinformatics/btu190
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Left: Example of a case when intramolecular distances can provide information about which intermolecular contacts can exist. The antibody–antigen contacts between Tyr-22 and Lys-27 and Gly-56 and Lys-34 (blue dashes) can exist as the intramolecular distance between Tyr-22 and Gly-56 is 9.4 Å and the distance between the two Lys residues is 10.2 Å. The difference between those two intramolecular distances is 0.8 Å, which is below the cutoff of 1 Å. As a counterexample, the contacts between Tyr-22 and Lys-27 and Asp-102 and Lys-34 (black dashes) cannot be satisfied simultaneously because the intramolecular distance between Tyr-22 and Asp-102 is 17.5 Å. Right: The top epitope prediction for the antigen 1boy (human tissue factor, the unbound form of the antigen complexed in 1ahw in H-test). The prediction consists of a set of residues, which are considered to constitute the general area of the epitope. The true positives are shown in green, false positives in teal, false negatives in red and true negatives in dark blue. This prediction achieved 36% precision and 94% recall. (The target comes from the dataset H-test, and thus, the antibody used in the prediction was a homology model and the corresponding antigen was in the unbound form)
Table summarizing the results of epitope prediction on the X-test set
| PDB | Ag size | Epitope prediction | Random | ||||||
|---|---|---|---|---|---|---|---|---|---|
| EpiPred | DiscoTope 2.0 | ||||||||
| Precision (%) | Recall (%) | MCC | Precision (%) | Recall (%) | MCC | Precision (%) | Recall (%) | ||
| 4hj0 | 92 | 0.27 | 0 | 0 | 0.0 | 29 | 50 | ||
| 1tzh | 94 | 1 | 6 | 0.04 | 0.72 | 12 | 25 | ||
| 4am0 | 96 | 0.09 | 33 | 20 | 0.19 | 14 | 60 | ||
| 2ih3 | 97 | 0.08 | 0 | 0 | 0.0 | 15 | 27 | ||
| 4i77 | 97 | 0.0 | 0 | 0 | 0.0 | 21 | 31 | ||
| 3q1s | 113 | 0.15 | 0 | 0 | 0.0 | 20 | 37 | ||
| 1p2c | 129 | 0 | 0 | 0.0 | 0.0 | 39 | 32 | ||
| 4ht1 | 131 | 0.05 | 0 | 0 | 0.0 | 28 | 44 | ||
| 3ab0 | 136 | 0.34 | 0 | 0 | 0.0 | 33 | 52 | ||
| 1v7m | 145 | 0.29 | 0 | 0 | 0.0 | 9 | 16 | ||
| 4g3y | 148 | 3 | 8 | 0.04 | 0.33 | 11 | 25 | ||
| 2vxt | 156 | 4 | 9 | 0.04 | 0.3 | 14 | 23 | ||
| 3u9p | 169 | 0.47 | 6 | 5 | 0.0 | 8 | 15 | ||
| 3o2d | 178 | 0.28 | 0 | 0 | 0.0 | 9 | 16 | ||
| 1fns | 196 | 0 | 0 | 0.0 | 0.0 | 33 | 11 | ||
| 3ma9 | 197 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 21 | 33 |
| 3rvv | 223 | 0.39 | 15 | 17 | 0.07 | 6 | 15 | ||
| 3raj | 230 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 24 | 21 |
| 1nfd | 239 | 7 | 23 | 0.04 | 0.75 | 10 | 15 | ||
| 3i50 | 273 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 1 | 6 |
| 3gjf | 276 | 0.2 | 5 | 11 | 0.05 | 6 | 15 | ||
| 3liz | 329 | 0.34 | 0 | 0 | 0.0 | 10 | 15 | ||
| 3pgf | 358 | 0.0 | 0 | 0 | 0.0 | 13 | 18 | ||
| 3zkm | 375 | 0.46 | 0 | 0 | 0.0 | 11 | 15 | ||
| 3r1g | 381 | 0.57 | 0 | 0 | 0.0 | 7 | 9 | ||
| 4jr9 | 409 | 19 | 85 | 0.34 | 0.46 | 4 | 11 | ||
| 4ene | 442 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 1 | 5 |
| 3o0r | 449 | 0.19 | 0 | 0 | 0.0 | 2 | 14 | ||
| 3t3p | 453 | 0 | 0 | 0.0 | 0.0 | 4 | 6 | ||
| 1n8z | 581 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 6 | 5 |
Note: We present the top EpiPred prediction and the corresponding results for DiscoTope 2.0 using a score threshold of −3.7. The values in bold indicate the best prediction result. Precision and recall were computed by the following formula: where TP stands for true positives, FP for false positives and FN for false negatives. In each case, we also give the Matthews correlation coefficient [MCC (Matthews, 1975)]. As control, the corresponding result using randomized score is given for each target.
Comparison of the specificity of EpiPred predictions evaluated on its capacity to distinguish between antibodies binding to lysozyme: epitope I (1a2y and 1jhl), epitope II (1p2c and 2iff) and epitope III (1j1x)
| Epitope | Prediction from | Epitope I | Epitope II | Epitope III | ||
|---|---|---|---|---|---|---|
| Evaluated on | PDB | 1a2y | 1jhl | 1p2c | 2iff | 1j1x |
| Epitope I | 1a2y | – | 0 | 0 | 0.19 | |
| 1jhl | – | 0 | 0 | 0 | ||
| Epitope II | 1p2c | 0.01 | 0.01 | – | 0 | |
| 2iff | 0 | 0 | – | 0 | ||
| Epitope III | 1j1x | 0.1 | 0 | 0 | 0 | |
Note: The row indicates the antibody for which the EpiPred prediction was performed and the column the antibody with respect to which the prediction was evaluated using MCC.
Fig. 2.Success rates of rescoring compared with the raw decoy lists given by the docking algorithms. We show results for each docking program (ZDOCK or ClusPro) on each test set (X-test or H-test) for top one, five and ten results. The leftmost bars are the number of times our global docking pipeline improved results. Bars that are second to left are the corresponding number of cases when including epitope information made the results worse. Bars that are second to right are the number of times including the epitope information did not change the raw result. The rightmost bars are the number of cases for which both procedures reported no close-to-native decoys. See Supplementary Section 6 for the per-complex information. (A) Success rate of ClusPro on dataset X-test. (B) Success rate of ZDOCK on dataset X-test. (C) Success rate of ClusPro on dataset H-test. (D) Success rate of ZDOCK on dataset H-test