| Literature DB >> 20961417 |
Hifzur Rahman Ansari1, Gajendra Ps Raghava.
Abstract
BACKGROUND: One of the major challenges in the field of vaccine design is to predict conformational B-cell epitopes in an antigen. In the past, several methods have been developed for predicting conformational B-cell epitopes in an antigen from its tertiary structure. This is the first attempt in this area to predict conformational B-cell epitope in an antigen from its amino acid sequence.Entities:
Year: 2010 PMID: 20961417 PMCID: PMC2974664 DOI: 10.1186/1745-7580-6-6
Source DB: PubMed Journal: Immunome Res ISSN: 1745-7580
Figure 1Feature extraction for a 19 window length pattern. Antibody interacting residues are marked in red e.g. S/T, Positive pattern shaded in green where S is at the center with 9 neighboring residues on either side, other overlapping negative patterns are shown in blue. a) Creation of 19 window overlapping patterns from amino acid sequence, b) generation of binary profile of pattern (BPP), c) generation of physico-chemical profile (PPP) and d) generation of composition profile of pattern (CPP).
Figure 2Comparison of amino acid composition of antibody interacting residues (B-cell epitope) and non-interacting residues (non-epitope).
The performance of BPP based SVM model developed using different window lengths from 5 to 21 residues
| Window size | Kernel parameters | Thr* | Sen | Spe | Acc | MCC |
|---|---|---|---|---|---|---|
| t 2 g 0.01 j 1 c 10 | 0.1 | 58.38 | 58.55 | 58.47 | 0.17 | |
| t 2 g 0.01 j 1 c 1 | 0.1 | 55.87 | 59.81 | 57.84 | 0.16 | |
| t 2 g 0.01 j 1 c 1 | 0.1 | 55.66 | 58.85 | 57.26 | 0.15 | |
| t 2 g 0.001 j 1 c 10 | 0 | 61.55 | 56.99 | 59.27 | 0.19 | |
| t 2 g 0.1 j 1 c 1 | 0 | 62.58 | 59.09 | 60.84 | 0.22 | |
| t 2 g 0.1 j 1 c 10 | 0 | 59.93 | 57.63 | 58.78 | 0.18 | |
| t 2 g 0.001 j 1 c 10 | 0 | 58.37 | 57.18 | 57.78 | 0.16 | |
| t 2 g 0.001 j 1 c 10 | 0.1 | 52.92 | 63.78 | 58.35 | 0.17 | |
| t 2 g 0.001 j 1 c 10 | 0 | 59.69 | 57.22 | 58.45 | 0.17 |
*(Thr- Threshold, Sen - Sensitivity, Spe - Specificity, Acc - Accuracy, MCC - Matthew's correlation coefficient)
The performance of PPP based SVM model developed different window lengths from 5 to 21 residues
| W | Kernel parameters | Thr* | Sen | Spe | Acc | MCC |
|---|---|---|---|---|---|---|
| t 2 g 0.00001 j 1 c 10 | -0.3 | 53.95 | 59.62 | 56.78 | 0.14 | |
| t 2 g 0.00001 j 1 c 10 | 0.1 | 55.82 | 58.03 | 56.93 | 0.14 | |
| t 2 g 0.00001 j 1 c 10 | 0 | 54.56 | 55.84 | 55.2 | 0.1 | |
| t 2 g 0.00001 j 1 c 10 | 0.1 | 52.3 | 62.48 | 57.39 | 0.15 | |
| t 2 g 0.00001 j 1 c 10 | 0.1 | 55.11 | 60.37 | 57.74 | 0.16 | |
| t 2 g 0.00001 j 1 c 10 | 0 | 56.57 | 60.06 | 58.31 | 0.17 | |
| t 2 g 0.00001 j 1 c 10 | 0 | 60.19 | 55.77 | 57.98 | 0.16 | |
| t 2 g 0.00001 j 1 c 10 | 0 | 57.82 | 54.15 | 55.98 | 0.12 | |
| t 1 d 1 | 0 | 57.31 | 58.32 | 57.81 | 0.16 |
The performance SVM models developed using composition profile of patterns at different window lengths
| Window size | Kernel parameters | Thr* | Sen | Spe | Acc | MCC |
|---|---|---|---|---|---|---|
| t 2 g 0.001 j 1 c 1 | 0 | 61.75 | 58.11 | 59.93 | 0.2 | |
| t 2 g 0.001 j 1 c 10 | 0 | 68.35 | 62.2 | 65.27 | 0.31 | |
| t 2 g 0.001 j 1 c 10 | 0 | 73.45 | 67.21 | 70.33 | 0.41 | |
| t 2 g 0.01 j 1 c 1 | -0.1 | 82.08 | 77.26 | 79.67 | 0.59 | |
| t 2 g 0.01 j 1 c 10 | -0.1 | 82.57 | 84.17 | 83.37 | 0.67 | |
| t 2 g 0.01 j 1 c 1 | -0.1 | 79.96 | 90.31 | 85.14 | 0.71 | |
| t 2 g 0.01 j 1 c 1 | -0.1 | 80.69 | 90.1 | 85.4 | 0.71 | |
| t 2 g 0.01 j 1 c 1 | -0.1 | 83.62 | 88.96 | 86.29 | 0.73 |
Figure 3The performance of SVM models developed using composition, binary and physic-chemical property profile.
The performance of BPP and CPP based SVM model on Benchmark dataset, developed using balance and realistic set of patterns.
| Type of Pattern set | Model | SVM parameters | Thr* | Sen | Spe | Acc | MCC |
|---|---|---|---|---|---|---|---|
| BPP | t 2 g 0.001 j 10 c 10 | -0.2 | 50.49 | 60.28 | 59.49 | 0.06 | |
| CPP | t 2 g 0.001 j 10 c 10 | -0.3 | 80.41 | 84.64 | 84.30 | 0.44 | |
| BPP | t 2 g 0.01 j 1 c 10 | 0.1 | 61.31 | 51.22 | 56.27 | 0.13 | |
| CPP | t 2 g 0.01 j 1 c 10 | 0 | 82.36 | 89.42 | 85.89 | 0.72 | |
Models were developed using window size 19.
Figure 4The performance of SVM models on Benchmark dataset as shown by ROC plot.
Overall performance of structure based and CBTOPE algorithms on benchmark dataset
| Evaluation parameter | ProMate | PSI-PRED best patch | Patch Dock | ClusPro (DOT) best model | CEP | DiscoTope (-7.7) | CBTOPE* (This Study) |
|---|---|---|---|---|---|---|---|
| 0.09 | 0.33 | 0.43 | 0.45 | 0.31 | 0.42 | ||
| 0.08 | 0.14 | 0.11 | 0.07 | 0.22 | 0.21 | ||
| 0.10 | 0.19 | 0.26 | 0.39 | 0.11 | 0.16 | ||
| 0.84 | 0.82 | 0.85 | 0.89 | 0.74 | 0.75 | ||
| 0.51 | 0.60 | 0.66 | 0.69 | 0.54 | 0.60 |
*(Thr- Threshold, Sen - Sensitivity, Spe - Specificity, Acc - Accuracy, PPV - positive predictive value)