| Literature DB >> 23517638 |
Ankur Gautam1, Kumardeep Chaudhary, Rahul Kumar, Arun Sharma, Pallavi Kapoor, Atul Tyagi, Gajendra P S Raghava.
Abstract
BACKGROUND: Cell penetrating peptides have gained much recognition as a versatile transport vehicle for the intracellular delivery of wide range of cargoes (i.e. oligonucelotides, small molecules, proteins, etc.), that otherwise lack bioavailability, thus offering great potential as future therapeutics. Keeping in mind the therapeutic importance of these peptides, we have developed in silico methods for the prediction of cell penetrating peptides, which can be used for rapid screening of such peptides prior to their synthesis.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23517638 PMCID: PMC3615965 DOI: 10.1186/1479-5876-11-74
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Figure 1Amino acid composition comparison. Comparison of percent average amino acid composition of (a) whole peptides, (b) N-terminal residues, and (c) C-terminal residues between CPPs and non-CPPs.
Figure 2Sequence logo of first ten residues (N-terminus) of CPPs. The figure depicts the sequence logo of first ten residues (N-terminus) of CPPs, where size of residue is proportional to its propensity.
Figure 3Sequence logo of last ten residues (C-terminus) of CPPs. The figure depicts the sequence logo of last ten residues (C-terminus) of CPPs, where size of residue is proportional to its propensity.
Performance of composition-based SVM method
| 89.12 | 91.81 | 90.47 | 0.81 | 0.96 | |
| 92.51 | 88.24 | 90.37 | 0.81 | 0.96 | |
| 70.59 | 67.38 | 68.98 | 0.38 | 0.73 |
Performance of dipeptide-based SVM method
| 88.14 | 91.95 | 90.04 | 0.80 | 0.95 | |
| 90.91 | 94.65 | 92.78 | 0.86 | 0.97 | |
| 72.73 | 61.50 | 67.11 | 0.34 | 0.71 |
Performance of binary profile-based SVM method
| 80.08 | 85.73 | 82.91 | 0.66 | 0.89 | 86.63 | 87.17 | 86.90 | 0.74 | 0.90 | 62.03 | 65.78 | 63.90 | 0.28 | 0.64 | |
| 84.60 | 82.20 | 83.40 | 0.67 | 0.91 | 91.44 | 82.35 | 86.90 | 0.74 | 0.95 | 64.17 | 67.38 | 65.78 | 0.32 | 0.66 | |
| 83.19 | 88.98 | 86.09 | 0.72 | 0.96 | 91.98 | 82.35 | 87.17 | 0.75 | 0.95 | 66.84 | 66.84 | 66.84 | 0.34 | 0.69 | |
| 83.95 | 86.19 | 85.03 | 0.70 | 0.91 | 89.44 | 90.34 | 89.87 | 0.80 | 0.95 | 66.67 | 63.27 | 65.05 | 0.30 | 0.65 | |
| 86.55 | 83.22 | 84.95 | 0.70 | 0.93 | 87.04 | 91.10 | 88.96 | 0.78 | 0.95 | 66.05 | 61.90 | 64.08 | 0.28 | 0.68 | |
| 90.60 | 86.89 | 88.81 | 0.78 | 0.95 | 93.21 | 93.84 | 93.51 | 0.87 | 0.96 | 66.67 | 64.63 | 65.70 | 0.31 | 0.68 | |
Sn: sensitivity, Sp: specificity, AC: accuracy.
Performance of physicochemical properties-based SVM method
| 91.24 | 90.25 | 90.75 | 0.82 | 0.95 | |
| 91.98 | 89.84 | 90.91 | 0.82 | 0.95 | |
| 73.80 | 63.64 | 68.72 | 0.32 | 0.70 |
Performance of MEME/MAST-based SVM method
| 0.50 | 81.17 | 0.48 | 79.88 | 0.54 | 79.88 | |
| 0.50 | 74.40 | 0.48 | 74.71 | 0.56 | 74.71 | |
| 0.48 | 63.10 | 0.50 | 69.54 | 0.60 | 69.54 | |
| 0.5 | 54.97 | 0.53 | 62.64 | 0.63 | 62.64 | |
| 0.56 | 50 | 0.57 | 56.32 | 0.64 | 56.32 | |
| 0.64 | 45.03 | 0.62 | 52.87 | 0.65 | 52.87 | |
| 0.74 | 42.92 | 0.70 | 51.14 | 0.66 | 51.14 | |
| 0.83 | 39.46 | 0.83 | 48.28 | 0.66 | 48.28 | |
| 0.90 | 36.45 | 0.88 | 45.98 | 0.68 | 45.98 | |
PCP: Percentage of correct prediction.
Performance of hybrid method
| 91.90 | 93.88 | 92.85 | 0.86 | 0.97 | 98.15 | 96.58 | 97.40 | 0.95 | 0.99 | 80.86 | 76.87 | 78.96 | 0.58 | 0.86 | |
| 91.41 | 93.88 | 92.60 | 0.85 | 0.97 | 96.91 | 96.58 | 96.75 | 0.93 | 0.99 | 79.01 | 76.87 | 77.99 | 0.56 | 0.84 | |
| 91.25 | 93.88 | 92.51 | 0.85 | 0.97 | 95.68 | 96.58 | 96.10 | 0.92 | 0.99 | 76.54 | 76.87 | 76.70 | 0.53 | 0.83 | |
| 90.76 | 93.88 | 92.26 | 0.85 | 0.97 | 95.06 | 96.58 | 95.78 | 0.92 | 0.99 | 74.07 | 76.87 | 75.40 | 0.51 | 0.81 | |
| 89.63 | 93.88 | 91.67 | 0.83 | 0.97 | 94.44 | 96.58 | 95.45 | 0.91 | 0.98 | 71.60 | 76.87 | 74.11 | 0.48 | 0.79 | |
| 88.65 | 93.88 | 91.17 | 0.83 | 0.97 | 94.44 | 96.58 | 95.45 | 0.91 | 0.98 | 53.09 | 76.87 | 64.40 | 0.31 | 0.68 | |
| 88.17 | 93.88 | 90.92 | 0.82 | 0.96 | 94.44 | 96.58 | 95.45 | 0.91 | 0.98 | 53.09 | 76.87 | 64.40 | 0.31 | 0.68 | |
| 88.01 | 93.88 | 92.83 | 0.82 | 0.96 | 94.44 | 96.58 | 95.45 | 0.91 | 0.98 | 53.09 | 76.87 | 64.40 | 0.31 | 0.68 | |
| 87.52 | 93.88 | 90.58 | 0.81 | 0.96 | 94.44 | 96.58 | 95.45 | 0.91 | 0.98 | 70.59 | 67.38 | 68.98 | 0.38 | 0.73 | |
Sn: sensitivity, Sp: specificity, AC: accuracy.
Figure 4The performance of SVM models developed using composition, dipeptide and physicochemical property profile on CPPsite-1 dataset (where 1-specificity represents the false positive rate and value in bracket shows area under curve).
Figure 5The performance of SVM models developed using composition and hybrid models on CPPsite-1 dataset (where 1-specificity represents the false positive rate and value in bracket shows area under curve).
Comparison with previous methods
| 95.94 | 96.40 | 98.65 | 97.75 | |
| 75.86 | 82.07 | 83.45 | 83.45 | |
| 88.73 | 88.74 | 89.64 | 90.09 | |
| 83.16 | 81.63 | 81.63 | 83.33 | |
| 67.44 | 78.82 | 83.53 | 80.00 | |
| 77.27 | 92.75 | 95.65 | 97.06 | |
Figure 6Schematic presentation of CellPPD webserver with an example of SVM based prediction results.