| Literature DB >> 26636321 |
Yongtao Xu1,2, Shui Yu1, Jian-Wei Zou3, Guixiang Hu3, Noorsaadah A B D Rahman4,5, Rozana Binti Othman5,6, Xia Tao7, Meilan Huang1.
Abstract
The peptides derived from envelope proteins have been shown to inhibit the protein-protein interactions in the virus membrane fusion process and thus have a great potential to be developed into effective antiviral therapies. There are three types of envelope proteins each exhibiting distinct structure folds. Although the exact fusion mechanism remains elusive, it was suggested that the three classes of viral fusion proteins share a similar mechanism of membrane fusion. The common mechanism of action makes it possible to correlate the properties of self-derived peptide inhibitors with their activities. Here we developed a support vector machine model using sequence-based statistical scores of self-derived peptide inhibitors as input features to correlate with their activities. The model displayed 92% prediction accuracy with the Matthew's correlation coefficient of 0.84, obviously superior to those using physicochemical properties and amino acid decomposition as input. The predictive support vector machine model for self- derived peptides of envelope proteins would be useful in development of antiviral peptide inhibitors targeting the virus fusion process.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26636321 PMCID: PMC4670226 DOI: 10.1371/journal.pone.0144171
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Experimentally validated peptide inhibitors of E proteins.
| Active peptides | Non-active peptides | Ref | |
|---|---|---|---|
|
|
|
| [ |
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
| RIPTGERVWDRGNVTLLCDC |
| ||
| TLLCDCPNGPWVWVPAFCQA |
| ||
| WDRGNVTLLCDCPNGPWVWV |
| ||
| WVWVPAFCQAVGWGDPITHW |
| ||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
| TCILDRRPASCGTCVRDCWP | |||
| TEVSEALGGAGLTGGFYEPL | |||
| TGTFGFFPGVPPINNCMPLG | |||
| TKIRDSLHLVKCPTPAIEPP | |||
| TTPFTIRGPLGNQGRGNPVR | |||
| VGSASCTIAALGSSDRDTVV | |||
| VRRCSELMGRRNPVCPGYAW | |||
| VSVTCVWGSVSWFASTGGRD | |||
|
| [ | ||
|
| |||
|
| |||
| YTSLIHSLIEESQNQQEKNEQELLELD | |||
| WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL | |||
|
| [ | ||
|
|
|
| [ |
|
|
|
| |
| RQMRAWGQDYQHGGMGYSC |
| ||
|
| |||
|
| |||
| WHTVEPIVTEKDRPVNYEWE | |||
|
| [ | ||
|
|
| [ | |
| MVDRGWGNGCGLFGKGGIV | |||
| AWLVHTQWFLDLPLPWLPGADTQGSNWI | |||
| DENV-DET |
| [ | |
|
| |||
|
| |||
|
|
|
| [ |
|
| TFLVHREWFMDLNLPWSSA | ||
|
|
| [ | |
|
|
| ||
|
|
| ||
|
|
| ||
| CDVIALLCHLNTPSF |
| ||
|
| |||
|
| |||
|
| |||
|
| |||
| CDVIALLACHLNTPSF | |||
| CDVIALLECHLNT | |||
| DTRACDVIALLECHLNT | |||
|
|
| [ | |
|
|
| ||
|
| |||
| FFIFPNYTIVSDFGRPNAA | |||
|
|
|
| [ |
|
|
|
| |
|
|
| ||
| TTPKFTVAWDWVPKR |
| ||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
| PYKFKATMYYKDVTV | |||
| TVSTFIDLNITMLED | |||
| APTSPGTPGVAAATQ | |||
| AYQPLLSNTLAELYV | |||
| CIVEEVDARSVYPYD | |||
|
|
|
| [ |
|
|
|
| |
| AAHLIDALYAEFLGGRVLTT | |||
|
|
|
| [ |
|
|
|
| |
|
|
| ||
|
|
| ||
| STALLLFPNGTVIHLLAFDTQPVAAKKKK |
| ||
| KKSTALLLFPNGTVIHLLAFDTQPVAAKK | TVIHLLAFDTQPVAAIAPGFLAASA | ||
| SHVLTAPALTFNLTDFVPILALAGIQA | |||
|
|
|
| [ |
|
|
|
| |
|
|
| ||
| YAYSHQLSRADITTVSTFI | FVRGHTGFVYCYGYTGFPR | ||
|
|
|
| [ |
|
|
|
| |
|
|
| ||
| AGDNATVAAGHATLREHLRDIKAENTDAN |
| ||
|
| |||
|
| |||
|
| |||
| GENNELRLTRDAI | |||
| DVREEEQLGERATGLNLNI | |||
|
|
| [ | |
|
|
| ||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
| FARLQFTYNHIQRHVNDMLGRVAIAWCE | |||
| FARLQFTYNHIQRHVNDMLGRVKKAWEE | |||
| SIEFARLQFTYNHIQRHVNDMLGRVAIAWCELQNHE | |||
|
|
| [ | |
|
| |||
|
|
| [ | |
|
|
| ||
|
|
| [ | |
|
|
| ||
|
|
| ||
|
| |||
|
| |||
|
| |||
|
| |||
|
| |||
| MANAGLQLLGFILAFLGWIG | |||
| MANAGLQLLGFILAFLGWIGAI | |||
| MANAGLQLLGFILAFLGW | |||
|
|
| [ | |
|
|
| ||
|
| |||
|
| |||
| AACEVAKNLNESLIDLQELGKYEQYIKW | |||
|
| [ | ||
|
| |||
|
| |||
| IQKEIDRLNEVAKNLNESLI | |||
|
|
| [ | |
|
|
| ||
|
| |||
|
| |||
| SNNTIAIPTNFSISITTEVM | |||
| GIGVTQNVLYENQKQIANQF | |||
|
|
| [ | |
|
| |||
|
|
| [ | |
|
|
| ||
|
| |||
|
| |||
| LNLFKKTINGLISDSLVIR | |||
|
|
| [ | |
|
|
| ||
|
| |||
| YLYGIGSAVVSFAIKWEY |
* The sequences in bold were used in the 75p+75n training set; the rest sequences were used in the 26p+26n test set.
Performance of the AVPpred and EAPpred models training set V75p+75n.
| Data set | Model | Sensitivity | Specificity | Accuracy | MCC |
|---|---|---|---|---|---|
|
| EAPphysico | 79.37 | 71.26 | 74.67 | 0.5 |
|
| |||||
| EAPcompo | 66.99 | 87.23 | 73.33 | 0.5 | |
| EAPscoring | 100 | 92.59 | 96 | 0.92 | |
|
| EAPphysico | 80 | 72.94 | 76 | 0.52 |
|
| |||||
| EAPcompo | 94.67 | 94.67 | 94.67 | 0.89 | |
| EAPscoring | 100 | 97.4 | 98.67 | 0.97 |
Performance of AVPpred and EAPpred models on independent test set V26p+26n.
| Model | Features | Sensitivity | Specificity | Accuracy | MCC |
|---|---|---|---|---|---|
|
| AVPmotif | 100 | 50.98 | 51.92 | 0.14 |
| AVPphysico | 72.22 | 61.76 | 65.38 | 0.32 | |
| AVPcompo | 63.16 | 57.58 | 59.62 | 0.20 | |
| AVPalign | 92.86 | 65.79 | 73.08 | 0.52 | |
|
| EAPphysico | 68.18 | 63.33 | 65.38 | 0.31 |
| EAPcompo | 72.41 | 78.26 | 75 | 0.5 | |
| EAPscoring | 92.3 | 92.3 | 92.3 | 0.84 |
Fig 1Feature ranking of the EAPcompo model.
X-axis is the type of amino acid, Y-axis is W * W.