| Literature DB >> 30619195 |
Akanksha Rajput1, Manoj Kumar1.
Abstract
Flaviviruses are arboviruses, which comprises more than 70 viruses, covering broad geographic ranges, and responsible for significant mortality and morbidity globally. Due to the lack of efficient inhibitors targeting flaviviruses, the designing of novel and efficient anti-flavi agents is an important problem. Therefore, in the current study, we have developed a dedicated prediction algorithm anti-flavi, to identify inhibition ability of chemicals and peptides against flaviviruses through quantitative structure-activity relationship based method. We extracted the non-redundant 2168 chemicals and 117 peptides from ChEMBL and AVPpred databases, respectively, with reported IC50 values. The regression based model developed on training/testing datasets of 1952 chemicals and 105 peptides displayed the Pearson's correlation coefficient (PCC) of 0.87, 0.84, and 0.87, 0.83 using support vector machine and random forest techniques correspondingly. We also explored the peptidomimetics approach, in which the most contributing descriptors of peptides were used to identify chemicals having anti-flavi potential. Conversely, the selected descriptors of chemicals performed well to predict anti-flavi peptides. Moreover, the developed model proved to be highly robust while checked through various approaches like independent validation and decoy datasets. We hope that our web server would prove a useful tool to predict and design the efficient anti-flavi agents. The anti-flavi webserver is freely available at URL http://bioinfo.imtech.res.in/manojk/antiflavi.Entities:
Keywords: QSAR; flaviviruses; inhibitor; machine learning techniques; peptidomimetics; prediction algorithm; random forest; support vector machine
Year: 2018 PMID: 30619195 PMCID: PMC6305493 DOI: 10.3389/fmicb.2018.03121
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Performance of training/testing and independent validation data sets of anti-flavi chemicals and peptides on 10-fold cross validation using Support Vector Machine and Random Forest techniques.
| Training/testing | Independent validation | ||||||
|---|---|---|---|---|---|---|---|
| Data | Descriptors | Features | MLTs | PCC | Data set | PCC | Data set |
| Chemicals | 16383 | 124 | SVM | 0.87 | TESTSET = 1952 | 0.87 | TESTSET = 216 |
| Peptides | 16383 | 19 | SVM | 0.84 | TESTSET = 105 | 0.84 | TESTSET = 12 |
| Chemicals | 16383 | 124 | RF | 0.87 | TESTSET = 1952 | 0.86 | TESTSET = 216 |
| Peptides | 16383 | 19 | RF | 0.83 | TESTSET = 105 | 0.86 | TESTSET = 12 |
Table depicting the performance of swapped most-contributing features of chemicals and peptides over each other during 10-fold cross validation employing support vector machine.
| Data | Descriptors | Features | PCC | Dataset |
|---|---|---|---|---|
| Chemicals | 16383 | 19 | 0.74 | TESTSET = 2168 |
| Peptides | 16383 | 124 | 0.53 | TESTSET = 117 |
| Chemicals | 16383 | 143 | 0.87 | TESTSET = 2168 |
| Peptides | 16383 | 143 | 0.83 | TESTSET = 117 |