| Literature DB >> 36158882 |
Divye Singh1, Avani Mahadik1, Shraddha Surana1, Pooja Arora1.
Abstract
Viruses remain an area of concern despite constant development of antiviral drugs and therapies. One of the contributors is the Flaviviridae family of viruses causing diseases that need attention. Among other anitviral methods, antiviral peptides are being studied as viable candidates. Although antiviral peptides (AVPs) are emerging as potential therapeutics, it is important to assess the efficacy of a given peptide in terms of its bioactivity. Experimental identification of the bioactivity of each potential peptide is an expensive and time consuming task. Computational methods like proteochemometric modeling (PCM) is a promising method for prediction of bioactivity (pIC50) based on peptide and target sequence pair. In this study, we propose a prediction of pIC50 of AVP against the Flaviviridae family that may help make the decision to choose a peptide with desired efficacy. The peptides data was collected from a public database and target sequences were manually curated from literature. Features are calculated using peptide and target sequence PCM descriptors which consist of individual and cross-term features of peptide and respective target. The resultant R 2 and MAPE values are 0.85 and 8.44%, respectively, for prediction of pIC50 value of AVPs.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36158882 PMCID: PMC9499780 DOI: 10.1155/2022/7901791
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Figure 1PCM model flowchart. Illustration of the descriptors and multiple combinations of descriptor groups used to predict the pIC50 using Random Forest.
Figure 2Distribution of IC50 and pIC50 values.
Range of values used to tune hyperparameters for Random Forest.
| Hyperparameter | Ranges |
|---|---|
| n_estimators | 100, 200, 250, 300, 500, 1000, 1500 |
| min_samples_split | 2, 5, 10 |
| min_samples_leaf | 1, 2, 4 |
| max_features | auto, sqrt, log2 |
| max_depth | 2, 3, 5, 10, 15, 20, None |
| bootstrap | True, False |
Actual and predicted pIC50 values for example sequences.
| Sequence | Actual pIC50 value | Predicted pIC50 value |
|---|---|---|
| AFLGWIGAIVSTALPQWR | 11.289 | 11.261 |
| ACFPWGNTWCGGK | 11.250 | 11.262 |
| MANAGLQLLGFILAFLGWIGAI | 12.429 | 11.224 |
| RWMVWRHWFHRLRLPYNPGK NKQNQQWP | 11.736 | 11.246 |
| AAQRRGRIGRNPSQVGD | 7.934 | 8.158 |
| RTGRGRRGIYR | 10.271 | 11.294 |
| GELGRLVYLLDGPGYDPIHCSL AYGDASTLVVF | 17.678 | 19.780 |
Results of obtained models.
| Descriptor combinations |
| MAPE | PCC | MSE |
|---|---|---|---|---|
| PP | 0.48 | 14.04 | 0.72 | 6.88 |
| PP+TP | 0.72 | 11.46 | 0.84 | 3.70 |
| PZ+TZ | 0.72 | 11.32 | 0.85 | 3.63 |
| PZ+TZ+XZ | 0.76 | 9.48 | 0.87 | 3.14 |
| PP+TP and PZ+TZ+XZ |
|
|
|
|