| Literature DB >> 22848570 |
Qi Huang1, Haixiao Jin, Qi Liu, Qiong Wu, Hong Kang, Zhiwei Cao, Ruixin Zhu.
Abstract
HIV-1 protease is one of the main therapeutic targets in HIV. However, a major problem in treatment of HIV is the rapid emergence of drug-resistant strains. It should be particularly helpful to clinical therapy of AIDS if one method can be used to predict antivirus capability of compounds for different variants. In our study, proteochemometric (PCM) models were created to study the bioactivity spectra of 92 chemical compounds with 47 unique HIV-1 protease variants. In contrast to other PCM models, which used Multiplication of Ligands and Proteins Descriptors (MLPD) as cross-term, one new cross-term, i.e. Protein-Ligand Interaction Fingerprint (PLIF) was introduced in our modeling. With different combinations of ligand descriptors, protein descriptors and cross-terms, nine PCM models were obtained, and six of them achieved good predictive abilities (Q(2)(test)>0.7). These results showed that the performance of PCM models could be improved when ligand and protein descriptors were complemented by the newly introduced cross-term PLIF. Compared with the conventional cross-term MLPD, the newly introduced PLIF had a better predictive ability. Furthermore, our best model (GD & P & PLIF: Q(2)(test) = 0.8271) could select out those inhibitors which have a broad antiviral activity. As a conclusion, our study indicates that proteochemometric modeling with PLIF as cross-term is a potential useful way to solve the HIV-1 drug-resistant problem.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22848570 PMCID: PMC3407198 DOI: 10.1371/journal.pone.0041698
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Q2 CV of each model with different combinations of descriptor blocks.
| Models with different descriptor combinations | Normalized Poly Kernel | Poly Kernel | Puk | RBF Kernel |
| GD×P |
| 0.3643 | 0.1586 | 0.2988 |
| DLI×P |
| 0.2054 | 0.2511 | 0.4221 |
| PLIF |
| 0.5727 | 0.1627 | 0.5475 |
| GD & P & GD×P |
| 0.3615 | 0.1581 | 0.2916 |
| GD & P & PLIF |
| 0.3572 | −0.0214 | 0.6988 |
| GD & P |
| 0.5702 | 0.2759 | 0.6731 |
| DLI & P & DLI×P |
| 0.2243 | 0.2509 | 0.4155 |
| DLI & P & PLIF |
| 0.3475 | 0.0306 | 0.6880 |
| DLI & P |
| 0.5195 | 0.3831 | 0.6544 |
Summary of Kernels.
| Type of Kernels | Functions |
| Normalized Poly Kernel |
|
| Poly Kernel |
|
| Puk |
|
| RBF Kernel |
|
Goodness-of-fit (R2) and predictive ability (Q2 test) of the obtained models.
| Models with different descriptor combinations | GD | DLI | ||
| R2 | Q2 test | R2 | Q2 test | |
| PLIF |
|
|
|
|
| MLPD | 0.9700 | 0.7101 | 0.9722 | 0.6702 |
| L & P & PLIF |
|
|
|
|
| L & P &MLPD | 0.9696 | 0.7129 | 0.9727 | 0.6612 |
| L&P | 0.9350 | 0.7298 | 0.9241 | 0.6134 |
Models created using only cross-terms.
Models created using ligand and protein descriptors with cross-terms.
Models created using ligand and protein descriptors.
Figure 1Graphical illustrations of the goodness-of-fit and predictive ability of the obtained models with the selected kernel.
Goodness-of-fit is shown as red solid circles, and predictive ability is shown as blue solid circles. The predicted versus measured activity values using different combinations of descriptor blocks, i.e. GD×P (a), DLI×P (b), PLIF (c), GD & P & GD×P (d), GD & P & PLIF (e), GD & P (f), DLI & P & DLI×P (g), DLI & P & PLIF (h), DLI & P (i) are shown in the figure.
Figure 2Predicted inhibitory activity (pKi) of the selected eight compounds against 47 proteases.
Red, pink, brown, orange circles stand for the first-generation inhibitors, i.e. Saquinavir, Ritonavir, Indinavir, Nelfinavir respectively; Darkgreen, cadetblue, cyan, blue triangles stand for the second-generation ones, i.e. Darunavir, Tipranavir, TMC-126, XV638 respectively. The lines indicate the average values for each of them.
Figure 3General framework for our proteochemometric modeling.