| Literature DB >> 30023496 |
Patric Schyman1, Ruifeng Liu1, Anders Wallqvist1.
Abstract
Permeability glycoprotein (Pgp) is an essential membrane-bound transporter that efficiently extracts compounds from a cell. As such, it is a critical determinant of the pharmacokinetic properties of drugs. Multidrug resistance in cancer is often associated with overexpression of Pgp, which increases the efflux of chemotherapeutic agents from the cell. This, in turn, may prevent an effective treatment by reducing the effective intracellular concentrations of such agents. Consequently, identifying compounds that can either be transported out of the cell by Pgp (substrates) or impair Pgp function (inhibitors) is of great interest. Herein, using publically available data, we developed quantitative structure-activity relationship (QSAR) models of Pgp substrates and inhibitors. These models employed a variable-nearest neighbor (v-NN) method that calculated the structural similarity between molecules and hence possessed an applicability domain, that is, they used all nearest neighbors that met a minimum similarity constraint. The performance characteristics of these v-NN-based models were comparable or at times superior to those of other model constructs. The best v-NN models for identifying either Pgp substrates or inhibitors showed overall accuracies of >80% and κ values of >0.60 when tested on external data sets with candidate Pgp substrates and inhibitors. The v-NN prediction model with a well-defined applicability domain gave accurate and reliable results. The v-NN method is computationally efficient and requires no retraining of the prediction model when new assay information becomes available-an important feature when keeping QSAR models up-to-date and maintaining their performance at high levels.Entities:
Year: 2016 PMID: 30023496 PMCID: PMC6044698 DOI: 10.1021/acsomega.6b00247
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Scheme 1Illustration of the Drug Efflux Mediated by ATP-Driven Pgp Transport
(i) Drug binds to the substrate binding site (■),[5] (ii) ATP binds to the ATP-binding site (▲), and (iii) hydrolysis of ATP to ADP + Pi (★). Although only one binding site is shown for simplicity, multiple binding sites could contribute to substrate promiscuity.
Figure 1Performance measures of the v-NN Pgp substrate model as a function of the Tanimoto-distance threshold d0 at a constant smoothing factor h of 0.6 (a) and as a function of the smoothing factor h at a constant Tanimoto-distance threshold d0 of 1.0 (b), evaluated using the 10-fold cross validation.
Performance Measures in Predicting Pgp Substrates by Using the Data Set of Li et al.[16]
| method | parameters | accuracy | sensitivity | specificity | κ | coverage |
|---|---|---|---|---|---|---|
| v-NN | HC ( | 0.71 | 0.78 | 0.65 | 0.42 | 1.00 |
| v-NN | HA ( | 0.77 | 0.78 | 0.75 | 0.53 | 0.60 |
| BC | ECFP10 + 8MP | 0.72 | 0.65 | 0.79 | 0.44 | 1.00 |
| v-NN | HC ( | 0.76 | 0.80 | 0.71 | 0.51 | 1.00 |
| v-NN | HA ( | 0.81 | 0.82 | 0.78 | 0.60 | 0.70 |
| BC | ECFP10 + 8MP | 0.73 | 0.66 | 0.81 | 0.47 | 1.00 |
| v-NN | HC ( | 0.50 | 0.50 | 0.50 | 0.01 | 1.00 |
| v-NN | HA ( | 0.52 | 0.50 | 0.54 | 0.04 | 0.65 |
Performance in a 10-fold cross validation.
v-NN parameters (smoothing factor, h, and Tanimoto-distance threshold, d0).
Bayesian classifier that employs ECFP10 fingerprints and eight molecular properties (MPs).
Training set compounds for Pgp were randomly assigned as substrates or nonsubstrates.
Performance Measures in Predicting Pgp Inhibitors by Using the Data Set of Broccatelli et al[10]
| method | parameters | accuracy | sensitivity | specificity | κ | coverage |
|---|---|---|---|---|---|---|
| v-NN | HC ( | 0.85 | 0.86 | 0.84 | 0.70 | 1.00 |
| v-NN | HA ( | 0.91 | 0.93 | 0.88 | 0.81 | 0.67 |
| FLAP/VolSurf+ | 0.88 | 0.84 | 0.91 | 0.75 | 1.00 | |
| v-NN | HC ( | 0.84 | 0.84 | 0.83 | 0.67 | 1.00 |
| v-NN | HA ( | 0.89 | 0.88 | 0.91 | 0.78 | 0.66 |
| FLAP/VolSurf+ | 0.85 | 0.82 | 0.87 | 0.69 | 1.00 | |
| v-NN | HC ( | 0.76 | 0.81 | 0.67 | 0.48 | 1.00 |
| v-NN | HA ( | 0.88 | 0.91 | 0.80 | 0.71 | 0.53 |
| FLAP/VolSurf+ | 0.86 | 0.90 | 0.80 | 0.70 | 1.00 | |
| v-NN | HC ( | 0.55 | 0.41 | 0.67 | 0.08 | 1.00 |
| v-NN | HA ( | 0.53 | 0.41 | 0.67 | 0.08 | 0.67 |
Performance of 10-fold cross validation.
v-NN parameters (smoothing factor, h, and Tanimoto-distance threshold, d0).
Training set compounds for Pgp were randomly assigned as substrates or nonsubstrates.
Performance Measures in Predicting Pgp Inhibitors by Using the Data set of Chen et al.[12]
| method | parameters | accuracy | sensitivity | specificity | κ | coverage |
|---|---|---|---|---|---|---|
| v-NN | HC ( | 0.76 | 0.84 | 0.64 | 0.49 | 1.00 |
| v-NN | HA ( | 0.83 | 0.91 | 0.63 | 0.57 | 0.68 |
| BC + MP | FCFP4 + 8 MP | 0.81 | 0.80 | 0.82 | 0.61 | 1.00 |
| v-NN | HC ( | 0.76 | 0.87 | 0.59 | 0.48 | 1.00 |
| v-NN | HA ( | 0.80 | 0.91 | 0.50 | 0.45 | 0.72 |
| BC + MP | FCFP4 + 8 MP | 0.79 | 0.80 | 0.78 | 0.57 | 1.00 |
| v-NN | HC ( | 0.53 | 0.59 | 0.43 | 0.02 | 1.00 |
| v-NN | HA ( | 0.55 | 0.59 | 0.46 | 0.04 | 0.67 |
Performance of LOO cross validation.
v-NN parameters (smoothing factor, h, and Tanimoto-distance threshold, d0).
Bayesian classifier that employs FCFP4 fingerprints and eight MPs.
Training set compounds for Pgp were randomly assigned as substrates or nonsubstrates.
Performance Measures in Predicting Pgp Inhibitors by Using All Inhibitor Data
| method | parameters | accuracy | sensitivity | specificity | κ | coverage |
|---|---|---|---|---|---|---|
| v-NN | HC ( | 0.83 | 0.87 | 0.77 | 0.65 | 1.00 |
| v-NN | HA ( | 0.87 | 0.93 | 0.74 | 0.68 | 0.75 |
| v-NN | HC ( | 0.77 | 0.88 | 0.66 | 0.54 | 1.00 |
| v-NN | HA ( | 0.81 | 0.92 | 0.69 | 0.62 | 0.81 |
Performance of 10-fold cross validation.
v-NN parameters (smoothing factor, h, and Tanimoto-distance threshold, d0).
Figure 2Prediction of substrates by using the v-NN Pgp inhibitor model and the prediction of inhibitors by using the v-NN Pgp substrate model, as evaluated by the 10-fold cross validation.