| Literature DB >> 35547066 |
Xiaocong Pang1, Baoyue Zhang1, Guangyan Mu1, Jie Xia2, Qian Xiang1, Xia Zhao1, Ailin Liu2, Guanhua Du2, Yimin Cui1.
Abstract
Cytochrome P450 3A4 (CYP3A4) is an important member of the CYP family and responsible for metabolizing a broad range of drugs. Potential drug-drug interactions (DDIs) caused by CYP3A4 inhibitors could lead to increasing risk of side-effects/toxicity or decreasing effectiveness. The evaluation of CYP3A4 inhibitory activity is time-consuming, labor-intensive, and costly, and it is necessary to establish virtual screening models for predicting CYP3A4 inhibitors. In this study, 4 classifier algorithms, including support vector machine (SVM), naive Bayesian (NB), recursive partitioning (RP), and K-nearest neighbor (KNN), were applied to discriminate CYP3A4 inhibitors from the non-inhibitors. Correlation analysis and stepwise linear regression methods were used for descriptor selection and optimization. The performance of classifiers was measured by 5-fold cross-validation, Y-scrambling and test set validation. Finally, the optimal NB model with Matthews correlation coefficients of 0.894 for the test set was developed to screen FDA-approved drugs and natural products database. As a result, 90 compounds from FDA-approved drug databases were predicted as inhibitors, and 46% of them were identified as known CYP3A4 inhibitors. 6 natural products were selected for further bioactivity assay and molecular docking. 2 of them with good docking score also exerted significant CYP3A4 inhibitory activities with IC50 values of 0.052 and 1.120 μM, respectively. This study proved the feasibility of a new method for predicting CYP3A4 inhibitory activity and preventing the occurrence of DDIs at early stage in drug development. This journal is © The Royal Society of Chemistry.Entities:
Year: 2018 PMID: 35547066 PMCID: PMC9086869 DOI: 10.1039/c8ra06311g
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1Workflow for classification models building, validation, and virtual screening as applied to CYP3A4 inhibitors and non-inhibitors data sets.
Fig. 2Diversity distribution of the training set and test set. Chemical space was defined by molecular weight (MW) as X-axis, and A log P as Y-axis. In the picture, red stands for training set compounds, and blue stands for the test set compounds.
Performance of classification models for training set and test seta
| Validation method | Model | SE | SP |
|
| MCC |
|---|---|---|---|---|---|---|
| Training set 5-fold cross-validation | SVM | 0.987 | 0.986 | 0.951 | 0.986 | 0.967 |
| KNN | 0.993 | 0.993 | 0.986 | 0.993 | 0.998 | |
| RP | 0.874 | 0.875 | 0.919 | 0.875 | 0.899 | |
| NB | 0.813 | 0.793 | 0.770 | 0.793 | 0.869 | |
| Test validation | SVM | 1.000 | 1.000 | 0.006 | 1.000 | 0.502 |
| KNN | 1.000 | 1.000 | 0.004 | 1.000 | 0.502 | |
| RP | 0.796 | 0.801 | 0.844 | 0.801 | 0.886 | |
| NB | 0.852 | 0.809 | 0.211 | 0.809 | 0.842 |
SE: Sensitivity, SP: specificity, Q+: prediction accuracy of inhibitors, Q−: prediction accuracy of inhibitors, MCC: matthews correlation coefficient. SVM: support vector machine, NB: naive Bayesian, RP: recursive partitioning, KNN: K-nearest neighbor.
Performance of optimized NB and RP models with ECFP-6 for training set and test set
| Validation method | Model with DS 2D descriptor + ECFP-6 | SE | SP |
|
| MCC |
|---|---|---|---|---|---|---|
| Training set 5-fold cross-validation | NB | 0.948 | 0.946 | 0.960 | 0.955 | 0.946 |
| RP | 0.875 | 0.877 | 0.907 | 0.916 | 0.877 | |
| Test validation | NB | 0.902 | 0.894 | 0.954 | 0.886 | 0.894 |
| RP | 0.827 | 0.832 | 0.917 | 0.880 | 0.832 |
Fig. 3Examples of the top 10 good (top) and bad (bottom) fragments estimated by NB model. The Bayesian score (score) is given for each fragment.
Fig. 4The prediction of CYP3A4 inhibitors from FDA-approved drug database. (A) drug distribution of predicted CYP3A4 inhibitors. (B) the performance of NB prediction model.
The IC50 values and -CDOCKER interaction energy of 6 natural products and ketoconazole determined by P450-Glo™ CYP3A4 assay and CDOCKER approach, respectively
| Name | Structure | -CDOCKER interaction energy | IC50 (μM) |
|---|---|---|---|
| Ketoconazole |
| 65.399 | 0.047 ± 0.002 |
| Isoimperatorin |
| 36.424 | 18.231 ± 1.721 |
| Bergaptin |
| 47.241 | 12.921 ± 1.171 |
| Bisdemethoxycurcumin |
| 42.251 | 14.821 ± 2.087 |
| Azulene |
| 13.430 | 22.485 ± 2.539 |
| Pterostilbene |
| 33.310 | 1.120 ± 0.056 |
| Ellipticine |
| 34.819 | 0.052 ± 0.003 |
Fig. 5The interaction between CYP3A4 and pterostilbene/ellipticine. (A and B) Elliptisine has a good interaction with CYP3A4 through forming hydrogen bond and pi–pi stacked bond with ARG372, ALA370 and PHE215. (C and D) Pterostilbene binds to CYP3A4 by interacting with the amino acids including ARG212, SER119 and ALA370 via conventional hydrogen bond/attractive charge and pi–alkyl interaction.