| Literature DB >> 32221780 |
Anita Rácz1, György M Keserű2.
Abstract
Cytochrome P450 (CYP) enzymes play an important role in the metabolism of xenobiotics. Since they are connected to drug interactions, screening for potential inhibitors is of utmost importance in drug discovery settings. Our study provides an extensive classification model for P450-drug interactions with one of the most prominent members, the 2C9 isoenzyme. Our model involved the largest set of 45,000 molecules ever used for developing prediction models. The models are based on three different types of descriptors, (a) typical one, two and three dimensional molecular descriptors, (b) chemical and pharmacophore fingerprints and (c) interaction fingerprints with docking scores. Two machine learning algorithms, the boosted tree and the multilayer feedforward of resilient backpropagation network were used and compared based on their performances. The models were validated both internally and using external validation sets. The results showed that the consensus voting technique with custom probability thresholds could provide promising results even in large-scale cases without any restrictions on the applicability domain. Our best model was capable to predict the 2C9 inhibitory activity with the area under the receiver operating characteristic curve (AUC) of 0.85 and 0.84 for the internal and the external test sets, respectively. The chemical space covered with the largest available dataset has reached its limit encompassing publicly available bioactivity data for the 2C9 isoenzyme.Entities:
Keywords: ADME-tox; CYP 2C9; Classification; Cytochrome P450; Machine learning
Year: 2020 PMID: 32221780 PMCID: PMC7320947 DOI: 10.1007/s10822-020-00308-y
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Fig. 1a Principal component analysis of the three PubChem datasets (grey: AID 777, red: AID 1851, green: AID 883). The first two principal component scores are plotted against each other. b Comparison of the three datasets based on the log P and molecular weight values of the molecules. The coloring is the same as in the previous case
Fig. 2An example of the ROC curves with the threshold optimum determination (blue circle: point with the optimal probability value; d: distance between the optimal point and perfect classification)
Fig. 3The complete modeling workflow. The explanation of the abbreviations can be found in the beginning of the manuscript
AUC values of the prepared primary and consensus models for training, CV, internal and external validation
| Method | Validation | Primary models | Consensus 1 | Consensus 2 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| IFP + DS | ECFP + PFP | 1D, 2D, 3D MD | Min | Ave | Max | Min | Ave | Max | ||
| MLP | Training | 0.69 | 0.89 | 0.82 | 0.86 | 0.88 | 0.87 | 0.89 | 0.91 | 0.90 |
| CV | 0.65 | 0.75 | 0.76 | 0.75 | 0.78 | 0.76 | 0.81 | 0.83 | 0.81 | |
| Test | 0.66 | 0.76 | 0.76 | 0.76 | 0.77 | 0.76 | 0.81 | 0.82 | 0.81 | |
| External | 0.65 | 0.74 | 0.74 | 0.75 | 0.76 | 0.74 | 0.79 | 0.81 | 0.80 | |
| GBT | Training | 0.73 | 0.84 | 0.87 | 0.83 | 0.84 | 0.88 | 0.88 | ||
| CV | 0.69 | 0.79 | 0.80 | 0.78 | 0.78 | 0.82 | 0.83 | |||
| Test | 0.69 | 0.79 | 0.80 | 0.77 | 0.78 | 0.82 | 0.82 | |||
| External | 0.67 | 0.75 | 0.76 | 0.75 | 0.73 | 0.80 | 0.80 | |||
Abbreviations can be found in the beginning of the text. The best models (based on the AUC of the validation sets) are marked with bold
The number of molecules in each validation part, and the number of active molecules in consensus 2 models
| Samples | Validation | Consensus 1 | Consensus 2 | Active molecules in consensus 2 (%) | Ratio of excluded molecules (consensus 2) |
|---|---|---|---|---|---|
| MLP | Training | 25,013 | 15,567 | 52.5 | 0.41 |
| CV | 25,013 | 14,696 | 48.4 | 0.41 | |
| Test | 10,720 | 6608 | 49.6 | 0.38 | |
| External | 9983 | 5919 | 21.0 | 0.41 | |
| GBT | Training | 25,013 | 17,348 | 51.8 | 0.31 |
| CV | 25,013 | 16,587 | 48.1 | 0.34 | |
| Test | 10,720 | 7260 | 48.9 | 0.32 | |
| External | 9983 | 6013 | 22.1 | 0.40 |
The ratio of excluded molecules is assigned to consensus 2 models
The AUC values of the prepared consensus models based on the six primary models for training, CV, internal and external validation
| Validation | Consensus 1 (MLP + GBT) | Consensus 2 (MLP + GBT) | ||||
|---|---|---|---|---|---|---|
| Min | Ave | Max | Min | Ave | Max | |
| Training | 0.87 | 0.88 | 0.87 | 0.91 | 0.91 | |
| CV | 0.76 | 0.79 | 0.76 | 0.84 | 0.84 | |
| Test | 0.77 | 0.79 | 0.76 | 0.83 | 0.83 | |
| External | 0.75 | 0.77 | 0.74 | 0.83 | 0.82 | |
Abbreviations can be found in the beginning of the text. The AUC values of the best model are marked with bold numbers
The amount of molecules and the ratio of actives in consensus 2 model together with the ratio of excluded molecules compared to the primary models
| Samples | Validation | Consensus 1 (MLP + GBT) | Consensus 2 (MLP + GBT) | Active molecules in consensus 2 (%) (MLP + GBT) | Ratio of excluded molecules (consensus 2 for MLP + GBT) |
|---|---|---|---|---|---|
| MLP | Training | 25,013 | 14,105 | 52.2 | 0.44 |
| CV | 25,013 | 12,526 | 49.3 | 0.50 | |
| Test | 10,720 | 5738 | 50.7 | 0.46 | |
| External | 9983 | 4802 | 23.0 | 0.52 |
Fig. 4The number of used molecules in the previous studies compared to the top three models in our study. Sun et al. [15], Rostkowski et al. [40], Li et al. [41], Cheng et al. [17], and Wu et al. [18] can be found in the reference list. Model 1: consensus 2 model based on the MLP + GBT algorithms; model 2: consensus 2 model based on the GBT algorithm and model 3: consensus 1 model based on the GBT algorithm. Our selected models can be found in Table 1 and 3 marked with bold
The AUC values of the best three models compared to the previous literature models
| AUC values | References | ||
|---|---|---|---|
| Internal test set | External test set | ||
| Sun et al. | 0.85 | – | [ |
| Rostkowski et al. | 0.90 | – | [ |
| Li et al. | – | 0.77 | [ |
| Cheng et al. | – | 0.86 | [ |
| Wu et al. | – | 0.81 | [ |
| Model 1 | 0.85 | 0.84 | – |
| Model 2 | 0.83 | 0.81 | – |
| Model 3 | 0.79 | 0.76 | – |
Comparison of the Matthews Correlation Coefficients of the top three models and the previous studies
| MCC values | Reference | ||
|---|---|---|---|
| Cross-validation | External test set | ||
| Wu et al | – | 0.35 | [ |
| Cheng et al | 0.49 | – | [ |
| Li et al | – | 0.32 | [ |
| Model 1 | 0.61 | 0.48 | – |
| Model 2 | 0.53 | 0.42 | – |
| Model 3 | 0.44 | 0.31 | – |
Model 1: consensus 2 model based on the MLP + GBT algorithms; model 2: consensus 2 model based on the GBT algorithm and model 3: consensus 1 model based on the GBT algorithm
Comparison of the sensitivities (Sn) and specificities (Sp) of the top three models and the previous studies
| Cross-validation | External test set | References | |||
|---|---|---|---|---|---|
| Sn | Sp | Sn | Sp | ||
| Wu | – | – | 0.29 | 0.97 | [ |
| Cheng et al | 0.63 | 0.85 | – | – | [ |
| Li et al | – | – | 0.32 | 0.77 | [ |
| Model 1 | 0.82 | 0.79 | 0.82 | 0.76 | – |
| Model 2 | 0.78 | 0.75 | 0.79 | 0.70 | – |
| Model 3 | 0.73 | 0.71 | 0.67 | 0.71 | – |
Model 1: consensus 2 model based on the MLP + GBT algorithms; model 2: consensus 2 model based on the GBT algorithm and model 3: consensus 1 model based on the GBT algorithm