| Literature DB >> 35993595 |
Hiroaki Iwata1, Tatsuru Matsuo2, Hideaki Mamada3, Takahisa Motomura4, Mayumi Matsushita2, Takeshi Fujiwara1, Kazuya Maeda5, Koichi Handa6.
Abstract
Pharmacokinetic research plays an important role in the development of new drugs. Accurate predictions of human pharmacokinetic parameters are essential for the success of clinical trials. Clearance (CL) and volume of distribution (Vd) are important factors for evaluating pharmacokinetic properties, and many previous studies have attempted to use computational methods to extrapolate these values from nonclinical laboratory animal models to human subjects. However, it is difficult to obtain sufficient, comprehensive experimental data from these animal models, and many studies are missing critical values. This means that studies using nonclinical data as explanatory variables can only apply a small number of compounds to their model training. In this study, we perform missing-value imputation and feature selection on nonclinical data to increase the number of training compounds and nonclinical datasets available for these kinds of studies. We could obtain novel models for total body clearance (CLtot) and steady-state Vd (Vdss) (CLtot: geometric mean fold error [GMFE], 1.92; percentage within 2-fold error, 66.5%; Vdss: GMFE, 1.64; percentage within 2-fold error, 71.1%). These accuracies were comparable to the conventional animal scale-up models. Then, this method differs from animal scale-up methods because it does not require animal experiments, which continue to become more strictly regulated as time passes.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35993595 PMCID: PMC9472274 DOI: 10.1021/acs.jcim.2c00318
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 6.162
Figure 1Workflow of our novel human CLtot and Vdss prediction method. (A) CLtot analysis flow. (i) There were 741 compounds with human CLtot data and 46 that had values for all 11 features. (ii) All feature values were estimated via prediction using ADMEWORKS. (iii) Feature extraction was performed using XGBoost or Random Forest, and a prediction model was constructed. (B) Vdss analysis flow. (i) There were 751 compounds with human Vdss data and 46 that had values for all 11 features. (ii) All feature values were estimated via prediction using ADMEWORKS. (iii) Feature extraction was performed using XGBoost or Random Forest, and a prediction model was constructed.
Details of the Compound Data
| feature | number of compounds | source |
|---|---|---|
| human CLtot | 741 | JCP2013, ChEMBL23 |
| rat CLtot | 387 | JCP2013, ChEMBL23 |
| dog CLtot | 284 | JCP2013, ChEMBL23 |
| monkey CLtot | 129 | JCP2013, ChEMBL23 |
| human Vdss | 751 | JCP2013, ChEMBL23 |
| rat Vdss | 351 | JCP2013, ChEMBL23 |
| dog Vdss | 274 | JCP2013, ChEMBL23 |
| monkey Vdss | 125 | JCP2013, ChEMBL23 |
| human fu | 577 | JCP2013, ChEMBL23 |
| rat fu | 237 | JCP2013, ChEMBL23 |
| dog fu | 179 | JCP2013, ChEMBL23 |
| monkey fu | 88 | JCP2013, ChEMBL23 |
| p | 334 | Pubchem, DrugBank |
| p | 335 | Pubchem, DrugBank |
| solubility | 339 | Pubchem, DrugBank |
| caco-2 permeability | 307 | Pubchem, DrugBank |
Figure 2Overview of the multimodal Deep Tensor model.
Results of the Accuracy Evaluations for Imputations of Rat CLtot Data
| method | GMFE | % of 2-fold error | |
|---|---|---|---|
| CLtot | 695 compounds | 2.53 | 45.7 |
| XGBoost: only CS | |||
| 343 compounds | 2.15 | 52.2 | |
| XGBoost: CS + rat CLtot | |||
| 695 compounds | 2.09 | 54.3 | |
| XGBoost: CS + rat CLtot imputed | |||
| 695 compounds | 2.44 | 45.7 | |
| Deep Tensor: only CS | |||
| 343 compounds | 2.15 | 54.8 | |
| Deep Tensor: CS + rat CLtot | |||
| 695 compounds | 2.09 | 54.3 | |
| Deep Tensor: CS + rat CLtot imputed | |||
| Vdss | 706 compounds | 1.66 | 82.2 |
| XGBoost: only CS | |||
| 306 compounds | 1.72 | 75.6 | |
| XGBoost: CS + rat Vdss | |||
| 706 compounds | 1.73 | 68.9 | |
| XGBoost: CS + rat Vdss imputed | |||
| 706 compounds | 1.85 | 62.2 | |
| Deep Tensor: only CS | |||
| 306 compounds | 1.89 | 56.9 | |
| Deep Tensor: CS + rat Vdss | |||
| 706 compounds | 1.75 | 64.4 | |
| Deep Tensor: CS + rat Vdss imputed |
Results of Accuracy Evaluations
| method | GMFE | % of 2-fold error | |
|---|---|---|---|
| CLtot | SSS rat | 2.36 | 43.5 |
| SSS dog | 2.30 | 39.1 | |
| SSS monkey | 1.93 | 58.7 | |
| SA | 2.33 | 45.7 | |
| FCIM | 1.99 | 52.2 | |
| XGBoost: only CS | 2.40 | 50.0 | |
| XGBoost: CS + 11 features | 2.06 | 58.7 | |
| XGBoost: CS + selected features | 1.98 | 50.0 | |
| Deep Tensor: only CS | 2.44 | 45.7 | |
| Deep Tensor: CS + 11 features | 2.11 | 52.2 | |
| Deep Tensor: CS + selected features | 1.92 | 66.5 | |
| Vdss | SSS rat | 1.91 | 62.2 |
| SSS dog | 1.93 | 71.1 | |
| SSS monkey | 1.60 | 80.0 | |
| SA | 2.07 | 68.9 | |
| Øie–Tozer | 1.46 | 84.4 | |
| XGBoost: only CS | 1.70 | 77.8 | |
| XGBoost: CS + 11 features | 1.64 | 71.1 | |
| XGBoost: CS + selected features | 1.66 | 71.1 | |
| Deep Tensor: only CS | 1.85 | 62.2 | |
| Deep Tensor: CS + 11 features | 1.75 | 69.8 | |
| Deep Tensor: CS + selected features | 1.74 | 74.2 |
SSS: single-species allometric scaling; SA: simple allometry; FCIM: fu-corrected intercept method; CS: chemical structure.
GMFE: geometric mean fold error.