| Literature DB >> 31867169 |
Qianqian Zhao1, Zhuyifan Ye1, Yan Su1, Defang Ouyang1.
Abstract
Most pharmaceutical formulation developments are complex and ideEntities:
Keywords: Binding free energy; Cyclodextrin; Deep learning; Ketoprofen; LightGBM; Machine learning; Molecular modeling; Random forest
Year: 2019 PMID: 31867169 PMCID: PMC6900559 DOI: 10.1016/j.apsb.2019.04.004
Source DB: PubMed Journal: Acta Pharm Sin B ISSN: 2211-3835 Impact factor: 11.413
Figure 1Relative distribution of data points in the full dataset: (A) The percentage of experimental data for each cyclodextrin. (B) The distribution of complexation free energy between cyclodextrins and guest molecules. (C) The molecular weight distribution of guest molecules. (D) The XlogP3 distribution of guest molecules. The full dataset was comprised of 3000 data points with 1320 different structures. Hp-β-CD, 2-hydroxypropyl-β-cyclodextrin; RMCD, randomly methylated β-cyclodextrin; TMCD, (2,3,6-tri-O-methyl)-β-cyclodextrin; DMCD, (2,6-di-O-methyl)-β-cyclodextrin; SBE-β-CD, sulfobutylether-β-cyclodextrin.
Figure 2Scatter plot of predicted vs. observed complexation free energy on full dataset by three machine learning methods. The predicted results of (A) LightGBM model, (B) random forest model, and (C) deep learning model. For all the models, an 80/10/10 split of training, validation and test set was used to measure the predictive performance.
Statistical performance of the predictive models for CDs complexation free energy.
| Method | Statistical parameter | Training set | Validation set | Test set |
|---|---|---|---|---|
| LightGBM | 0.86 | 0.87 | 0.86 | |
| MAE (kJ/mol) | 1.41 | 1.28 | 1.38 | |
| RMSE (kJ/mol) | 1.94 | 1.63 | 1.83 | |
| RF | 0.83 | 0.83 | 0.81 | |
| MAE (kJ/mol) | 1.54 | 1.39 | 1.54 | |
| RMSE (kJ/mol) | 2.17 | 1.89 | 2.11 | |
| DL | 0.76 | 0.63 | 0.62 | |
| MAE (kJ/mol) | 2.53 | 2.41 | 2.56 | |
| RMSE (kJ/mol) | 3.34 | 3.12 | 3.36 |
RF, random forest; DL, deep learning.
Figure 3Training and test set performance of the LightGBM model with sparse data. The data points were gradually reduced from 3000 to 500 and the smaller datasets were randomly extracted from the full datasets. A gradual increase of MAE and a significant drop of R2 occurred with the decrease of dataset size.
Figure 4The relative importance of the molecular descriptors in the LightGBM model. Molecular descriptor with the suffix of x represented the descriptor of guest molecules, while with the suffix of y meant the descriptor of cyclodextrins. pH and T (K) represented the experimental conditions in the phase solubility study. The relative importance of molecular descriptors was determined by the measuring the number of times of the features used in the LightGBM model.
Phase solubility study of ketoprofen with different CDs in distilled water at 37 ± 0.5 °C.
| System | Equation | Δ | ||
|---|---|---|---|---|
| KTP- | 6.408 | 0.578 | N/A | |
| KTP- | 598.523 | 0.993 | −16.489 | |
| KTP- | 104.388 | 0.983 | −11.986 | |
| KTP-Hp- | 1152.340 | 0.985 | −18.178 | |
| KTP-DMe- | 1084.530 | 0.999 | −18.021 | |
| KTP-SBE- | 1025.145 | 0.993 | −17.876 |
Stability constant.
Correlation coefficient.
Gibbs free energy at the temperature of 37 ± 0.5 °C.
Calculated complexation free energy and energy components of KTP–CD inclusion complexes by MD simulation.
| System | ELE | VDW | PBSOL | PBTOT (kcal/mol) | Δ | Δ | |
|---|---|---|---|---|---|---|---|
| N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| −7.82 | −25.91 | 16.95 | −16.78 | −13.21 | −3.57 | −14.94 | |
| −9.41 | −26.38 | 18.71 | −17.08 | −13.81 | −3.27 | −13.68 | |
| Hp- | −10.33 | −33.21 | 23.41 | −20.14 | −14.06 | −6.08 | −25.44 |
| DMe- | −6.89 | −29.06 | 16.7 | −19.24 | −14.28 | −4.96 | −20.75 |
| SBE- | −10.25 | −27.45 | 19.02 | −18.68 | −13.44 | −4.94 | −20.67 |
ELE, electrostatic energy as calculated by the MM force field; VDW, van der Waals contribution from MM; PBSOL, non-polar and polar contributions to solvation; PBTOT, binding enthalpy; TΔS, total Entropy. Temperature has been multiplied in as 310 K.
Figure 5Complexation free energy between ketoprofen and various cyclodextrins by the experimental determination (ΔG_Experiment), MD simulation calculation (ΔG_MD simulation) and LightGBM prediction (ΔG_LightGBM).