| Literature DB >> 36014380 |
Bader Huwaimel1, Ahmed Alobaida2.
Abstract
Nowadays, supercritical CO2(SC-CO2) is known as a promising alternative for challengeable organic solvents in the pharmaceutical industry. The mathematical prediction and validation of drug solubility through SC-CO2 system using novel artificial intelligence (AI) approach has been considered as an interesting method. This work aims to evaluate the solubility of tamoxifen as a chemotherapeutic drug inside the SC-CO2 via the machine learning (ML) technique. This research employs and boosts three distinct models utilizing Adaboost methods. These models include K-nearest Neighbor (KNN), Theil-Sen Regression (TSR), and Gaussian Process (GPR). Two inputs, pressure and temperature, are considered to analyze the available data. Furthermore, the output is Y, which is solubility. As a result, ADA-KNN, ADA-GPR, and ADA-TSR show an R2 of 0.996, 0.967, 0.883, respectively, based on the analysis results. Additionally, with MAE metric, they had error rates of 1.98 × 10-6, 1.33 × 10-6, and 2.33 × 10-6, respectively. A model called ADA-KNN was selected as the best model and employed to obtain the optimum values, which can be represented as a vector: (X1 = 329, X2 = 318.0, Y = 6.004 × 10-5) according to the mentioned metrics and other visual analysis.Entities:
Keywords: drug solubility; pharmaceutical industry; predictive models; supercritical CO2
Mesh:
Substances:
Year: 2022 PMID: 36014380 PMCID: PMC9413580 DOI: 10.3390/molecules27165140
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Dataset.
| No. | X1 = P (bar) | X2 = T (K) | Y (Solubility/Mole Fraction) |
|---|---|---|---|
| 1 | 120 | 308 | 4 × 10−6 |
| 2 | 160 | 308 | 4.94 × 10−6 |
| 3 | 200 | 308 | 5.49 × 10−6 |
| 4 | 240 | 308 | 5.96 × 10−6 |
| 5 | 280 | 308 | 3.99 × 10−6 |
| 6 | 320 | 308 | 3.88 × 10−6 |
| 7 | 360 | 308 | 8.38 × 10−6 |
| 8 | 400 | 308 | 1.24 × 10−5 |
| 9 | 120 | 318 | 2.15 × 10−6 |
| 10 | 160 | 318 | 5.79 × 10−6 |
| 11 | 200 | 318 | 8.95 × 10−6 |
| 12 | 240 | 318 | 7.27 × 10−6 |
| 13 | 280 | 318 | 3.40 × 10−6 |
| 14 | 320 | 318 | 7.03 × 10−5 |
| 15 | 360 | 318 | 4.01 × 10−6 |
| 16 | 400 | 318 | 1.39 × 10−5 |
| 17 | 120 | 328 | 1.79 × 10−6 |
| 18 | 160 | 328 | 5.13 × 10−6 |
| 19 | 200 | 328 | 1.05 × 10−6 |
| 20 | 240 | 328 | 5.48 × 10−5 |
| 21 | 280 | 328 | 2.31 × 10−5 |
| 22 | 320 | 328 | 2.04 × 10−5 |
| 23 | 360 | 328 | 2.50 × 10−5 |
| 24 | 400 | 328 | 4.41 × 10−5 |
| 25 | 120 | 338 | 1.52 × 10−5 |
| 26 | 160 | 338 | 3.84 × 10−6 |
| 27 | 200 | 338 | 1.05 × 10−5 |
| 28 | 240 | 338 | 2.08 × 10−5 |
| 29 | 280 | 338 | 3.13 × 10−5 |
| 30 | 320 | 338 | 1.95 × 10−5 |
| 31 | 360 | 338 | 5.47 × 10−5 |
| 32 | 400 | 338 | 6.0 × 10−5 |
Output.
| Models | MAE | R2 |
|---|---|---|
| ADA-KNN | 1.98 × 10−6 | 0.996 |
| ADA-GPR | 1.33 × 10−6 | 0.967 |
| ADA-TSR | 2.33 × 10−6 | 0.883 |
Figure 1Fitting chart for ADA-KNN.
Figure 2Fitting chart for ADA-GPR.
Figure 3Fitting chart for ADA-TSR.
Figure 4Three-dimensional illustration of pressure (X1), temperature (X2), and solubility (Y).
Figure 5Tendency of X1.
Figure 6Tendency of X2.
Modified parameters applying maximum response.
| X1 = P (bar) | X2 = T (K) | Y (Solubility) |
|---|---|---|
| 329 | 318.0 | 7.03 × 10−5 |