| Literature DB >> 36015258 |
Maryam Najmi1, Mohamed Arselene Ayari2,3, Hamidreza Sadeghsalehi4, Behzad Vaferi5, Amith Khandakar6, Muhammad E H Chowdhury6, Tawsifur Rahman6, Zanko Hassan Jawhar7.
Abstract
Synthesizing micro-/nano-sized pharmaceutical compounds with an appropriate size distribution is a method often followed to enhance drug delivery and reduce side effects. Supercritical CO2 (carbon dioxide) is a well-known solvent utilized in the pharmaceutical synthesis process. Reliable knowledge of a drug's solubility in supercritical CO2 is necessary for feasible study, modeling, design, optimization, and control of such a process. Therefore, the current study constructs a stacked/ensemble model by combining three up-to-date machine learning tools (i.e., extra tree, gradient boosting, and random forest) to predict the solubility of twelve anticancer drugs in supercritical CO2. An experimental databank comprising 311 phase equilibrium samples was gathered from the literature and applied to design the proposed stacked model. This model estimates the solubility of anticancer drugs in supercritical CO2 as a function of solute and solvent properties and operating conditions. Several statistical indices, including average absolute relative deviation (AARD = 8.62%), mean absolute error (MAE = 2.86 × 10-6), relative absolute error (RAE = 2.42%), mean squared error (MSE = 1.26 × 10-10), and regression coefficient (R2 = 0.99809) were used to validate the performance of the constructed model. The statistical, sensitivity, and trend analyses confirmed that the suggested stacked model demonstrates excellent performance for correlating and predicting the solubility of anticancer drugs in supercritical CO2.Entities:
Keywords: anticancer solid drugs; artificial intelligence technique; ensemble model; solubility; supercritical CO2
Year: 2022 PMID: 36015258 PMCID: PMC9416672 DOI: 10.3390/pharmaceutics14081632
Source DB: PubMed Journal: Pharmaceutics ISSN: 1999-4923 Impact factor: 6.525
Experimental data reported in the literature for the solubility of anticancer drugs in supercritical CO2.
| Anticancer Drug | Pressure | Temperature | CO2 Density | Drug Solubility | No. of Data | Ref. |
|---|---|---|---|---|---|---|
| bar | °C | kg/m3 | Mole Fraction | |||
| Sunitinib malate | 120–270 | 35–65 | 388–914 | 5.00 × 10−6–8.56 × 10−5 | 24 | [ |
| Busulfan | 120–400 | 35–65 | 383–971 | 3.27 × 10−5–8.65 × 10−4 | 32 | [ |
| Tamsulosin | 120–270 | 35–65 | 384–914 | 1.80 × 10−7–1.01 × 10−5 | 24 | [ |
| Azathioprine | 120–270 | 35–65 | 388–914 | 2.70 × 10−6–1.83 × 10−5 | 24 | [ |
| Paclitaxel | 100–275 | 35–55 | 654–915 | 1.20 × 10−6–6.20 × 10−6 | 21 | [ |
| 5-Fluorouracil | 125–250 | 35–55 | 541–901 | 3.80 × 10−6–1.46 × 10−5 | 18 | [ |
| Thymidine | 100–300 | 35–55 | 325–928 | 1.20 × 10−6–8.00 × 10−6 | 25 | [ |
| Capecitabine | 152–354 | 35–75 | 477–955 | 2.70 × 10−6–1.59 × 10−4 | 35 | [ |
| Decitabine | 120–400 | 35–65 | 383–971 | 2.84 × 10−5–1.07 × 10−3 | 32 | [ |
| Letrozole | 120–360 | 45–75 | 319–922 | 1.60 × 10−6–8.51 × 10−5 | 20 | [ |
| Sorafenib tosylate | 120–270 | 35–65 | 388–914 | 6.80 × 10−7–1.26 × 10−5 | 24 | [ |
| Tamoxifen | 120–400 | 35–65 | 383–971 | 1.88 × 10−5–9.89 × 10−4 | 32 | [ |
Molecular weights and chemical structures of the investigated anticancer drugs.
| Anticancer Drug | Molecular Weight | Molecular Structure |
|---|---|---|
| 5-Fluorouracil | 130 |
|
| Azathioprine | 277.26 |
|
| Busulfan | 246.3 |
|
| Capecitabine | 359.35 |
|
| Decitabine | 228.21 |
|
| Letrozole | 285.3 |
|
| Paclitaxel | 854 |
|
| Sorafenib tosylate | 637.03 |
|
| Sunitinib malate | 532.56 |
|
| Tamoxifen | 371.51 |
|
| Tamsulosin | 408.05 |
|
| Thymidine | 242 |
|
Figure 1The general architecture of the stacked approach.
Prediction accuracy of the base leaner machines.
| Base Learner Model | Subgroup |
|
|
| ||
|---|---|---|---|---|---|---|
| Extra tree | Internal | 11.52 | 9.21 × 10−6 | 7.71 | 1.19 × 10−9 | 0.98283 |
| External | 37.44 | 2.58 × 10−5 | 22.85 | 2.36 × 10−9 | 0.95534 | |
| All data | 16.77 | 1.26 × 10−5 | 10.63 | 1.43 × 10−9 | 0.97838 | |
| Gradient boosting | Internal | 21.04 | 1.57 × 10−5 | 13.13 | 1.40 × 10−9 | 0.97898 |
| External | 43.07 | 2.48 × 10−5 | 21.99 | 1.97 × 10−9 | 0.95756 | |
| All data | 25.50 | 1.75 × 10−5 | 14.82 | 1.52 × 10−9 | 0.97560 | |
| Random forest | Internal | 20.27 | 1.50 × 10−5 | 12.53 | 1.54 × 10−9 | 0.98354 |
| External | 44.29 | 2.51 × 10−5 | 22.20 | 2.51 × 10−9 | 0.94926 | |
| All data | 25.14 | 1.70 × 10−5 | 14.38 | 1.73 × 10−9 | 0.97844 |
Prediction accuracy of the stacked model.
| AI Scenario | Subgroup |
|
|
| ||
|---|---|---|---|---|---|---|
| Stacked model | Internal | 9.46 | 3.18 × 10−11 | 2.66 | 1.51 × 10−20 | 0.99791 |
| External | 5.35 | 1.62 × 10−11 | 1.44 | 2.66 × 10−21 | 0.99946 | |
| All data | 8.62 | 2.86 × 10−6 | 2.42 | 1.26 × 10−10 | 0.99809 |
Prediction accuracy of the stacked model.
| Drug | Model |
| Reference |
|---|---|---|---|
| Decitabine | Adaptive neuro-fuzzy inference systems | 0.99663 | [ |
| Stacked model | 0.99508 | This work | |
| Busulfan | Support vector machines | 0.98327 | [ |
| Stacked model | 0.99054 | This work |
Figure 2Correlation between experimental and calculated solubilities of the studied anticancer drugs.
Figure 3The histogram of residual errors provided by the stacked model (blue graph shows the normal distribution).
Figure 4The kernel density estimation graphs for (a) internal and (b) external groups.
Figure 5The Bland-Altman plots for (a) internal and (b) external groups.
Figure 6Monitoring the effect of pressure on the anticancer drug (Capecitabine) solubility in supercritical CO2 from the laboratory and modeling perspectives.
Figure 7The experimental and modeling profiles of the effect of temperature on the solubility of the anticancer drug Decitabine in supercritical CO2.
Figure 8Average values of the solubility in supercritical CO2 of the studied anticancer drugs achieved from experimental data and modeling results.
Dependency of anticancer drug solubility in supercritical CO2 on the independent variables.
| Information | Dependent–Independent Pairs | |||
|---|---|---|---|---|
|
|
|
|
| |
| Pearson coefficient | −0.248 | 0.295 | 0.204 | 0.617 |
| Direction of relationship | Indirect | Direct | Direct | Direct |
| Importance | Third | Second | Fourth | First |