| Literature DB >> 35456693 |
Jakub Szlęk1, Mohammad Hassan Khalid1, Adam Pacławski1, Natalia Czub1, Aleksander Mendyk1.
Abstract
Tablets are the most common dosage form of pharmaceutical products. While tablets represent the majority of marketed pharmaceutical products, there remain a significant number of patients who find it difficult to swallow conventional tablets. Such difficulties lead to reduced patient compliance. Orally disintegrating tablets (ODT), sometimes called oral dispersible tablets, are the dosage form of choice for patients with swallowing difficulties. ODTs are defined as a solid dosage form for rapid disintegration prior to swallowing. The disintegration time, therefore, is one of the most important and optimizable critical quality attributes (CQAs) for ODTs. Current strategies to optimize ODT disintegration times are based on a conventional trial-and-error method whereby a small number of samples are used as proxies for the compliance of whole batches. We present an alternative machine learning approach to optimize the disintegration time based on a wide variety of machine learning (ML) models through the H2O AutoML platform. ML models are presented with inputs from a database originally presented by Han et al., which was enhanced and curated to include chemical descriptors representing active pharmaceutical ingredient (API) characteristics. A deep learning model with a 10-fold cross-validation NRMSE of 8.1% and an R2 of 0.84 was obtained. The critical parameters influencing the disintegration of the directly compressed ODTs were ascertained using the SHAP method to explain ML model predictions. A reusable, open-source tool, the ODT calculator, is now available at Heroku platform.Entities:
Keywords: AutoML; ODTs; explainable models; machine learning; orally disintegrating tablets; partial dependence plots; shapley values
Year: 2022 PMID: 35456693 PMCID: PMC9044744 DOI: 10.3390/pharmaceutics14040859
Source DB: PubMed Journal: Pharmaceutics ISSN: 1999-4923 Impact factor: 6.525
Source of data–publications [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47].
| API | Dose [mg] | Filler | Binder | Disintegrant | Lubricant | Solubilizer | No. of Formulations | Reference |
|---|---|---|---|---|---|---|---|---|
| Aceclofenac | 100 | Lactose, MCC | - | CC-Na | MgSt | - | 9 | [ |
| Carbinoxamine maleate | 4 | Mannitol, MCC | - | L-HPC | MgSt | Amberlite | 5 | [ |
| Carvedilol | 12.5 | Mannitol, MCC | - | SSG | MgSt, Talc | 2-hydroxypropyl-β-cyclodextrin, Camphore-as a porophore | 15 | [ |
| Dexamethasone | 2 | Mannitol, Lactose, MCC | - | Crospovidone | MgSt, Colloidal sillica | - | 13 | [ |
| Dextromethorphan | 15 | Mannitol, Lactose, MCC | - | - | MgSt | Amberlite | 2 | [ |
| Donepezil | 10 | Mannitol | - | Crospovidone, CC-Na, SSG | Sodium stearyl fumarate | Poloxamer, Amberlite | 6 | [ |
| Drotaverine HCl | 40 | Mannitol | Calcium silicate, HPMC | Crospovidone, CC-Na | MgSt | PVP | 20 | [ |
| Eletriptan | 20 | Mannitol, MCC | CC-Na, SSG, Crospovidone | MgSt, Talc | - | 9 | [ | |
| Eslicarbazepine | 800 | Mannitol, MCC | - | Crospovidone, SSG, Pregelatinized starch | MgSt, Talc | β-cyclodextrin | 8 | [ |
| Glipizide | 10 | Mannitol, MCC | CC-Na, SSG, Crospovidone, Pregelatinized starch | MgSt, Aerosil, Talc | - | 9 | [ | |
| Granisetron HCl | 50 | Mannitol, MCC | - | Crospovidone, CC-Na, SSG | MgSt, Aerosil | - | 6 | [ |
| Granisetron HCl | 2.4 | Mannitol, MCC | - | CC-Na, SSG, Crospovidone | MgSt, Talc | Camphore–as porophore | 12 | [ |
| Loratadine | 10 | Mannitol | - | Crospovidone, CC-Na | MgSt | PVA | 6 | [ |
| Lornoxicam | 4 | Mannitol, MCC | - | CC-Na, L-HPC | MgSt, Aerosil | Cyclodextrin methacrylate | 3 | [ |
| Lornoxicam | 8 | Mannitol | - | Crospovidone, SSG, Pregelatinized starch | MgSt | - | 4 | [ |
| Mefenamic acid | 100 | MCC | - | Crospovidone | MgSt, Aerosil | Eudragit EPO | 2 | [ |
| Meloxicam | 7.5 | Mannitol, Lactose, MCC | - | Crospovidone | MgSt | - | 1 | [ |
| Memantine HCl | 5 | Mannitol, MCC | - | CC-Na | MgSt, Colloidal silica | Eudragit EPO | 15 | [ |
| Memantine HCl | 10 | Mannitol, MCC | - | CC-Na | MgSt, Aerosil | - | 3 | [ |
| Montelukast sodium | 5.2 | Mannitol, MCC | Sodium bicarbonate | Crospovidone | MgSt | - | 8 | [ |
| Mosapride citrate | 5 | Mannitol, Lactose, MCC | - | CC-Na, Sodium carboxymethyl starch, L-HPC, Crospovidone, Pregelatinized starch | MgSt | - | 7 | [ |
| Olanzapine | 10 | Mannitol, MCC | - | SSG, CC-Na, Crospovidone | MgSt, Aerosil | 2-hydroxypropyl-β-cyclodextrin | 10 | [ |
| Ondansetron | 8 | Mannitol, MCC | - | Crospovidone, CC-Na, SSG, L-HPC | SSF, Aerosil | - | 20 | [ |
| Propafenone HCl | 150 | Lactose | - | Crospovidone, CC-Na | MgSt | Camphore–as porophore | 15 | [ |
| Propranolol HCl | 40 | Mannitol | - | Crospovidone, CC-Na, SSG | MgSt, Talc | SLS | 9 | [ |
| Salbutamol suphate | 4 | Mannitol, MCC | - | CC-Na, SSG | MgSt, Talc | - | 7 | [ |
| Simvastatin | 5 | Mannitol, MCC | - | CC-Na | MgSt | Poloxamer | 9 | [ |
| Tadalafil | 5 | Mannitol, MCC | - | CC-Na | Talc | PVP | 5 | [ |
| Ziprasidone HCl Monohydrate | 22.63 | Mannitol, MCC | - | CC-Na | MgSt | PVP | 18 | [ |
MCC, microcrystalline celulose; CC-Na, croscarmelose sodium; SSG, sodium starch glycollate; L-HPC, low-substituted hydroxypropylcellulose; HPMC, hydroxypropylcellulose; PVP, polyvinylpyrrolidone; PVA, polyvinyl alcohol; SLS, sodium lauryl sulfate; MgSt, magnesium stearate; SSF, sodium stearyl fumarate; *—formulations formerly present in database by Han et al. [16].
Figure 1Schematic representation of the applied workflow. Models: distributed random forest (DRF), extremely randomized trees (XRT), generalized linear model (GLM), extreme gradient boosting machine (XGBoost), gradient boosting machine (GBM), deep learning (fully connected multilayer artificial neural network, DL), and stacked ensemble (SE); n_try, number of starting points for probing hyperparameter space; cv, cross-validation; API, active pharmaceutical ingredient.
Figure 2Correlation matrix of the database.
Descriptive statistics of the database. MCC, microcrystalline cellulose; CC-Na, croscarmellose sodium; SSG, sodium starch glycollate; L-HPC, low-substituted hydroxypropyl-cellulose; PVP, polyvinylpyrrolidone; PVA, polyvinyl alcohol; SLS, sodium lauryl sulfate; MgSt, magnesium stearate; SSF, sodium stearyl fumarate; API, active pharmaceutical ingredient; 2-HP-beta-CD, 2-hydroxypropyl-beta-cyclodextrin; CD-methacrylate, beta-cyclodextrin-methacrylate.
| Variable | Count | Mean | Std | Min | 25% | 50% | 75% | Max |
|---|---|---|---|---|---|---|---|---|
| Tablet mass [mg] | 243 | 274.10 | 252.29 | 67.13 | 116.4 | 180 | 336 | 1179.98 |
| API [%] | 243 | 12.81 | 16.25 | 1 | 3.02 | 5.56 | 11.59 | 67.8 |
| Mannitol [%] | 243 | 37.76 | 24.24 | 0 | 23.7 | 32.61 | 60.35 | 86.84 |
| MCC [%] | 243 | 22.62 | 20.37 | 0 | 4.57 | 18.12 | 37.32 | 84.1 |
| Lactose [%] | 243 | 7.19 | 15.45 | 0 | 0 | 0 | 0 | 62 |
| SSG [%] | 243 | 1.35 | 3.08 | 0 | 0 | 0 | 0 | 18.21 |
| CC-Na [%] | 243 | 3.43 | 4.99 | 0 | 0 | 1 | 5 | 31.95 |
| Crospovidone [%] | 243 | 2.55 | 4.28 | 0 | 0 | 0 | 4.5 | 20.03 |
| L-HPC [%] | 243 | 0.40 | 1.94 | 0 | 0 | 0 | 0 | 14.71 |
| Pregelatinized starch [%] | 243 | 0.07 | 0.53 | 0 | 0 | 0 | 0 | 5.08 |
| Sodium carboxymethyl starch [%] | 243 | 0.02 | 0.32 | 0 | 0 | 0 | 0 | 5 |
| 2-HP-beta-CD [%] | 243 | 3.12 | 9.56 | 0 | 0 | 0 | 0 | 36.46 |
| beta-CD [%] | 243 | 0.31 | 1.66 | 0 | 0 | 0 | 0 | 9.31 |
| CD-methacrylate [%] | 243 | 0.06 | 0.77 | 0 | 0 | 0 | 0 | 11.39 |
| Amberlite [%] | 243 | 0.27 | 1.38 | 0 | 0 | 0 | 0 | 8.35 |
| Eudragit-EPO [%] | 243 | 0.46 | 4.22 | 0 | 0 | 0 | 0 | 61.54 |
| Poloxamer [%] | 243 | 0.37 | 1.46 | 0 | 0 | 0 | 0 | 7.95 |
| PVP [%] | 243 | 0.55 | 1.51 | 0 | 0 | 0 | 0 | 7.99 |
| SLS [%] | 243 | 0.08 | 0.41 | 0 | 0 | 0 | 0 | 2.16 |
| PVA [%] | 243 | 0.06 | 0.50 | 0 | 0 | 0 | 0 | 4.52 |
| Camphor [%] | 243 | 0.97 | 2.50 | 0 | 0 | 0 | 0 | 10.31 |
| Hardness [N] | 243 | 36.58 | 18.98 | 2.4 | 27.415 | 35.69 | 44.075 | 155.43 |
| Thickness [mm] | 243 | 3.50 | 0.93 | 1.86 | 2.995 | 3.34 | 4.01 | 6.5 |
| Punch die of tablet press [mm] | 243 | 8.86 | 2.86 | 5.5 | 7 | 8 | 10 | 16 |
| Disintegration time [s] | 243 | 41.13 | 27.35 | 4.98 | 22.5 | 34.66 | 52.34 | 140 |
Figure 3Box and violin plot of selected variables present in the database. Boxes represent interquartile range (IQR), namely: first quartile (Q1), median (horizontal line), third quartile (Q3), and the lower whisker = Q1–1.5*IQR; the higher whisker = Q3 + 1.5*IQR; curves represent distributions of numeric data using kernel density function.
Hyperparameters and robustness of the H2O AutoML model development (multistart); mean values of RMSE, NRMSE, and R2 are provided for the developed models in a multistart procedure with 30 repetitions; standard deviation is in round brackets. DRF, distributed random forest; XRT, extremely randomized trees; GLM, generalized linear model; XGBoost, extreme gradient boosting machine; GBM, gradient boosting machine; DL, deep learning (fully connected multilayer artificial neural network); SE, stacked ensemble.
| Repetition | Hyperparameter Search | RMSE [s] | NRMSE [%] | R2 |
|---|---|---|---|---|
| 30 | Feature selection short loop time = 180 s | 11.37 (0.42) | 8.42 (0.31) | 0.83 (0.01) |
Selected input vector for the best predictive model.
| Variable | Variable Type | Scaled Variable Importance |
|---|---|---|
| CC-Na [%] | Composition, disintegrant | 1.0000 |
| Crospovidone [%] | Composition, disintegrant | 0.8013 |
| SSG [%] | Composition, disintegrant | 0.7341 |
| Hardness [N] | Manufacturing parameter | 0.6564 |
| Eudragit EPO [%] | Composition, solubilizer | 0.5620 |
| MgSt [%] | Composition, lubricant | 0.5008 |
| Aerosil [%] | Composition, lubricant | 0.3991 |
| GATS7i | API molecular descriptor | 0.3441 |
| MCC [%] | Composition, filler | 0.3394 |
| Colloidal silica [%] | Composition, lubricant | 0.2336 |
| Mannitol [%] | Composition, filler | 0.2335 |
| Pregelatinized starch [%] | Composition, disintegrant | 0.2009 |
| PVA [%] | Composition, solubilizer | 0.1618 |
| Thickness [mm] | Manufacturing parameter | 0.1482 |
| CD-methacrylate [%] | Composition, solubilizer | 0.1253 |
| GGI7 | API molecular descriptor | 0.1168 |
| MATS4p | API molecular descriptor | 0.1148 |
| MIC2 | API molecular descriptor | 0.1133 |
| API [%] | Composition | 0.1109 |
| Punch die of tablet press [mm] | Manufacturing parameter | 0.1058 |
| nT12Ring | API molecular descriptor | 0.1053 |
| XLogP | API molecular descriptor | 0.1048 |
| GATS7p | API molecular descriptor | 0.1046 |
| nF8HeteroRing | API molecular descriptor | 0.1038 |
| Amberlite [%] | Composition, solubilizer | 0.0972 |
| Sodium carboxymethyl starch [%] | Composition, disintegrant | 0.0955 |
| SLS [%] | Composition, solubilizer | 0.0952 |
| Camphor [%] | Composition, solubilizer (porophore) | 0.0896 |
| Calcium silicate [%] | Composition, binder | 0.0868 |
| Poloxamer [%] | Composition, solubilizer | 0.0862 |
| Sodium bicarbonate [%] | Composition, binder | 0.0839 |
| beta-CD [%] | Composition, solubilizer | 0.0831 |
| Talc [%] | Composition, lubricant | 0.0830 |
| 2-HP-beta-CD [%] | Composition, solubilizer | 0.0816 |
| SSF [%] | Composition, lubricant | 0.0751 |
| HPMC [%] | Composition, binder | 0.0675 |
| Lactose [%] | Composition, filler | 0.0591 |
| L-HPC [%] | Composition, disintegrant | 0.0542 |
| PVP [%] | Composition, solubilizer | 0.0525 |
Figure 4SHAP dependence plot of the top 20 features of the deep learning model. MCC, microcrystalline cellulose; CC-Na, croscarmellose sodium; SSG, sodium starch glycollate; MgSt, magnesium stearate; SSF, sodium stearyl fumarate; API, active pharmaceutical ingredient.
Figure 5SHAP plots representing the effects of formulation composition on the disintegration time [s] for: Crosspovidone [%] (A), croscarmellose sodium (CC-Na) [%] (B), sodium starch glycolate (SSG) [%] (C), Eudragit EPO [%] (D), Aerosil [%] (E), Talc [%] (F).
Figure 6SHAP plots representing effects of various manufacturing parameters on the disintegration time: punch die of tablet press [mm] (A), thickness [mm] (B), hardness [N] (C), amount of API [%] (D).
Figure 7SHAP plots representing the effects of APIs molecular descriptors on disintegration time. GATS7i and GATS7p, the Geary autocorrelation with lag 7 descriptors, weighted by ionization potential (A) or polarizability (B); GGI7, the topological charge index of order 7 (C); MATS4p, the Moran autocorrelation of lag 4 weighted by polarizability (D); MIC2, a modified information content index, neighborhood symmetry of 2-order descriptor (E); XLogP, a theoretical n-octanol–water partition coefficient (F).
Figure 8Partial dependency plots for XLogP vs. super-disintegrants: crospovidone (A), CC-Na (B), SSG (C), and lubricants: Aerosil (D), MgSt (E), SSF (F). XLogP, a theoretical n-octanol–water partition coefficient; SSG, sodium starch glycolate; MgSt, magnesium stearate; SSF, sodium stearyl fumarate; CC-Na, croscarmellose sodium.