| Literature DB >> 30339708 |
Zhiyuan Ma1,2, Ping Wang1, Zehui Gao3, Ruobing Wang4,5, Koroush Khalighi1,6,7.
Abstract
Warfarin dosing remains challenging due to narrow therapeutic index and highly individual variability. Incorrect warfarin dosing is associated with devastating adverse events. Remarkable efforts have been made to develop the machine learning based warfarin dosing algorithms incorporating clinical factors and genetic variants such as polymorphisms in CYP2C9 and VKORC1. The most widely validated pharmacogenetic algorithm is the IWPC algorithm based on multivariate linear regression (MLR). However, with only a single algorithm, the prediction performance may reach an upper limit even with optimal parameters. Here, we present novel algorithms using stacked generalization frameworks to estimate the warfarin dose, within which different types of machine learning algorithms function together through a meta-machine learning model to maximize the prediction accuracy. Compared to the IWPC-derived MLR algorithm, Stack 1 and 2 based on stacked generalization frameworks performed significantly better overall. Subgroup analysis revealed that the mean of the percentage of patients whose predicted dose of warfarin within 20% of the actual stable therapeutic dose (mean percentage within 20%) for Stack 1 was improved by 12.7% (from 42.47% to 47.86%) in Asians and by 13.5% (from 22.08% to 25.05%) in the low-dose group compared to that for MLR, respectively. These data suggest that our algorithms would especially benefit patients requiring low warfarin maintenance dose, as subtle changes in warfarin dose could lead to adverse clinical events (thrombosis or bleeding) in patients with low dose. Our study offers novel pharmacogenetic algorithms for clinical trials and practice.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30339708 PMCID: PMC6195267 DOI: 10.1371/journal.pone.0205872
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic and clinical characteristics of the IWPC cohort.
| Variable | IWPC data (n = 5743) |
|---|---|
| Warfarin dose—mg/week | |
| Mean (SD) | 32.0 (16.8) |
| Median | 28.0 |
| Interquartile range | 20.0–40.0 |
| Genotype—no. (%) | |
| | |
| G/G | 1887 (32.9) |
| A/G | 2065 (36.0) |
| A/A | 1683 (29.3) |
| Unknown | 108 (1.9) |
| | |
| *1/*1 | 4232 (73.7) |
| *1/*2 | 755 (13.2) |
| *1/*3 | 482 (8.4) |
| *2/*2 | 58 (1.0) |
| *2/*3 | 68 (1.2) |
| *3/*3 | 20 (0.4) |
| Unknown | 128 (2.2) |
| Age—no. (%) | |
| < 50 | 974 (17.0) |
| 50–80 | 4075 (71.0) |
| > 80 | 694 (12.1) |
| Height—m | |
| Median | 167.6 |
| Interquartile range | 160.0–175.8 |
| Weight—kg | |
| Median | 76.0 |
| Interquartile range | 63.0–90.7 |
| Race—no. (%) | |
| White | 3095 (53.9) |
| Asian | 1517 (26.4) |
| Black | 665 (11.6) |
| Mixed or missing | 466 (8.1) |
| Enzyme inducer | 61 (1.1) |
| Amiodarone | 280 (4.9) |
| Statin | 972 (16.9) |
| Smoker | 482 (8.4) |
| DM | 619 (10.8) |
| Heart failure | 716 (12.5) |
| Valve replacement | 975 (17.0) |
Comparison of the performance of individual machine learning algorithms.
| Algorithms | MAE (95% CI) | Within 20% (95% CI) | P value (vs. MLR) |
|---|---|---|---|
| 8.53 (8.08–8.99) | 46.31 (43.73–48.89) | ||
| 8.52 (8.11–8.93) | 46.51 (44.08–48.97) | 0.639 | |
| 8.52 (8.12–8.92) | 46.29 (43.74–48.84) | 0.758 | |
| 8.84 (8.35–9.33) | 44.35 (41.50–47.20) | <0.001 | |
| 8.82 (8.42–9.23) | 44.88 (42.47–47.29) | <0.001 | |
| 9.28 (8.84–9.73) | 42.88 (40.16–45.59) | <0.001 | |
| 10.18 (9.73–10.63) | 39.02 (36.81–41.22) | <0.001 | |
| 10.86 (10.32–11.40) | 36.43 (33.89–38.96) | <0.001 |
* P value for MAE
# P value for Within 20%
Within 20%: the mean of the percentage of patients whose predicted dose of warfarin within 20% of the actual stable therapeutic dose.
SV: Support Vector Machine; RR: Ridge Regression; MLR: Multivariate Linear Regression; NN: Neural Network; GBT: Light Gradient Boosting Machine; RF: Random Forests; ET: Extremely Randomized Tree; KN: K nearest neighbors.
Fig 1Schematic representation of the configurations of the stacked generalization frameworks (Stack 1 and 2) built from base models.
SV: Support Vector Regression; RR: Ridge Regression; NN: Neural Network; GBT: Light Gradient Boosting Machine.
Overall comparison of the performance of the Stack 1 and 2 with MLR.
| Algorithms | MAE (95% CI) | Within 20% (95% CI) | P value (vs. MLR) |
|---|---|---|---|
| 8.31 (7.86–8.76) | 47.85 (45.43–50.28) | <0.001 | |
| 8.31 (7.87–8.76) | 47.81 (45.44–50.19) | <0.001 | |
| 8.53 (8.08–8.99) | 46.31 (43.73–48.89) |
* P value for MAE
# P value for Within 20%
Within 20%: the mean of the percentage of patients whose predicted dose of warfarin within 20% of the actual stable therapeutic dose.
Comparison of the performance of the Stack 1 and 2 with MLR in Asians, whites and blacks.
| Algorithms | Asian | White | Black | |||
|---|---|---|---|---|---|---|
| MAE | Within 20% | MAE | Within 20% | MAE | Within 20% | |
| 6.13 | 47.86 | 8.70 | 48.43 | 11.88 | 44.90 | |
| 6.14 | 47.66 | 8.70 | 48.46 | 11.92 | 44.73 | |
| 6.64 | 42.47 | 8.85 | 48.13 | 12.00 | 43.97 | |
| <0.001 | <0.001 | 0.002 | 0.244 | 0.360 | 0.093 | |
* P value for Stack1 vs. MLR
# P value for Stack2 vs. MLR
Within 20%: the mean of the percentage of patients whose predicted dose of warfarin within 20% of the actual stable therapeutic dose.
Comparison of the Stack 1 and 2 with MLR in three dose ranges.
| Algorithms | Low dose | Intermediate dose | High dose | |||
|---|---|---|---|---|---|---|
| MAE | Within 20% | MAE | Within 20% | MAE | Within 20% | |
| 8.11 | 25.05 | 5.47 | 62.13 | 14.43 | 41.75 | |
| 8.27 | 24.66 | 5.50 | 61.96 | 14.23 | 42.35 | |
| 8.62 | 22.08 | 5.59 | 60.90 | 14.61 | 41.11 | |
| <0.001 | <0.001 | <0.001 | <0.001 | 0.072 | 0.071 | |
* P value for Stack1 vs. MLR
# P value for Stack2 vs. MLR
Within 20%: the mean of the percentage of patients whose predicted dose of warfarin within 20% of the actual stable therapeutic dose.