| Literature DB >> 28176850 |
Jie Tang1,2, Rong Liu1,2, Yue-Li Zhang1,2, Mou-Ze Liu1,2, Yong-Fang Hu3, Ming-Jie Shao4, Li-Jun Zhu4, Hua-Wen Xin5, Gui-Wen Feng6, Wen-Jun Shang6, Xiang-Guang Meng7, Li-Rong Zhang7, Ying-Zi Ming4, Wei Zhang1,2.
Abstract
Tacrolimus has a narrow therapeutic window and considerable variability in clinical use. Our goal was to compare the performance of multiple linear regression (MLR) and eight machine learning techniques in pharmacogenetic algorithm-based prediction of tacrolimus stable dose (TSD) in a large Chinese cohort. A total of 1,045 renal transplant patients were recruited, 80% of which were randomly selected as the "derivation cohort" to develop dose-prediction algorithm, while the remaining 20% constituted the "validation cohort" to test the final selected algorithm. MLR, artificial neural network (ANN), regression tree (RT), multivariate adaptive regression splines (MARS), boosted regression tree (BRT), support vector regression (SVR), random forest regression (RFR), lasso regression (LAR) and Bayesian additive regression trees (BART) were applied and their performances were compared in this work. Among all the machine learning models, RT performed best in both derivation [0.71 (0.67-0.76)] and validation cohorts [0.73 (0.63-0.82)]. In addition, the ideal rate of RT was 4% higher than that of MLR. To our knowledge, this is the first study to use machine learning models to predict TSD, which will further facilitate personalized medicine in tacrolimus administration in the future.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28176850 PMCID: PMC5296901 DOI: 10.1038/srep42192
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Basic characteristic of the patients.
| Variable | The Derivation Cohort (N = 838) | The Validating Cohort (N = 207) |
|---|---|---|
| Continuous variable mean (sd) | ||
| Tacrolimus stable dose − mg/day | 3.48 (1.28) | 3.50 (1.32) |
| Age (year) | 36.19 (10.62) | 35.82 (10.34) |
| Height− cm | 165.36 (7.84) | 166.14 (7.53) |
| Weight − kg | 58.43 (11.01) | 58.39 (9.81) |
| Hemoglobin − g/dl | 118.29 (26.63) | 121.69 (26.06) |
| Leukocyte | 7.63 (2.66) | 7.64 (2.87) |
| Serum creatinine | 422.07 (463.78) | 429.15 (524.21) |
| Total bilirubin | 11.42 (6.68) | 11.28 (4.41) |
| Albumin − g/l | 43.35 (5.47) | 44.76 (4.68) |
| Categorical variable n. (%) | ||
| Male | 597 (71.2) | 148 (71.5) |
| Diabetes | 19 (2.3) | 3 (1.4) |
| Hypertension | 561 (66.9) | 134 (64.7) |
| Living donor | 420 (50.1) | 107 (51.7) |
| Anemia | 409 (48.8) | 94 (45.4) |
| Cardiac insufficiency | 33 (3.9) | 11 (5.3) |
| Use of Calcium channel blocker | 544 (64.9) | 127 (61.4) |
| Use of Metoprolol | 377 (45.0) | 84 (40.6) |
| Use of Omeprazole | 246 (29.4) | 59 (28.5) |
| Use of Furosemide | 420 (50.1) | 107 (51.7) |
| ACEI/ARA | 185 (22.1) | 43 (20.8) |
| Cephalosporin | 513 (61.2) | 126 (60.9) |
| Infected | 190 (22.7) | 44 (21.2) |
| CYP3A5 *3 | ||
| A/A | 77 (9.2) | 15 (7.2) |
| A/G | 321 (38.3) | 87 (42.0) |
| G/G | 424 (50.6) | 97 (46.9) |
| Unknown | 16 (1.9) | 8 (3.8) |
*ACEI/ARA: Angiotensin converting enzyme inhibition, Angiotensin II receptor antagonist.
Figure 1Ideal rate and mean absolute error in train and test partitions for nine techniques averaged over 100 round of resampling process results for different models fitted.
Predicted dose within 20% of the actual dose in the train (A) and test (B) set of the derivation cohort. Mean absolute error between the predicted and actual dose in the train (C) and test (D) set. The vertical bars represent the 95% CIs of MAE. MLR: multiple linear regression; SVR: support vector regression; ANN: artificial neural network; RT: regression tree; RFR: random forest regression; BRT: boosted regression tree; MARS: multivariate adaptive regression splines; LAR: lasoo regression; BART: Bayesian additive regression trees.
Figure 2Predicted tacrolimus doses according to the regression tree algorithm.
N and dose represent the sample size and predicted tacrolimus dose, respectively.
Predicted tacrolimus stable doses with the regression tree algorithm compared with the actual stable dose in the derivation and validation cohorts.
| Cohort | Mean Absolute Error (95% CI) mg/day | |
|---|---|---|
| Derivation cohort | Validation cohort | |
| Overall | 0.71 (0.66–0.75) | 0.73 (0.63–0.84) |
| ≤2.5 mg/day | 1.31 (1.20–1.42) | 1.33 (1.13–1.53) |
| >2.5 mg/day to <4 mg/day | 0.50 (0.46–0.54) | 0.48 (0.39–0.58) |
| ≥4 mg/day | 1.07 (0.96–1.18) | 1.14 (0.92–1.37) |
Percentage of patients in the validation cohort and in the derivation-plus-validation cohort with an ideal, underestimated, or overestimated dose of tacrolimus in renal transplant patients requiring low, intermediate, or high actual doses of tacrolimus.
| Actual Dose Required | N/o. of Patients (%) | Ideal Dose | Underestimation& | Overestimation |
|---|---|---|---|---|
| Validation cohort only | 126 | 54.8 | 19.8 | 25.4 |
| ≤2.5 mg/day | 32 | 31.2 | 0 | 68.8 |
| >2.5 mg/day to ≤4 mg/day | 69 | 72.5 | 14.5 | 13.0 |
| >4 mg/day | 25 | 36.0 | 60.0 | 4.0 |
| Derivation-plus-validation cohort | 693 | 57.3 | 19.2 | 23.4 |
| ≤2.5 mg/day | 169 | 38.5 | 0 | 61.5 |
| >2.5 mg/day to ≤4 mg/day | 380 | 70.5 | 14.4 | 15.0 |
| >4 mg/day | 144 | 44.4 | 54.9 | 0.7 |
*The ideal dose was defined as a predicted dose that was within 20% of the actual stable therapeutic dose of tacrolimus. &We defined underestimation as a predicted dose that was at least 20% lower than the actual stable dose. $We defined overestimation as a predicted dose that was at least 20% higher than the actual stable dose.
Figure 3Study flow chart.
MLR: multiple linear regression; SVR: support vector regression; ANN: artificial neural network; RT: regression tree; RFR: random forest regression; BRT: boosted regression tree; MARS: multivariate adaptive regression splines; LAR: lasoo regression; BART: Bayesian additive regression trees.