| Literature DB >> 35228813 |
Xiaolan Mo1,2, Xiujuan Chen3, Xianggui Wang2, Xiaoli Zhong2, Huiying Liang3, Yuanyi Wei1, Houliang Deng1, Rong Hu1, Tao Zhang1, Yilu Chen1, Xia Gao4, Min Huang2, Jiali Li2.
Abstract
PURPOSE: Tacrolimus (TAC) is a first-line immunosuppressant for patients with refractory nephrotic syndrome (NS). However, there is a high inter-patient variability of TAC pharmacokinetics, thus therapeutic drug monitoring (TDM) is required. In this study, we aimed to employ machine learning algorithms to investigate the impact of clinical and genetic variables on the TAC dose/weight-adjusted trough concentration (C0/D) in Chinese children with refractory NS, and then develop and validate the TAC C0/D prediction models. PATIENTS AND METHODS: The association of 82 clinical variables and 244 single nucleotide polymorphisms (SNPs) with TAC C0/D in the third month since TAC treatment was examined in 171 children with refractory NS. Extremely randomized trees (ET), gradient boosting decision tree (GBDT), random forest (RF), extreme gradient boosting (XGBoost), and Lasso regression were carried out to establish and validate prediction models, respectively. The best prediction models were validated on a cohort of 30 refractory NS patients.Entities:
Keywords: genetic polymorphism; machine learning; nephrotic syndrome; prediction model; tacrolimus
Year: 2022 PMID: 35228813 PMCID: PMC8881964 DOI: 10.2147/PGPM.S339318
Source DB: PubMed Journal: Pharmgenomics Pers Med ISSN: 1178-7066
Figure 1The flowchart of model generation and validation.
Demographics and Clinical Characteristics of All Pediatric Patients with Primary Nephrotic Syndrome
| Characteristics | Whole Group (n=171) | CYP3A5 Expresser Group (n=84) | CYP3A5 Nonexpresser Group (n=87) |
|---|---|---|---|
| Male/Female | 128/43 | 64/20 | 64/23 |
| Age (years) | 5.4±3.5 | 5.9±3.5 | 4.9±3.3 |
| Weight (kg) | 19.5±8.3 | 20.8±8.8 | 18.4±7.8 |
| Alanine transaminase (ALT, U/L) | 19.416±14.449 | 17.760±11.063 | 21.039±17.093 |
| Aspartate transaminase (AST, U/L) | 28.180±11.714 | 27.700±11.249 | 28.660±12.257 |
| serum creatinine (SCr, µmol/L) | 30.733±13.678 | 32.855±13.657 | 28.466±13.448 |
| TAC C0/D ((ng/mL)/(mg/kg)) | 84.660 (20.300~228.004) | 61.622 (20.300~139.360) | 95.763 (42.900~228.004) |
Notes: Data are presented as median with P25~P75 (Percentile:25%~75%), mean ± standard deviation or amount.
Figure 2The influence of ALB0 (A), CYP3A5 (B), MYH9 rs2239781 (C and D) and CTLA4 rs4553808 (E) on TAC C0/D in whole group (* p<0.05, ** p<0.01).
Figure 3The influence of AGE0 (A), MAP3K11 rs7946115 (B), MYH9 rs2239781 (C) and CTLA4 rs4553808 (D) on TAC C0/D in CYP3A5 expressers (* p<0.05, ** p<0.01).
Figure 4The influence of GENDER (A), MYH9 rs4821478 (B), MYH9 rs2239781 (C), IL2RA rs12722489 (D), INF2 rs1128880 (E and F), ACTN4 rs56113315 (G), ACTN4 rs62121818 (H) and ACTN4 rs3745859 (I) on TAC C0/D in CYP3A5 nonexpressers (* p<0.05, ** p<0.01).
The Performances Comparison of Five Models in Test Set in Whole Group, CYP3A5 Expresser Group, and CYP3A5 Nonexpresser Group
| Groups | R Square | Precision (Mean Square Error) | Bias (Mean Error) | Mean Absolute Error | Median Absolute Error |
|---|---|---|---|---|---|
| Lasso | 0.428 | 608.533 | 4.536 | 21.161 | 21.052 |
| XGBoost | 0.438 | 597.490 | 3.700 | 21.739 | 19.158 |
| ET | 0.377 | 663.175 | 3.005 | 22.149 | 19.236 |
| RF | 0.397 | 641.602 | 3.085 | 21.215 | 19.242 |
| GBDT | 0.444 | 591.032 | 1.638 | 20.782 | 18.980 |
| Lasso | 0.333 | 514.357 | −4.795 | 18.285 | 11.228 |
| XGBoost | 0.345 | 504.838 | −6.665 | 17.962 | 13.691 |
| ET | 0.380 | 477.948 | −4.191 | 18.119 | 18.771 |
| RF | 0.310 | 531.457 | −5.238 | 18.697 | 16.292 |
| GBDT | 0.252 | 576.909 | −4.291 | 19.683 | 18.421 |
| Lasso | 0.207 | 1980.247 | −11.376 | 33.291 | 21.655 |
| XGBoost | 0.263 | 1840.371 | −10.599 | 31.696 | 20.332 |
| ET | 0.245 | 1887.017 | −10.643 | 32.081 | 18.825 |
| RF | 0.230 | 1924.323 | −9.710 | 32.737 | 20.518 |
| GBDT | 0.264 | 1839.459 | −11.002 | 31.257 | 19.399 |
Figure 5Observed and predicted C0/D in the whole group (A), the CYP3A5 expresser group (B), and the CYP3A5 nonexpresser group (C).
The Clinical Application of These Prediction Models. After Inputting the Variables into the Predicted Models, These Models Finally Output the Predicted C0/D Values
| Name | Input Variables | Output (ng/mL/(mg/kg)) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ALB0 (g/l) | / | / | / | / | / | ||||||
| X1 | 33.2 | 0 | 1 | 0 | 0 | / | / | / | / | 72.286 | |
| Y1 | 13.2 | 1 | 1 | 0 | 0 | / | / | / | / | 52.771 | |
| AGE0 (y) | / | / | / | / | / | / | |||||
| X2 | 4.9 | 0 | 1 | 0 | / | / | / | / | / | 51.082 | |
| Y2 | 6.4 | 0 | 0 | 0 | / | / | / | / | / | 60.876 | |
| GENDER | / | ||||||||||
| X3 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 106.600 | |
| Y3 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 117.945 | |
Notes: Xn~Yn are names of patients. C0/D means the TAC dose/weight-adjusted trough concentrations. 0 represents that the patient does not carry the genotype, and 1 represents that the patient carries the genotype.