| Literature DB >> 32547808 |
Mohammad Moqaddasi Amiri1, Leili Tapak1,2, Javad Faradmal2,3, Javad Hosseini1, Ghodratollah Roshanaei2,3.
Abstract
OBJECTIVES: Longitudinal data are prevalent in clinical research; due to their correlated nature, special analysis must be used for this type of data. Creatinine is an important marker in predicting end-stage renal disease, and it is recorded longitudinally. This study compared the prediction performance of linear regression (LR), linear mixed-effects model (LMM), least-squares support vector regression (LS-SVR), and mixed-effects least-squares support vector regression (MLS-SVR) methods to predict serum creatinine as a longitudinal outcome.Entities:
Keywords: Creatinine; Longitudinal Studies; Machine Learning; Renal Dialysis; Support Vector Machine
Year: 2020 PMID: 32547808 PMCID: PMC7278511 DOI: 10.4258/hir.2020.26.2.112
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Figure 1Framework of data pre-processing. LR: linear regression, LMM: linear mixed-effects model, LS-SVR: leastsquares support vector regression, MLS-SVR: mixed-effects least-squares support vector regression.
Descriptive statistics of hemodialysis data
| Variable | Male | Female | |
|---|---|---|---|
| Creatinine | 7.45 ± 3.13 | 6.99 ± 2.45 | 0.302 |
| Blood urea nitrogen | 130.06 ± 48.67 | 123.00 ± 43.08 | 0.335 |
| Hematocrit | 31.20 ± 5.79 | 32.00 ± 4.93 | 0.355 |
| Hemoglobin | 9.83 ± 2.05 | 10.01 ± 1.77 | 0.567 |
| Fasting blood sugar | 112.80 ± 50.11 | 108.71 ± 42.09 | 0.578 |
| Potassium (K) | 4.83 ± 1.00 | 4.93 ± 0.96 | 0.501 |
| Phosphorous (P) | 5.08 ± 1.65 | 5.09 ± 1.46 | 0.965 |
| Calcium (Ca) | 8.61 ± 1.28 | 8.95 ± 0.98 | 0.058 |
| Age | 59.19 ± 15.87 | 61.63 ± 14.93 | 0.321 |
Values are presented as mean ± standard deviation.
Regression coeficient of covariates of fitting the linear mixed-effects model to serum creatinine
| Variable | Coefficient (standard error) | |
|---|---|---|
| Time | 0.036 (0.010) | 0.0002 |
| Blood urea nitrogen | 0.019 (0.001) | <0.001 |
| Hematocrit | −0.038 (0.012) | 0.001 |
| Hemoglobin | 0.208 (0.036) | <0.001 |
| Fasting blood sugar | −0.001 (0.001) | 0.312 |
| Potassium (K) | 0.129 (0.040) | 0.001 |
| Phosphorous (P) | 0.172 (0.029) | <0.001 |
| Calcium (Ca) | 0.086 (0.035) | 0.015 |
| Age | −0.049 (0.023) | <0.001 |
| Diabetes (yes) | 0.119 (0.256) | 0.644 |
| Gender (male) | 0.616 (0.247) | 0.014 |
| Number of weekly dialysis | 0.630 (0.216) | 0.004 |
| Hypertension (yes) | 0.587 (0.254) | 0.022 |
Performance of the models in predicting creatinine for training and testing sets
| MLS-SVR | LMM | LS-SVR | LR | |
|---|---|---|---|---|
| Training data | ||||
| MSE | 1.280 | 1.525 | 3.720 | 4.001 |
| MAE | 0.833 | 0.921 | 1.480 | 1.545 |
| MAPE | 0.129 | 0.142 | 0.230 | 0.241 |
| 0.805 | 0.766 | 0.426 | 0.381 | |
|
| ||||
| Testing data | ||||
| MSE | 3.275 | 3.885 | 6.646 | 6.865 |
| MAE | 1.319 | 1.495 | 2.008 | 2.072 |
| MAPE | 0.159 | 0.171 | 0.238 | 0.244 |
| 0.654 | 0.648 | 0.349 | 0.345 | |
MLS-SVR: mixed-effects least-squares support-vector regression, LMM: linear mixed-effects model, LS-SVR: least-squares support vector regression, LR: linear regression, MSE: mean squared error, MAE: mean absolute error, MAPE: mean absolute-prediction error, R2: determination coefficient.
Figure 2Comparison of predicted and observed values for MLS-SVR and LMM for three patients: (A) patient #1, (B) patient #2, and (C) patient #3. The extended line is bisector. MLS-SVR: mixed-effects least-squares support-vector regression, LMM: linear mixed-effects model.
Figure 3Variable importance (VIMP) of each factor in prediction of creatinine using MLS-SVR method. Mean of changes in MAE after each permutation (standard error). BUN: blood urea nitrogen, FBS: fasting blood sugar, HCT: hematocrit, P: phosphorous, HB: hemoglobin, Ca: calcium, K: potassium, MLS-SVR: mixed-effects least-squares support-vector regression, MAE: mean absolute error.