| Literature DB >> 34152274 |
Richard John Woodman1, Kimberley Bryant1, Michael J Sorich1, Alberto Pilotto2,3, Arduino Aleksander Mangoni1.
Abstract
BACKGROUND: The Multidimensional Prognostic Index (MPI) is an aggregate, comprehensive, geriatric assessment scoring system derived from eight domains that predict adverse outcomes, including 12-month mortality. However, the prediction accuracy of using the three MPI categories (mild, moderate, and severe risk) was relatively poor in a study of older hospitalized Australian patients. Prediction modeling using the component domains of the MPI together with additional clinical features and machine learning (ML) algorithms might improve prediction accuracy.Entities:
Keywords: Multidimensional Prognostic Index; XGBoost; diagnostic accuracy; machine learning; mortality
Year: 2021 PMID: 34152274 PMCID: PMC8277374 DOI: 10.2196/26139
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Patient characteristics according to vital status at 12 months after hospital discharge.
| Characteristics | Alive (n=536) | Deceased (n=201) | ||||||
| Age (years), median (IQR) | 79 (72-85) | 82 (74-88) | .002 | |||||
|
| .14 | |||||||
|
| Female | 278 (51.9) | 92 (45.6) |
| ||||
|
| Male | 257 (47.9) | 110 (54.4) |
| ||||
|
| <.001 | |||||||
|
| Mild | 211 (39.4) | 39 (19.3) |
| ||||
|
| Moderate | 290 (54.2) | 136 (67.3) |
| ||||
|
| Severe | 34 (6.4) | 27 (13.4) |
| ||||
|
| ||||||||
|
| ADLc, median (IQR) | 6 (5-6) | 5 (4-6) | <.001 | ||||
|
| IADLd, median (IQR) | 6 (4-8) | 4 (3-6) | <.001 | ||||
|
| SPMSQe, median (IQR) | 1 (0-2) | 1 (0-3) | .05 | ||||
|
| ESSf, median (IQR) | 18 (17-19) | 17 (15-18) | <.001 | ||||
|
| CIRSg, mean (SD) | 2.4 (0.4) | 2.6 (0.4) | <.001 | ||||
|
| MNAh, mean (SD) | 20.9 (3.9) | 18.0 (4.6) | <.001 | ||||
| Total number of medications, mean (SD) | 10.0 (4.4) | 10.3 (4.5) | .55 | |||||
|
| .009 | |||||||
|
| Living alone | 199 (37.2) | 70 (34.7) |
| ||||
|
| Family or friends | 300 (56.1) | 104 (51.5) |
| ||||
|
| Institute | 36 (6.7) | 28 (13.9) |
| ||||
| BMI (kg/m2), median (IQR) | 26.9 (23.8-31.8) | 25.2 (22.0-29.1) | <.001 | |||||
| Sodium (mmol/L), median (IQR) | 138 (135-140) | 138 (135-140) | .006 | |||||
| Albumin (g/L), mean (SD) | 32.5 (5.6) | 30.4 (5.7) | <.001 | |||||
| Hemoglobin (g/L), mean (SD) | 118.7 (18.2) | 112.0 (18.3) | <.001 | |||||
| eGFRi (mL/min/1.73m2), mean (SD) | 55.3 (24.2) | 52.2 (26.6) | .14 | |||||
| CRPj (mg/L), median (IQR) | 29.0 (6.0-81.0) | 33.0 (14.2-84.0) | .048 | |||||
| Creatinine (mmol/L), median (IQR) | 95 (74-134) | 103 (72-151) | .09 | |||||
| Urea (mmol/L), median (IQR) | 7.40 (5.3-11.6) | 8.90 (5.7-15.0) | <.001 | |||||
| Urea-to-creatinine ratio, median (IQR) | 0.08 (0.06-0.10) | 0.09 (0.06-0.10) | .001 | |||||
| ARSk, median (IQR) | 0 (0-2) | 0 (0-2) | .41 | |||||
aUsing two-tailed independent t test, Mann-Whitney U test, or chi-square test, as appropriate.
bMPI: Multidimensional Prognostic Index.
cADL: activities of daily living.
dIADL: instrumental activities of daily living.
eSPMSQ: Short Portable Mental Status Questionnaire.
fESS: Exton Smith Scale.
gCIRS: Cumulative Illness Rating Scale.
hMNA: Mini Nutritional Assessment.
ieGFR: estimated glomerular filtration rate.
jCRP: C-reactive protein.
kARS: anticholinergic risk score.
Figure 1Spearman ρ correlation matrix heatmap for feature set 4. ADL: activities of daily living; ARS: anticholinergic risk score; CIRS: Cumulative Illness Rating Scale; Cohab1: living alone; Cohab2: living with family or friends; Cohab3: living in an institute; CRP: C-reactive protein; eGFR: estimated glomerular filtration rate; ESS: Exton Smith Scale; Hgb: serum hemoglobin; IADL: instrumental activities of daily living; MNA: Mini Nutritional Assessment; No.Meds: number of medications; SPMSQ: Short Portable Mental Status Questionnaire; Urea/Cr: urea-to-creatinine ratio.
Diagnostic accuracy for logistic regression with maximum likelihood estimation and the 9 machine learning algorithms using feature sets 1 to 4 with the test data set.
| Model | AUCa | |||||
|
| Feature set 1b | Feature set 2c,d | Feature set 3e | Feature set 4f,d | Value, mean (SD) | |
| LR-MLEg | 0.632 | 0.688 | 0.738 | 0.757 | 0.704 (0.06) | |
|
| ||||||
|
| XGBh | 0.635 | 0.706 | 0.756 | 0.757 | 0.714 (0.06) |
|
| Neural network | 0.637 | 0.689 | 0.749 | 0.758 | 0.708 (0.06) |
|
| Random forest | 0.621 | 0.684 | 0.753 | 0.751 | 0.702 (0.06) |
|
| Ridgei | 0.632 | 0.671 | 0.738 | 0.749 | 0.698 (0.06) |
|
| KNNj | 0.626 | 0.642 | 0.731 | 0.715 | 0.679 (0.06) |
|
| Nonpenalized logistic regression | 0.627 | 0.642 | 0.707 | 0.690 | 0.667 (0.05) |
|
| Naïve Bayes | 0.591 | 0.649 | 0.705 | 0.704 | 0.663 (0.04) |
|
| SVMk | 0.530 | 0.661 | 0.737 | 0.711 | 0.656 (0.09) |
|
| Decision tree | 0.604 | 0.588 | 0.695 | 0.686 | 0.643 (0.06) |
aAUC: area under the receiver operating curve.
bMultidimensional Prognostic Index categories, age, gender (n=5 features).
cMultidimensional Prognostic Index categories, age, gender, BMI, anticholinergic risk score, laboratory data (n=15 features).
dLab data=serum albumin, sodium, serum hemoglobin, C-reactive protein, creatinine, urea, urea-to-creatinine ratio, and estimated glomerular filtration rate.
eMultidimensional Prognostic Index domains, age, gender (n=10 features).
fMultidimensional Prognostic Index domains, age, gender, BMI, anticholinergic risk score, laboratory data (n=20 features).
gLR-MLE: logistic regression with maximum likelihood estimation.
hXGB: extreme gradient boosting.
iRidge: ridge regression.
jKNN: K-nearest neighbors.
kSVM: support vector machine.
Figure 2Test accuracy of the 9 machine learning algorithms using feature sets 1 to 4. AUC: area under the receiver operating curve; dt: decision tree; knn: K-nearest neighbors; lr: logistic regression without penalization; nb: naive bayes; nn: neural network; rf: random forest; ridge: ridge regression; ROC: receiver operating curve; svm: support vector machine; xgb: eXtreme gradient boosting.
Figure 3Feature importance plot for the eXtreme gradient boosting algorithm using test data with feature sets 1 to 4. ADL: activities of daily living; ARS: anticholinergic risk score; CIRS: Cumulative Illness Rating Scale; Cohab1: living alone; Cohab2: living with family or friends; Cohab3: living in an institute; Creat: creatinine; CRP: C-reactive protein; eGFR: estimated glomerular filtration rate; ESS: Exton Smith Scale; Hgb: serum hemoglobin; IADL: instrumental activities of daily living; MNA: Mini Nutritional Assessment; MPI: Multidimensional Prognostic Index; MNA: Mini Nutritional Assessment; ROC: receiver operating curve; SPMSQ: Short Portable Mental Status Questionnaire; Ur/Cr: urea-to-creatinine ratio.
Figure 4Violin plots showing distributions for the top 4 features for eXtreme gradient boosting in the second test feature set by patient vial status at 12 months after hospital discharge. CIRS: Cumulative Illness Rating Scale; IADL: instrumental activities of daily living; MNA: Mini Nutritional Assessment.
Odds ratios (95% CIs) for the logistic regression with maximum likelihood estimation model using the test data with feature set 4a.
| Feature | Odds ratio (95% CI) | ||
| Age | 1.20 (0.90-1.61) | .21 | |
| ADLb | 0.99 (0.71-1.39) | .96 | |
| IADLc | 0.88 (0.63-1.22) | .44 | |
| SPMSQd | 0.99 (0.78-1.25) | .91 | |
| ESSe | 0.89 (0.62-1.27) | .51 | |
| CIRSf | 1.81 (1.32-2.49) | <.001 | |
| BMI | 0.82 (0.63-1.07) | .14 | |
| MNAg | 0.57 (0.44-0.74) | <.001 | |
| Sodium | 0.96 (0.77-1.20) | .74 | |
| Urea | 1.77 (0.89-3.51) | .10 | |
| Creatinine | 0.85 (0.53-1.36) | .50 | |
| Albumin | 0.83 (0.64-1.07) | .15 | |
| Hemoglobin | 0.88 (0.68-1.14) | .35 | |
| Number of medications | 0.69 (0.52-0.93) | .02 | |
| ARSh | 1.06 (0.82-1.36) | .66 | |
| eGFRi | 1.15 (0.74-1.80) | .53 | |
| CRPj | 0.94 (0.73-1.20) | .61 | |
|
| |||
|
| Alone | 1.00k | N/Al |
|
| Family or friends | 0.96 (0.73-1.25) | .75 |
|
| Institute | 1.22 (0.95-1.57) | .12 |
|
| |||
|
| Female | 1.00k | N/A |
|
| Male | 0.69 (0.40-1.18) | .18 |
| Urea-to-creatinine | 1.13 (0.67-1.91) | .64 | |
aAll continuous variables were scaled before analysis to have a mean of zero and an SD of 1. Gender and cohabitation status were dummy coded for each category.
bADL: activities of daily living.
cIADL: instrumental activities of daily living.
dSPMSQ: Short Portable Mental Status Questionnaire.
eESS: Exton Smith Scale.
fCIRS: Cumulative Illness Rating Scale.
gMNA: Mini Nutritional Assessment.
hARS: anticholinergic risk score.
ieGFR: estimated glomerular filtration rate.
jCRP: C-reactive protein.
kThis is the reference group. Therefore, there is no CI.
lN/A: not applicable.