| Literature DB >> 30383850 |
Aaron Jones1, Andrew P Costa1,2, Angelina Pesevski3, Paul D McNicholas4.
Abstract
OBJECTIVE: The objective of this study was to compare the performance of several commonly used machine learning methods to traditional statistical methods for predicting emergency department and hospital utilization among patients receiving publicly-funded home care services. STUDY DESIGN ANDEntities:
Mesh:
Year: 2018 PMID: 30383850 PMCID: PMC6211724 DOI: 10.1371/journal.pone.0206662
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of study participants.
| Year of Assessment | 2014 (n = 29,132) | 2015 (n = 29,278) | 2016 (n = 29,935) |
|---|---|---|---|
| n(%) | n(%) | n(%) | |
| Age (Median (IQR)) | 82 (16) | 82 (16) | 82 (16) |
| Sex (F) | 18,497 (63.49) | 18,301 (62.51) | 18,250 (60.97) |
| Lives Alone | 10,169 (34.91) | 9,985 (34.10) | 10,050 (33.57) |
| ADL Impairment | 18,489 (63.47) | 19,423 (66.34) | 21,004 (70.17) |
| Cognitive Impairment | 18,789 (64.50) | 19,776 (67.55) | 20,958 (70.01) |
| Fall in last 90 days | 12,104 (41.55) | 13,051 (44.58) | 14,181 (47.37) |
| Bladder Incontinence | 12,393 (42.54) | 12,728 (43.47) | 13,315 (44.48) |
| Poor Self-Reported Health | 5,956 (20.44) | 6,253 (21.36) | 8,173 (27.30) |
| Dyspnea | 8,191 (28.12) | 8,492 (29.00) | 9,047 (30.22) |
| Mood Symptoms | 12,843 (44.09) | 13,644 (46.60) | 15,239 (50.91) |
| Aggressive Behaviour | 2,659 (9.13) | 3,057 (10.44) | 3,650 (12.19) |
| Wandering | 803 (2.76) | 9,37 (3.20) | 1,101 (3.68) |
| Number of Medications (Mean (SD)) | 7.40 (2.33) | 7.41 (2.32) | 7.37 (2.34) |
| Caregiver expresses distress | 6,093 (20.92) | 6,743 (23.03) | 9,507 (31.76) |
| Informal care hours per day (Mean (SD)) | 19.57 (22.11) | 19.47 (21.87) | 20.32 (23.92) |
| Cardiovascular | 15,053 (51.67) | 15,301 (52.26) | 15,958 (53.31) |
| Dementia | 6,869 (23.58) | 7,280 (24.87) | 7,915 (26.44) |
| Neurological | 2,981 (10.23) | 3,341 (11.41) | 3,961 (13.23) |
| Musculoskeletal | 19,141 (65.70) | 19,389 (66.22) | 20,028 (66.90) |
| Psychiatric | 5,995 (20.58) | 6,408 (21.89) | 7,183 (24.00) |
| Cancer | 3,669 (12.59) | 3,779 (12.91) | 4,014 (13.41) |
| Diabetes | 7,700 (26.43) | 7,754 (26.48) | 8,316 (27.78) |
| COPD | 5,464 (18.76) | 5,581 (19.06) | 5,793 (19.35) |
aADL Long Form > 0.
bCognitive Performance Scale > 0.
cDepression Rating Scale > 0.
dVerbal abuse, physical abuse, socially inappropriate behavior, or resistance to care in last 3 days.
eCaregiver expresses feelings of distress, anger, or depression.
fStroke, congestive heart failure, coronary artery disease, dysrhythmia, peripheral vascular disease.
gHead trauma, hemiplegia, multiple sclerosis, parkinsonism.
hArthritis, fracture, osteoporosis.
i Any psychiatric diagnosis
Distribution of observed outcomes.
| Year of Assessment | 2014 | 2015 | 2016 |
|---|---|---|---|
| Outcome | % | % | % |
| ED visit with injurious fall | 9.1 | 9.8 | 9.7 |
| Unplanned hospital admission | 27.4 | 28.0 | 28.0 |
| ED visit count—0 | 53.4 | 52.5 | 51.6 |
| ED visit count—1 | 24.6 | 24.6 | 24.6 |
| ED visit count—2+ | 22.0 | 22.9 | 23.8 |
Performance metrics for the ED with injurious fall outcome.
| Outcome | Training Method | Score | Method | |||||
|---|---|---|---|---|---|---|---|---|
| Null | LR | FL | NN | GBT | RF | |||
| ED visit with injurious fall | 2015-training | Logarithmic | -0.319 | -0.305 | -0.306 | -0.303 | -0.303 | -0.306 |
| Brier | 0.176 | 0.171 | 0.170 | 0.170 | 0.170 | 0.171 | ||
| AUC | 0.500 | 0.663 | 0.663 | 0.668 | 0.668 | 0.659 | ||
| 2014-training 2015-training | Logarithmic | -0.319 | -0.303 | -0.303 | -0.302 | -0.302 | -0.304 | |
| Brier | 0.176 | 0.170 | 0.170 | 0.170 | 0.169 | 0.170 | ||
| AUC | 0.500 | 0.670 | 0.671 | 0.673 | 0.673 | 0.664 | ||
| 2014-validation 2015-training | Logarithmic | -0.319 | -0.305 | -0.306 | -0.303 | -0.305 | ||
| Brier | 0.176 | 0.171 | 0.170 | 0.170 | 0.170 | |||
| AUC | 0.500 | 0.663 | 0.666 | 0.668 | 0.659 | |||
LR, logistic regression; FL, Forward-stepping logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.
Performance metrics for the unplanned hospital admission outcome.
| Outcome | Training Method | Score | Method | |||||
|---|---|---|---|---|---|---|---|---|
| Null | LR | FL | NN | GBT | RF | |||
| Unplanned hospital admission | 2015-training | Logarithmic | -0.593 | -0.554 | -0.553 | -0.551 | -0.547 | -0.552 |
| Brier | 0.404 | 0.372 | 0.372 | 0.369 | 0.367 | 0.370 | ||
| AUC | 0.500 | 0.675 | 0.675 | 0.680 | 0.686 | 0.683 | ||
| 2014-training 2015-training | Logarithmic | -0.593 | -0.551 | -0.550 | -0.548 | -0.551 | ||
| Brier | 0.404 | 0.369 | 0.369 | 0.367 | 0.369 | |||
| AUC | 0.500 | 0.681 | 0.681 | 0.685 | 0.686 | |||
| 2014-validation 2015-training | Logarithmic | -0.593 | -0.554 | -0.553 | -0.551 | -0.546 | -0.551 | |
| Brier | 0.404 | 0.372 | 0.372 | 0.369 | 0.366 | 0.369 | ||
| AUC | 0.500 | 0.675 | 0.675 | 0.680 | 0.688 | 0.683 | ||
LR, logistic regression; FL, Forward-stepping logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.
Performance metrics for the ED visit count outcome.
| Outcome | Training Method | Score | Method | |||||
|---|---|---|---|---|---|---|---|---|
| Null | MLR | MFL | NN | GBT | RF | |||
| ED visit count | 2015-training | Logarithmic | -1.029 | -0.972 | -0.972 | -0.968 | -0.965 | -0.969 |
| Brier | 0.617 | 0.577 | 0.577 | 0.575 | 0.573 | 0.576 | ||
| AUC | 0.500 | 0.643 | 0.643 | 0.646 | 0.655 | 0.651 | ||
| 2014-training 2015-training | Logarithmic | -1.029 | -0.968 | -0.968 | -0.965 | -0.965 | -0.969 | |
| Brier | 0.617 | 0.575 | 0.575 | 0.573 | 0.573 | 0.575 | ||
| AUC | 0.500 | 0.648 | 0.647 | 0.651 | 0.655 | 0.652 | ||
| 2014-validation 2015-training | Logarithmic | -1.029 | -0.972 | -0.972 | -0.967 | -0.968 | ||
| Brier | 0.617 | 0.577 | 0.577 | 0.575 | 0.575 | |||
| AUC | 0.500 | 0.643 | 0.643 | 0.647 | 0.651 | |||
MLR, multinomial logistic regression; MFL, Forward-stepping multinomial logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.
Pairwise paired t-tests of difference in logarithmic score with Hochberg’s correction.
| Outcome | Method | LR | FL | NN | GBT |
|---|---|---|---|---|---|
| ED visit with injurious fall | 0.700 | ||||
| 0.068 | 0.700 | ||||
| 0.011 | 0.035 | 0.110 | |||
| 0.110 | 0.106 | 0.017 | <0.001 | ||
| Unplanned hospital admission | 0.279 | ||||
| <0.001 | 0.099 | ||||
| <0.001 | <0.001 | 0.009 | |||
| 0.988 | 0.426 | 0.007 | <0.001 | ||
| ED visit count | 0.949 | ||||
| <0.001 | <0.001 | ||||
| <0.001 | <0.001 | 0.008 | |||
| 0.949 | 0.949 | 0.008 | <0.001 |
LR, logistic regression; FL, Forward-stepping logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.
Diagnostic accuracy measures for the ED visit with injurious fall outcome.
| Threshold | Measure | Method | ||||
|---|---|---|---|---|---|---|
| LR | FL | NN | GBT | RF | ||
| Sensitivity fixed at 80% | Sensitivity | 80.1% | 80.1% | 80.1% | 80.2% | |
| Specificity | 41.8% | 41.2% | 42.6% | 40.0% | ||
| LR+ | 1.38 | 1.36 | 1.39 | 1.34 | ||
| LR- | 0.48 | 0.48 | 0.47 | 0.50 | ||
| Odds Ratio | 2.89 | 2.82 | 2.98 | 2.70 | ||
| 80% of predicted probability distribution | ||||||
| Sensitivity | 37.5% | 38.6% | 36.5% | |||
| Specificity | 81.9% | 82.0% | 82.0% | |||
| LR+ | 2.07 | 2.14 | 2.03 | |||
| LR- | 0.76 | 0.75 | 0.77 | |||
| Odds Ratio | 2.72 | 2.86 | 2.61 | |||
LR, logistic regression; FL, Forward-stepping logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.
Diagnostic accuracy measures for the unplanned hospital admission outcome.
| Threshold | Measure | Method | ||||
|---|---|---|---|---|---|---|
| LR | FL | NN | GBT | RF | ||
| Sensitivity fixed at 80% | Sensitivity | 80.0% | 80.1% | 79.9% | 80.0% | |
| Specificity | 42.1% | 41.8% | 42.7% | 42.7% | ||
| LR+ | 1.38 | 1.38 | 1.40 | 1.40 | ||
| LR- | 0.47 | 0.48 | 0.47 | 0.47 | ||
| Odds Ratio | 2.91 | 2.89 | 2.97 | 2.99 | ||
| 80% of predicted probability distribution | ||||||
| Sensitivity | 34.7% | 34.4% | 35.3% | 35.5% | ||
| Specificity | 85.7% | 85.6% | 86.0% | 86.0% | ||
| LR+ | 2.43 | 2.39 | 2.52 | 2.53 | ||
| LR- | 0.76 | 0.77 | 0.75 | 0.75 | ||
| Odds Ratio | 3.19 | 3.13 | 3.35 | 3.37 | ||
LR, logistic regression; FL, Forward-stepping logistic regression with interactions and squared terms; NN, Neural Network; GBT, Gradient boosted trees; RF, Random forest.