| Literature DB >> 32894128 |
Vincent J Major1, Yindalon Aphinyanaphongs2.
Abstract
BACKGROUND: Automated systems that use machine learning to estimate a patient's risk of death are being developed to influence care. There remains sparse transparent reporting of model generalizability in different subpopulations especially for implemented systems.Entities:
Keywords: Advance directives; Electronic health records; End-of-life care; Machine learning; Medical informatics; Mortality prediction; Palliative care; Supportive care
Mesh:
Year: 2020 PMID: 32894128 PMCID: PMC7487547 DOI: 10.1186/s12911-020-01235-6
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Monthly admissions stratified by a) model development cohort and b) hospital location
Demographics, outcome, comorbidity, and model predictor characteristics of the model development population
| All Patients | Training Set | Testing Set | |||
|---|---|---|---|---|---|
| Measure | Value | ||||
| Age | % (n) | % (n) | % (n) | * | |
| | 11.5% (14786) | 10.7% (7778) | 13.1% (6087) | ||
| | 17.5% (22607) | 18.0% (13053) | 18.0% (8361) | ||
| | 9.45% (12183) | 9.49% (6877) | 9.69% (4504) | ||
| | 13.3% (17204) | 13.5% (9784) | 13.4% (6206) | ||
| | 18.2% (23500) | 18.7% (13556) | 17.3% (8026) | ||
| | 15.8% (20388) | 15.8% (11439) | 15.1% (7008) | ||
| | 10.7% (13839) | 10.5% (7588) | 10.2% (4748) | ||
| | 3.44% (4434) | 3.26% (2362) | 3.27% (1518) | ||
| Ethnicity b | % (n) | % (n) | % (n) | * | |
| | 9.75% (3467) | 9.77% (2336) | 8.62% (666) | ||
| | 90.3% (32086) | 90.2% (21584) | 91.4% (7060) | ||
| | -- (93388) | -- (48517) | -- (38732) | ||
| Race | % (n) | % (n) | % (n) | * | |
| | 10.9% (14033) | 11.0% (7933) | 10.7% (4987) | ||
| | 7.38% (9520) | 6.50% (4707) | 9.10% (4230) | ||
| | 1.66% (2146) | 1.68% (1219) | 1.74% (807) | ||
| | 61.6% (79424) | 64.1% (46404) | 57.3% (26642) | ||
| | 16.4% (21181) | 14.8% (10692) | 18.8% (8714) | ||
| | 2.05% (2637) | 2.05% (1482) | 2.32% (1078) | ||
| Sex | % (n) | % (n) | % (n) | ||
| | 60.1% (77478) | 60.3% (43664) | 60.5% (28130) | ||
| | 39.9% (51459) | 39.7% (28770) | 39.4% (18327) | ||
| | 0% (4) | 0% (3) | 0% (1) | ||
| Site | % (n) | % (n) | % (n) | * | |
| | 63.4% (81807) | 72.3% (52398) | 49.2% (22877) | ||
| | 15.6% (20137) | 18.1% (13122) | 12.8% (5938) | ||
| | 20.9% (26997) | 9.55% (6917) | 38% (17643) | ||
| % (n) | % (n) | % (n) | |||
| Any known death | 7.93% (10229) | 9.00% (6521) | 5.20% (2414) | * | |
| 60-day death | 4.15% (5356) | 4.05% (2935) | 3.57% (1657) | * | |
| Median [IQR] | Median [IQR] | Median [IQR] | |||
| Days from admission to death | 53 [6, 205] | 83 [12, 306] | 21 [1, 92.75] | * | |
| Median [IQR] | Median [IQR] | Median [IQR] | |||
| Charlson Score | 1 [0, 2] | 1 [0, 2] | 0 [0, 2] | * | |
| % (n) | % (n) | % (n) | |||
| AIDS/HIV | 0.626% (635) | 0.61% (349) | 0.506% (176) | ||
| Cancer (any malignancy) | 16.8% (17094) | 18.2% (10432) | 13.2% (4594) | * | |
| Cerebrovascular disease | 10.0% (10149) | 9.99% (5716) | 8.13% (2826) | * | |
| Chronic obstructive pulmonary disease | 17.9% (18218) | 18.6% (10649) | 13.5% (4703) | * | |
| Congestive heart failure | 12.0% (12144) | 11.8% (6774) | 8.56% (2978) | * | |
| Dementia | 3.67% (3721) | 3.18% (1819) | 3.09% (1075) | ||
| Diabetes with chronic complications | 6.34% (6439) | 4.9% (2806) | 5.68% (1977) | * | |
| Diabetes without chronic complications | 16.8% (17019) | 16.2% (9256) | 14.4% (4995) | * | |
| Hemiplegia or paraplegia | 2.92% (2962) | 2.83% (1617) | 2.35% (817) | * | |
| Metastatic solid tumor | 6.02% (6115) | 6.39% (3657) | 4.55% (1584) | * | |
| Mild liver disease | 6.40% (6495) | 6.23% (3566) | 5.14% (1787) | * | |
| Moderate or severe liver disease | 1.62% (1642) | 1.59% (910) | 1.11% (385) | * | |
| Myocardial infarction | 9.73% (9874) | 9.48% (5423) | 6.9% (2400) | * | |
| Peptic ulcer disease | 1.84% (1871) | 1.76% (1009) | 1.27% (443) | * | |
| Peripheral vascular disease | 13.1% (13278) | 13.0% (7446) | 9.97% (3469) | * | |
| Renal disease | 10.9% (11093) | 10.4% (5937) | 7.93% (2759) | * | |
| Rheumatoid disease | 2.87% (2915) | 3.11% (1781) | 2.06% (718) | * | |
| Median [IQR] | Median [IQR] | ||||
| 1–30 days | # of diagnoses | 3 [0, 12] | 3 [0, 13] | 2 [0, 10] | * |
| 1–30 days | # of lab results | 0 [0, 46] | 3 [0, 47] | 0 [0, 43] | * |
| 1–30 days | # of office visits | 3 [1, 6] | 3 [1, 6] | 2 [1, 5] | * |
| 1–30 days | # of emergency department visits | 0 [0, 0] | 0 [0, 0] | 0 [0, 0] | * |
| 1–30 days | # of hospitalizations | 0 [0, 0] | 0 [0, 0] | 0 [0, 0] | * |
| 1–365 days | # of diagnoses | 15 [2, 51] | 14 [2, 52] | 11 [0, 36] | * |
| 1–365 days | # of lab results | 35 [0, 151] | 34 [0, 142] | 15 [0, 84] | * |
| 1–365 days | # of office visits | 11 [5, 25] | 11 [5, 25] | 9 [4, 20] | * |
| 1–365 days | # of emergency department visits | 0 [0, 1] | 0 [0, 1] | 0 [0, 1] | |
| 1–365 days | # of hospitalizations | 0 [0, 1] | 0 [0, 1] | 0 [0, 0] | * |
*: Differences between training and testing sets are computed with: 1) χ2 tests for demographics; 2) proportion tests for individual comorbidities and mortality rates; and 3) Mann-Whitney tests for Charlson score and days from admission to death. In all cases, statistical significance is indicated (*) for adjusted p < 0.05 using a Bonferroni correction
a: Demographics coded within the EHR at the time of admission
b: Ethnicity contains many missing values which are omitted before computing the proportion and difference between groups
c: Including death and initiation of hospice care
d: Comorbidities are derived from ICD-10 diagnosis codes present in each patient’s year of history pre-admission using the diagnostic groups of the Charlson Comorbidity Index as implemented in the comorbidity R package [24]. Patients with no documented history are omitted from the denominator of each comorbidity
Model performance within cross-validation, applied to the testing set, and stratified by site
| Model | Cohort | Measure | AUROC | AUPRC |
|---|---|---|---|---|
| Lasso regression | Training (Cross-validation) | Mean [min, max] | 78.8 [78.0, 80.2] | 21.0 [18.3, 22.0] |
| XGBoost | Training (Cross-validation) | Mean [min, max] | 84.6 [83.8, 86.0] | 25.7 [21.2, 27.4] |
| Random forest | Training (Cross-validation) | Mean [min, max] | 86.9 [85.3, 87.7] | 26.4 [20.1, 31.0] |
Testing (Bootstrapped) | Median [95% CI] | 87.2 [86.1, 88.2] | 28.0 [25.0, 31.0] | |
| Brooklyn | Median [95% CI] | 83.8 [81.9, 85.6] | 26.6 [22.5, 31.0] | |
| Non-Brooklyn | Median [95% CI] |
Fig. 2Performance curves from the complete testing cohort and further stratified by location. a receiver operating characteristic, and b precision-recall curves. The selected threshold is highlighted along with each corresponding point once stratified by location