| Literature DB >> 35322668 |
Jeremiah R Brown1, Iben M Ricket1, Ruth M Reeves2,3, Rashmee U Shah4, Christine A Goodrich1, Glen Gobbel2,3,5,6, Meagan E Stabler1, Amy M Perkins3,5, Freneka Minter2, Kevin C Cox1, Chad Dorn2, Jason Denton2, Bruce E Bray6,7, Ramkiran Gouripeddi7,8, John Higgins1, Wendy W Chapman9, Todd MacKenzie1, Michael E Matheny2,3,5,6.
Abstract
Background Social risk factors influence rehospitalization rates yet are challenging to incorporate into prediction models. Integration of social risk factors using natural language processing (NLP) and machine learning could improve risk prediction of 30-day readmission following an acute myocardial infarction. Methods and Results Patients were enrolled into derivation and validation cohorts. The derivation cohort included inpatient discharges from Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary diagnosis of acute myocardial infarction, who were discharged alive, and not transferred from another facility. The validation cohort included patients from Dartmouth-Hitchcock Health Center between April 2, 2011, and December 31, 2016, meeting the same eligibility criteria described above. Data from both sites were linked to Centers for Medicare & Medicaid Services administrative data to supplement 30-day hospital readmissions. Clinical notes from each cohort were extracted, and an NLP model was deployed, counting mentions of 7 social risk factors. Five machine learning models were run using clinical and NLP-derived variables. Model discrimination and calibration were assessed, and receiver operating characteristic comparison analyses were performed. The 30-day rehospitalization rates among the derivation (n=6165) and validation (n=4024) cohorts were 15.1% (n=934) and 10.2% (n=412), respectively. The derivation models demonstrated no statistical improvement in model performance with the addition of the selected NLP-derived social risk factors. Conclusions Social risk factors extracted using NLP did not significantly improve 30-day readmission prediction among hospitalized patients with acute myocardial infarction. Alternative methods are needed to capture social risk factors.Entities:
Keywords: electronic health records; machine learning; myocardial infarction; natural language processing; patient readmission
Mesh:
Year: 2022 PMID: 35322668 PMCID: PMC9075435 DOI: 10.1161/JAHA.121.024198
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 6.106
Characteristics for 6195 Patients Hospitalized at VUMC With a Primary Diagnosis of AMI (Derivation Cohort)
| Characteristic | Readmission, % (n=934) | Nonreadmission, % (n=5231) |
|---|---|---|
| Sex | ||
| Men | 63.5 (n=593) | 67.8 (n=3545) |
| Women | 36.5 (n=341) | 32.2 (n=1686) |
| Race | ||
| White | 83.7 (n=782) | 83.4 (n=4363) |
| Black | 10.8 (n=101) | 9.4 (n=492) |
| Other | 5.5 (n=51) | 7.2 (n=376) |
| Comorbidities | ||
| Arrhythmia | 21.0 (n=197) | 12.7 (n=666) |
| Anemia | 17.0 (n=160) | 8.2 (n=430) |
| Hypertension | 38.3 (n=358) | 30.2 (n=1580) |
| COPD | 4.5 (n=42) | 2.9 (n=150) |
| CKD | 16.0 (n=149) | 6.7 (n=353) |
| Tobacco use | 6.2 (n=58) | 4.7 (n=246) |
| Depression | 6.9 (n=64) | 4.1 (n=217) |
| CAD | 10.3 (n=96) | 10.1 (n=528) |
| CHF | 21.2 (n=198) | 11.5 (n=599) |
| Dementia | 2.6 (n=24) | 1.9 (n=101) |
| Cardiac arrest | 5.7 (n=53) | 5.1 (n=269) |
| STEMI | 48.2 (n=450) | 50.7 (n=2651) |
| Heart failure during hospitalization | 53.2 (n=497) | 35.8 (n=1871) |
| Ischemia during hospitalization | 17.0 (n=159) | 11.5 (n=600) |
| Histories | ||
| AMI | 24.0 (n=224) | 21.4 (n=1122) |
| Peripheral vascular disease | 21.2 (n=198) | 12.4 (n=647) |
| Angina | 15.2 (n=142) | 11.0 (n=575) |
| Unstable angina | 24.4 (n=228) | 19.9 (n=1042) |
| Hypertension | 51.1 (n=477) | 42.8 (n=2241) |
| Depression | 12.8 (n=120) | 10.2 (n=535) |
| Discharge location | ||
| Home | 78.1 (n=729) | 89.3 (n=4671) |
| Health facility | 21.9 (n=205) | 10.7 (n=560) |
| Mean continuous scores | ||
| Age, y | 67.78 (SD=13.04) | 63.22 (SD=12.99) |
| LACE score | 5.71 (SD=2.35) | 4.67 (SD=2.0) |
| GRACE score | 141.06 (SD=33.3) | 129.55 (SD=33.18) |
| HOSPITAL score | 3.42 (SD=1.65) | 2.63 (SD=1.58) |
| Charlson Deyo score | 1.19 (SD=1.86) | 0.75 (SD=1.46) |
| Length of stay, d | 7.47 (SD= 5.64) | 5.67 (SD=5.06) |
AMI indicates acute myocardial infarction; CAD, coronary artery disease; CHF, congestive heart failure; CKD, chronic kidney disease; COPD, chronic obstructive pulmonary disease; STEMI, ST‐segment–elevation myocardial infarction; and VUMC, Vanderbilt University Medical Center.
LACE indicates length of stay, acuity of admission, comorbidity of the patient (measured with the Charlson comorbidity index score), and emergency department use (measured as the number of visits in the 6 months before admission). Possible score range is 1 to 19.
GRACE indicates Global Registry of Acute Coronary Events; possible score is 1 to 372 points.
HOSPITAL score indicates hemoglobin levels at discharge, discharge from an oncology service, sodium level at discharge, procedure during the index admission, index type of admission, number of admissions during the past 12 months, and length of stay. Possible score range is 0 to 13.
Other includes all racial groups other than White and Black.
Univariate, Bivariate, and Adjusted Relationships of NSRF and 30‐Day Readmission Following an AMI Among Derivation and Validation Cohorts
| Variable | Unadjusted odds ratio | Outcome, N (%) | Nonoutcome, N (%) | Adjusted variable importance | Adjusted coefficient |
|---|---|---|---|---|---|
| VUMC derivation cohort (n=6195) | |||||
| Dementia positive | 2.920 | 114 (2.2) | 57 (6.1) | 4.450 | 0.128 |
| Depression any | 1.770 | 591 (11.3) | 172 (18.4) | 5.600 | 0.051 |
| Impaired ADL/IADL any | 2.400 | 1276 (24.4) | 408 (43.7) | 7.020 | 0.033 |
| Instrumental support any | 2.200 | 1692 (32.3) | 479 (51.3) | 7.310 | 0.034 |
| Living alone uncertain | 2.940 | 626 (12.0) | 267 (28.6) | 8.910 | 0.082 |
| Medical condition positive | 2.440 | 1257 (24.0) | 407 (43.6) | 8.060 | 0.040 |
| Medication compliance any | 1.280 | 258 (4.9) | 58 (6.2) | 3.440 | −0.002 |
| DHMC validation cohort (n=4024) | |||||
| Dementia positive | 1.800 | 274 (7.6) | 53 (12.9) | 4.450 | 0.128 |
| Depression any | 1.220 | 1337 (37.0) | 172 (41.7) | 5.600 | 0.051 |
| Impaired ADL/IADL any | 1.630 | 2163 (59.9) | 292 (70.9) | 7.020 | 0.033 |
| Instrumental support any | 1.330 | 3040 (84.2) | 361 (87.6) | 7.310 | 0.034 |
| Living alone uncertain | 1.620 | 260 (7.2) | 46 (11.2) | 8.910 | 0.082 |
| Medical condition positive | 2.020 | 1195 (33.1) | 206 (50.0) | 8.060 | 0.040 |
| Medication compliance any | 1.300 | 2153 (59.6) | 271 (65.8) | 3.440 | −0.002 |
ADL indicates activity of daily living; AMI, acute myocardial infarction; DHMC, Dartmouth‐Hitchcock Medical Center; IADL, instrumental activity of daily living; NLP, natural language processing; and VUMC, Vanderbilt University Medical Center.
Pooled variable importance from best‐performing nonparametric model in derivation cohort.
Pooled coefficients from best‐performing parametric model in derivation cohort.
Statistically significant at P<0.05.
ROC Comparison Analysis of Pooled AUROC Calculated on Test Set at VUMC for 5 ML Models, Scored on DHMC, Run on SCD Only, NSRF Only, and SCD+NSRF
| Model | AUC SCD only | AUC NSRF only | AUC SCD+NSRF |
|
| 95% CI SCD vs NSRF only | 95% CI SCD only vs SCD+NSRF |
|---|---|---|---|---|---|---|---|
| VUMC models | |||||||
| RF default test | 0.683 | 0.526 | 0.696 | 6.493 | −0.537 | −0.222 to −0.094 | −0.071 to 0.097 |
| RF optimized test | 0.686 | 0.519 | 0.703 | 7.211 | −0.546 | −0.230 to −0.105 | −0.067 to 0.099 |
| GB default test | 0.705 | 0.629 | 0.691 | 2.505 | 0.499 | −0.146 to −0.007 | −0.085 to 0.055 |
| GB optimized test | 0.673 | 0.628 | 0.654 | 1.475 | 0.657 | −0.121 to 0.030 | −0.098 to 0.058 |
| EN default test | 0.695 | 0.626 | 0.699 | 2.192 | −0.216 | −0.147 to 0.009 | −0.092 to 0.100 |
| EN optimized test | 0.682 | 0.612 | 0.692 | 2.367 | −0.326 | −0.153 to 0.0124 | −0.068 to 0.087 |
| RR default test | 0.696 | 0.626 | 0.704 | 2.218 | −0.328 | −0.150 to 0.009 | −0.137 to 0.153 |
| RR optimized test | 0.692 | 0.629 | 0.703 | 2.077 | −0.387 | −0.138 to 0.012 | −0.060 to 0.082 |
| LASSO default test | 0.695 | 0.626 | 0.699 | 2.184 | −0.223 | −0.148 to 0.009 | −0.094 to 0.102 |
| LASSO optimized test | 0.681 | 0.605 | 0.691 | 2.598 | −0.326 | −0.162 to 0.010 | −0.069 to 0.088 |
| Models scored on DHMC | |||||||
| RF default | 0.603 | 0.543 | 0.609 | 3.120 | −0.464 | −0.118 to −0.001 | −0.040 to 0.054 |
| RF optimized | 0.608 | 0.535 | 0.614 | 3.950 | −0.679 | −0.125 to −0.022 | −0.040 to 0.048 |
| GB default | 0.630 | 0.586 | 0.634 | 2.480 | −0.462 | −0.093 to 0.004 | −0.044 to 0.053 |
| GB optimized | 0.606 | 0.586 | 0.595 | 1.053 | 0.664 | −0.095 to 0.056 | −0.117 to 0.096 |
| EN default | 0.527 | 0.589 | 0.520 | −3.170 | 0.004 | −0.034 to 0.158 | −0.099 to 0.085 |
| EN optimized | 0.655 | 0.584 | 0.572 | 4.424 | 5.259 | −0.193 to 0.051 | −0.202 to 0.036 |
| RR default | 0.541 | 0.590 | 0.518 | −2.676 | 0.896 | −0.092 to 0.188 | −0.159 to 0.111 |
| RR optimized | 0.558 | 0.600 | 0.528 | −2.566 | 1.529 | −0.058 to 0.141 | −0.133 to 0.072 |
| LASSO default | 0.528 | 0.590 | 0.520 | −3.125 | 0.134 | −0.033 to 0.155 | −0.098 to 0.080 |
| LASSO optimized | 0.595 | 0.576 | 0.578 | 1.182 | 0.686 | −0.195 to 0.157 | −0.178 to 0.145 |
AUCs can decrease when variables (eg, natural language processing) are added to a model, especially if the AUC is based on a validation sample and the additional variables have no additional discriminatory ability. AUC indicates area under the curve; AUROC, area under the ROC curve; DHMC, Dartmouth‐Hitchcock Medical Center; EN, elastic net; GB, gradient boosting; LASSO, least absolute shrinkage and selection operator; ML, machine learning; NSRF, natural language processing–derived social risk factors (main effects); RF, random forest; ROC, receiver operating characteristic; RR, ridge regression; SCD, structured clinical data; and VUMC, Vanderbilt University Medical Center.
Additional Pooled Metrics From Best‐Performing Models From VUMC, Scored on DHMC, Run on SCD Only, NSRF Only, and SCD+NSRF
| Variable | VUMC models | Models score on DHMC | ||||
|---|---|---|---|---|---|---|
| SCD only | NSRF only | SCD+NSRF | SCD only | NSRF only | SCD+NSRF | |
| Sensitivity | 0.021 | 0.513 | 0.026 | 0.012 | 0.375 | 0.003 |
| Specificity | 0.992 | 0.708 | 0.994 | 0.984 | 0.765 | 0.999 |
| Precision | 0.306 | 0.238 | 0.426 | 0.081 | 0.154 | 0.228 |
| F1 | 0.039 | 0.325 | 0.050 | 0.021 | 0.218 | 0.005 |
SCD and SCD+NSRF used 0.5 cutoff. NSRF only used third quartile cutoff. DHMC indicates Dartmouth‐Hitchcock Medical Center; NSRF, natural language processing–derived social risk factors (main effects); SCD, structured clinical data; and VUMC, Vanderbilt University Medical Center.
Best‐performing models by area under the curve.
Figure 1Percentage calibrated for test on Vanderbilt University Medical Center (VUMC) using structured clinical data (SCD) only and SCD with natural language processing–derived social risk factors (main effects) (NSRF), scored on Dartmouth‐Hitchcock Medical Center (DHMC).
Bars represent the percentage of aligned risk predictions. Model 1, SCD. Model 2, SCD+NSRF. Default, models with untuned hyperparameters. Optimized, models with tuned hyperparameters. LASSO indicates least absolute shrinkage and selection operator.