| Literature DB >> 30646095 |
Andrew Wong1, Albert T Young1, April S Liang1, Ralph Gonzales2, Vanja C Douglas3, Dexter Hadley4.
Abstract
Importance: Current methods for identifying hospitalized patients at increased risk of delirium require nurse-administered questionnaires with moderate accuracy. Objective: To develop and validate a machine learning model that predicts incident delirium risk based on electronic health data available on admission. Design, Setting, and Participants: Retrospective cohort study evaluating 5 machine learning algorithms to predict delirium using 796 clinical variables identified by an expert panel as relevant to delirium prediction and consistently available in electronic health records within 24 hours of admission. The training set comprised 14 227 adult patients with non-intensive care unit hospital stays and no delirium on admission who were discharged between January 1, 2016, and August 31, 2017, from UCSF Health, a large academic health institution. The test set comprised 3996 patients with hospital stays who were discharged between August 1, 2017, and November 30, 2017. Exposures: Patient demographic characteristics, diagnoses, nursing records, laboratory results, and medications available in electronic health records during hospitalization. Main Outcomes and Measures: Delirium was defined as a positive Nursing Delirium Screening Scale or Confusion Assessment Method for the Intensive Care Unit score. Models were assessed using the area under the receiver operating characteristic curve (AUC) and compared against the 4-point scoring system AWOL (age >79 years, failure to spell world backward, disorientation to place, and higher nurse-rated illness severity), a validated delirium risk-assessment tool routinely administered in this cohort.Entities:
Mesh:
Year: 2018 PMID: 30646095 PMCID: PMC6324291 DOI: 10.1001/jamanetworkopen.2018.1018
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Study Flow Outlining Exclusion Criteria
CAM-ICU indicates Confusion Assessment Method for the Intensive Care Unit; ICD-9, International Classification of Diseases, Ninth Revision; ICU, intensive care unit; Nu-DESC, Nursing Delirium Screening Scale.
Characteristics of the 18 223 Included Patients
| Characteristic | No. (%) | |
|---|---|---|
| Training Set (n = 14 227) | Test Set (n = 3996) | |
| Age, y | ||
| 18-39 | 2589 (18.2) | 743 (18.6) |
| 40-64 | 6525 (45.9) | 1762 (44.1) |
| 65-79 | 3887 (27.3) | 1118 (28.0) |
| >79 | 1226 (8.6) | 373 (9.3) |
| Sex | ||
| Male | 6892 (48.4) | 2030 (50.8) |
| Female | 7335 (51.6) | 1966 (49.2) |
| Race | ||
| Asian | 1751 (12.3) | 514 (12.9) |
| Black | 1443 (10.1) | 391 (9.8) |
| Native Hawaiian or Pacific Islander | 127 (0.9) | 47 (1.2) |
| White | 8372 (58.8) | 2320 (58.1) |
| Other or declined | 2534 (17.8) | 724 (18.1) |
| Ethnicity | ||
| Hispanic or Latino | 1818 (12.8) | 536 (13.4) |
| Not Hispanic or Latino | 12 113 (85.1) | 3391 (84.9) |
| Unknown or declined | 296 (2.1) | 69 (1.7) |
| Marital status | ||
| Married | 6620 (46.5) | 1899 (47.5) |
| Single | 5157 (36.2) | 1447 (36.2) |
| Divorced or legally separated | 1255 (8.8) | 327 (8.2) |
| Widowed | 994 (7.0) | 255 (6.4) |
| Other or declined | 201 (1.4) | 68 (1.7) |
| Delirium | ||
| Yes | 687 (4.8) | 191 (4.8) |
| Age 18-64 y | 330 (2.3) | 86 (2.2) |
| Age >64 y | 357 (2.5) | 105 (2.6) |
| No | 13 540 (95.2) | 3805 (95.2) |
| Age 18-64 y | 8784 (61.7) | 2419 (60.5) |
| Age >64 y | 4756 (33.4) | 1386 (34.7) |
Defined as Nursing Delirium Screening Scale score of 2 or greater or positive result for Confusion Assessment Method for the Intensive Care Unit at any time between 1 and 30 days after admission.
Figure 2. Receiver Operating Characteristic Curves for Machine Learning Models and AWOL
Model performance was evaluated on a prospective test set (receiver operating characteristic curves shown are determined using the subset of the test set with AWOL [age, inability to spell world backward, orientation, illness severity] measurements). ANN indicates artificial neural network; GBM, gradient boosting machine; LR, penalized logistic regression; RF, random forest; and SVM, support vector machine.
Categorical Variables With Top Importance by Gradient Boosting Machine Occurring in at Least 10 Samples
| Variable | Variable Category | Variable Importance | Variable Frequency by Delirium Status, No. (%) | Selection by Other Models | |
|---|---|---|---|---|---|
| Yes (n = 191) | No (n = 3805) | ||||
| Neurologic examination | |||||
| Best verbal response | 4 | 100.0 | 65 (34.0) | 138 (3.6) | RF, LR |
| Neurologic symptoms (other) | Yes | 13.2 | 25 (13.1) | 116 (3.0) | RF, LR |
| Best motor response (upper extremities) | 5 | 10.2 | 5 (2.6) | 23 (0.6) | RF, LR |
| Best eye response | 3 | 2.7 | 35 (18.3) | 226 (5.9) | RF, LR |
| Best motor response (upper extremities) | 6 | 1.1 | 104 (54.5) | 1929 (50.7) | RF |
| Admission status | |||||
| Source | Transfer-acute hospital | 17.8 | 38 (19.9) | 237 (6.2) | RF, LR |
| Category | Urgent | 4.8 | 65 (34.0) | 684 (18.0) | RF, LR |
| Service | Neurology | 2.6 | 13 (6.8) | 166 (4.4) | RF, LR |
| Department | Neurosciences | 1.7 | 20 (10.5) | 246 (6.5) | RF, LR |
| Department | Other | 1.6 | 132 (69.1) | 2367 (62.2) | RF |
| Readmission (ie, recent hospitalization within prior 30 d) | Yes | 1.3 | 27 (14.1) | 507 (13.3) | LR |
| Activities of daily living | |||||
| Elimination | Incontinence | 8.5 | 36 (18.8) | 116 (3.0) | RF, LR |
| Feeding | Independent | 6.8 | 114 (59.7) | 3264 (85.8) | RF, LR |
| Bowel and bladder habits | Unable to assess | 6.4 | 2 (1.0) | 24 (0.6) | RF, LR |
| Grooming | Independent | 4.0 | 73 (38.2) | 2825 (74.2) | RF, LR |
| Bathing | Independent | 1.1 | 59 (30.9) | 2533 (66.6) | RF, LR |
| Home medications and devices | |||||
| Psychotherapeutic agents | Yes | 5.7 | 83 (43.5) | 1253 (32.9) | RF, LR |
| Parasympathomimetic or cholinergic agents | Yes | 3.1 | 9 (4.7) | 38 (1.0) | LR |
| Antimanic agents | Yes | 2.3 | 3 (1.6) | 37 (1.0) | NS |
| Devices | Yes | 1.0 | 24 (12.6) | 416 (10.9) | NS |
| Admission medications and devices | |||||
| Antimigraine agents | Yes | 1.8 | 0 | 25 (0.7) | NS |
| Abdominal binder | Yes | 1.6 | 11 (5.8) | 96 (2.5) | NS |
| β-Adrenergic blocking agents | Yes | 1.4 | 3 (1.6) | 15 (0.4) | LR |
| Indwelling urinary Foley catheter | NA | 1.4 | 108 (56.5) | 2378 (62.5) | RF |
| Analgesic and antipyretics | Yes | 1.4 | 37 (19.4) | 631 (16.6) | LR |
| Diagnostic agents | Yes | 1.2 | 0 | 19 (0.5) | NS |
| Opiate antagonists | Yes | 1.0 | 9 (4.7) | 166 (4.4) | NS |
| Comorbidities | |||||
| Depression | Yes | 3.1 | 1 | 25 (0.7) | LR |
| Peripheral vascular disease | Yes | 3.0 | 4 (2.1) | 73 (1.9) | LR |
| Pulmonary disease | Yes | 1.6 | 3 (1.6) | 102 (2.7) | NS |
| Liver disease | Yes | 1.6 | 9 (4.7) | 99 (2.6) | LR |
| Alcohol use | Yes | 1.6 | 4 (2.1) | 33 (0.9) | LR |
| Difficulty chewing | Yes | 1.5 | 12 (6.3) | 172 (4.5) | NS |
| Nonhealing wound | NA | 1.5 | 27 (14.1) | 213 (5.6) | NS |
| Tumor | Yes | 1.0 | 23 (12.0) | 280 (7.4) | LR |
| Renal disease | Yes | 1.0 | 12 (6.3) | 104 (2.7) | NS |
| Mobility and fall risk | |||||
| Schmid fall score | 4 | 1.9 | 1 (0) | 18 (0.5) | NS |
| Mobility | Unable to ambulate or transfer | 1.4 | 42 (22.0) | 293 (7.7) | RF |
| Schmid fall score | 3 | 1.1 | 20 (10.5) | 150 (3.9) | LR |
| Patient demographic characteristics | |||||
| Race | Asian | 1.9 | 25 (13.1) | 489 (12.9) | LR |
Abbreviations: LR, penalized logistic regression; NA, not applicable; NS, not selected; RF, random forest.
Rather than P values or coefficients, the gradient boosting machine model reports the importance of predictor variables included in a model. Importance is a measure of each variable’s cumulative contribution toward reducing square error, or heterogeneity within the subset, after the data set is sequentially split based on that variable. Thus, it is a reflection of a variable’s impact on the predictor. Absolute importance is then scaled to give relative importance, with a maximum importance of 100.
Defined as Nursing Delirium Screening Scale score of 2 or greater or positive result for Confusion Assessment Method for the Intensive Care Unit at any time between 1 and 30 days after admission.
Variable selected by model and ranked among top 50 in importance.
Continuous Variables With Top Importance Selected by Gradient Boosting Machine and Coselection by Random Forest and Penalized Logistic Regression
| Variable | Variable Importance | Value by Delirium Status, Mean (SD) | Selection by Other Models | |
|---|---|---|---|---|
| Yes (n = 191) | No (n = 3805) | |||
| Patient demographic characteristics | ||||
| Age, y | 18.6 | 65.0 (15.7) | 57.0 (17.3) | RF, LR |
| Time since onset of pain, d | 1.4 | 530 (1109) | 560 (1790) | RF |
| Vitals | ||||
| Temperature, °F | 17.0 | 97.1 (7.9) | 97.5 (3.8) | RF |
| Heart rate, beats/min | 8.3 | 88.4 (20.5) | 78.7 (22.8) | RF, LR |
| Respiratory rate, breaths/min | 7.5 | 12.7 (2.4) | 13.4 (4.1) | RF, LR |
| NR average diastolic blood pressure, mm Hg | 7.2 | 60.6 (11.7) | 61.4 (10.4) | RF |
| NR average systolic blood pressure, mm Hg | 6.6 | 104.6 (7.8) | 106.4 (8.6) | RF, LR |
| Sp | 0.9 | 99.1 (2.7) | 98.2 (4.6) | RF |
| Comprehensive metabolic panel | ||||
| Calcium, mg/dL | 6.8 | 8.8 (0.8) | 8.8 (0.7) | RF |
| Total bilirubin, mg/dL | 5.4 | 1.5 (3.2) | 1.3 (2.5) | RF, LR |
| Chloride, mmol/L | 5.3 | 101.7 (6.2) | 102.6 (5.3) | RF |
| Minimum BUN, mg/dL | 4.5 | 28.1 (25.3) | 19.7 (17.7) | RF, LR |
| AST, units/L | 2.8 | 65.0 (148.9) | 56.1 (170.2) | RF |
| Maximum glucose, mg/dL | 2.1 | 137.4 (52.8) | 138.0 (62.9) | RF, LR |
| Bicarbonate, mmol/L | 2.0 | 25.0 (4.3) | 24.7 (4.0) | RF |
| Ammonia, μmol/L | 1.4 | 33.0 (NC) | 41.4 (26.5) | RF |
| ALT, units/L | 1.2 | 46.8 (95.9) | 50.0 (169.4) | RF |
| CBC | ||||
| Platelet, ×103/μL | 6.5 | 240.2 (130.7) | 238.1 (108.1) | RF |
| Hematocrit, % | 2.5 | 34.4 (6.4) | 35.3 (6.5) | RF |
Abbreviations: ALT, alanine aminotransferase; AST, aspartate aminotransferase; BUN, blood urea nitrogen; CBC, complete blood cell count; LR, penalized logistic regression; NC, not calculable; NR, nursing record; RF, random forest; Spo2, oxygen saturation as measured by pulse oximetry.
SI conversion factors: To convert calcium to mmol/L, multiply by 0.25; AST and ALT to μkat/L, multiply by 0.0167; total bilirubin to μmol/L, multiply by 17.104; glucose to mmol/L, multiply by 0.0555; and platelet count to ×109, multiply by 1.0.
Rather than P values or coefficients, the gradient boosting machine model reports the importance of predictor variables included in a model. Importance is a measure of each variable's cumulative contribution toward reducing square error, or heterogeneity within the subset, after the data set is sequentially split based on that variable. Thus, it is a reflection of a variable's impact on the predictor. Absolute importance is then scaled to give relative importance, with a maximum importance of 100.
Mean values are calculated excluding missing values.
Defined as Nursing Delirium Screening Scale score of 2 or greater or positive Confusion Assessment Method for the Intensive Care Unit result at any time between 1 and 30 days after admission.
Variable selected by model and ranked among top 50 in importance.