| Literature DB >> 35663114 |
Jae Hyun Kim1, May Hua2,3, Robert A Whittington2, Junghwan Lee1, Cong Liu1, Casey N Ta1, Edward R Marcantonio4,5,6, Terry E Goldberg2,7, Chunhua Weng1.
Abstract
The identification of delirium in electronic health records (EHRs) remains difficult due to inadequate assessment or under-documentation. The purpose of this research is to present a classification model that identifies delirium using retrospective EHR data. Delirium was confirmed with the Confusion Assessment Method for the Intensive Care Unit. Age, sex, Elixhauser comorbidity index, drug exposures, and diagnoses were used as features. The model was developed based on the Columbia University Irving Medical Center EHR data and further validated with the Medical Information Mart for Intensive Care III dataset. Seventy-six patients from Surgical/Cardiothoracic ICU were included in the model. The logistic regression model achieved the best performance in identifying delirium; mean AUC of 0.874 ± 0.033. The mean positive predictive value of the logistic regression model was 0.80. The model promises to identify delirium cases with EHR data, thereby enable a sustainable infrastructure to build a retrospective cohort of delirium.Entities:
Keywords: Confusion Assessment Method for the Intensive Care Unit (CAM-ICU); delirium; electronic health records; logistic regression; machine learning model
Year: 2022 PMID: 35663114 PMCID: PMC9152701 DOI: 10.1093/jamiaopen/ooac042
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Characteristics of patients according to the delirium evaluation results
| Variable | CUIMC dataset | MIMIC-III dataset | |||||
|---|---|---|---|---|---|---|---|
| Delirium positive | Delirium negative | Total | Delirium positive | Delirium negative | Total | ||
| ( | ( | ( | ( | ( | ( | ||
| Age, mean (SD) | 68.6 (12.3) | 61.9 (15.6) | 63.4 (15.1) | 65.8 (17.3) | 63.6 (16.9) | 64.1 (17.0) | .722 |
| Male, | 9 (52.9) | 36 (61.0) | 45 (59.2) | 471 (55.0) | 1516 (55.2) | 1987 (55.1) | .556 |
| Elixhauser comorbidity index, mean (SD) | 5.2 (3.0) | 3.8 (3.2) | 4.1 (3.2) | 4.8 (2.3) | 3.8 (2.1) | 4.0 (2.2) | .698 |
| Hospitalization duration (days), mean (SD) | 52.1 (49.9) | 21.3 (28.4) | 28.1 (36.4) | 13.9 (11.8) | 7.7 (7.7) | 9.1 (9.3) | <.001 |
| Race, | .773 | ||||||
| White | 10 (58.8) | 38 (64.4) | 48 (63.2) | 538 (62.8) | 1872 (68.2) | 2410 (66.9) | |
| Black/African American | 2 (11.8) | 5 (8.5) | 7 (9.2) | 61 (7.1) | 217 (7.9) | 278 (7.7) | |
| Other/Unknown | 5 (29.4) | 16 (27.1) | 21 (27.6) | 258 (30.1) | 657 (23.9) | 915 (25.4) | |
| Delirium diagnosis code, | 6 (35.2) | 10 (16.9) | 16 (21.1) | 152 (17.7) | 104 (3.8) | 256 (7.1) | <.001 |
CUIMC: Columbia University Irving Medical Center; MIMIC-III: Medical Information Mart for Intensive Care III; SD: standard deviation.
P values for comparison between the CUIMC dataset and MIMIC-III dataset.
Age over 89 was capped to 89 in the MIMIC-III dataset.
ICD-10-CM F05 and R41.0 were used in the CUIMC dataset and ICD-9-CM codes 293.0, 293.1, 290.11, 290.3, 290.41, 291.0, 292.81, and 780.97 were used in the MIMIC-III dataset for delirium diagnosis code.
Figure 1.ROC curve of all models. ROC: receiver operating curve; CUIMC: Columbia University Irving Medical Center; MIMIC: Medical Information Mart for Intensive Care; LR: logistic regression; MLP: multi-layer perceptron.
Figure 2.Precision–recall curve of the LR and MLP model in the CUIMC (A) and MIMIC-III datasets (B). CUIMC: Columbia University Irving Medical Center; MIMIC: Medical Information Mart for Intensive Care; LR: logistic regression; MLP: multi-layer perceptron. Dash lines show the proportion of positive cases among evaluated sample (0.22 and 0.23 for CUIMC and MIMIC-III dataset, respectively). All curves were above the baseline.