| Literature DB >> 35396283 |
Chung Kwan Wong1, Barbara C van Munster1, Athanasios Hatseras1, Else Huis In 't Veld1, Barbara L van Leeuwen2, Sophia E de Rooij1, Rick G Pleijhuis3.
Abstract
OBJECTIVES: Delirium is associated with increased morbidity, mortality, prolonged hospitalisation and increased healthcare costs. The number of clinical prediction models (CPM) to predict postoperative delirium has increased exponentially. Our goal is to perform a head-to-head comparison of CPMs predicting postoperative delirium in non-intensive care unit (non-ICU) elderly patients to identify the best performing models.Entities:
Keywords: delirium & cognitive disorders; geriatric medicine; internal medicine; risk management; surgery
Mesh:
Year: 2022 PMID: 35396283 PMCID: PMC8996014 DOI: 10.1136/bmjopen-2021-054023
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flow chart indicating the selection process of included delirium prediction models.
Baseline characteristics of external validation cohort
| Characteristic | No. of patients |
| Gender, n (%) | |
| Men | 175 (60) |
| Age, mean (SD), years | 66 (8) |
| Age category (years), n (%) | |
| 50–59 | 75 (26) |
| 60–69 | 128 (44) |
| 70–79 | 69 (24) |
| >80 | 20 (7) |
| APACHE II, median (IQR), points | 5 (4–7) |
| Number of comorbidities, n (%) | |
| 0–1 | 82 (28) |
| >2 | 210 (72) |
| Type of comorbidities, n (%) | |
| Diabetes mellitus | 54 (19) |
| Hypertension | 109 (37) |
| Other cardiovascular disease | 87 (30) |
| Cerebrovascular disease | 28 (10) |
| Other neurological disease | 12 (4) |
| Chronic pulmonary disease | 35 (12) |
| Chronic renal disease | 20 (7) |
| Medication use, n (%) | |
| 0–1 | 144 (49) |
| >4 | 148 (51) |
| History of delirium, n (%) | 41 (14) |
| Cognitive impairment*, n (%) | 40 (14) |
| Admission type, n (%) | |
| Elective | 264 (90) |
| Emergency | 28 (10) |
| Type of surgery, n (%) | |
| Oncological | 93 (32) |
| General | 84 (29) |
| Vascular | 54 (18) |
| Hepatobiliary | 39 (13) |
| Other | 22 (8) |
| Length of stay, median (IQR), days | 8 (4–14) |
| Postoperative delirium, n (%) | 25 (9) |
Categorical variables are expressed as the total number of patients with corresponding percentages between brackets. Continuous variables are expressed as median values with IQR unless specified otherwise.
APACHE, Acute Physiology, Age, Chronic Health Evaluation.
Overview of study populations, diagnostic instruments and model variables for all models included for analysis
| Study | Population | Mean age | Diagnostic instrument | Variables |
| Ten Broeke | Cardiac surgery | 68 | DOSS | Age (continuous variable) |
| Carrasco | Mixed | 78 | CAM | BUN/Creatinine ratio |
| Dai | Orthopaedic/Urological surgery | 72.7 | DSM-IV, | Old age (>80 years) |
| Ettema | Mixed | 81 | DOSS | Previous confusion |
| Freter | Orthopaedic surgery | 76.8 | CAM | Cognitive impairment (MMSE <24) |
| Halladay | Mixed | 75 | DSM-IV | Cognitive impairment (prior diagnosis of dementia in the EMR or outpatient prescription of a medication for dementia at admission) |
| Kim | Mixed surgical (>50% open) | NR | ICDSC | Age (continuous variable) |
| Litaker | Mixed surgical | 67 | CAM, DSM-IV | Age >70 years |
| Pendlebury | Mixed | 81 | CAM, DSM-V | Dementia/Cognitive impairment (AMTS <9 or MMSE <24) |
| Pompei | Mixed | 74.3 | CAM, DSM-III | Cognitive impairment (MMSE cut-off 21–24, based on education level) |
| Rudolph | Cardiac surgery | 74.7 | CAM | MMSE <23 |
| Rudolph | Mixed | 72.1 | DSM-IV-TR | Cognitive impairment (MOCA ≤18) |
| de Wit | Mixed | 76.9 | NS | Age (continuous variable) |
| Zhang | Orthopaedic surgery | 79 | DSM-V | Preoperative cognitive impairment (not defined in original article) |
ADL, activities of daily life; AMTS, Abbreviated Mental Test Score; ASA, American Association of Anesthesiologists; CAM(-ICU), confusion assessment method (for the intensive care unit); DOSS, Delirium Observation Screening Scale; DSM, Diagnostic and Statistical Manual for Mental Disorders; EMR, electronic medical record; FE, femoral endarterectomy; GDS, Geriatric Depression Scale; ICDSC, Intensive Care Delirium Screening Checklist; MDC, major diagnostic categories; RBCs, red blood cells; SMMSE, Standardised Mini Mental State Examination; TIA, transient ischaemic attack; TICS, Telephone Interview For Cognitive Status.
Performance of the included clinical prediction models on external model validation
| Study | Sample size derivation cohort | Reported TRIPOD items | Validation previously performed | C-statistic previous validation | Sample size validation cohort | C | ∆ C-statistic (%) | Intercept | Slope | Brier score | Scaled Brier score |
| Pompei | 432 | 17/20 | Yes | 0.64 | 281 | 0.543 (0.441 to 0.645) | 0.097 (15) | 0.069 | 0.211 | 0.086 | −0.059 |
| Dai | 701 | 12/20 | No | NA | 283 | 0.739 (0.664 to 0.813) | NA | −0.018 | 1.96 | 0.077 | 0.049 |
| Litaker | 500 | 14/20 | No | NA | 282 | 0.706 (0.590 to 0.823) | NA | −0.015 | 0.995 | 0.074 | 0.088 |
| Freter | 132 | 16/20 | No | NA | 282 | 0.576 (0.472 to 0.680) | NA | 0.045 | 0.267 | 0.093 | −0.148 |
| Rudolph | 122 | 16/20 | Yes | 0.75 | 167 | 0.610 (0.485 to 0.734) | 0.14 (19) | 0.002 | 0.249 | 0.220 | −1.289 |
| Carrasco | 374 | 18/20 | Yes | 0.78 | 268 | 0.563 (0.435 to 0.692) | 0.217 (28) | 0.340 | −0.743 | 0.144 | −0.834 |
| Kim | 561 | 17/20 | Yes | 0.94 | 206 | 0.610 (0.505 to 0.715) | 0.33 (35) | 0.018 | 0.309 | 0.124 | −0.410 |
| Rudolph | 27 625 | 17/20 | Yes | 0.74 | 231 | 0.624 (0.504 to 0.743) | 0.116 (16) | 0.054 | 0.638 | 0.080 | −0.018 |
| de Wit | 1291 | 18/20 | Yes | 0.77 | 206 | 0.635 (0.501 to 0.769) | 0.135 (18) | 0.076 | 0.592 | 0.092 | −0.045 |
| Pendlebury et al. | 308 | 19/20 | Yes | 0.81 | 219 | 0.539 (0.424 to 0.654) | 0.271 (33) | 0.039 | 0.329 | 0.088 | −0.062 |
| Ettema | 3786 | 17/20 | No | NA | 281 | 0.580 (0.478 to 0.683) | NA | 0.070 | 0.225 | 0.08 | −0.020 |
| Halladay | 27 625 | 18/20 | Yes | 0.91 | 227 | 0.519 (0.412 to 0.626) | 0.391 (43) | 0.063 | 0.295 | 0.095 | −0.181 |
| Ten Broeke | 329 | 18/20 | No | NA | 283 | 0.635 (0.521 to 0.749) | NA | 0.037 | 0.219 | 0.114 | −0.419 |
| Zhang | 825 | 16/20 | No | NA | 282 | 0.650 (0.541 to 0.759) | NA | 0.024 | 0.258 | 0.117 | −0.45 |
Overview of model performance on external validation of all 14 eligible clinical prediction models for delirium, expressed as model discrimination (C-statistic with corresponding 95% CI), calibration (model intercept and slope) and a composite measure of discrimination and calibration (Brier score and scaled Brier score).
*Full model as reported by de Wit et al.29
†Three-question model as reported by Ettema et al.45
NA, not available; TRIPOD, Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis.
Figure 2Head-to-head comparison of discriminative power of delirium prediction models. Discriminative power of externally validated delirium prediction models is reported as c-indices with associated 95% CIs, ranked from low to high. A c-index of 0.5 resembles a situation in which the model has no discriminative power, that is, the model predicts no better than flipping a coin. Only 2 out of 14 validated models showed fair discrimination with c-indices >0.70 (0.71 and 0.74 for the models developed by Litaker et al and Dai et al, respectively) and 95% CIs with lower bounds >0.50. Discriminative power of the remaining 12 models was considered poor.
Figure 3Discrimination, calibration and clinical utility of best performing models. Panels A and B show the receiver operating characteristic (ROC) curve of the delirium prediction models by Litaker et al and Dai et al, respectively, with the area under the ROC curve (c-index) indicating the discriminative power of the model. A graphical representation of the calibration of both models is shown in panels C and D, plotting the predicted probability (x-axis) with corresponding 95% CI against the actually observed occurrence of delirium in the validation cohort (y-axis). The model by Litaker et al showed adequate calibration (panel C), correctly differentiating patients at low risk of delirium (20%). The model by Dai et al correctly identified patients at low risk (20%). Panels E and F show decision curve analyses as a measure of clinical utility of both models. For the models by Litaker et al and Dai et al, a positive net benefit was observed in the 10%–35% threshold probability range (panel E) and the 5%–20% threshold probability range (panel F), respectively.