| Literature DB >> 35177406 |
Fahad Kamran1,2, Shengpu Tang1,2, Erkin Otles3,4, Dustin S McEvoy5, Sameh N Saleh6,7, Jen Gong8, Benjamin Y Li1,4, Sayon Dutta5,9, Xinran Liu10, Richard J Medford6,7, Thomas S Valley11,12, Lauren R West13, Karandeep Singh11,14, Seth Blumberg10,15, John P Donnelly11,14, Erica S Shenoy13,16,17, John Z Ayanian11,12, Brahmajee K Nallamothu11,12, Michael W Sjoding11,12,18, Jenna Wiens19,11,18.
Abstract
OBJECTIVE: To create and validate a simple and transferable machine learning model from electronic health record data to accurately predict clinical deterioration in patients with covid-19 across institutions, through use of a novel paradigm for model development and code sharing.Entities:
Mesh:
Year: 2022 PMID: 35177406 PMCID: PMC8850910 DOI: 10.1136/bmj-2021-068576
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Characteristics of internal and external validation cohorts of adults admitted to hospital with covid-19 (see supplemental eTable 1 for characteristics of the development cohort). Values are numbers (percentages) unless stated otherwise
| Cohort | Internal validation cohort* (n=887) | External validation cohorts† | ||||||
|---|---|---|---|---|---|---|---|---|
| A (n=2161) | B (n=1252) | C (n=1180) | D (n=1009) | E (n=909) | F (n=747) | G (n=555) | ||
| No of hospital admissions | 956 | 2320 | 1320 | 1256 | 1073 | 965 | 794 | 607 |
| Median (IQR) age (years) | 64 (52-75) | 63 (50-76) | 62 (50-73) | 68 (56-79) | 65 (53-76) | 69 (58-80) | 73 (59-84) | 62 (48-75) |
| Age group (years): | ||||||||
| 18-25 | <25 | 52 (2.2) | <25 | <25 | <25 | <25 | <25 | <25 |
| 26-45 | 129 (13.5) | 398 (17.2) | 225 (17.1) | 159 (12.7) | 159 (14.8) | 77 (8.0) | 74 (9.3) | 114 (18.8) |
| 46-65 | 374 (39.1) | 800 (34.5) | 518 (39.2) | 380 (30.3) | 358 (33.4) | 327 (33.9) | 204 (25.7) | 215 (35.4) |
| 66-85 | 365 (38.2) | 873 (37.6) | 497 (37.7) | 539 (42.9) | 435 (40.5) | 412 (42.7) | 331 (41.7) | 184 (30.3) |
| >85 | 70 (7.3) | 197 (8.5) | 57 (4.3) | 159 (12.7) | 97 (9.0) | 145 (15.0) | 177 (22.3) | 74 (12.2) |
| Sex: | ||||||||
| Women | 420 (43.9) | 993 (42.8) | 612 (46.3) | 564 (44.9) | 533 (49.7) | 445 (46.1) | 363 (45.7) | 313 (51.6) |
| Men | 536 (56.1) | 1327 (57.2) | 709 (53.7) | 692 (55.1) | 540 (50.3) | 520 (53.9) | 431 (54.3) | 294 (48.4) |
| Race‡: | ||||||||
| White | 649 (67.9) | 1364 (58.8) | 733 (55.6) | 935 (74.4) | 589 (54.9) | 636 (65.9) | 584 (73.6) | 214 (35.3) |
| Black | 187 (19.6) | 190 (8.2) | 332 (25.2) | 123 (9.8) | 234 (21.8) | 135 (14.0) | 49 (6.2) | 62 (10.2) |
| Asian§ | 30 (3.1) | 80 (3.4) | 29 (2.2) | 51 (4.1) | 39 (3.6) | <25 | 39 (4.9) | 135 (22.2) |
| Other or unknown¶ | 90 (9.4) | 686 (29.6) | 226 (17.1) | 147 (11.7) | 211 (19.7) | 168 (17.4) | 122 (15.4) | 196 (32.3) |
| Ethnicity: | ||||||||
| Hispanic or Latino | 34 (3.6) | 587 (25.3) | 379 (28.7) | 350 (27.9) | 210 (19.6) | 138 (14.3) | 107 (13.5) | 176 (29.0) |
| Non-Hispanic or non-Latino | 883 (92.4) | 1569 (67.6) | 915 (69.3) | 875 (69.7) | 841 (78.4) | 783 (81.1) | 637 (80.2) | 414 (68.2) |
| Other or unknown | 39 (4.1) | 164 (7.1) | 26 (1.8) | 31 (2.5) | <25 | 44 (4.6) | 50 (6.3) | <25 |
| Median (IQR) length of stay (hours) | 138 (83-261) | 160 (95-284) | 141 (96-257) | 136 (93-235) | 167 (100-287) | 143 (92-234) | 154 (95-256) | 183 (113-324) |
| Outcome ever: | ||||||||
| Death | 60 (6.3) | 197 (8.5) | 108 (8.2) | 125 (10.0) | 96 (8.9) | 93 (9.6) | 123 (15.5) | 42 (6.9) |
| Mechanical ventilation | 98 (10.3) | 259 (11.2) | 142 (10.7) | 135 (10.7) | 116 (10.8) | 69 (7.2) | 69 (8.7) | 52 (8.6) |
| Intravenous vasopressors | 87 (9.1) | 299 (12.9) | 152 (11.5) | 139 (11.1) | 125 (11.6) | 65 (6.7) | 74 (9.3) | 70 (11.5) |
| Heated high flow nasal cannula | 218 (22.4) | 132 (5.7) | 263 (19.9) | 121 (9.6) | 95 (8.9) | 99 (10.3) | 106 (13.4) | 101 (16.6) |
| Primary outcome ≤5 days | 206 (21.6) | 311 (13.4) | 249 (18.8) | 206 (16.4) | 155 (14.4) | 136 (14.1) | 155 (19.5) | 92 (15.2) |
| Reason for primary outcome (% of outcomes): | ||||||||
| Death | 5 (2.4) | 34 (10.9) | 4 (1.6) | 21 (10.2) | 16 (10.3) | 25 (18.4) | 37 (23.9) | 2 (2.2) |
| Mechanical ventilation | 20 (9.7) | 89 (28.6) | 25 (10.0) | 52 (25.2) | 52 (33.5) | 22 (16.2) | 18 (11.6) | 8 (8.7) |
| Intravenous vasopressors | 9 (4.4) | 95 (30.5) | 18 (7.2) | 33 (16.0) | 26 (16.8) | 10 (7.4) | 21 (13.5) | 16 (17.4) |
| Heated high flow nasal cannula | 172 (83.5) | 93 (29.9) | 202 (81.1) | 100 (48.5) | 61 (39.4) | 79 (58.1) | 79 (51.0) | 66 (71.7) |
IQR=interquartile range.
Patients with covid-19 admitted to one institution during 2020-21.
Patients admitted with covid-19 during 2020-21 at 12 external medical centers. Six sites with fewer than 100 patients that met the primary outcome were combined into a single cohort when performing evaluation, resulting in seven external validation cohorts.
Race was self-identified by patients or their guardian, with options: American Indian or Alaska Native, Asian, Black, native Hawaiian or other Pacific Islander, White, other, patient refused, or unknown.
As defined by the US Census Bureau,51 the Asian race refers to people having origins in any of the original peoples of the Far East, Southeast Asia, or the Indian subcontinent including, for example, Cambodia, China, India, Japan, Korea, Malaysia, Pakistan, the Philippine Islands, Thailand, and Vietnam.
Includes American Indian or Alaskan, native Hawaiian or other Pacific Islander, other, unknown, or patient refused.
Fig 1Model performance across internal and external validation cohorts. Discriminative performance was measured using receiver operating characteristic curves and precision-recall curves. Model calibration is shown in reliability plots based on quintiles of predicted scores. The table summarizes results with 95% confidence intervals. The thick line shows the internal validation cohort at Michigan Medicine (MM) and the different colors represent the external validation cohorts (A-G). PPV=positive predictive value; AUROC=area under the receiver operating characteristics curve; AUPR=area under the precision-recall curve; ECE=expected calibration error
Fig 2Model discriminative performance (area under the receiver operating characteristics curve (AUROC) and area under the precision-recall curve (AUPR) scores) over the year (March 2020 to February 2021) by quarter. The table shows the number (percentage) of patient hospital admissions in each cohort in each quarter and met the primary outcome of a composite of clinical deterioration within the first five days of hospital admission, defined as in-hospital mortality or any of three treatments indicating severe illness: mechanical ventilation, heated high flow nasal cannula, and intravenous vasopressors. MM=Michigan Medicine; A-G represent the external validation cohorts
Fig 3Model discriminative performance (area under the receiver operating characteristics curve (AUROC) scores) evaluated across subgroups. Values are macro-average performance across institutions (error bars are ±1 standard deviation). No error bar shown for age subgroup 18-25 years because only a single institution had enough positive cases to calculate the AUROC score
Fig 4Model used to identify potential patients with covid-19 for early discharge after 48 hours of observation. A decision threshold was chosen that achieves a negative predictive value of ≥95%. Figure depicts both the proportion of patients who could be discharged early and the number of bed days saved, normalized by the number of correctly discharged patients in each validation cohort. Results are computed over 1000 bootstrap replications. MM=Michigan Medicine; A-G represent the external validation cohorts