| Literature DB >> 33785839 |
Brett K Beaulieu-Jones1, William Yuan1, Gabriel A Brat2, Andrew L Beam3, Griffin Weber2, Marshall Ruffin1, Isaac S Kohane4.
Abstract
Machine learning can help clinicians to make individualized patient predictions only if researchers demonstrate models that contribute novel insights, rather than learning the most likely next step in a set of actions a clinician will take. We trained deep learning models using only clinician-initiated, administrative data for 42.9 million admissions using three subsets of data: demographic data only, demographic data and information available at admission, and the previous data plus charges recorded during the first day of admission. Models trained on charges during the first day of admission achieve performance close to published full EMR-based benchmarks for inpatient outcomes: inhospital mortality (0.89 AUC), prolonged length of stay (0.82 AUC), and 30-day readmission rate (0.71 AUC). Similar performance between models trained with only clinician-initiated data and those trained with full EMR data purporting to include information about patient state and physiology should raise concern in the deployment of these models. Furthermore, these models exhibited significant declines in performance when evaluated over only myocardial infarction (MI) patients relative to models trained over MI patients alone, highlighting the importance of physician diagnosis in the prognostic performance of these models. These results provide a benchmark for predictive accuracy trained only on prior clinical actions and indicate that models with similar performance may derive their signal by looking over clinician's shoulders-using clinical behavior as the expression of preexisting intuition and suspicion to generate a prediction. For models to guide clinicians in individual decisions, performance exceeding these benchmarks is necessary.Entities:
Year: 2021 PMID: 33785839 PMCID: PMC8010071 DOI: 10.1038/s41746-021-00426-3
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1Clinician-initiated data alone is a filtered representation of patient physiology.
a Clinician-initiated and non-clinician-initiated data are distinguished by their proximity as readouts of patient physiology, as well as the presence of the expertise of the clinician. b Physician actions are a reflection of their beliefs regarding a patient, which are formed through examination of patient physiology.
Example first day charge details for a patient with MI.
| Description | Department | Quantity |
|---|---|---|
| EKG Routine tracing only | EKG | 1 |
| ECHO 2D W/OR W/O M-Mode complete W/color flow | Cardiology | 1 |
| ER Level V | Emergency room | 1 |
| XR Chest 2 views | Diagnostic imaging | 1 |
| Culture blood | Laboratory | 2 |
| Partial thromboplastin time (PTT) | Laboratory | 1 |
| Prothrombin time (PT) | Laboratory | 1 |
| Complete CBC AUTO W/O DIFF | Laboratory | 1 |
| TROPONIN QN | Laboratory | 2 |
| B-Type natriuretic peptide | Laboratory | 1 |
| Lactate/lactic acid | Laboratory | 1 |
| Creatine kinase (CPK) MB only | Laboratory | 1 |
| Creatine kinase (CPK) | Laboratory | 2 |
| Comprehensive metabolic panel | Laboratory | 1 |
| Therapeutic/DIAG INJ IV push single INITI SUB/drug | IV Therapy | 1 |
| DOCUSATE NA, COLACE CAP 100 mg | Pharmacy | 1 |
| Aspirin Tab 325 mg (EA) | Pharmacy | 1 |
| Moxifloxacin, Avelox IVPB 400 mg | Pharmacy | 1 |
| Moxifloxacin, Avelox tab 400 mg | Pharmacy | 1 |
| Metoprolol, lopressor tab 25 mg | Pharmacy | 1 |
| Ipratropium, atrovent INH SOL 0.02% 2.5 ml | Pharmacy | 1 |
| Heparin NA VL 5000 U/ml 1 ml | Pharmacy | 1 |
| Furosemide, Lasix tab 20 mg | Pharmacy | 2 |
| Albuterol, proventil INH SOL 0.083% 3 ml (2.5 mg) | Pharmacy | 3 |
| R&B Telemetry private | Room and board | 1 |
Population information for data included for risk stratification using machine learning.
| 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | Total | |
|---|---|---|---|---|---|---|---|
| Hospitals included | 778 | 783 | 797 | 786 | 770 | 755 | 973 |
| Total encounters | 79,209,178 | 82,145,811 | 85,037,615 | 85,391,057 | 84,448,480 | 84,641,611 | 500,873,752 |
| Inpatient admissions | 8,556,411 | 8,682,382 | 8,812,595 | 8,683,133 | 8,288,089 | 8,052,278 | 51,074,888 |
| Multiday inpatient admissions | 7,175,154 | 7,338,193 | 7,425,860 | 7,296,849 | 6,939,021 | 6,720,949 | 42,896,026 |
| Total population: mortality | 120,583 (1.68%) | 123,764 (1.69%) | 129,640 (1.75%) | 126,844 (1.74%) | 124,310 (1.79%) | 121,549 (1.81%) | 746,690 (1.74%) |
| Total population: extended length of stay | 1,466,580 (20.44%) | 1,492,958 (20.35%) | 1,518,803 (20.45%) | 1,506,125 (20.64%) | 1,449,174 (20.88%) | 1,437,552 (21.39%) | 8,871,192 (20.68%) |
| Total population: 30-day readmission | 941,911 (13.13%) | 937,562 (12.78%) | 950,561 (12.80%) | 887,418 (12.16%) | 925,833 (13.34%) | 901,290 (13.41%) | 5,544,575 (12.93%) |
| All MI admissions (% of all admissions) | 69,448 (0.81%) | 71,609 (0.82%) | 78,975 (0.90%) | 82,952 (0.96%) | 84,407 (1.02%) | 84,551 (1.05%) | 471,942 (0.92%) |
| Multiday MI admissions (% of total multiday admissions) | 56,594 (0.79%) | 57,665 (0.79%) | 63,026 (0.85%) | 65,925 (0.90%) | 66,859 (0.96%) | 67,135 (1.00%) | 319,539 (0.88%). |
| MI Cohort: mortality | 3497 (6.18%) | 3393 (5.88%) | 3625 (5.75%) | 3569 (5.41%) | 3583 (5.36%) | 3337 (4.97%) | 21,004 (5.57%) |
| MI Cohort: extended length of stay | 9172 (16.21%) | 8941 (15.51%) | 9463 (15.01%) | 10,024 (15.21%) | 10,036 (15.01%) | 10,174 (15.15%) | 57,810 (15.33%) |
Fig. 2Performance comparison between charge and EMR data across cohorts and outcomes.
a Comparison of mortality, readmission and length of stay performance (area under receiver-operating curve, AUROC) on randomly selected validation data. b Average relative features per patient for each model version. c Outcome comparison on a myocardial infarction (MI) patient cohort between models trained on MI patients exclusively and all available patients.