| Literature DB >> 35639380 |
Finly J Zachariah1, Lorenzo A Rossi2, Laura M Roberts3, Linda D Bosserman4.
Abstract
Importance: To date, oncologist and model prognostic performance have been assessed independently and mostly retrospectively; however, how model prognostic performance compares with oncologist prognostic performance prospectively remains unknown. Objective: To compare oncologist performance with a model in predicting 3-month mortality for patients with metastatic solid tumors in an outpatient setting. Design, Setting, and Participants: This prognostic study evaluated prospective predictions for a cohort of patients with metastatic solid tumors seen in outpatient oncology clinics at a National Cancer Institute-designated cancer center and associated satellites between December 6, 2019, and August 6, 2021. Oncologists (57 physicians and 17 advanced practice clinicians) answered a 3-month surprise question (3MSQ) within clinical pathways. A model was trained with electronic health record data from January 1, 2013, to April 24, 2019, to identify patients at high risk of 3-month mortality and deployed silently in October 2019. Analysis was limited to oncologist prognostications with a model prediction within the preceding 30 days. Exposures: Three-month surprise question and gradient-boosting binary classifier. Main Outcomes and Measures: The primary outcome was performance comparison between oncologists and the model to predict 3-month mortality. The primary performance metric was the positive predictive value (PPV) at the sensitivity achieved by the medical oncologists with their 3MSQ answers.Entities:
Mesh:
Year: 2022 PMID: 35639380 PMCID: PMC9157269 DOI: 10.1001/jamanetworkopen.2022.14514
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Patients With Metastatic Solid Tumors Subject to the 3-Month Surprise Question Prognostications and Model Predictions
| Variable | Cohort, No. (%) |
|---|---|
| Encounters/prognostications | 3099 |
| Patients | 2041 |
| Medical oncologists and advanced practice clinicians | 74 |
| Prognostication count per oncologist, mean (range) | 41.9 (1-245) |
| Days between appointment and prognostication, median (SD) | 2 (15.5) |
| Sex | |
| Male | 770 (37.7) |
| Female | 1271 (62.3) |
| Age, median (range), y | 62.6 (18-96) |
| Disease group | |
| Breast | 482 (23.6) |
| Gastrointestinal | 629 (30.8) |
| Genitourinary | 280 (13.7) |
| Lung | 378 (18.5) |
| Rare | 272 (13.3) |
| Race | |
| American Indian/Alaska Native | 12 (0.6) |
| Asian | 479 (23.5) |
| Black/African American | 102 (5.0) |
| White | 1322 (64.8) |
| Other/unknown | 126 (6.2) |
| Ethnicity | |
| Hispanic or Latino | 488 (23.9) |
| Not Hispanic or Latino | 1498 (73.4) |
| Unknown/declined to answer | 55 (2.7) |
Age is reported at the encounter level.
Other includes Native Hawaiian and other Pacific Islander (9 [0.4%]), as well as other races not discretely captured within the electronic health record.
Performance of Oncologists Answering a 3-Month Surprise Question Compared With the 90-Day Mortality Prediction Model
| Variable | Oncologists | ML model | Oncologist-ML model concordant decisions |
|---|---|---|---|
| No. | 3099 | 3099 | 3099 |
| Prevalence (90-d mortality), % | 15.2 | 14.4 | 15.2 |
| Area under the receiver operating characteristic curve, % | 59.8 (57.7-62.0) | 81.2 (79.1-83.3) | 55.7 (54.2-57.3) |
| Area under the precision-recall curve, % | NA | 46.2 (41.4-51.3) | NA |
| PPV (precision) | 34.8 (30.1-39.5) | 60.0 (53.6-66.3) | 68.6 (58.2-78.4) |
| Sensitivity (recall), % | 29.7 (25.6-33.8) | 29.5 (25.4-34.0) | 12.5 (9.6-15.6) |
| Specificity, % | 90.0 (88.9-91.2) | 96.7 (96.0-97.3) | 99.0 (98.6-99.4) |
| PPV-to-prevalence ratio | 2.3 (2.0-2.6) | 4.2 (3.7-4.7) | 4.5 (3.8-5.2) |
| Negative predictive value, % | 87.7 (86.4-88.9) | 89.1 (87.9-90.2) | 86.3 (85.1-87.5) |
| Median lead days | 37.5 (31.5-45.0) | 28.5 (25.0-36.0) | 30.0 (20.5-32.5) |
Abbreviations: ML, machine learning; NA, not applicable; PPV, positive predictive value.
Performance of Medical Oncologists With a 3-Month Surprise Question Compared With an ML Model With Stratification by Disease Groups and Presence of Systemic Therapy Changes
|
|
|
|
|
|
|
|
|
|
| |||||||
| Oncologists | 3099 | 15.2 | NC | 2.3 (2.0 to 2.6) | 29.7 | 34.8 | 18.5 to 31.9 |
| ML model | 14.4 | 81.2 (79.1 to 83.3) | NC | 29.5 | 60.0 | ||
|
| |||||||
| Oncologists | 697 | 10.3 | NC | 3.5 (2.7 to 4.6) | 37.5 | 36.5 | 1.7 to 32.5 |
| Model | 9.9 | 87.3 (83.0 to 91.1) | NC | 36.2 | 53.2 | ||
|
| |||||||
| Oncologists | 937 | 15.4 | NC | 2.1 (1.8 to 2.4) | 52.1 | 32.5 | 4.1 to 18.5 |
| ML model | 14.4 | 81 (76.8 to 85.0) | NC | 52.6 | 43.8 | ||
|
| |||||||
| Oncologists | 376 | 12 | NC | 2.7 (1.4 to 4.2) | 20 | 32.1 | −15.7 to 35.6 |
| ML model | 11.2 | 85 (78.8 to 90.5) | NC | 19 | 42.1 | ||
|
| |||||||
| Oncologists | 639 | 22.8 | NC | 2 (1.2 to 2.9) | 9.6 | 46.7 | −10.6 to 43.4 |
| ML model | 21.6 | 77.7 (73.2 to 82.2) | NC | 10.1 | 63.6 | ||
|
| |||||||
| Oncologists | 450 | 14.4 | NC | 2.7 (1.7 to 3.8) | 23.1 | 38.5 | −1.3 to 45.3 |
| ML model | 14 | 76.4 (69.3 to 82.8) | NC | 22.2 | 60.9 | ||
|
| |||||||
| Oncologists | 1333 | 13.4 | NC | 2.5 (2.0 to 3.0) | 26.8 | 33.1 | 5.9 to 26.0 |
| ML model | 13.2 | 80.1 (76.6 to 83.6) | NC | 27.3 | 49 | ||
|
| |||||||
| Oncologists | 1766 | 16.6 | NC | 2.2 (1.8 to 2.5) | 31.4 | 35.8 | 19.7 to 37.2 |
| ML model | 15.4 | 82.4 (79.7 to 85.0) | NC | 31.7 | 64.2 | ||
Abbreviations: AUROC, area under the receiving operator characteristic curve; ML, machine learning; NC, not calculated; PPV, positive predictive value.
If the 95% CI of the precision difference does not include 0, the precision of the model is statistically significantly better than that of the oncologists.
Figure 1. Receiver Operating Characteristic Curve (ROC), Precision Recall Curve (PRC), and Survival Curves for the Model and Oncologists
Receiver operating characteristic curve (A) and positive predictive value (PPV)–sensitivity or PRC (B) for the machine learning model (model) vs oncologists. The model area under the ROC curve was 81%; area under the PRC, 50%; and prevalence, 14.4%. In survival plots for oncologists (C) and the model (D), the continuous curve was associated with a predicted low risk of death and, in the ideal case, would be horizontal for the first 3 months.
Figure 2. Positive Predictive Value (PPV) and Sensitivity Scatterplots in PPV Plots With Exclusion of Near-Death Encounters
A, For oncologists, the PPV was 34.8% and sensitivity was 29.7%. B, For the machine learning model, PPV was 60.0% and sensitivity was 29.5%. Each dot in the scatterplots corresponds to the prognostications of an oncologist (A) and to associated model predictions (B). The size of a dot is proportional to the prediction count, and the hue represents the ratio of PPV over prevalence (darker color indicates better performance). C, Comparison of PPV for the model and oncologists with progressive exclusion of near-death encounters.