| Literature DB >> 30646043 |
Aymen A Elfiky1,2,3, Maximilian J Pany2,3, Ravi B Parikh4,5, Ziad Obermeyer2,3.
Abstract
Importance: Patients with cancer who die soon after starting chemotherapy incur costs of treatment without the benefits. Accurately predicting mortality risk before administering chemotherapy is important, but few patient data-driven tools exist. Objective: To create and validate a machine learning model that predicts mortality in a general oncology cohort starting new chemotherapy, using only data available before the first day of treatment. Design, Setting, and Participants: This retrospective cohort study of patients at a large academic cancer center from January 1, 2004, through December 31, 2014, determined date of death by linkage to Social Security data. The model was derived using data from 2004 through 2011, and performance was measured on nonoverlapping data from 2012 through 2014. The analysis was conducted from June 1 through August 1, 2017. Participants included 26 946 patients starting 51 774 new chemotherapy regimens. Main Outcomes and Measures: Thirty-day mortality from the first day of a new chemotherapy regimen. Secondary outcomes included model discrimination by predicted mortality risk decile among patients receiving palliative chemotherapy, and 180-day mortality from the first day of a new chemotherapy regimen.Entities:
Mesh:
Year: 2018 PMID: 30646043 PMCID: PMC6324307 DOI: 10.1001/jamanetworkopen.2018.0926
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Patient Characteristics of Model Derivation and Validation Sets
| Variable | Set | ||
|---|---|---|---|
| Derivation (n = 17 832) | Validation (n = 9114) | Difference (n = 8718) | |
| No. of chemotherapy regimens | 36 007 | 15 767 | 20 240 |
| Age, mean (95% CI), y | 58.7 (58.5 to 58.9) | 60.7 (60.5 to 61.0) | −2.1 (−2.4 to −1.7) |
| Female, % (95% CI) | 61.1 (60.4 to 61.9) | 60.7 (59.7 to 61.7) | 0.5 (−0.7 to 1.7) |
| White, % (95% CI) | 86.9 (86.4 to 87.4) | 88.3 (87.7 to 89.0) | −1.5 (−2.3 to −0.7) |
| Inpatient visit, % (95% CI) | 43.9 (43.2 to 44.6) | 38.0 (37.0 to 38.9) | 6.0 (4.7 to 7.2) |
| Gagne score, median (IQR) | 2 (1 to 6) | 1 (1 to 6) | −1 (0 to 0) |
| Cancer type, % (95% CI) | |||
| Breast | 23.6 (23.0 to 24.3) | 21.1 (20.2 to 21.9) | 2.5 (1.5 to 3.6) |
| Colon and rectum | 17.6 (17.1 to 18.2) | 19.3 (18.5 to 20.2) | −1.7 (−2.7 to −0.7) |
| Lung and bronchus | 17.7 (17.2 to 18.3) | 18.0 (17.2 to 18.8) | −0.3 (−1.3 to 0.7) |
| Hematologic | 7.1 (6.7 to 7.5) | 8.4 (7.8 to 8.9) | −1.3 (−2.0 to −0.6) |
| Ovary | 6.7 (6.3 to 7.0) | 7.7 (7.1 to 8.2) | −1.0 (−1.7 to −0.3) |
| Other | 27.8 (27.1 to 28.4) | 25.8 (24.9 to 26.7) | 2.0 (0.9 to 3.1) |
| Chemotherapy beyond first line, % (95% CI) | 49.0 (48.3 to 49.7) | 42.4 (41.4 to 43.4) | 6.6 (5.3 to 7.8) |
| Intent of chemotherapy, % (95% CI) | |||
| Curative | 15.7 (15.3 to 16.1) | 26.8 (26.1 to 27.5) | −11.1 (−11.9 to −10.3) |
| Palliative | 33.8 (33.3 to 34.3) | 46.6 (45.8 to 47.3) | −12.7 (−13.7 to −11.8) |
Abbreviation: IQR, interquartile range.
Calculated by quantile regression.
Intent of chemotherapy does not add up to 100% owing to missing values, resulting in unknown intent.
Figure 1. Observed 180-Day Survival From the Initiation of Palliative Chemotherapy
Data are stratified by decile of model-predicted mortality risk. Decile 1 denotes the highest predicted risk decile; decile 10, the lowest. Overall denotes overall mean survival among all patients, irrespective of model-predicted risk.
Model Performance in Selected Subgroups
| Subgroup | AUC (95% CI) | No. in Validation Sample | |
|---|---|---|---|
| 30-d Mortality | 180-d Mortality | ||
| Intent of chemotherapy | |||
| Curative | 0.981 (0.970-0.992) | 0.892 (0.872-0.912) | 4220 |
| Palliative | 0.924 (0.910-0.939) | 0.827 (0.817-0.838) | 7345 |
| Stage | |||
| Distant | 0.938 (0.926-0.951) | 0.852 (0.842-0.861) | 8531 |
| Nondistant | 0.936 (0.916-0.955) | 0.874 (0.862-0.887) | 7236 |
| Cancer | |||
| Breast | 0.970 (0.956-0.983) | 0.939 (0.928-0.951) | 3408 |
| Colon and rectum | 0.924 (0.898-0.949) | 0.843 (0.827-0.860) | 2868 |
| Lung and bronchus | 0.916 (0.884-0.949) | 0.820 (0.803-0.838) | 2822 |
| Ovary | 0.956 (0.922-0.989) | 0.895 (0.871-0.918) | 1632 |
| Hematologic | 0.943 (0.902-0.983) | 0.819 (0.782-0.856) | 1435 |
| Head and neck | 0.970 (0.933-1.000) | 0.839 (0.798-0.879) | 739 |
| Cervix uteri | 0.955 (0.906-1.000) | 0.888 (0.847-0.929) | 599 |
| Prostate | 0.936 (0.849-1.000) | 0.805 (0.751-0.858) | 429 |
| Chemotherapy regimen | |||
| Carboplatin plus paclitaxel | 0.940 (0.859-1.000) | 0.846 (0.813-0.880) | 1297 |
| FOLFOX | 0.952 (0.923-0.982) | 0.863 (0.833-0.893) | 719 |
| Gemcitabine hydrochloride | 0.865 (0.789-0.942) | 0.992 (0.982-1.000) | 547 |
| Pemetrexed plus carboplatin | 0.960 (0.939-0.981) | 0.807 (0.769-0.845) | 518 |
| Paclitaxel | 0.989 (0.972-1.000) | 0.738 (0.686-0.791) | 505 |
| Novel clinical trial agents | 0.942 (0.882-1.000) | 0.870 (0.817-0.923) | 343 |
| Line of chemotherapy | |||
| First | 0.941 (0.925-0.956) | 0.865 (0.854-0.875) | 9114 |
| Subsequent | 0.938 (0.924-0.952) | 0.864 (0.854-0.874) | 6653 |
| Year | |||
| 2012 | 0.951 (0.935-0.966) | 0.887 (0.875-0.899) | 5169 |
| 2013 | 0.935 (0.914-0.955) | 0.869 (0.857-0.882) | 4863 |
| 2014 | 0.928 (0.908-0.947) | 0.837 (0.824-0.851) | 4915 |
| Age group, y | |||
| 21-40 | 0.963 (0.939-0.988) | 0.901 (0.874-0.928) | 1057 |
| 41-60 | 0.950 (0.935-0.966) | 0.893 (0.883-0.903) | 6263 |
| 61-80 | 0.936 (0.920-0.951) | 0.850 (0.839-0.861) | 7626 |
| 81-100 | 0.876 (0.806-0.947) | 0.803 (0.768-0.838) | 806 |
| Sex | |||
| Male | 0.927 (0.910-0.944) | 0.840 (0.828-0.852) | 5799 |
| Female | 0.948 (0.935-0.962) | 0.886 (0.877-0.894) | 9968 |
| Race/ethnicity | |||
| White | 0.940 (0.929-0.951) | 0.870 (0.862-0.877) | 13 960 |
| Nonwhite | 0.941 (0.909-0.972) | 0.870 (0.850-0.891) | 1807 |
| Insurance | |||
| Private | 0.949 (0.937-0.962) | 0.886 (0.878-0.895) | 10 415 |
| Medicare | 0.938 (0.911-0.965) | 0.854 (0.837-0.872) | 2699 |
| Medicaid | 0.945 (0.914-0.976) | 0.824 (0.784-0.865) | 611 |
| Self-pay | 0.932 (0.851-1.000) | 0.821 (0.753-0.888) | 210 |
Abbreviations: AUC, area under the receiver operating characteristic curve; FOLFOX, leucovorin calcium (folinic acid), fluorouracil, and oxaliplatin.
Indicates subgroup of patients receiving new clinical trial regimens first observed during 2012 through 2014 (ie, years of data to which the model was not exposed in the training process).
Observations in the validation set occurring in 2011 (n = 717) were reassigned to 2012 for the purpose of this Table.
Figure 2. One-Year Mortality After Chemotherapy Initiation
A-D, Mortality data from 4 randomized clinical trials[40,41,42,43] compared with the machine learning model predictions for patients in the validity sample. The randomized clinical trials compared bevacizumab and oxaliplatin,[40] pemetrexed and carboplatin,[41] etoposide and carboplatin,[42] and carboplatin and paclitaxel.[43] E-H, Mortality estimates from the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) program compared with the machine learning model predictions. The 45° dotted lines denote equivalence of observed and estimated mortality. Orange lines and shaded 95% CIs show observed 1-year mortality against quintiles of model-predicted 30-day mortality risk. Blue lines and shaded 95% CIs show observed 1-year mortality against predictions from 1-year mortality and 95% CI for trial patients taking a given regimen and 1-year mortality estimates from the SEER program, by type, age, sex, and race/ethnicity.
Selected Predictors by Risk Decile and Model Variance Explained
| Predictor | Risk Decile | Model Variance Explained, % | ||
|---|---|---|---|---|
| Top | Median | Bottom | ||
| Cancer of the brain and with other nervous system areas, % | 6.7 | 1.6 | 0.0 | 1.94 |
| Demographics | ||||
| Mean age, y | 62.3 | 62.1 | 51.9 | 1.30 |
| Female, % | 56.4 | 60.9 | 86.9 | 1.17 |
| Black, % | 3.8 | 3.6 | 3.4 | 0.07 |
| Mean comorbidity score | 5.14 | 3.44 | 2.01 | 0.03 |
| Prior diagnoses, % | ||||
| Ascites | 0.31 | 0.07 | 0.01 | 0.39 |
| Mouth disorder | 0.02 | 0.01 | 0.01 | 0.25 |
| Nausea and vomiting | 0.23 | 0.09 | 0.01 | 0.18 |
| Lower respiratory tract disorders | 2.10 | 1.36 | 0.16 | 0.03 |
| Secondary malignant neoplasm | 5.57 | 1.69 | 0.36 | 0.01 |
| Failure to thrive | 0.05 | 0.01 | 0.00 | 0.01 |
| Nutritional disorders | 2.53 | 1.63 | 0.54 | 0.00 |
| Malaise and fatigue | 0.22 | 0.08 | 0.02 | 0.00 |
| Medications, % | ||||
| Corticosteroids | 0.53 | 0.00 | 0.00 | 0.15 |
| Opioids | 0.29 | 0.00 | 0.00 | 0.05 |
| Anxiolytics | 0.60 | 0.24 | 0.18 | 0.02 |
| Cathartics | 0.54 | 0.00 | 0.00 | 0.00 |
| Vital signs | ||||
| Maximum pulse, bpm (baseline) | 106.1 | 95.7 | 87.1 | 0.37 |
| Maximum weight, kg (baseline) | 79.5 | 80.2 | 76.8 | 0.11 |
| Weight, SD, kg (baseline) | 3.1 | 2.1 | 1.3 | 0.06 |
| Minimum pulse, bpm (recent) | 83.4 | 72.8 | 63.0 | 0.01 |
| Weight change, kg | −3.1 | −1.0 | 0.1 | 0.00 |
| Laboratory findings | ||||
| Maximum C-reactive protein level, mg/L | 93.9 | 65.6 | 2.2 | 0.19 |
| Maximum ALT level, U/L | 75.9 | 57.3 | 24.3 | 0.07 |
| Maximum AST level, U/L | 73.7 | 54.2 | 23.8 | 0.09 |
| Maximum white blood cell count, ×103/μL | 13.9 | 12.4 | 9.8 | 0.03 |
| Maximum alkaline phosphatase, IU/L | 199.5 | 128.7 | 76.5 | 0.02 |
| Mean lymphocyte count, ×103/μL | 1.0 | 1.3 | 1.8 | 0.00 |
| Mean platelet count, ×103/μL | 251.8 | 241.4 | 265.1 | 0.00 |
| Ejection fraction, % | 54.4 | 48.0 | 51.9 | 0.01 |
| Total linear terms | NA | NA | NA | 13.6 |
| Total nonlinear terms and interactions | NA | NA | NA | 86.4 |
Abbreviations: ALT, alanine aminotransferase; AST, aspartate aminotransferase; NA, not applicable.
SI conversion factors: To convert ALT, AST, and alkaline phosphatase to microkatals per liter, multiply by 0.0167; C-reactive protein to nanomoles per liter, multiply by 9.524; and white blood cell count, lymphocyte count, and platelet count to ×109 per liter, multiply by 1.
Scores range from 0 to 26, with higher scores indicating more comorbidites.
Baseline and recent denote the value of the predictor (minimum, maximum, or SD, as noted) during the 2 to 12 months and the 1 month before chemotherapy initiation, respectively.
Indicates the change from the mean value during the 2 to 12 months before to the mean value during the 1 month before chemotherapy initiation.
Indicates the value of the predictor (minimum or maximum as noted) during the 12 months before chemotherapy initiation.