| Literature DB >> 32790207 |
Noah Bice1, Neil Kirby1, Tyler Bahr1, Karl Rasmussen1, Daniel Saenz1, Timothy Wagner1, Niko Papanikolaou1, Mohamad Fakhreddine1.
Abstract
PURPOSE: Prognostic indices such as the Brain Metastasis Graded Prognostic Assessment have been used in clinical settings to aid physicians and patients in determining an appropriate treatment regimen. These indices are derivative of traditional survival analysis techniques such as Cox proportional hazards (CPH) and recursive partitioning analysis (RPA). Previous studies have shown that by evaluating CPH risk with a nonlinear deep neural network, DeepSurv, patient survival can be modeled more accurately. In this work, we apply DeepSurv to a test case: breast cancer patients with brain metastases who have received stereotactic radiosurgery.Entities:
Keywords: brain metastasis; deep learning; machine learning; survival
Mesh:
Year: 2020 PMID: 32790207 PMCID: PMC7285927 DOI: 10.1002/acm2.12995
Source DB: PubMed Journal: J Appl Clin Med Phys ISSN: 1526-9914 Impact factor: 2.243
Fig. 1GPA is an online free‐to‐use tool utilized by some oncologists for patient prognosis (www.brainmetgpa.com). This validity of this tool has been established by Sperduto et al. in several journal publications with timely updates (reference: PMID: 22203767). The GPA tool uses five covariates determined to be significant by MCR to make survival predictions for patients with brain metastases and breast primary site. In this image, a breast cancer patient with age> 60, KPS in the range 90–100, tumor subtype luminal B triple‐positive, etc. is given a score of 2.5, indicating an expected survival of 11.1–47 months.
Fig. 2t‐SNE visualization of the NCDB dataset, labeled by event occurrence. Patients with recorded death times have a yellow marker, while those which are lost to follow‐up are labeled purple. Because directions in the embedding space do not correspond to known physical parameters, axis labels in t‐SNE visualization are arbitrary.
The significance of various covariates according to the proportional hazards assumption is listed in the table.
| Variable |
| Variable |
| Variable |
|
|---|---|---|---|---|---|
| Age | 0.02 | Sex | 0.64 | Race | 0.51 |
| Charleon/Deyo | 0.19 |
|
| Tumor Size | 0.87 |
| Regional LNs Positive | 0.94 |
|
| AJCC Pathologic N | 0.32 |
| Bone Mets at DX | 0.59 |
|
| Lung Mets at DX | 0.85 |
| ER Assay | 0.45 | PR Assay | 0.93 |
|
|
| Multigene Signature Method | 0.62 | Multigene Signature Results | 0.91 |
|
|
| Rad Days After DX | 0.38 | Radiation Type | 0.97 |
|
|
| Regional Dose | 0.99 | Chemotherapy Type | 0.16 | Hormone Therapy Type | 0.26 |
|
|
| Mets at DX | 0.42 | Systemic Surgery Sequence | 0.04 |
Some factors that are obviously significant in survival predictions have significant P‐values. The many covariates with insignificant P‐values will not greatly contribute to risk calculation in the CPH framework. Bold indicates statistical significance.
Fig. 3Left: Hyperparameters that yielded the highest validation accuracy. Right: A plot of learning rates versus validation accuracy demonstrates the impact of one hyperparameter choice on generalization error. A learning rate of was chosen for deployment.
Fig. 4The test set concordance indices for three models are shown in this plot. The 95% confidence intervals of the median are given by notches in the boxes. DeepSurv appears to yield the smallest generalization error.
A Tukey HSD post‐hoc test supports the hypothesis that DeepSurv and the RSF have greater mean test concordances than CPH.
| Tukey HSD – Multiple comparison of means | |||||
|---|---|---|---|---|---|
| Group 1 | Group 2 | Difference of means | Lower bound | Upper bound | Reject null |
| DeepSurv | CPH | 0.1237 | 0.1074 | 0.1399 | True |
| RSF | CPH | 0.1117 | 0.0957 | 0.1277 | True |
| DeepSurv | RSF | 0.0120 | ‐0.0042 | 0.0282 | False |
Fig. 5Survival probability curves for three patients predicted by each model according to a Kaplan–Meier estimate of the baseline survival. The true times of death for these patients are indicated by the vertical lines.
Fig. 6The distribution of errors predicted with for the validation dataset suggests all three models overestimate patient survival times.