| Literature DB >> 29482517 |
Jared L Katzman1, Uri Shaham2,3,4, Alexander Cloninger5,6, Jonathan Bates5,7,3, Tingting Jiang8, Yuval Kluger9,10,11.
Abstract
BACKGROUND: Medical practitioners use survival models to explore and understand the relationships between patients' covariates (e.g. clinical and genetic features) and the effectiveness of various treatment options. Standard survival models like the linear Cox proportional hazards model require extensive feature engineering or prior medical knowledge to model treatment interaction at an individual level. While nonlinear survival methods, such as neural networks and survival forests, can inherently model these high-level interaction terms, they have yet to be shown as effective treatment recommender systems.Entities:
Keywords: Deep learning; Survival analysis; Treatment recommendations
Mesh:
Year: 2018 PMID: 29482517 PMCID: PMC5828433 DOI: 10.1186/s12874-018-0482-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Diagram of DeepSurv. DeepSurv is a configurable feed-forward deep neural network. The input to the network is the baseline data x. The network propagates the inputs through a number of hidden layers with weights θ. The hidden layers consist of fully-connected nonlinear activation functions followed by dropout. The final layer is a single node which performs a linear combination of the hidden features. The output of the network is taken as the predicted log-risk function . The hyper-parameters of the network (e.g. number of hidden layers, number of nodes in each layer, dropout probability, etc.) were determined from a random hyper-parameter search and are detailed in Table 3
DeepSurv’s experimental hyper-parameters
| Hyper-parameter | Sim linear | Sim nonlinear | WHAS | SUPPORT | METABRIC | Sim treatment | GBSG |
|---|---|---|---|---|---|---|---|
| Optimizer | sgd | sgd | adam | adam | adam | adam | adam |
| Activation | SELU | ReLU | ReLU | SELU | SELU | SELU | SELU |
| # Dense layers | 1 | 3 | 2 | 1 | 1 | 1 | 1 |
| # Nodes / Layer | 4 | 17 | 48 | 44 | 41 | 45 | 8 |
| Learning rate (LR) | 2.922e −4 | 3.194e −4 | 0.067 | 0.047 | 0.010 | 0.026 | 0.154 |
| 1.999 | 4.425 | 16.094 | 8.120 | 10.891 | 9.722 | 6.551 | |
| Dropout | 0.375 | 0.401 | 0.147 | 0.255 | 0.160 | 0.109 | 0.661 |
| LR decay | 3.579e −4 | 3.173e −4 | 6.494e −4 | 2.573e −3 | 4.169e −3 | 1.636e −4 | 5.667e −3 |
| Momentum | 0.906 | 0.936 | 0.863 | 0.859 | 0.844 | 0.845 | 0.887 |
Experimental results for all experiments C-index (95% confidence interval)
| Experiment | CPH | DeepSurv | RSF |
|---|---|---|---|
| Simulated Linear |
| 0.778065 (0.776,0.780) | 0.757863 (0.756,0.760) |
| Simulated Nonlinear | 0.486728 (0.484,0.489) |
| 0.626552 (0.624,0.629) |
| WHAS | 0.816025 (0.813, 0.819) | 0.866723 (0.863,0.870) |
|
| SUPPORT | 0.583076 (0.581,0.585) | 0.618907 (0.617,0.621) |
|
| METABRIC | 0.631674 (0.627,0.636) |
| 0.619517 (0.615,0.624) |
| Simulated Treatment | 0.516620 (0.514,0.519) |
| 0.550298 (0.548,0.553) |
| Rotterdam & GBSG | 0.658773 (0.655, 0.662) |
| 0.647924 (0.644, 0.651) |
The bold faced numbers signify the best performing algorithm
Fig. 2Simulated Linear Experimental Log-Risk Surfaces. Predicted log-risk surfaces and errors for the simulated survival data with linear log-risk function with respect to a patient’s covariates x0 and x1. a The true log-risk h(x)=x0+2x1 for each patient. b The predicted log-risk surface of from the linear CPH model parameterized by β. c The output of DeepSurv predicts a patient’s log-risk. d The absolute error between true log-risk h(x) and CPH’s predicted log-risk . e The absolute error between true log-risk h(x) and DeepSurv’s predicted log-risk
Fig. 3Simulated Nonlinear Experimental Log-Risk Surfaces. Log-risk surfaces of the nonlinear test set with respect to patient’s covariates x0 and x1. a The calculated true log-risk h(x) (Eq. 9) for each patient. b The predicted log-risk surface of from the linear CPH model parameterized on β. The linear CPH predicts a constant log-risk. c The output of DeepSurv is the estimated log-risk function
Fig. 4Simulated Treatment Log-Risk Surface. Treatment Log-Risk Surfaces as a function of a patient’s relevant covariates x0 and x1. a The true log-risk h1(x) if all patients in the test set were given treatment τ=1. We then manually set all treatment groups to either τ=0 or τ=1. b The predicted log-risk for patients with treatment group τ=0. c The network’s predicted log-risk for patients in treatment group τ=1
Fig. 5Simulated Treatment Survival Curves. Kaplan-Meier estimated survival curves with confidence intervals (α=.05) for the patients who were given the treatment concordant with a method’s recommended treatment (Recommendation) and the subset of patients who were not (Anti-Recommendation). We perform a log-rank test to validate the significance between each set of survival curves. a Effect of DeepSurv’s Treatment Recommendations (Simulated Data), b Effect of RSF’s Treatment Recommendations (Simulated Data)
Experimental results for treatment recommendations: median survival time (months)
| Experiment | DeepSurv | RSF | ||
|---|---|---|---|---|
| Rec | Anti-Rec | Rec | Anti-Rec | |
| Simulated |
|
| 3.270 | 3.334 |
| Rotterdam & GBSG |
|
| 39.014 | 30.752 |
The bold faced numbers signify the best performing algorithm
Fig. 6Rotterdam & German Breast Cancer Study Group (GBSG) Survival Curves. Kaplan-Meier estimated survival curves with confidence intervals (α=.05) for the patients who were given the treatment concordant with a method’s recommended treatment (Recommendation) and the subset of patients who were not (Anti-Recommendation). We perform a log-rank test to validate the significance between each set of survival curves. a Effect of DeepSurv’s Treatment Recommendations (GBSG), b Effect of RSF’s Treatment Recommendations (GBSG)