| Literature DB >> 32241264 |
Zhi Huang1,2,3, Travis S Johnson2,4, Zhi Han2, Bryan Helm2, Sha Cao5, Chi Zhang3,5, Paul Salama3, Maher Rizkalla3, Christina Y Yu2,4, Jun Cheng2,6, Shunian Xiang5,7, Xiaohui Zhan2,7, Jie Zhang5, Kun Huang8,9.
Abstract
BACKGROUND: Recent advances in kernel-based Deep Learning models have introduced a new era in medical research. Originally designed for pattern recognition and image processing, Deep Learning models are now applied to survival prognosis of cancer patients. Specifically, Deep Learning versions of the Cox proportional hazards models are trained with transcriptomic data to predict survival outcomes in cancer patients.Entities:
Keywords: Cancer prognosis; Cox regression; Deep learning; Survival analysis; Tumor mutation burden
Mesh:
Substances:
Year: 2020 PMID: 32241264 PMCID: PMC7118823 DOI: 10.1186/s12920-020-0686-1
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Neural network architectures of three Deep Learning-based models. a Cox-nnet with a single hidden layer; b DeepSurv with multiple hidden layers having consistent dimensions; c AECOX with multiple hidden layers in the both encoder and decoder part. Last hidden layers in all models were indicated in orange and were connect to a Cox regression neural networks with hazard ratios as the outputs
Comparison of model architectures and settings across three Deep Learning-based cancer survival prognosis approaches
| Properties | Models | ||
|---|---|---|---|
| Cox-nnet | DeepSurv | AECOX | |
| Deep Learning Architecture | Single-layer neural networks | Multi-layer neural networks | Multi-layer Autoencoder neural networks |
| Deep Learning Programming Framework | Theano | Theano, Lasagne | PyTorch |
| Hyper-parameters | L2 regularization weight | Learning rate; Number of hidden layers; Hidden layer sizes; Learning rate decay; Momentum; L2 regularization weight | Learning rate; Autoencoder input-output error weight |
| Hyper-parameters Searching Methods | Line search | Sobol solver | Sobol solver |
| Number of iterations for searching hyper-parameters | 12 | 100 | 100 |
| Maximum epochs | 4000 | 500 | 300 |
| Number of Hidden Layers | 1 | 1, 2, 3, or 4 | 0, 2, 4, 6, or 8 |
| Last hidden Layer sizes | Integer value in range [131, 135] | Integer value in range [30, 50] | 16 |
| Regularization Methods | L1, L2, Dropout | L2, Dropout | Dropout, L1, L2, Elastic Net |
| Basic Objective (Loss) Functions | |||
| Optimization Methods | Nesterov accelerated gradient descent | Stochastic gradient descent (SGD) | Adaptive Moment Estimation (Adam) |
| Network Architectures | (Input Layer) – (Hidden Layer) (tanh) – (Hazard Ratio) | (Input Layer) – (Hidden Layer) (ReLU/SELU) – … – (Hidden Layer) (ReLU/SELU) – (Hazard Ratio) | (Input Layer) – (Hidden Layers) (ReLU/Dropout) – (Code) – (Hidden Layers) (ReLU/Dropout) – (Output Layer); (Code) (tanh) – (Hazard Ratio) |
The Cancer Genome Atlas (TCGA) 12 cancers’ statistics. Cancers were sorted based on averaged concordance index in descending order according to Fig. 2
| TCGA Cancers | TCGA Cancer | Total Cases | Censored (Living) Group | Uncensored (Deceased) Group | Number of Genes After Pre-processing | Age | Overall Survival Months | ||
|---|---|---|---|---|---|---|---|---|---|
| Median | Range | Median | Range | ||||||
| Kidney | KIRP | 286 | 242 | 44 | 17,867 | 61.5 | 28–88 | 25.45 | 0.00–194.65 |
| Kidney | KIRC | 531 | 357 | 174 | 17,870 | 61 | 26–90 | 38.96 | 0.00–149.05 |
| Liver | LIHC | 369 | 239 | 130 | 17,963 | 61 | 16–90 | 19.32 | 0.00–120.73 |
| Breast | BRCA | 1083 | 933 | 150 | 18,030 | 58 | 26–90 | 27.56 | 0.00–282.69 |
| Cervical | CESC | 302 | 231 | 71 | 17,731 | 46 | 20–88 | 20.93 | 0.00–210.51 |
| Lung | LUAD | 495 | 315 | 180 | 17,715 | 66 | 38–88 | 21.55 | 0.00–238.11 |
| Bladder | BLCA | 402 | 225 | 177 | 18,008 | 69 | 34–90 | 17.61 | 0.43–165.90 |
| Head-Neck | HNSC | 514 | 296 | 218 | 17,968 | 61 | 19–90 | 21.46 | 0.07–210.81 |
| Pancreatic | PAAD | 176 | 83 | 93 | 17,150 | 65 | 35–88 | 15.20 | 0.00–90.05 |
| Ovarian | OV | 299 | 119 | 180 | 17,635 | 58 | 30–87 | 31.27 | 0.30–180.06 |
| Stomach | STAD | 397 | 244 | 153 | 18,172 | 67 | 30–90 | 14.03 | 0.00–122.21 |
| Lung | LUSC | 489 | 283 | 206 | 18,030 | 68 | 39–90 | 21.91 | 0.00–173.69 |
Fig. 2a, b: Performance comparisons between three Deep Learning-based models across 12 TCGA (The Cancer Genome Atlas) cancers. a concordance index; b p-value of log-rank test (in −log10 scale). c, d: Performance comparisons between three Deep Learning-based models and three traditional machine learning models across 12 TCGA (The Cancer Genome Atlas) cancers. c concordance index; d p-value of log-rank test (in −log10 scale). Cancers were sorted based on averaged concordance index across models and experiments. For detailed cancer names, please refer to the Additional file 1
Model-wised performances comparison at pan-cancer level (12 TCGA (The Cancer Genome Atlas) cancer types) by pairwise paired t-test (A) and linear mixed-effects models test (B), according to metrics concordance index and p-value of log-rank test. Note that for concordance index, larger t-statistic/coefficient indicated better performance at pan-cancer level, while the p-value of log-rank test was on the contrary
| (A) Pairwise Paired T-test | ||||||
|---|---|---|---|---|---|---|
| Distribution 2 | ||||||
| DeepSurv | AECOX | |||||
| t | P | t | P | |||
| Distribution 1 | Cox-nnet | concordance index | 3.1843 | 2.32E-03 | 3.2281 | 2.04E-03 |
| p-value of log-rank test | −1.4006 | 1.67E-01 | −0.8962 | 3.74E-01 | ||
| DeepSurv | concordance index | – | – | −0.6732 | 5.03E-01 | |
| p-value of log-rank test | – | – | 0.5164 | 6.07E-01 | ||
| Notes: t denotes the pairwise paired Student’s t-test statistic, P denotes the p-value obtained. | ||||||
| (B) Linear Mixed-Effects Models Test | ||||||
| Distribution 2 | ||||||
| DeepSurv | AECOX | |||||
| P | P | |||||
| Distribution 1 | Cox-nnet | concordance index | 0.0195 | 1.97E-02 | 0.0142 | 1.12E-01 |
| p-value of log-rank test | −0.0489 | 2.52E-01 | −0.0294 | 4.85E-01 | ||
| DeepSurv | concordance index | – | – | −0.0052 | 5.85E-01 | |
| p-value of log-rank test | – | – | 0.0195 | 6.62E-01 | ||
Notes: β denotes the coefficient (slope) of linear mixed-effects models, P denotes the p-value obtained.
P-value of the log-rank test of lower dimensional representation, generated by Partitioning Around Medoids (PAM) clustering algorithm on the last hidden layers of three Deep Learning-based approaches (testing set only). 12 TCGA (The Cancer Genome Atlas) cancers are being compared. Bolded values indicate the smallest p-value among three Deep Learning approaches, refer to better low dimensional representation
| TCGA Cancers | |||
|---|---|---|---|
| Cox-nnet | DeepSurv | AECOX | |
| KIRP | 2.45E-01 | 1.40E-01 | |
| KIRC | 9.79E-02 | 3.01E-01 | |
| LIHC | 6.14E-01 | 3.86E-01 | |
| BRCA | 4.81E-01 | 4.85E-01 | |
| CESC | 3.26E-01 | 3.92E-01 | |
| LUAD | 4.07E-01 | 1.11E-01 | |
| BLCA | 3.27E-01 | 4.94E-01 | |
| HNSC | 4.06E-01 | 6.19E-01 | |
| PAAD | 2.97E-01 | 4.04E-01 | |
| OV | 3.38E-01 | 4.42E-01 | |
| STAD | 5.72E-01 | 7.01E-01 | |
| LUSC | 4.14E-01 | 6.05E-01 | |
The boldface p-value indicates it is the smallest one among all three algorithms
Fig. 3Cox-nnet performances with TMB feature input and without TMB feature input across 12 TCGA (The Cancer Genome Atlas) cancers. a concordance index; b p-value of log-rank test (in −log10 scale). Red diamonds and texts on the boxplot indicate the mean values. Cancers were ordered based on Fig. 2. Note that the performances are differ from Fig. 2 due to new patient cohorts (intersection of patients who has both RNA-seq data and TMB data). For detailed cancer names, please refer to the Additional file 1
Fig. 4a Box plot of log2 transformed tumor mutation burden (TMB) values from all available TCGA (The Cancer Genome Atlas) patients with respect to each cancer type, ordered according to Fig. 2. Texts and diamond symbols in red color indicated the mean values. b Mean TMB versus averaged concordance index results across 12 cancer types with three survival prognosis models. Pearson ρ = − 0.45 (p-value = 5.79E-03). Individual model correlations are range from −0.46 to −0.44, described in Additional file 1: Table S4. Other results of TMB statistics versus concordance index were shown in Additional file 1: Figure S2 – Figure S6.