| Literature DB >> 31510656 |
Anika Cheerla1, Olivier Gevaert2.
Abstract
MOTIVATION: Estimating the future course of patients with cancer lesions is invaluable to physicians; however, current clinical methods fail to effectively use the vast amount of multimodal data that is available for cancer patients. To tackle this problem, we constructed a multimodal neural network-based model to predict the survival of patients for 20 different cancer types using clinical data, mRNA expression data, microRNA expression data and histopathology whole slide images (WSIs). We developed an unsupervised encoder to compress these four data modalities into a single feature vector for each patient, handling missing data through a resilient, multimodal dropout method. Encoding methods were tailored to each data type-using deep highway networks to extract features from clinical and genomic data, and convolutional neural networks to extract features from WSIs.Entities:
Mesh:
Year: 2019 PMID: 31510656 PMCID: PMC6612862 DOI: 10.1093/bioinformatics/btz342
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Data distribution of TCGA data including missing data
| Data type | Number of cases | Number of missing cases | Percentage missing (%) |
|---|---|---|---|
| Gene expression data | 10 198 | 962 | 8.62 |
| MicroRNA expression data | 10 125 | 1035 | 9.27 |
| WSI slide data | 10 914 | 246 | 2.2 |
| Clinical data | 7512 | 3648 | 32.69 |
| Survival target data (time of death) | 11 121 | 39 | 0.35 |
| Patients with complete data | 6404 | 4756 | 42.62 |
Note: Survival data are available for the majority of patients, while microRNA and clinical data are missing in a subset of patients. Nearly 43% of patients have at least one type of missing data.
Fig. 1.Kaplan–Meier survival curves for all cancer sites in TCGA demonstrating that overall survival is tissue specific. The first graph contains the 10 cancers with the highest mean overall survival, the second graph contains the 10 cancers with the lowest mean overall survival
Fig. 2.Structure of the unsupervised model: the similarity loss can be visualized as projecting representations of different modalities in the same space. Each modality uses a different network architecture. For the clinical data, we use FC layers with sigmoid activations, for the genomic data we use deep highway networks (Srivastava ) and for the WSI images, we use the SqueezeNet architecture (Iandola ) (see main text for architecture details). These architectures generate feature vectors that are then aggregated into a single representation and used to predict overall survival
Fig. 3.The SqueezeNet model architecture. The SqueezeNet architecture consists of a set of fire modules interspersed with maxpool layers. Each fire module consists of a squeeze layer (with 1 × 1 convolution filters) and expand layer (with a mix of 1 × 1 and 3 × 3 convolution filters). This fire module architecture helps to reduce the parameter space for faster training. We replaced the final softmax layer of the original SqueezeNet model with the 512-length feature encoding predictor
Fig. 4.T-SNE-mapped representations of feature vectors T-SNE-mapped representations of feature vectors for 500 patients within the testing set. The 512-length feature vectors were compressed using PCA (50 features) and T-SNE into the 2D space. These representations manage to capture relationships between patients; e.g. patients with the same sex were generally clustered together (left image), and to a lesser extent, patients of the same race and same cancer type tended to be clustered as well (center and right), even when those clinical features were not provided to the model
Fig. 5.Evaluation of multimodal dropout: learning rate in terms of C-index of the model on the validation dataset for predicting prognosis across 20 cancer sites combining multimodal data. The model converges after 40 epochs and shows that multimodal dropout improves the validation performance
Model performance using C-index on the 20 studied cancer types, using different combinations of data modalities
| Clin+miRNA+mRNA+WSI | Clin+miRNA | Clin+mRNA | Clin+miRNA+mRNA | Clin+miRNA+WSI | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cancer site | Baseline | Multimodal dropout | Delta (%) | Baseline | Multimodal dropout | Delta (%) | Baseline | Multimodal dropout | Delta (%) | Baseline | Multimodal dropout | Delta (%) | Baseline | Multimodal dropout | Delta (%) |
| BLCA | 0.65 | 0.73 | 12.6 | 0.66 | 0.69 | 4.4 | 0.60 | 0.58 | −4.4 | 0.65 | 0.62 | −5.1 | 0.65 | 0.68 | 4.3 |
| BRCA | 0.77 | 0.79 | 3.0 | 0.80 | 0.80 | −0.1 | 0.57 | 0.56 | −1.9 | 0.73 | 0.73 | 0.3 | 0.77 | 0.77 | 0.0 |
| CESC | 0.73 | 0.76 | 4.6 | 0.77 | 0.76 | −1.2 | 0.67 | 0.62 | −6.9 | 0.74 | 0.74 | 0.4 | 0.78 | 0.76 | −2.5 |
| COADREAD | 0.72 | 0.74 | 3.8 | 0.78 | 0.75 | −4.8 | 0.72 | 0.58 | −20.0 | 0.77 | 0.64 | −16.9 | 0.70 | 0.74 | 4.5 |
| HNSC | 0.61 | 0.67 | 10.4 | 0.64 | 0.64 | 0.7 | 0.58 | 0.55 | −5.4 | 0.63 | 0.66 | 4.6 | 0.61 | 0.65 | 6.6 |
| KICH | 0.95 | 0.93 | −2.0 | 0.82 | 0.85 | 3.0 | 0.80 | 0.84 | 5.5 | 0.73 | 0.77 | 5.9 | 0.81 | 0.88 | 9.7 |
| KIRC | 0.73 | 0.73 | −0.3 | 0.70 | 0.72 | 3.1 | 0.61 | 0.65 | 5.9 | 0.65 | 0.66 | 2.7 | 0.68 | 0.61 | −11.1 |
| KIRP | 0.84 | 0.79 | −6.0 | 0.76 | 0.79 | 4.1 | 0.65 | 0.64 | −1.0 | 0.61 | 0.70 | 14.5 | 0.79 | 0.86 | 9.2 |
| LAML | 0.66 | 0.67 | 1.8 | 0.69 | 0.79 | 14.9 | 0.57 | 0.61 | 7.4 | 0.66 | 0.57 | −12.8 | 0.61 | 0.59 | −2.8 |
| LGG | 0.83 | 0.85 | 3.4 | 0.79 | 0.81 | 2.0 | 0.63 | 0.67 | 6.3 | 0.77 | 0.78 | 1.4 | 0.76 | 0.82 | 8.2 |
| LIHC | 0.72 | 0.77 | 7.6 | 0.73 | 0.74 | 2.7 | 0.64 | 0.69 | 7.7 | 0.68 | 0.67 | −1.8 | 0.70 | 0.77 | 11.2 |
| LUAD | 0.72 | 0.73 | 1.3 | 0.72 | 0.72 | −0.9 | 0.63 | 0.58 | −8.9 | 0.73 | 0.69 | −5.1 | 0.69 | 0.77 | 10.5 |
| LUSC | 0.67 | 0.66 | −0.9 | 0.72 | 0.67 | −6.5 | 0.50 | 0.51 | 2.1 | 0.62 | 0.60 | −2.9 | 0.67 | 0.68 | 0.5 |
| OV | 0.63 | 0.67 | 6.4 | 0.65 | 0.63 | −2.2 | 0.47 | 0.52 | 11.5 | 0.59 | 0.61 | 3.5 | 0.62 | 0.69 | 10.4 |
| PAAD | 0.71 | 0.74 | 3.5 | 0.68 | 0.71 | 3.8 | 0.57 | 0.61 | 7.6 | 0.59 | 0.64 | 8.9 | 0.69 | 0.69 | 0.3 |
| PRAD | 0.77 | 0.81 | 0.0 | 0.64 | 0.64 | −0.3 | 0.60 | 0.58 | −3.5 | 0.59 | 0.78 | 32.8 | 0.53 | 0.60 | 13.4 |
| SKCM | 0.68 | 0.72 | 5.2 | 0.68 | 0.68 | −0.1 | 0.56 | 0.55 | −0.1 | 0.58 | 0.72 | 24.3 | 0.67 | 0.72 | 6.8 |
| STAD | 0.76 | 0.78 | 2.6 | 0.75 | 0.76 | 1.5 | 0.63 | 0.54 | −13.9 | 0.80 | 0.69 | −14.1 | 0.72 | 0.74 | 2.6 |
| THCA | 0.95 | 0.90 | −4.8 | 0.97 | 0.95 | −2.6 | 0.82 | 0.54 | −34.2 | 0.70 | 0.83 | 18.7 | 0.93 | 0.94 | 1.4 |
| UCEC | 0.85 | 0.85 | 0.6 | 0.81 | 0.85 | 4.3 | 0.63 | 0.63 | 0.0 | 0.66 | 0.78 | 18.2 | 0.77 | 0.80 | 3.0 |
| Average improvement | 2.8% | 1.3% | −2.3% | 3.9% | 4.3% | ||||||||||
| Pancancer | 0.75 | 0.78 | 4.5 | 0.74 | 0.78 | 4.3 | 0.60 | 0.60 | −1.2 | 0.75 | 0.78 | 3.6 | 0.76 | 0.78 | 3.2 |
Note: Cancer sites are defined according to TCGA cancer codes. For each cancer, the best result is bold faced. Delta refers to the relative performance improvement of the multimodal dropout model compared to the baseline.
Clin, clinical data; miRNA, microRNA expression data; mRNA, mRNA expression data; WSI, whole slide images.
Comparison of pancancer training with single cancer training using the C-index showing that in the case of integrating clinical, miRNA, mRNA and WSI using multimodal dropout, for all but one cancer site (KIRC), pancancer training performs equally or outperforms training on each cancer individually
| Cancer site | Single cancer | Pancancer | Difference (%) |
|---|---|---|---|
| BLCA | 0.60 | 0.73 | 22 |
| BRCA | 0.62 | 0.79 | 28 |
| CESC | 0.52 | 0.76 | 48 |
| COADREAD | 0.58 | 0.74 | 28 |
| HNSC | 0.64 | 0.67 | 6 |
| KICH | 0.69 | 0.93 | 34 |
| KIRC | 0.78 | 0.73 | −6 |
| KIRP | 0.51 | 0.79 | 56 |
| LAML | 0.65 | 0.67 | 4 |
| LGG | 0.73 | 0.85 | 18 |
| LIHC | 0.78 | 0.77 | 0 |
| LUAD | 0.72 | 0.73 | 1 |
| LUSC | 0.63 | 0.66 | 5 |
| OV | 0.54 | 0.67 | 24 |
| PAAD | 0.57 | 0.74 | 30 |
| PRAD | 0.76 | 0.81 | 7 |
| SKCM | 0.54 | 0.72 | 33 |
| STAD | 0.60 | 0.78 | 29 |
| THCA | 0.53 | 0.90 | 69 |
| UCEC | 0.67 | 0.85 | 28 |