| Literature DB >> 32241303 |
Milad Mostavi1,2, Yu-Chiao Chiu1, Yufei Huang3,4, Yidong Chen5,6.
Abstract
BACKGROUND: Precise prediction of cancer types is vital for cancer diagnosis and therapy. Through a predictive model, important cancer marker genes can be inferred. Several studies have attempted to build machine learning models for this task however none has taken into consideration the effects of tissue of origin that can potentially bias the identification of cancer markers.Entities:
Keywords: Breast cancer subtype prediction; Cancer gene markers; Cancer type prediction; Convolutional neural networks; Deep learning; The Cancer Genome Atlas
Mesh:
Substances:
Year: 2020 PMID: 32241303 PMCID: PMC7119277 DOI: 10.1186/s12920-020-0677-2
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Illustration of three CNN models. a 1D-CNN with input as a vector format with 7100 genes. b 2D-Vanilla-CNN, with an input reformatted as a 100 × 71 matrix, and one convolution layer. c 2D-Hybrid-CNN, similar input as in (b) but with two parallel convolution layers, vertical and horizontal, as in (a)
Different hyperparameter settings for 1D-CNN model based on the trained and tested statistical measures. The final selected parameters are highlighted
| Hyperparameters | Loss | |||||
|---|---|---|---|---|---|---|
| dense layer size | filter | kernel | mean train_score | stdev train_score | mean test_score | stdev test_score |
| 64 | (1, 50) | 8 | 0.069 | 0.031 | 0.167 | 0.023 |
| 64 | (1, 50) | 16 | 0.037 | 0.013 | 0.140 | 0.007 |
| 64 | (1, 50) | 32 | 0.023 | 0.003 | 0.132 | 0.006 |
| 64 | (1, 50) | 64 | 0.013 | 0.002 | 0.128 | 0.006 |
| 128 | (1, 50) | 8 | 0.032 | 0.008 | 0.147 | 0.006 |
| 128 | (1, 50) | 16 | 0.027 | 0.014 | 0.138 | 0.014 |
| 128 | (1, 50) | 32 | 0.011 | 0.003 | 0.121 | 0.009 |
| 128 | (1, 50) | 64 | 0.004 | 0.001 | 0.126 | 0.012 |
| 512 | (1, 50) | 8 | 0.009 | 0.000 | 0.138 | 0.008 |
| 512 | (1, 50) | 16 | 0.006 | 0.001 | 0.127 | 0.003 |
| 512 | (1, 50) | 32 | 0.124 | 0.179 | 0.265 | 0.160 |
| 512 | (1, 50) | 64 | 0.003 | 0.002 | 0.125 | 0.008 |
| 64 | (1, 71) | 8 | 0.072 | 0.009 | 0.177 | 0.009 |
| 64 | (1, 71) | 16 | 0.044 | 0.009 | 0.149 | 0.006 |
| 64 | (1, 71) | 32 | 0.036 | 0.011 | 0.135 | 0.009 |
| 64 | (1, 71) | 64 | 0.016 | 0.004 | 0.124 | 0.012 |
| 128 | (1, 71) | 8 | 0.046 | 0.007 | 0.154 | 0.015 |
| 128 | (1, 71) | 16 | 0.027 | 0.006 | 0.135 | 0.015 |
| 128 | (1, 71) | 64 | 0.008 | 0.001 | 0.119 | 0.003 |
| 512 | (1, 71) | 8 | 0.023 | 0.018 | 0.152 | 0.023 |
| 512 | (1, 71) | 16 | 0.009 | 0.008 | 0.132 | 0.017 |
| 512 | (1, 71) | 32 | 0.004 | 0.002 | 0.123 | 0.008 |
| 512 | (1, 71) | 64 | 0.011 | 0.016 | 0.134 | 0.015 |
| 64 | (1, 100) | 8 | 0.088 | 0.010 | 0.172 | 0.015 |
| 64 | (1, 100) | 16 | 0.066 | 0.014 | 0.162 | 0.009 |
| 64 | (1, 100) | 32 | 0.037 | 0.007 | 0.132 | 0.009 |
| 64 | (1, 100) | 64 | 0.024 | 0.009 | 0.128 | 0.013 |
| 128 | (1, 100) | 8 | 0.058 | 0.001 | 0.164 | 0.009 |
| 128 | (1, 100) | 16 | 0.031 | 0.008 | 0.144 | 0.014 |
| 128 | (1, 100) | 64 | 0.016 | 0.010 | 0.137 | 0.027 |
| 512 | (1, 100) | 8 | 0.031 | 0.013 | 0.155 | 0.014 |
| 512 | (1, 100) | 16 | 0.009 | 0.001 | 0.135 | 0.009 |
Different hyperparameter settings for 2D-Vanilla-CNN model based on the trained and tested statistical measures. The final selected parameters are highlighted
| Hyperparameters | Loss | ||||||
|---|---|---|---|---|---|---|---|
| dense layer size | filter | kernel | stride | mean train_score | stdev train_score | mean test_score | stdev test_score |
| 128 | 32 | (7, 7) | (1, 1) | 20.999 | 18.228 | 21.281 | 14.904 |
| 128 | 32 | (7, 7) | (2, 2) | 0.005 | 0.002 | 0.192 | 0.022 |
| 128 | 32 | (10, 10) | (1, 1) | 21.398 | 18.582 | 21.771 | 15.298 |
| 128 | 32 | (20, 20) | (1, 1) | 0.027 | 0.004 | 0.202 | 0.029 |
| 128 | 32 | (20, 20) | (2, 2) | 0.043 | 0.011 | 0.206 | 0.009 |
| 128 | 64 | (7, 7) | (1, 1) | 10.213 | 17.688 | 10.566 | 14.618 |
| 128 | 64 | (7, 7) | (2, 2) | 0.004 | 0.001 | 0.187 | 0.018 |
| 128 | 64 | (10, 10) | (1, 1) | 31.430 | 1.149 | 31.675 | 1.019 |
| 128 | 64 | (10, 10) | (2, 2) | 0.012 | 0.006 | 0.177 | 0.014 |
| 128 | 64 | (20, 20) | (1, 1) | 12.020 | 18.052 | 12.149 | 14.818 |
| 128 | 64 | (20, 20) | (2, 2) | 0.055 | 0.016 | 0.204 | 0.020 |
| 512 | 32 | (7, 7) | (1, 1) | 21.245 | 18.419 | 21.175 | 14.815 |
| 512 | 32 | (7, 7) | (2, 2) | 10.944 | 18.953 | 11.022 | 15.306 |
| 512 | 32 | (10, 10) | (1, 1) | 10.964 | 18.987 | 11.148 | 15.482 |
| 512 | 32 | (10, 10) | (2, 2) | 0.003 | 0.001 | 0.213 | 0.025 |
| 512 | 32 | (20, 20) | (1, 1) | 10.988 | 19.002 | 11.132 | 15.436 |
| 512 | 32 | (20, 20) | (2, 2) | 1.110 | 1.849 | 1.271 | 1.397 |
| 512 | 64 | (7, 7) | (1, 1) | 31.430 | 1.149 | 31.675 | 1.019 |
| 512 | 64 | (7, 7) | (2, 2) | 10.213 | 17.688 | 10.560 | 14.622 |
| 512 | 64 | (10, 10) | (1, 1) | 31.497 | 1.211 | 31.648 | 1.087 |
| 512 | 64 | (10, 10) | (2, 2) | 20.628 | 17.858 | 20.481 | 14.363 |
| 512 | 64 | (20, 20) | (1, 1) | 11.299 | 16.825 | 11.562 | 13.969 |
| 512 | 64 | (20, 20) | (2, 2) | 12.020 | 18.046 | 12.152 | 14.776 |
Fig. 2Cancer type prediction performance of three CNN models trained with tumor samples only. a Learning curves for all three CNN models. b Micro-averaged accuracy of three CNN models when trained with only tumor samples (light blue) from 33 tumor types, and with tumors and normal samples together (light brown). c Confusion matrix of normal samples prediction from 1D-CNN model trained with 33 tumor types only. d Confusion matrix of the 1D-CNN model on all 33 tumor types
Fig. 3Cancer type prediction performance of three CNN models trained with combined tumor and normal samples. a Learning curves for all three CNN models trained with combined tumor and normal samples. b Precision (light blue) and recall (light brown) of 1D-CNN model when trained with combined tumor normal samples. c Confusion matrix of all sample prediction from 1D-CNN model trained with 33 tumor types + normal
Fig. 4Interpretation of the 1D-CNN model. a Distributions of gene-effect scores for individual cancer and normal classes. Colors correspond to cancer types denoted in Fig. 4b. b t-SNE plots of pan-cancer and normal samples by expression of marker genes identified using different thresholds. c Marker genes identified in each class with a criterion of gene-effect score > 0.5. The dashed line denotes the average number of marker genes identified across 34 classes. d-e Differential expression of marker genes and other genes between sample classes. Here differential expression is presented by an absolute difference between a class (normal or BRCA) and all other samples in log2(FPKM+ 1). f Pan-classes gene-effect scores of three marker genes of BRCA. g Functions associated with marker genes identified in each class
Breast cancer subtype classification using 1D-CNN model
| Class name | Precision | Recall | F1-score | Number of samples |
|---|---|---|---|---|
| Basal | 0.973 | 0.980 | 0.976 | 147 |
| Her2 | 0.829 | 0.853 | 0.841 | 68 |
| Luminal A | 0.894 | 0.927 | 0.910 | 437 |
| Luminal B | 0.810 | 0.780 | 0.795 | 186 |
| Normal | 0.857 | 0.462 | 0.600 | 26 |
| Avg/Total | 0.883 | 0.884 | 0.882 | 864 |
Hyperparameters and training time of CNN models
| Training | Testing | |||||
|---|---|---|---|---|---|---|
| DL modela | Number of parameters | Loss | Accuracy | Loss | Accuracyb | Timec (seconds) |
| 1D-CNN | 211,489 | 0.01 | 0.9971 | 0.1769 | 0.9567 | 80.3 |
| 2D-Vanilla-CNN | 1,420,737 | 0.007 | 0.9981 | 0.1778 | 0.9557 | 94 |
| 2D-Hybrid-CNN | 362,177 | 0.0149 | 0.996 | 0.1586 | 0.9582 | 80.8 |
| 2D-3Layer-CNN | 26,211,233 | 0.5149 | 0.9654 | 0.6875 | 0.9184 | 214.6 |
| 2D-3Layer-CNN (with patience = 10) | 0.1976 | 0.9869 | 0.3914 | 0.9419 | 379.17 | |
aEarly stopping is used for all models (all with patience = 4, except for the last model)
bResults of 5-fold cross-validations
cAll models were trained using a Linux server with Xeon 8176 CPU @2.1GHz, with 4 × 28 cores
Fig. 5CNN models testing on noisy data. Classification accuracy on TCGA data with different additive Gaussian noise added. Both classifiers were trained with original TCGA data and tested on TCGA data + Gaussian noise