| Literature DB >> 30061525 |
Sen Liang1, Rongguo Zhang2, Dayang Liang3, Tianci Song4, Tao Ai5, Chen Xia6, Liming Xia7, Yan Wang8,9.
Abstract
Non-invasive prediction of isocitrate dehydrogenase (IDH) genotype plays an important role in tumor glioma diagnosis and prognosis. Recently, research has shown that radiology images can be a potential tool for genotype prediction, and fusion of multi-modality data by deep learning methods can further provide complementary information to enhance prediction accuracy. However, it still does not have an effective deep learning architecture to predict IDH genotype with three-dimensional (3D) multimodal medical images. In this paper, we proposed a novel multimodal 3D DenseNet (M3D-DenseNet) model to predict IDH genotypes with multimodal magnetic resonance imaging (MRI) data. To evaluate its performance, we conducted experiments on the BRATS-2017 and The Cancer Genome Atlas breast invasive carcinoma (TCGA-BRCA) dataset to get image data as input and gene mutation information as the target, respectively. We achieved 84.6% accuracy (area under the curve (AUC) = 85.7%) on the validation dataset. To evaluate its generalizability, we applied transfer learning techniques to predict World Health Organization (WHO) grade status, which also achieved a high accuracy of 91.4% (AUC = 94.8%) on validation dataset. With the properties of automatic feature extraction, and effective and high generalizability, M3D-DenseNet can serve as a useful method for other multimodal radiogenomics problems and has the potential to be applied in clinical decision making.Entities:
Keywords: World Health Organization grade; gliomas; isocitrate dehydrogenase genotype; magnetic resonance imaging; multimodal deep learning; three-dimensional DenseNet model
Year: 2018 PMID: 30061525 PMCID: PMC6115744 DOI: 10.3390/genes9080382
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Clinical characteristics of the patients.
| Clinical Features | Value |
|---|---|
| No. of patients | 167 |
| Age, mean ± SD | 52.4 ± 15.5 |
| <30 | 18 (10.8%) |
| 30–60 | 90 (53.9%) |
| 60–80 | 55 (32.9%) |
| ≥80 | 3 (1.8%) |
| Sex | |
| Male | 90 (54.2%) |
| Female | 74 (45.8%) |
| Tumor Grade | |
| Low-grade (grade II, III) | 65 (38.9%) |
| High-grade (grade IV) | 102 (61.1%) |
| Mutant | 53 (31.7%) |
| Wild-type | 114 (68.3%) |
IDH: isocitrate dehydrogenase; SD: standard deviation.
Figure 1Data preprocessing flow diagram. (i) We first masked data with a magnetic resonance imaging (MRI) image volume data and ground truth label, (ii) and then cropped the lesion to a small cube, (iii) followed by filling into a uniform shape to produce our input data called patched data. (iv) At last, considering small data quantity and model overfitting, we did data augmentation for each patched data 648 times by flipping and shifting.
Figure 2A schematic illustration of multimodal three-dimensional DenseNet (M3D-DenseNet). Our M3D-DenseNet takes four MRI sequences (l, w, h) as input and then concatenates the four modalities as a four-dimensional matrix (4, l, w, h). All convolutions and poolings in the network are 3D operations, and are the same in the four dense blocks. T1: native; T1Gd: post-contrast T1-weighted ; T2: T2-weighted; FLAIR: T2 fluid attenuated inversion recovery.
Multimodal three-dimensional DenseNet architecture.
| Layers | Output Size | M3D-DenseNet-121 | M3D-DenseNet-161 | M3D-DenseNet-169 | M3D-DenseNet-201 |
|---|---|---|---|---|---|
| Convolution | |||||
| Pooling | |||||
| 3D Dense Block (1) * | |||||
| Transition Layer (1) * | |||||
| 3D Dense Block (2) * | |||||
| Transition Layer (2) * | |||||
|
| |||||
| 3D Dense Block (3) * |
|
|
|
|
|
| Transition Layer (3) * |
| ||||
|
| |||||
| 3D Dense Block (4) * |
|
|
|
|
|
| Classification Layer |
| ||||
| 2D fully-connected, softmax | |||||
Note: The growth rate parameter of 3D DenseNet is set to 32. Each conv layer shows in the table represents the layer sequence BN-ReLU-Conv (BN: batch normalization layer, ReLU: rectified linear unit activation layer, Conv: convolutional layer). * Numbers refer to Figure 2, 3D Dense Block 1, 2, 3 and 4, respectively.
Isocitrate dehydrogenase genotype prediction performance.
| Training Dataset | Validation Dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| AUC | AUC | |||||||
| 88.9% | 92.6% | 87.2% | 97.1% | 84.6% | 78.5% | 88.0% | 85.7% | |
| 91.3% | 82.9% | 95.3% | 97.5% | 82.1% | 57.1% | 96.0% | 85.0% | |
| 85.0% | 85.3% | 84.9% | 94.2% | 82.1% | 64.3% | 92.0% | 82.8% | |
| 87.4% | 63.4% | 98.8% | 94.6% | 76.9% | 42.8% | 96.0% | 85.7% | |
ACC: overall accuracy; SN: sensitivity; SP: specificity; AUC: area under curve; MNet-121/161/169/201: M3D-DenseNet with 121/161/169/201 layers.
Figure 3Comparing receiver operating characteristic (ROC) curve of different depth layer models on IDH genotype prediction experiments. AUC: area under the curve.
Comparing the performance between single-modality and multi-modality model.
| Training Dataset | Validation Dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| AUC | AUC | |||||||
| 67.7% | 56.6% | 92.4% | 67.6% | 64.1% | 43.2% | 80.2% | 47.4% | |
| 77.9% | 82.9% | 75.6% | 87.5% | 74.4% | 78.6% | 72.0% | 81.6% | |
| 74.8% | 65.9% | 79.0% | 81.0% | 74.3% | 50.0% | 88.0% | 74.6% | |
| 76.3% | 31.7% | 97.7% | 81.0% | 71.8% | 35.7% | 92.0% | 72.6% | |
| 88.9% | 92.6% | 87.2% | 97.1% | 84.6% | 78.5% | 88.0% | 85.7% | |
Single-modality 3D DenseNet with 121 layers; MNet: M3D-DenseNet with 121 layers.
Figure 4Comparing ROC curve between single-modality and multi-modality models.
Figure 5Comparing AUCs between single-modality and multi-modality models. Grouped-SNet: grouped single-modality models, Grouped-MNet: grouped multi-modality models. We grouped the results of five-fold cross-validation from all single-modality models as Grouped-SNet, and also grouped the results of five-fold cross-validation from all multi-modality models as Grouped-MNet. The student’s t-test result comparing Grouped-SNet and Grouped-MNet is statistically significant.
World Health Organization (WHO) grade status prediction performance.
| Training Dataset | Validation Dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| AUC | AUC | |||||||
| 75.2% | 96.3% | 42.3% | 88.4% | 80.0% | 100% | 46.1% | 84.3% | |
| 75.9% | 62.9% | 96.2% | 94.3% | 77.1% | 68.2% | 92.3% | 87.9% | |
| 91.7% | 93.8% | 88.5% | 96.9% | 85.7% | 86.4% | 84.6% | 91.1% | |
| 90.2% | 88.9% | 92.3% | 95.3% | 91.4% | 92.3% | 92.3% | 94.8% | |
M3D-DenseNet with 121/161/169/201 layers.
Figure 6Comparing ROC curve of different depth layer models on World Health Organization (WHO) grade prediction experiments.