| Literature DB >> 30854458 |
Eric Wu1, Lubomir M Hadjiiski1, Ravi K Samala1, Heang-Ping Chan1, Kenny H Cha1, Caleb Richter1, Richard H Cohan1, Elaine M Caoili1, Chintana Paramagul1, Ajjai Alva2, Alon Z Weizer3.
Abstract
We compared the performance of different Deep learning-convolutional neural network (DL-CNN) models for bladder cancer treatment response assessment based on transfer learning by freezing different DL-CNN layers and varying the DL-CNN structure. Pre- and posttreatment computed tomography scans of 123 patients (cancers, 129; pre- and posttreatment cancer pairs, 158) undergoing chemotherapy were collected. After chemotherapy 33% of patients had T0 stage cancer (complete response). Regions of interest in pre- and posttreatment scans were extracted from the segmented lesions and combined into hybrid pre -post image pairs (h-ROIs). Training (pairs, 94; h-ROIs, 6209), validation (10 pairs) and test sets (54 pairs) were obtained. The DL-CNN consisted of 2 convolution (C1-C2), 2 locally connected (L3-L4), and 1 fully connected layers. The DL-CNN was trained with h-ROIs to classify cancers as fully responding (stage T0) or not fully responding to chemotherapy. Two radiologists provided lesion likelihood of being stage T0 posttreatment. The test area under the ROC curve (AUC) was 0.73 for T0 prediction by the base DL-CNN structure with randomly initialized weights. The base DL-CNN structure with pretrained weights and transfer learning (no frozen layers) achieved test AUC of 0.79. The test AUCs for 3 modified DL-CNN structures (different C1-C2 max pooling filter sizes, strides, and padding, with transfer learning) were 0.72, 0.86, and 0.69. For the base DL-CNN with (C1) frozen, (C1-C2) frozen, and (C1-C2-L3) frozen, the test AUCs were 0.81, 0.78, and 0.71, respectively. The radiologists' AUCs were 0.76 and 0.77. DL-CNN performed better with pretrained than randomly initialized weights.Entities:
Keywords: CT; bladder; deep-learning; segmentation; transfer learning; treatment response
Mesh:
Substances:
Year: 2019 PMID: 30854458 PMCID: PMC6403041 DOI: 10.18383/j.tom.2018.00036
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1.Example of a prepost lesion pair generated ROI. In this example the case was stage T2 in pretreatment, and stage T1 in post-treatment, resulting in a label of >T0.
Figure 2.Subset of 6209 total regions of interest (ROIs) used in training set. Cases with complete response (T0) to treatment (A). Cases that did not fully respond (>T0) to treatment (B).
Figure 3.TensorFlow graph of base deep learning-convolutional neural network (DL-CNN) structure with different layers marked. The hybrid pre–post lesion pair ROIs were input to the DL-CNN, which then predicted a likelihood score of a complete response (T0) as an output.
Modifications in Layers C1 and C2 for Each Structure Variation
| Base | DL-CNN-1 | DL-CNN-2 | DL-CNN-3 | |
|---|---|---|---|---|
| C1 | ||||
| Convolution | ||||
| Size | 5 × 5 | 5 × 5 | 5 × 5 | 5 × 5 |
| Stride | 1 | 1 | 2 | 1 |
| Max Pooling | ||||
| Size | 3 × 3 | 5 × 5 | 3 × 3 | 3 × 3 |
| Stride | 2 | 2 | 2 | 2 |
| Padding | Valid | Valid | Valid | Same |
| C2 | ||||
| Convolution | ||||
| Size | 5 × 5 | 5 × 5 | 5 × 5 | 5 × 5 |
| Stride | 1 | 1 | 1 | 1 |
| Max Pooling | ||||
| Size | 3 × 3 | 2 × 2 | 2 × 2 | 4 × 4 |
| Stride | 2 | 1 | 1 | 2 |
Test AUC Values for DL-CNN Models with Modified Structures
| DL-CNN Type | Base DL-CNN Structure(Random Weights) | Base DL-CNN Structure(Pretrained Weights) | DL-CNN-1 | DL-CNN-2 | DL-CNN-3 |
|---|---|---|---|---|---|
| AUC | 0.73 ± 0.08 | 0.79 ± 0.07 | 0.72 ± 0.08 | 0.86 ± 0.06 | 0.69 ± 0.09 |
Test AUC Values for DL-CNN Models with Transfer Learning and Different Frozen Layers
| DL-CNN Type | Base DL-CNN Structure (Pretrained Weights) | C1 Frozen | C1, C2 Frozen | C1, C2, L3 Frozen |
|---|---|---|---|---|
| AUC | 0.79 ± 0.07 | 0.81 ± 0.07 | 0.78 ± 0.08 | 0.71 ± 0.08 |
Figure 4.Test ROC curves of different DL-CNN models. ROC graph comparing base DL-CNN model (base structure) to DL-CNN models with modified structure (A). ROC graph comparing base DL-CNN model (base structure) with pretrained weights but no frozen layers to DL-CNN models with frozen layers (B).
Test AUC Values for Radiologists and Methods Used in Cha et al. Study
| DL-CNN Type | Base DL-CNN Structure(Random Weights) | Radiologist 1 | Radiologist 2 | DL-CNN (Cha) | RF-SL | RF-ROI |
|---|---|---|---|---|---|---|
| AUC | 0.73 ± 0.08 | 0.76 ± 0.08 | 0.77 ± 0.08 | 0.73 ± 0.08 | 0.77 ± 0.08 | 0.69 ± 0.08 |
Test Sensitivity and Accuracy of DL-CNN Models at a Specificity of 80%
| Sensitivity (%) | Accuracy (%) | |
|---|---|---|
| Base Structure (Pretrained weights) | 59.5% | 64.1% |
| Base Structure (Random Weights) | 41.7% | 71.5% |
| DL-CNN-1 | 50.0% | 73.3% |
| DL-CNN-2 | 75.0% | 78.9% |
| DL-CNN-3 | 50.0% | 73.3% |
| C1 Frozen | 58.3% | 75.2% |
| C1, C2 Frozen | 58.3% | 75.2% |
| C1, C2, L3 Frozen | 58.3% | 75.2% |
Figure 5.Examples of cases that the DL-CNN predicted correctly and incorrectly. The base DL-CNN with transferred weights correctly predicted this lesion as >T0 (A). The base DL-CNN with transfer learning incorrectly predicted this T0 lesion as >T0 (B).