| Literature DB >> 32548298 |
Dmitry Cherezov1, Rahul Paul1, Nikolai Fetisov1, Robert J Gillies2, Matthew B Schabath3, Dmitry B Goldgof1, Lawrence O Hall1.
Abstract
Noninvasive diagnosis of lung cancer in early stages is one task where radiomics helps. Clinical practice shows that the size of a nodule has high predictive power for malignancy. In the literature, convolutional neural networks (CNNs) have become widely used in medical image analysis. We study the ability of a CNN to capture nodule size in computed tomography images after images are resized for CNN input. For our experiments, we used the National Lung Screening Trial data set. Nodules were labeled into 2 categories (small/large) based on the original size of a nodule. After all extracted patches were re-sampled into 100-by-100-pixel images, a CNN was able to successfully classify test nodules into small- and large-size groups with high accuracy. To show the generality of our discovery, we repeated size classification experiments using Common Objects in Context (COCO) data set. From the data set, we selected 3 categories of images, namely, bears, cats, and dogs. For all 3 categories a 5- × 2-fold cross-validation was performed to put them into small and large classes. The average area under receiver operating curve is 0.954, 0.952, and 0.979 for the bear, cat, and dog categories, respectively. Thus, camera image rescaling also enables a CNN to discover the size of an object. The source code for experiments with the COCO data set is publicly available in Github (https://github.com/VisionAI-USF/COCO_Size_Decoding/).Entities:
Keywords: Convolutional neural network; camera images; computed tomography; explanation; lung cancer
Mesh:
Year: 2020 PMID: 32548298 PMCID: PMC7289250 DOI: 10.18383/j.tom.2019.00024
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1.Cropping (A) and warping (B) patch extraction methods. The solid line represents the region of interest border. The dashed line represents an extracted patch border. This assumes that convoluted neural network (CNN) input is a 100- × 100-pixel image. X and Y represent the corresponding patch's width and height, respectively.
Number of Patients in Groups after Labeling Nodules by Size
| Threshold | Cohort1 T0 | Cohort2 T0 | Cohort2 T1 | Cohort2 T2 | ||||
|---|---|---|---|---|---|---|---|---|
| Small | Large | Small | Large | Small | Large | Small | Large | |
| Longest diameter 6 mm | 57 | 204 | 44 | 193 | 39 | 171 | 44 | 166 |
| Longest diameter 8 mm | 129 | 132 | 126 | 111 | 106 | 104 | 89 | 121 |
| Longest diameter 10 mm | 183 | 65 | 172 | 65 | 140 | 70 | 126 | 84 |
| Median of min size | 122 | 139 | 89 | 148 | 128 | 82 | 124 | 86 |
| Median nodule area | 128 | 133 | 99 | 138 | 123 | 87 | 117 | 93 |
| Total | 261 | 237 | 210 | 210 | ||||
The number of patients in cohort 2 at T0 and T1/T2 vary because some patients were excluded due to low image quality or patient removal for the trial.
Figure 2.The CNN architecture used for size classification in the Common Objects in Context (COCO) data set. There are 8 convolution layers with 3 × 3 kernels. Each convolution layer is followed by a max-pooling layer with a 2 × 2 window and stride equal to 2. For all but the last layers, the rectified linear unit (ReLU) activation function was used. The softmax activation function was used for the last fully connected (FC) layer. Dropout for all FC layers was set to 0.75.
Accuracy and AUC (in Brackets) of a CNN Trained from Scratch for Classification a Nodule Original Size Group (Experiment 1)
| Threshold | Cohort2 T0 (%) | Cohort2 T1 (%) | Cohort2 T2 (%) |
|---|---|---|---|
| Longest diameter 6 mm | 95 (0.97) | 79.52 (0.85) | 81.4 (0.85) |
| Longest diameter 8 mm | 89 (0.947) | 79 (0.839) | 76 (0.82) |
| Longest diameter 10 mm | 94.5 (0.9784) | 87 (0.867) | 84 (0.877) |
| Median of min size | 99.2 (0.9998) | 92.38 (0.94) | 94.28 (0.95) |
| Median nodule size | 94.93 (0.9894) | 97.14 (0.9978) | 95.7 (0.9974) |
Accuracy and AUC (in Brackets) of a CNN Trained for Nodule Original Size Classification after Tuning for Cancer Classification (Experiment 2)
| Threshold | LD 6 mm | LD 8 mm | LD 10 mm | Median of Min Size | Median Nodule Size |
|---|---|---|---|---|---|
| Accuracy (%) | 72.15 (0.76) | 74.26 (0.788) | 75.1 (0.8182) | 74.26 (0.786) | 74.26 (0.794) |
Accuracy of a CNN trained from scratch to classify cancer is 76%. Accuracy of cancer classification using a tumor volume only is 71.6%.
Accuracy and AUC (in Brackets) of a CNN Trained for Cancer Classification after Tuning to Classify a Nodules Original Size Group (Experiment 3)
| Threshold | Cohort2 T0 (%) | Cohort2 T1 (%) | Cohort2 T2 (%) |
|---|---|---|---|
| Longest diameter 6 mm | 93.67 (0.969) | 79.52 (0.82) | 81.4 (0.858) |
| Longest diameter 8 mm | 90.3 (0.923) | 81 (0.8438) | 80.5 (0.828) |
| Longest diameter 10 mm | 93.67 (0.9763) | 87.14 (0.9235) | 84.76 (0.907) |
| Median of min size | 100 ( | 92.4 (0.937) | 94.3 (0.962) |
| Median nodule size | 97.89 (0.989) | 98.57 (0.989) | 98.09 (0.99) |
Accuracy and AUC (in Brackets) Results for 5- × 2-Fold Cross-Validation in the COCO Data Set
| Run | Fold | Bear | Cat | Dog |
|---|---|---|---|---|
| 1 | A | 89.8 (0.942) | 88.5 (0.929) | 93.4 (0.983) |
| B | 89.9 (0.964) | 88.2 (0.968) | 93.9 (0.974) | |
| 2 | A | 85.2 (0.946) | 88.6 (0.966) | 93 (0.98) |
| B | 88.3 (0.965) | 89.8 (0.956) | 94.1 (0.98) | |
| 3 | A | 88.5 (0.964) | 88.6 (0.951) | 86.3 (0.971) |
| B | 90.7 (0.969) | 86.6 (0.951) | 91.6 (0.98) | |
| 4 | A | 88.5 (0.97) | 87.2 (0.954) | 92.4 (0.986) |
| B | 90.5 (0.944) | 89.2 (0.959) | 93.2 (0.981) | |
| 5 | A | 89.8 (0.95) | 88.8 (0.953) | 93.1 (0.982) |
| B | 88.8 (0.933) | 88.8 (0.94) | 93.5 (0.982) |