| Literature DB >> 30854457 |
Rahul Paul1, Matthew Schabath2, Yoganand Balagurunathan3, Ying Liu4, Qian Li4, Robert Gillies3, Lawrence O Hall1, Dmitry B Goldgof1.
Abstract
Quantitative features are generated from a tumor phenotype by various data characterization, feature-extraction approaches and have been used successfully as a biomarker. These features give us information about a nodule, for example, nodule size, pixel intensity, histogram-based information, and texture information from wavelets or a convolution kernel. Semantic features, on the other hand, can be generated by an experienced radiologist and consist of the common characteristics of a tumor, for example, location of a tumor, fissure, or pleural wall attachment, presence of fibrosis or emphysema, concave cut on nodule surface. These features have been derived for lung nodules by our group. Semantic features have also shown promise in predicting malignancy. Deep features from images are generally extracted from the last layers before the classification layer of a convolutional neural network (CNN). By training with the use of different types of images, the CNN learns to recognize various patterns and textures. But when we extract deep features, there is no specific naming approach for them, other than denoting them by the feature column number (position of a neuron in a hidden layer). In this study, we tried to relate and explain deep features with respect to traditional quantitative features and semantic features. We discovered that 26 deep features from the Vgg-S neural network and 12 deep features from our trained CNN could be explained by semantic or traditional quantitative features. From this, we concluded that those deep features can have a recognizable definition via semantic or quantitative features.Entities:
Keywords: CNN; deep features; explainable AI; interpretation of features; quantitative features; radiomics; semantic features
Mesh:
Year: 2019 PMID: 30854457 PMCID: PMC6403047 DOI: 10.18383/j.tom.2018.00034
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1.Selection of cohort 1 and cohort 2.
Description of Semantic Features
| Characteristic | Definition | Scoring |
|---|---|---|
| Location | ||
| 1. Lobe Location | Lobe location of the nodule | Left lower lobe (5), left upper lobe (4), right lower lobe (3), right middle lobe (2), right upper lobe (1) |
| Size | ||
| 2. Long-Axis Diameter | Longest diameter of the nodule | NA |
| 3. Short-Axis Diameter | Longest perpendicular diameter of nodule in the same section | NA |
| Shape | ||
| 4. Contour | Roundness of the nodule | 1, round; 2, oval; 3, irregular |
| 5. Lobulation | Wavy nodule's surface | 1, none; 2, yes |
| 6. Concavity | Concave cut on nodule surface | 1, none; 2, slight concavity; 3, deep concavity |
| Margin | ||
| 7. Border Definition | Edge appearance of the nodule | 1, well defined; 2, slight poorly; 3, poorly defined |
| 8. Spiculation | Lines radiating from the margins of tumor | 1, none; 2 yes |
| Attenuation | ||
| 9. Texture | Solid, non-solid, part solid | 1, non-solid; 2, part solid; 3, solid |
| 10. Cavitation | Presence of air in the tumor at the time of diagnosis | 0, no; 1, yes |
| External | ||
| 11. Fissure Attachment | Nodule attaches to the fissure | 0, no; 1, yes |
| 12. Pleural Attachment | Nodules attaches to the pleura | 0, no; 1, yes |
| 13. Vascular Convergence | Convergence of vessels to nodule | 0, no significant convergence; 1, significant |
| 14. Pleural Retraction | Retraction of the pleura towards nodule | 0, absence of pleural retraction; 1, present |
| 15. Peripheral Emphysema | Peripheral emphysema caused by nodule | 1, absence of emphysema; 2, slight present; 3 severely present |
| 16. Peripheral Fibrosis | Peripheral fibrosis caused by nodule | 1, absence of fibrosis; 2, slight present; 3 severely present |
| 17. Vessel Attachment | Nodule attachment to blood vessel | 0, no; 1, yes |
| Associated Findings | ||
| 18. Nodules in Primary Lobe | Any nodules suspected to be malignant or intermediate | 0, no; 1, yes |
| 19. Nodules in Nonprimary Lobe | Any nodules suspected to be malignant or intermediate | 0, no; 1, yes |
| 20. Lymphadenopathy | Lymph nodes with short- axis diameter greater than 1 cm | 0, no; 1, yes |
Description of Rider Stable Traditional Quantitative Features
| Characteristic | Features |
|---|---|
| Size | 1. Long-axis diameter |
| 2. Short-axis diameter | |
| 3. Long-axis diameter × short-axis diameter | |
| 4. Volume (cm) | |
| 5. Volume (pixel) | |
| 6. Number of pixels | |
| 7. Length/width | |
| Pixel Intensity Histogram | 8. Mean (HU) |
| 9. Stand deviation (HU) | |
| Tumor Location | 10. 8a_3D_ is attached to pleural wall |
| 11. 8b_3D Relative border to lung | |
| 12. 8c_3D_Relative border to pleural wall | |
| 13. 9e_3D_Standard deviation_COG to border | |
| 14. 9g_3D_max_Dist_COG to border | |
| Tumor Shape (Roundness) | 15. 9b-3D circularity |
| 16. 5a_3D- MacSpic | |
| 17. Asymmetry | |
| 18. Roundness | |
| Run-length and Co-occurrence | 19. Avg_RLN |
| Law's Texture Feature | 20. E5 E5 L5 layer 1 |
| 21. E5 E5 R5 layer 1 | |
| 22. E5 W5 L5 layer 1 | |
| 23. L5 W5 L5 layer 1 |
Figure 2.(Left) lung image with nodule inside outlined in blue (nodule pixel size =0.74 mm), with box used for extracted nodule in red, (Right) extracted nodule.
Our Designed CNN architecture
| Layers | Parameter | Total Parameters |
|---|---|---|
| Left branch | ||
| Input Image | 100 × 100 | |
| Max Pool 1 | 10 × 10 | |
| Dropout | 0.1 | |
| Right branch | ||
| Input Image | 100 × 100 | |
| Conv 1 | 64 × 5 × 5, pad 0, stride 1 | |
| Leaky ReLU | alpha = 0.01 | |
| Max Pool 2a | 3 × 3, pad 0, stride 3 | 39,553 |
| Conv 2 | 64 × 2 × 2, pad 0, stride 1 | |
| Leaky ReLU | alpha = 0.01 | |
| Max Pool 2b | 3 × 3, pad 0, stride 3 | |
| Dropout | 0.1 | |
| Concatenate Left Branch + Right Branch | ||
| Conv 3 + ReLU | 64 × 2 × 2, pad 0, stride 1 | |
| Max Pool 3 | 2 × 2, pad 0, stride 2 | |
| L2 regularizer | 0.01 | |
| Dropout | 0.1 | |
| Fully Connected 1 | 1 sigmoid | |
Figure 3.Overview of the approach taken in this study.
Classification performance After Features Removal
| Features | Feature Names | Accuracy |
|---|---|---|
| Semantic Features | Long-axis diameter | 82.70 (0.82) |
| Lobulation | 82.70 (0.83) | |
| Concavity | 83.24 (0.83) | |
| Spiculation | 83.24 (0.83) | |
| Texture | 82.70 (0.83) | |
| Cavitation | 82.70 (0.83) | |
| Vascular convergence | 83.24 (0.84) | |
| Peripheral fibrosis | 82.70 (0.83) | |
| Nodules in primary lobe | 81.62 (0.83) | |
| Traditional Quantitative Features | 9b-3D circularity | 82.16 (0.86) |
| Roundness | 82.70 (0.87) | |
| L5W5L5 layer 1 | 82.70 (0.87) |
These features were from our chosen subset of features, leaving 12 features for training/testing.
Semantic and Traditional Quantitative Features and Corresponding Deep Feature(s)
| Features | Feature Names | Deep Features from Vgg-S With Correlation Value | Deep Features from Our Trained CNN With Correlation Value | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Semantic Features | Long-axis diameter | 3353 | 2135 | 230 | ||||||
| 0.4334 | 0.42 | 0.3055 | ||||||||
| Lobulation | 3534 | 1372 | 2975 | 2111 | NA | |||||
| 0.5742 | 0.5614 | 0.5611 | 0.5520 | |||||||
| Concavity | 3534 | 2975 | 1372 | 2111 | 3246 | 547 | 440 | |||
| 0.5 | 0.4839 | 0.4837 | 0.475 | 0.4612 | 0.1776 | 0.1514 | ||||
| Spiculation | 2811 | NA | ||||||||
| 0.4111 | ||||||||||
| Texture | 1201 | 3350 | NA | |||||||
| −0.3119 | 0.2936 | |||||||||
| Cavitation | 3353 | 526 | 395 | |||||||
| 0.3888 | 0.3551 | 0.2748 | ||||||||
| Vascular convergence | 1464 | 2115 | NA | |||||||
| 0.7052 | 0.701 | |||||||||
| Peripheral fibrosis | 3305 | 3064 | NA | |||||||
| 0.2076 | 0.2043 | |||||||||
| Nodules in primary lobe | NA | 425 | 57 | |||||||
| 0.1871 | 0.1836 | |||||||||
| Traditional Quantitative Features | Roundness | 1395 | 2510 | 160 | 20 | |||||
| 0.3 | 0.27 | 0.16 | 0.13 | |||||||
| 9b-3d circularity | 1395 | 1757 | 3401 | 2777 | 160 | 20 | ||||
| 0.24 | −0.234 | −0.2069 | −0.2069 | 0.14 | 0.13 | |||||
| L5W5L5 layer 1 | 51 | 66 | 163 | 476 | 928 | 547 | 169 | 265 | 309 | |
| 0.77 | 0.75 | 0.69 | 0.69 | 0.69 | 0.28 | 0.27 | 0.26 | 0.26 | ||