| Literature DB >> 28612050 |
Sebastian Echegaray1, Viswam Nair2,3,4, Michael Kadoch2, Ann Leung2, Daniel Rubin2,3, Olivier Gevaert3, Sandy Napel2.
Abstract
Quantitative imaging approaches compute features within images' regions of interest. Segmentation is rarely completely automatic, requiring time-consuming editing by experts. We propose a new paradigm, called "digital biopsy," that allows for the collection of intensity- and texture-based features from these regions at least 1 order of magnitude faster than the current manual or semiautomated methods. A radiologist reviewed automated segmentations of lung nodules from 100 preoperative volume computed tomography scans of patients with non-small cell lung cancer, and manually adjusted the nodule boundaries in each section, to be used as a reference standard, requiring up to 45 minutes per nodule. We also asked a different expert to generate a digital biopsy for each patient using a paintbrush tool to paint a contiguous region of each tumor over multiple cross-sections, a procedure that required an average of <3 minutes per nodule. We simulated additional digital biopsies using morphological procedures. Finally, we compared the features extracted from these digital biopsies with our reference standard using intraclass correlation coefficient (ICC) to characterize robustness. Comparing the reference standard segmentations to our digital biopsies, we found that 84/94 features had an ICC >0.7; comparing erosions and dilations, using a sphere of 1.5-mm radius, of our digital biopsies to the reference standard segmentations resulted in 41/94 and 53/94 features, respectively, with ICCs >0.7. We conclude that many intensity- and texture-based features remain consistent between the reference standard and our method while substantially reducing the amount of operator time required.Entities:
Keywords: CT; image processing; medical imaging; quantitative imaging; radiomics; segmentation
Year: 2016 PMID: 28612050 PMCID: PMC5466872 DOI: 10.18383/j.tom.2016.00163
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1.Cross-section through part-solid nodule in the right upper lobe (A), and its intersection with the reference standard 3-dimensional (3D) segmentation (B), and the 3D digital biopsy (C).
Figure 2.Cross-sections of digital biopsies obtained by applying morphological operations to the manual digital biopsy shown in Figure 1. The first and second rows show erosions and dilations, respectively.
Overlap Between Digital Biopsies and the Reference Standard Segmentation
| Method | Size | Mean (%) | SD (%) | Median (%) | Minimum (%) | Maximum (%) |
|---|---|---|---|---|---|---|
| Original digital biopsies | None | 74.04 | 10.77 | 76.65 | 25.26 | 85.23 |
| Erosions | 0.5 mm | 64.46 | 11.27 | 67.11 | 20.94 | 82.81 |
| 1.0 mm | 53.04 | 12.03 | 54.87 | 13.63 | 75.52 | |
| 1.5 mm | 42.33 | 13.11 | 42.61 | 8.47 | 69.61 | |
| Dilations | 0.5 mm | 77.73 | 10.38 | 79.49 | 27.11 | 88.58 |
| 1.0 mm | 78.21 | 9.66 | 80.36 | 28.22 | 90.94 | |
| 1.5 mm | 71.43 | 10.22 | 73.34 | 28.55 | 89.17 |
Abbreviation: SD, Standard deviation.
Figure 3.Distribution of the overlap of the reference standard segmentation and in order from top to bottom: the original biopsy, 0.5-mm erosion, 1.0-mm erosion, and 1.5-mm erosion. Statistics regarding the distributions are shown in Table 1.
Figure 4.Distribution of the overlap of the reference standard segmentation and in order from top to bottom: the original biopsy, 0.5-mm dilation, 1.0-mm dilation, and 1.5-mm dilation. Statistics regarding the distributions are shown in Table 1.
Figure 5.The intraclass correlation coefficient (ICC) curves for the features extracted from the digital biopsies and each of the morphological erosions compared with their reference standard segmentation. The features are organized in the descending order by their ICC value. Each line has been marked to indicate the number of features, with ICC > 0.7.
Figure 6.The ICC curves for the features extracted from the digital biopsies and each of the morphological dilations compared with their reference standard segmentation. The features are organized in the descending order by their ICC. The features are organized in the descending order by their ICC. Each line has been marked to indicate the number of features, with ICC > 0.7.
Features Above Thresholds of Agreement
| ICC | Original | Erosion | Dilation | ||||
|---|---|---|---|---|---|---|---|
| 0.5 mm | 1.0 mm | 1.5 mm | 0.5 mm | 1.0 mm | 1.5 mm | ||
| >0.9 | 47 | 6 | 2 | 1 | 27 | 15 | 8 |
| >0.8 | 74 | 56 | 18 | 4 | 64 | 51 | 24 |
| >0.7 | 84 | 68 | 60 | 41 | 88 | 89 | 53 |
| >0.6 | 93 | 74 | 67 | 62 | 92 | 91 | 84 |
The number of features presented in the table are out of 94 that presented ICC > 0.9, > 0.8, > 0.7, and > 0.6 in each of the digital biopsies and its morphological modifications when compared with the features extracted from the reference standard segmentation. Appendix 2 names and ranks the individual features with the highest ICCs across all 7 digital biopsy variations.
Figure 7.Boxplot of ICC of intensity features for original digital biopsies and their erosions/dilations compared with the reference standard segmentations. The Y-axis shows the ICC score and the X-axis is the morphological operation, with “−” and “+” representing that the segmentation underwent erosion and dilation, respectively.
Figure 8.Boxplot of ICC of texture features for original digital biopsies and their erosions/dilations compared with the reference standard segmentations. The Y-axis shows the ICC score and the X-axis is the morphological operation, with “−” and “+” representing that the segmentation underwent erosion and dilation, respectively.
Features of Important Descriptors of Tumoral Heterogeneity
| Feature Name | ICC > 0.7 | ICC > 0.8 | ICC > 0.9 |
|---|---|---|---|
| “Haralick D=3mm std sum of means” | 7 | 7 | 3 |
| “Intensity Entropy” | 7 | 6 | 5 |
| “Intensity Mean” | 7 | 6 | 3 |
| “Intensity Median” | 7 | 6 | 3 |
| “Intensity Trimmed Mean (25%)” | 7 | 6 | 3 |
| “Haralick D=1mm std energy” | 7 | 6 | 1 |
| “Haralick D=2mm std entropy” | 7 | 6 | 1 |
| “Haralick D=1mm std max probability” | 7 | 5 | 4 |
| “Haralick D=1mm std entropy” | 7 | 5 | 2 |
| “Haralick D=1mm mean cluster tendency” | 7 | 5 | 1 |
| “Haralick D=1mm std sum of means” | 7 | 5 | 1 |
| “Haralick D=2mm std sum of means” | 7 | 5 | 1 |
| “Haralick D=3mm std cluster shade” | 7 | 5 | 1 |
| “Haralick D=2mm std max probability” | 7 | 5 | 0 |
| “Haralick D=2mm mean cluster tendency” | 7 | 4 | 1 |
| “Haralick D=3mm mean cluster tendency” | 7 | 4 | 1 |
| “Haralick D=2mm std cluster shade” | 7 | 4 | 1 |
| “Haralick D=3mm std contrast” | 7 | 4 | 1 |
| “Haralick D=3mm std inertia” | 7 | 4 | 1 |
| “Haralick D=2mm std energy” | 7 | 4 | 0 |
| “Haralick D=1mm std variance” | 7 | 3 | 0 |
| “Haralick D=2mm std variance” | 7 | 1 | 0 |
| “Intensity Under −291 HU Percentage” | 6 | 5 | 3 |
| “Intensity Over −291 HU Percentage” | 6 | 5 | 3 |
| “Haralick D=1mm mean variance” | 6 | 5 | 2 |
| “Haralick D=2mm mean variance” | 6 | 5 | 2 |
| “Haralick D=3mm mean variance” | 6 | 5 | 2 |
| “Haralick D=3mm std max probability” | 6 | 5 | 0 |
| “Haralick D=2mm std contrast” | 6 | 4 | 2 |
| “Haralick D=2mm std inertia” | 6 | 4 | 2 |
| “Haralick D=1mm mean cluster shade” | 6 | 4 | 1 |
| “Haralick D=2mm mean cluster shade” | 6 | 4 | 1 |
| “Haralick D=1mm std cluster shade” | 6 | 4 | 1 |
| “Haralick D=2mm std cluster tendency” | 6 | 4 | 1 |
| “Haralick D=3mm std cluster tendency” | 6 | 4 | 1 |
| “Haralick D=3mm mean cluster shade” | 6 | 4 | 0 |
| “Haralick D=1mm std contrast” | 6 | 3 | 2 |
| “Haralick D=1mm std inertia” | 6 | 3 | 2 |
| “Haralick D=1mm mean entropy” | 6 | 3 | 0 |
| “Haralick D=2mm mean entropy” | 6 | 3 | 0 |
| “Haralick D=3mm std variance” | 6 | 3 | 0 |
| “Haralick D=3mm mean energy” | 6 | 2 | 0 |
| “Haralick D=3mm mean max probability” | 6 | 2 | 0 |
| “Haralick D=1mm mean energy” | 6 | 1 | 0 |
| “Haralick D=2mm mean energy” | 6 | 1 | 0 |
| “Haralick D=2mm mean max probability” | 6 | 1 | 0 |
| “Haralick D=3mm std entropy” | 5 | 5 | 1 |
| “Intensity Skewness” | 5 | 4 | 2 |
| “Intensity Min” | 5 | 4 | 2 |
| “Haralick D=1mm mean contrast” | 5 | 4 | 2 |
| “Haralick D=1mm mean inertia” | 5 | 4 | 2 |
| “Haralick D=1mm std cluster tendency” | 5 | 4 | 1 |
| “Haralick D=2mm mean contrast” | 5 | 3 | 2 |
| “Haralick D=2mm mean inertia” | 5 | 3 | 2 |
| “Haralick D=3mm mean contrast” | 5 | 3 | 1 |
| “Haralick D=3mm mean inertia” | 5 | 3 | 1 |
| “Haralick D=1mm std homogeneity” | 5 | 3 | 1 |
| “Haralick D=2mm std homogeneity” | 5 | 3 | 1 |
| “Haralick D=3mm mean entropy” | 5 | 3 | 0 |
| “Haralick D=3mm std homogeneity” | 5 | 3 | 0 |
| “Haralick D=1mm mean max probability” | 5 | 1 | 0 |
| “Intensity Mean Absolute Difference” | 4 | 4 | 1 |
| “Intensity Standard Deviation” | 4 | 4 | 0 |
| “Intensity Interquartile Difference “ | 4 | 2 | 1 |
| “Intensity Range” | 4 | 2 | 0 |
| “Intensity Max” | 4 | 2 | 0 |
| “Haralick D=3mm std energy” | 4 | 1 | 0 |
| “Haralick D=1mm mean sum of means” | 4 | 0 | 0 |
| “Haralick D=2mm mean sum of means” | 4 | 0 | 0 |
| “Haralick D=3mm mean sum of means” | 4 | 0 | 0 |
| “Haralick D=1mm mean correlation” | 3 | 1 | 0 |
| “Intensity Kurtosis” | 3 | 0 | 0 |
| “Haralick D=1mm mean homogeneity” | 3 | 0 | 0 |
| “Haralick D=1mm mean inverse variance” | 3 | 0 | 0 |
| “Haralick D=2mm mean homogeneity” | 3 | 0 | 0 |
| “Haralick D=2mm mean inverse variance” | 3 | 0 | 0 |
| “Haralick D=3mm mean homogeneity” | 3 | 0 | 0 |
| “Haralick D=3mm mean inverse variance” | 3 | 0 | 0 |
| “Haralick D=2mm std correlation” | 3 | 0 | 0 |
| “Haralick D=1mm std correlation” | 2 | 1 | 0 |
| “Haralick D=3mm std inverse variance” | 2 | 1 | 0 |
| “Haralick D=2mm mean correlation” | 2 | 0 | 0 |
| “Haralick D=3mm mean correlation” | 2 | 0 | 0 |
| “Haralick D=2mm std inverse variance” | 2 | 0 | 0 |
| “Intensity Harmonic Mean” | 1 | 1 | 1 |
| “Intensity Mode” | 1 | 1 | 0 |
| “Haralick D=3mm std correlation” | 1 | 0 | 0 |
Abbreviation: ICC, Intra-class correlation.
The features mentioned were computed for each of the 7 digital biopsies (original, 3 erosions, and 3 dilations) and the number of times their ICC compared with the reference standard was higher than 0.9, 0.8, and 0.7, ranked by ICC. Features that never scored >0.7 are not shown.
List of Haralick Features Extracted from the GLCM
| Features | References |
|---|---|
| Energy | ( |
| Contrast | ( |
| Sum of Means | ( |
| Cluster Tendency | ( |
| Entropy | ( |
| Homogeneity | ( |
| Inertia | ( |
| Max Probability | ( |
| Correlation | ( |
| Variance | ( |
| Cluster Shade | ( |
| Inverse Variance | ( |
Abbreviation: GLCM, gray-level co-occurrence matrix.
Details of the implementation of each of these features can be found in the references mentioned alongside the features.