| Literature DB >> 28642480 |
Patrik Brynolfsson1, David Nilsson2, Turid Torheim3, Thomas Asklund4, Camilla Thellenberg Karlsson4, Johan Trygg2, Tufve Nyholm4, Anders Garpebring4.
Abstract
In recent years, texture analysis of medical images has become increasingly popular in studies investigating diagnosis, classification and treatment response assessment of cancerous disease. Despite numerous applications in oncology and medical imaging in general, there is no consensus regarding texture analysis workflow, or reporting of parameter settings crucial for replication of results. The aim of this study was to assess how sensitive Haralick texture features of apparent diffusion coefficient (ADC) MR images are to changes in five parameters related to image acquisition and pre-processing: noise, resolution, how the ADC map is constructed, the choice of quantization method, and the number of gray levels in the quantized image. We found that noise, resolution, choice of quantization method and the number of gray levels in the quantized images had a significant influence on most texture features, and that the effect size varied between different features. Different methods for constructing the ADC maps did not have an impact on any texture feature. Based on our results, we recommend using images with similar resolutions and noise levels, using one quantization method, and the same number of gray levels in all quantized images, to make meaningful comparisons of texture feature results between different subjects.Entities:
Year: 2017 PMID: 28642480 PMCID: PMC5481454 DOI: 10.1038/s41598-017-04151-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Changes in texture feature distributions with different imaging and pre-processing settings. The box plots show the distribution of contrast, correlation, energy, entropy and homogeneity for the 72 ROIs in the glioma data set. The box shows the first and third quartiles, with the median value indicated by the center line. The whiskers show the extreme values. An asterisk in the upper left corner indicates that at least one pair of settings is significantly different.
Figure 2Probabilities that texture features are unaffected by changes in imaging or pre-processing steps. Heatmaps showing the probability (p-value) that all settings for a given parameter give the same texture feature values. The dots represent significant changes at the α = 0.01 level, with Bonferroni corrections. (a) Shows the result from the glioma data set, (b) from the prostate cancer data set.
Haralick texture features calculated from GLCMs. There was an error in the definition of Sum variance in Haralick et al.[1], which has been corrected.
| Feature | Equation | Ref. |
|---|---|---|
| Autocorrelation |
|
|
| Cluster Prominence |
|
|
| Cluster shade |
|
|
| Contrast |
|
|
| Correlation |
|
|
| Difference entropy |
|
|
| Difference variance |
|
|
| Dissimilarity |
|
|
| Energy |
|
|
| Entropy |
|
|
| Homogeneity |
|
|
| Information measure of correlation 1 |
|
|
| Information measure of correlation 2 |
|
|
| Inverse difference |
|
|
| Maximum probability |
|
|
| Sum average, |
|
|
| Sum entropy |
|
|
| Sum of squares |
|
|
| Sum variance |
|
|
Figure 6The span of ADC values in the data sets. Boxplots of ADC minimum and maximum values as well as the range of ADC values within each tumor for the glioma data set and the prostate cancer data set respectively.
Percentage change in texture features when expanding the ROI by one voxel.
| AutoROI (%) | Manual (%) | Manual/AutoROI | |
|---|---|---|---|
| Autocorrelation | −40.8 | 4.59 | 0.112 |
| Cluster Prominence | −53.9 | 111 | 2.05 |
| Cluster shade | −8840 | 286 | −0.0323 |
| Contrast | −43.9 | 26.1 | −0.595 |
| Correlation | −5.91 | −5.31 | 0.898 |
| Difference entropy | −9.69 | 3.45 | −0.356 |
| Difference variance | −38.5 | 35.8 | −0.930 |
| Dissimilarity | −0.276 | 0.0886 | −0.321 |
| Energy | 57.8 | 30.3 | 0.525 |
| Entropy | −6.32 | 1.50 | −0.237 |
| Homogeneity | 38.8 | 3.53 | 0.0910 |
| Information measure of correlation 1 | −25.8 | −11.3 | 0.437 |
| Information measure of correlation 2 | −10.1 | −2.95 | 0.293 |
| Inverse difference | 25.1 | 1.13 | 0.0449 |
| Maximum probability | 0.422 | 25.5 | 60.5 |
| Sum average, | −22.5 | −4.19 | 0.186 |
| Sum entropy | −8.18 | 0.386 | −0.0472 |
| Sum of squares | −48.2 | 17.6 | −0.366 |
| Sum variance | −49.3 | 15.4 | −0.312 |
The sensitivity of AutoROI and a manual level of 500 and 1500 mm2/s to the definition of the ROI for a patient with a tumor close to the left lateral ventricle, in the slice shown in Fig. 3.
Figure 3The effect of ROI uncertainties to the texture features. The delineated glioma in a slice near the left lateral ventricle in a 73 year old male, from which the variations in the texture features were calculated in Table 1. The colormap shows the ADC map, fused on the T1-weighted contrast enhanced MPRAGE. An expansion or a shift by one voxel can include CSF in the ROI, which will increase the minimum and maximum values in the ROI, and will have an effect on the resulting texture features. The manual quantization method is less sensitive to this shift.
Figure 5The effect of using different minimum and maximum values when quantizing the image. The images show how different minimum and maximum values influence the result when quantizing the original image, prior to constructing the GLCM. (a) Shows the original image with 4096 gray levels. In (b) the image has been quantized to 8 gray levels, and the minimum and maximum gray levels have been set to that of the ROI, dashed outline. In (c), the image has been quantized to 8 gray levels and minimum and maximum gray levels have been set based on the entire image. There are large regions of uniform gray levels in (c), the texture is very different compared to (b), and the only difference is how the maximum and minimum gray levels were chosen.
Figure 4A description of how Haralick’s texture features are calculated. In an example 4 × 4 image ROI, three gray levels are represented by numerical values from 1 to 3. The GLCM is constructed by considering the relation of each voxel with its neighborhood. In this example we only look at the neighbor to the right. The GLCM acts like a counter for every combination of gray level pairs in the image. For each voxel, its value and the neighboring voxel value are counted in a specific GLCM element. The value of the reference voxel determines the column of the GLCM and the neighbor value determines the row. In this ROI, there are two instances when a reference voxel of 3 “co-occurs” with a neighbor voxel of 2, indicated in solid blue, and there is one instance of a reference voxel of 3 with a neighbor voxel of 1, indicated in dashed red. The normalized GLCM represents the frequency or probability of each combination to occur in the image. The Haralick texture features are functions of the normalized GLCM, where different aspects of the gray level distribution in the ROI are represented. For example, diagonal elements in the GLCM represent voxels pairs with equal gray levels. The texture feature “contrast” gives elements with similar gray level values a low weight but elements with dissimilar gray levels a high weight. It is common to add GLCMs from opposite neighbors (e.g. left-right or up-down) prior to normalization. This generates symmetric GLCMs, since each voxel has been the neighbor and the reference in both directions. The GLCMs and texture features then reflect the “horizontal” or “vertical” properties of the image. If all neighbors are considered when constructing the GLCM, the texture features are direction invariant.
Variables and notation used to calculate Haralick features.
The pre-processing work flow and settings for each investigated parameter for the glioma data.
| Investigated parameter | Step 1: Resolution of DWI | Step 2: Noise in DWI | Step 3: Calculate ADC | Step 4: Select GLCM size | Step 5: Quantize image |
|---|---|---|---|---|---|
| Resolution | 1.2 mm2 (1.0×) |
| 200–1000 s/mm2 | 32 | AutoROI |
| 1.8 mm2 (1.5×) | |||||
| 3.6 mm2 (3.0×) | |||||
| Noise | 1.2 mm2 |
| 200–1000 s/mm2 | 32 | AutoROI |
|
| |||||
|
| |||||
| Diffusion b-values | 1.2 mm2 |
| 200–1000 s/mm2 | 32 | AutoROI |
| 0–1000 s/mm2 | |||||
| 200,1000 s/mm2 | |||||
| 200,1000 s/mm2 | |||||
| Gray levels | 1.2 mm2 |
| 200–1000 s/mm2 | 8 | AutoROI |
| 16 | |||||
| 32 | |||||
| 64 | |||||
| 128 | |||||
| Quantization method | 1.2 mm2 |
| 200–1000 s/mm2 | 32 | AutoROI |
| AutoSlice | |||||
| Manual |
Each row represents the work flow of one investigated parameter. The prostate cancer data set used a similar work flow, where the native resolution was 1.625 mm2, the native noise standard deviation was σ = 2.5, and the ADC was calculated using 0, 800 s/mm2 only.