| Literature DB >> 35448725 |
Harald Keller1,2, Tina Shek2, Brandon Driscoll2, Yiwen Xu1, Brian Nghiem2, Sadek Nehmeh3, Milan Grkovski4, Charles Ross Schmidtlein4, Mikalai Budzevich5, Yoganand Balagurunathan5, John J Sunderland6, Reinhard R Beichel7, Carlos Uribe8,9, Ting-Yim Lee10, Fiona Li11, David A Jaffray1,2, Ivan Yeung1,2.
Abstract
For multicenter clinical studies, characterizing the robustness of image-derived radiomics features is essential. Features calculated on PET images have been shown to be very sensitive to image noise. The purpose of this work was to investigate the efficacy of a relatively simple harmonization strategy on feature robustness and agreement. A purpose-built texture pattern phantom was scanned on 10 different PET scanners in 7 institutions with various different image acquisition and reconstruction protocols. An image harmonization technique based on equalizing a contrast-to-noise ratio was employed to generate a "harmonized" alongside a "standard" dataset for a reproducibility study. In addition, a repeatability study was performed with images from a single PET scanner of variable image noise, varying the binning time of the reconstruction. Feature agreement was measured using the intraclass correlation coefficient (ICC). In the repeatability study, 81/93 features had a lower ICC on the images with the highest image noise as compared to the images with the lowest image noise. Using the harmonized dataset significantly improved the feature agreement for five of the six investigated feature classes over the standard dataset. For three feature classes, high feature agreement corresponded with higher sensitivity to the different patterns, suggesting a way to select suitable features for predictive models.Entities:
Keywords: PET radiomics features; feature agreement; image harmonization; repeatability; reproducibility
Mesh:
Year: 2022 PMID: 35448725 PMCID: PMC9025788 DOI: 10.3390/tomography8020091
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1Texture phantom used for this study. (A) Photographs of the manufactured cylindrical texture compartments inserted into a standard NEMA phantom shell. (B) Each compartment mimics a different distribution of radiotracer uptake due to a different structural pattern of blocked space. (C) Axial and coronal CT and PET images of the phantom.
GE = GE Medical Systems (Waukesha, WI, USA). Siemens = Siemens Medical Solutions USA, Inc. Malvern, PA, USA.
| Scanner | Manufacturer | Manufacturer Model Name | TOF Capability |
|---|---|---|---|
| 1 | GE | Discovery 600 | No |
| 2 | GE | Discovery 690 | Yes |
| 3 | GE | Discovery MI | Yes |
| 4 | GE | Discovery STE | No |
| 5 | GE | Discovery 710 | Yes |
| 6 | GE | Discovery MI DR | Yes |
| 7 | GE | Discovery 610 | No |
| 8 | Siemens | Biograph40_mCT | Yes |
| 9 | Siemens | Biograph_mMR | No |
| 10 | Siemens | Biograph64_mCT | Yes |
Reconstruction parameters and contrast-to-noise ratios for the standard dataset (10 PET images). ScNo = scanner number from Table 1. TOF = time of flight acquisition, ZFilter = post-reconstruction filter, NIt = number of iterations, NSubS = number of subsets, FOV = field of view, M = matrix size, P = pixel spacing (pixel size), S = slice thickness, Time = acquisition time (per bed position), CNR = contrast-to-noise ratio according to Equation (1).
| ScNo | TOF | ZFilter | NIt | NSubS | FOV (cm) | M | P (mm) | S (mm) | Time (min) | CNR |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | No | 6.4 | 2 | 32 | 70 | 192 | 3.65 | 3.27 | 2.5 | 4.62 |
| 2 | No | 6.4 | 2 | 32 | 70 | 192 | 3.65 | 3.27 | 2.5 | 4.50 |
| 3 | Yes | 5 | 3 | 16 | 50 | 256 | 1.95 | 2.79 | 3 | 4.38 |
| 4 | No | 4 | 2 | 20 | 50 | 128 | 3.91 | 3.27 | 5 | 3.47 |
| 5 | Yes | 6.4 | 2 | 16 | 70 | 128 | 5.47 | 3.27 | 3 | 10.11 |
| 6 | Yes | 6.4 | 2 | 32 | 70 | 192 | 3.65 | 3.27 | 3 | 7.02 |
| 7 | No | 6.4 | 2 | 32 | 50 | 256 | 1.95 | 3.27 | 5 | 6.39 |
| 8 | Yes | 5 | 3 | 21 | 81.5 | 256 | 3.18 | 5 | 2 | 4.37 |
| 9 | No | 6 | 3 | 21 | 71.8 | 256 | 2.80 | 2.03 | 10 | 8.39 |
| 10 | Yes | 5 | 2 | 21 | 81.5 | 200 | 4.07 | 5 | 5 | 9.26 |
| CNR Mean | 6.25 | |||||||||
| CNR Standard Deviation | 2.23 | |||||||||
Contrast-to-noise ratios for the repeatability dataset (12 PET images). Time = acquisition time (per bed position), R = reconstruction, CNR = contrast-to-noise ratio according to Equation (1).
| Time (min) | CNR | |||||
|---|---|---|---|---|---|---|
| R1 | R2 | R3 | R4 | Mean | Std Dev | |
| 10 | 7.74 | 8.09 | 8.41 | 8.01 | 8.06 | 0.28 |
| 5 | 5.61 | 5.83 | 5.80 | 5.45 | 5.67 | 0.18 |
| 2 | 3.86 | 3.71 | 3.70 | 3.53 | 3.70 | 0.13 |
Figure 2Single slices of the standard (A) and harmonized (B) datasets. The order of the images follows the order in Table 1 (scanner 1 to 10 from top left to bottom right). The images were normalized by the mean activity of their respective background (Pattern-ROI-5). The displayed color scale is identical for all images and ranges from 0 (black) to 1.22 (red).
Figure 3Intraclass coefficient ICC(1,1) for all radiomics features from the PET images in the repeatability dataset. For each feature, three values of ICC(1,1) are plotted: binning time 10 min (green), 5 min (red), and 2 min (black). Error bars indicate the 95% confidence intervals of the ICC values. The results are sorted in descending value of ICC(1,1) for the 2 min binning time (black circles). The feature class of each feature is indicated by the color of the tiles (dark blue: first order, light blue: GLCM, cyan: GLDM, yellow: GLRLM, orange: GLSZM, brown: NGTDM).
Summary of datasets for the reproducibility analysis.
| Label | Dataset | Scanners | Number PET Images | Average CNR +/− 1 SD |
|---|---|---|---|---|
| F | Full dataset | 1–10 | 68 | 7.53 ± 2.38 |
| S10 | Standard dataset | 1–10 | 10 | 6.25 +/− 2.23 |
| H10 | Harmonized Dataset | 1–10 | 10 (*) | 7.29 +/− 0.52 |
| S6 | Subset of S10 | 1,2,3,5,8,10 | 6 | 6.21 ± 2.71 |
| H6 | Subset of H10 | 1,2,3,5,8,10 | 6 (*) | 7.38 ± 0.25 |
(*) closest to average CNR of F.
Figure 4ICC(2,1) values for all features for the reproducibility datasets S10, H10, S6, and H6 (Table 4), ranked by decreasing value of the H10 dataset (solid black). The feature class of each feature is indicated by the color of the tiles (dark blue: first order, light blue: GLCM, cyan: GLDM, yellow: GLRLM, orange: GLSZM, brown: NGTDM).
Figure 5ICC(2,1) values grouped by radiomics feature class. A star (*) indicates statistical significance according to the paired Wilcoxon signed-rank test.
Figure 6Feature reproducibility across different scanners (ICC(2,1)) versus pattern sensitivity (inter-ROI standard deviation) for all features in the standard (A) and harmonized (B) datasets.
Reconstruction parameters and contrast-to-noise ratios for the harmonized dataset (10 PET images). ScNo = scanner number according to Table 1. TOF = time of flight scanner, ZFilter = post-reconstruction Zfilter, NIt = number of iterations, NSubS = number of subsets, FOV = field of view, M = matrix size, P = pixel spacing (pixel size), S = slice thickness, Time = acquisition time (per bed position), CNR = contrast-to-noise ratio according to Equation (1).
| ScNo | TOF | ZFilter | NIt | NSubS | FoV (cm) | M | P (mm) | S (mm) | Time (min) | CNR |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | No | 6.4 | 3 | 16 | 50 | 256 | 1.95 | 3.27 | 5 | 7.29 |
| 2 | Yes | 4.6 | 3 | 16 | 50 | 256 | 1.95 | 3.27 | 5 | 7.09 |
| 3 | Yes | 6.4 | 3 | 16 | 50 | 256 | 1.95 | 2.79 | 5 | 7.29 |
| 4 | No | 6 | 3 | 14 | 50 | 128 | 3.91 | 3.27 | 10 | 6.3 |
| 5 | Yes | 6.4 | 3 | 16 | 50 | 256 | 1.95 | 3.27 | 3 | 7.64 |
| 6 | Yes | 6.4 | 2 | 32 | 70 | 192 | 3.65 | 3.27 | 3 | 7.02 |
| 7 | No | 6.4 | 2 | 32 | 50 | 128 | 3.91 | 3.27 | 5 | 6.95 |
| 8 | Yes | 6 | 3 | 21 | 81.5 | 256 | 3.18 | 5 | 5 | 7.25 |
| 9 | No | 6 | 3 | 21 | 71.8 | 256 | 2.80 | 2.03 | 10 | 8.39 |
| 10 | Yes | 6 | 3 | 21 | 50.9 | 256 | 1.99 | 3 | 5 | 7.72 |
| CNR Mean | 7.29 | |||||||||
| CNR Standard Deviation | 0.52 | |||||||||