| Literature DB >> 30131973 |
Shih-Ying Huang1, Benjamin L Franc1, Roy J Harnish1, Gengbo Liu2, Debasis Mitra2, Timothy P Copeland1, Vignesh A Arasu1, John Kornak3, Ella F Jones1, Spencer C Behr1, Nola M Hylton1, Elissa R Price1, Laura Esserman1,4, Youngho Seo1,5,6.
Abstract
Radiomics is an emerging technology for imaging biomarker discovery and disease-specific personalized treatment management. This paper aims to determine the benefit of using multi-modality radiomics data from PET and MR images in the characterization breast cancer phenotype and prognosis. Eighty-four features were extracted from PET and MR images of 113 breast cancer patients. Unsupervised clustering based on PET and MRI radiomic features created three subgroups. These derived subgroups were statistically significantly associated with tumor grade (p = 2.0 × 10-6), tumor overall stage (p = 0.037), breast cancer subtypes (p = 0.0085), and disease recurrence status (p = 0.0053). The PET-derived first-order statistics and gray level co-occurrence matrix (GLCM) textural features were discriminative of breast cancer tumor grade, which was confirmed by the results of L2-regularization logistic regression (with repeated nested cross-validation) with an estimated area under the receiver operating characteristic curve (AUC) of 0.76 (95% confidence interval (CI) = [0.62, 0.83]). The results of ElasticNet logistic regression indicated that PET and MR radiomics distinguished recurrence-free survival, with a mean AUC of 0.75 (95% CI = [0.62, 0.88]) and 0.68 (95% CI = [0.58, 0.81]) for 1 and 2 years, respectively. The MRI-derived GLCM inverse difference moment normalized (IDMN) and the PET-derived GLCM cluster prominence were among the key features in the predictive models for recurrence-free survival. In conclusion, radiomic features from PET and MR images could be helpful in deciphering breast cancer phenotypes and may have potential as imaging biomarkers for prediction of breast cancer recurrence-free survival.Entities:
Year: 2018 PMID: 30131973 PMCID: PMC6095872 DOI: 10.1038/s41523-018-0078-2
Source DB: PubMed Journal: NPJ Breast Cancer ISSN: 2374-4677
A summary of patient demographic characteristics is shown
| Characteristics ( | Type | No. of patients (%) |
|---|---|---|
| Tumor Histology ( | Ductual or lobular carcinoma in situ | 5 (4.5) |
| Invasive ductal carcinoma (IDC) | 98 (88.3) | |
| Invasive lobular carcinoma (ILC) | 5 (4.5) | |
| Mixed IDC and ILC | 3 (2.7) | |
| Tumor Grade ( | 1 (well differentiated) | 15 (14.4) |
| 2 (moderately differentiated) | 57 (54.8) | |
| 3 (poorly differentiated) | 32 (30.8) | |
| T stage ( | T0 | 32 (31.4) |
| T1 | 33 (32.4) | |
| T2 | 27 (26.5) | |
| T3 | 10 (9.8) | |
| N stage ( | N0 | 62 (61.4) |
| N1 | 32 (31.7) | |
| N2 | 4 (4.0) | |
| N3 | 3 (3.0) | |
| Overall stage ( | 0 | 33 (31.7) |
| IA, IB, IIA | 42 (40.4) | |
| IIB | 14 (13.5) | |
| IIIA, IIIB, IIIC | 13 (12.5) | |
| IV | 2 (1.9) | |
| Breast cancer subtype ( | HR + /HER2− | 56 (52.3) |
| HR + /HER2+ | 15 (14.0) | |
| HR-/HER2+ | 15 (14.0) | |
| HR-/HER2− | 21 (19.6) | |
| Disease recurrence ( | No recurrence | 81 (71.1) |
| Recur | 23 (20.2) | |
| Never disease free | 10 (8.8) | |
| Recurrence site ( | No recurrence | 61 (84.7) |
| Local recurrence | 1 (1.4) | |
| Distant recurrence | 10 (14.9) | |
| Recurrence free in 1 year ( | Recurrence free | 75 (88.2) |
| Not Recurrence free | 10 (11.8) | |
| Recurrence free in 2 years ( | Recurrence free | 68 (80.0) |
| Not Recurrence free | 17 (20.0) | |
| Recurrence free in 3 years ( | Recurrence free | 67 (78.8) |
| Not Recurrence free | 18 (21.2) | |
| Recurrence free in 4 years ( | Recurrence free | 65 (76.5) |
| Not Recurrence free | 20 (23.5) | |
| Recurrence free in 5 years ( | Recurrence free | 60 (70.6) |
| Not Recurrence free | 25 (29.4) |
For breast cancer subtype definition, HR+ denotes tumors with ER+ or PR+
A summary of χ2 test statistics (p-value and Cramer’s V), median cluster consensus (CC), and the optimal clustering algorithm is listed to describe the degree of association between the patient clusters with a given clinical feature
| Clinical variable | Clustering algorithm | # of samples | Cramer’s V | Median CC | |
|---|---|---|---|---|---|
| Tumor grade | HC, Spearman | 104 | 2.02 × 10−6a | 0.39 | 0.72 |
| Tumor histology | PAM, Euc | 111 | 0.084 | 0.22 | 0.94 |
| T-stage | HC, Spearman | 102 | 0.19 | 0.21 | 0.77 |
| N-stage | KMdist, Spearman | 101 | 0.14 | 0.22 | 0.73 |
| Overall stage | PAM, Pearson | 104 | 0.037a | 0.28 | 0.83 |
| Breast cancer subtype | HC, Spearman | 107 | 0.0085a | 0.28 | 0.77 |
| Disease recurrence | KMdist, Spearman | 114 | 0.0053a | 0.25 | 0.73 |
| Recurrence site | PAM, Pearson | 72 | 0.19 | 0.21 | 0.86 |
aindicates there is statistical significance for the χ2 test of independence at the 5% level
Fig. 1PET and MR radiomics vs. tumor grade heatmap. a A heatmap of the PET and MR radiomic features is shown with the corresponding tumor grade and the tumor clusters resulted from the optimized consensus clustering. Each column represents a tumor and each row represents a radiomic feature. The PET and MR radiomic features are shown as z-scores. b The proportion of different grade tumors is shown for each tumor cluster. The frequency is shown with respect to the total number of tumors in each tumor cluster category. c The proportion of different tumor clusters is shown for each tumor grade category. The frequency is shown with respect to the total number of tumors in each tumor grade category
Fig. 2PET and MR radiomics vs. tumor overall stage heatmap. a A heatmap of the PET and MR radiomic features is shown with the corresponding tumor overall stage and the tumor clusters resulted from the optimized consensus clustering. b The proportion of different tumor overall stages is shown for each tumor cluster category. The frequency is shown with respect to the total number of tumors in each tumor cluster category. c The proportion of different tumor clusters is shown for each tumor overall stage category. The frequency is shown with respect to the total number of tumors in each tumor overall stage category
Fig. 3PET and MR radiomics vs. breast cancer subtype heatmap. a A heatmap of the PET and MR radiomic features is shown with the corresponding breast cancer subtype and the tumor clusters resulted from the optimized consensus clustering. b The proportion of breast cancer subtypes is shown for each tumor cluster. The frequency is shown with respect to the total number of tumors in each tumor cluster category. c The proportion of different tumor clusters is shown for each breast cancer subtype. The frequency is shown with respect to the total number of tumors in each breast cancer subtype category
Fig. 4PET and MR radiomics vs. disease recurrence status heatmap. a A heatmap of the PET and MR radiomic features is shown with the corresponding disease recurrence status and the tumor clusters resulted from the optimized consensus clustering. b The proportion of different disease recurrence categories is shown for each tumor cluster. The frequency is shown with respect to the total number of tumors in each tumor cluster category. c The proportion of different tumor clusters is shown for each disease recurrence category. The frequency is shown with respect to the total number of tumors in each disease recurrence category
Fig. 5Pairwise relationship of radiomics with breast cancer outcome. a A heatmap of Spearman’s rank correlation coefficients (ρ) between the PET and MR radiomic features and the ordered clinical outcome is shown. Only the radiomic features with |ρ| > 0.2 are displayed. b A heatmap of proportion of variance from multiple regression () between the PET and MR radiomic features and the unordered clinical outcome is illustrated. Only the radiomic features with > 0.04 are shown
Fig. 6Heatmap of the predictive performance of radiomics to breast cancer outcome. A heatmap depicts the classification performance in AUC and 95% confidence interval for several classification algorithms at predicting recurrence-free duration of 1–5 years and tumor grade. SVM denotes support vector machine. The classification name for logistic regression is defined as [Reg][Solver]LogReg, where [Reg] specifies the regularization scheme and [Solver] is the solver algorithm. For example, L1LiblinearLogReg denotes logistic regression with L1-regularization using Liblinear solver
The feature importance of the repeated nested cross-validation with optimal logistic regression algorithm with PET and MR radiomic features set is summarized
| Outcome | Important features |
|---|---|
| Disease free in 1 year (ElasticNet) | MR GLCM IDN (99.1%) |
| MR GLCM IDMN (84.1%) | |
| PET GLCM cluster prominence (83.0%) | |
| MR entropyHIST (81.5%) | |
| MRI mean intensity (77.5%) | |
| MR GLCM sum entropy (76.2%) | |
| MR GLCM sum average (74.7%) | |
| MR GLCM average intensity (74.7%) | |
| MR minimum intensity (73.9%) | |
| MR GLCM difference entropy (72.0%) | |
| Disease free in 2 years (ElasticNet) | MR mean intensity (98.2%) |
| MR GLCM sum average (98.1%) | |
| MR GLCM average intensity (98.1%) | |
| MR minimum intensity (96.6%) | |
| MR maximum intensity (89.4%) | |
| MR GLCM IDN (87.5%) | |
| MR GLCM difference average (87.1%) | |
| MR GLCM dissimilarity (87.1%) | |
| PET SUVmin (86.3%) | |
| MR tumor compactness2 (84.3%) | |
| Disease free in 3 years (ElasticNet) | MRI mean intensity (98.9%) |
| MR GLCM sum average (98.4%) | |
| MR GLCM average intensity (98.4%) | |
| MR minimum intensity (96.8%) | |
| MR GLCM difference average (85.0%) | |
| MR GLCM dissimilarity (85.0%) | |
| MR maximum intensity (84.8%) | |
| MR tumor compactness2 (83.6%) | |
| PET tumor compactness2 (83.2%) | |
| PET SUVmin (81.7%) | |
| Disease free in 4 years (ElasticNet) | MR minimum intensity (94.3%) |
| MR mean intensity (93.2%) | |
| MR GLCM sum average (91.3%) | |
| MR GLCM average intensity (91.3%) | |
| PET GLCM cluster prominence (85.6%) | |
| MR GLCM IMC2 (85.5%) | |
| PET tumor compactness2 (82.6%) | |
| MR maximum intensity (79.5%) | |
| MR tumor compactness2 (79.2%) | |
| MR GLCM IDN (77.9%) | |
| Disease free in 5 years (ElasticNet) | MR minimum intensity (92.0%) |
| PET GLCM cluster prominence (79.8%) | |
| PET GLCM IDN (78.7%) | |
| MR GLCM IMC2 (78.4%) | |
| PET tumor maximum 3D diameter (77.1%) | |
| MR mean intensity (74.6%) | |
| MR GLCM sum average (70.2%) | |
| MR GLCM average intensity (70.2%) | |
| MR GLCM IDN (69.8%) | |
| MR energyHIST (69.0%) | |
| Binary Tumor Grade (L2LbfgsLogReg) | PET GLCM inverse variance (90.6%) |
| PET GLCM homogeneity1 (85.6%) | |
| PET GLCM homogeneity2 (83.7%) | |
| PET EntropyHIST (79.5%) | |
| PET GLCM sum average (78.4%) | |
| PET GLCM average intensity (78.4%) | |
| PET SUVmean (78.2%) | |
| PET GLCM entropy (76.5%) | |
| PET GLCM sum entropy (72.4%) | |
| PET GLCM difference average (70.3%) |
The number in () is the proportion of the number of times that the feature was considered ‘important’ during the repeated nested CV out of the maximum number of CVs (3000)
A summary describing the radiomic features extracted from the PET and MR images are shown
| Feature type | Feature name | Description |
|---|---|---|
| First-order statistics (FOstats) | Min, max | Minimum and maximum of the image intensity values |
| Mean, variance | ||
| Skewness | Measure of lopsidedness of the intensity distribution | |
| Kurtosis | Measure of the heaviness of the tail of the intensity distribution | |
| EntropyHIST | Measure of randomness in an image | |
| EnergyHIST | ||
| UniformityHIST | Degree of image intensity having similar probability | |
| Shape and size (SS) | Volume | |
| Compactness1 and Compactness2 | As a function of volume and surface area | |
| Maximum 3D diameter | The largest pairwise Euclidean distance between voxels on the tumor surface | |
| Spherical disproportion | Degree of similarity in surface area between the shape and that with a radius of a sphere with the same volume as the tumor | |
| Sphericity | ||
| Surface area | ||
| Surface-to-volume ratio | ||
| Texture (TX) | Autocorrelation | Measure of texture fineness and coarseness |
| Cluster prominence | Measure of image asymmetry of the GLCM | |
| Cluster shade | Measure of the skewness of the GLCM | |
| Cluster tendency | Measure of voxel clusters of similar gray-level values | |
| Contrast | Measure of the local variations presented in the image | |
| Correlation | Measure of the linear dependency of image intensity of the neighboring voxels | |
| Difference entropy | Measure of the variability in neighboring intensity value differences | |
| Difference average | Relationships between voxel clusters with similar intensity values and voxel clusters with different intensity values | |
| Difference variance | Measure of heterogeneity | |
| Average intensity | The mean gray level intensity of the GLCM vertical or horizontal distribution | |
| dissimilarity | ||
| EnergyGLCM | Measure of homogeneity of an image | |
| EntropyGLCM | Measure of image texture randomness | |
| Homogeneity1 and Homogeneity2 | ||
| Inverse difference moment normalized (IDMN) and inverse difference normalized (IDN) | Measure of the local homogeneity of an image | |
| Inverse variance | ||
| Maximum probability | The number of most occurred pair of neighboring intensity values | |
| Sum average | Average value of the GLCM | |
| Sum entropy | Measure of randomness of the GLCM | |
| Sum variance | High weight on the elements different from the GLCM average value | |
| Sum squares | Measure of the neighboring intensity level pairs about the mean GLCM intensity level | |
| IMC1 and IMC2 |