| Literature DB >> 34885124 |
Hyun-Soo Park1, Kwang-Sig Lee2, Bo-Kyoung Seo1, Eun-Sil Kim1, Kyu-Ran Cho3, Ok-Hee Woo4, Sung-Eun Song3, Ji-Young Lee5, Jaehyung Cha1,6.
Abstract
This prospective study enrolled 147 women with invasive breast cancer who underwent low-dose breast CT (80 kVp, 25 mAs, 1.01-1.38 mSv) before treatment. From each tumor, we extracted eight perfusion parameters using the maximum slope algorithm and 36 texture parameters using the filtered histogram technique. Relationships between CT parameters and histological factors were analyzed using five machine learning algorithms. Performance was compared using the area under the receiver-operating characteristic curve (AUC) with the DeLong test. The AUCs of the machine learning models increased when using both features instead of the perfusion or texture features alone. The random forest model that integrated texture and perfusion features was the best model for prediction (AUC = 0.76). In the integrated random forest model, the AUCs for predicting human epidermal growth factor receptor 2 positivity, estrogen receptor positivity, progesterone receptor positivity, ki67 positivity, high tumor grade, and molecular subtype were 0.86, 0.76, 0.69, 0.65, 0.75, and 0.79, respectively. Entropy of pre- and postcontrast images and perfusion, time to peak, and peak enhancement intensity of hot spots are the five most important CT parameters for prediction. In conclusion, machine learning using texture and perfusion characteristics of breast cancer with low-dose CT has potential value for predicting prognostic factors and risk stratification in breast cancer patients.Entities:
Keywords: breast neoplasms; computed tomography; machine learning; perfusion analysis; prospective studies; texture analysis
Year: 2021 PMID: 34885124 PMCID: PMC8656976 DOI: 10.3390/cancers13236013
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Flowchart of study population.
Figure 2Perfusion and texture analyses on low-dose breast CT in a 47-year-old woman with invasive ductal cancer of the left breast. (A) Axial CT image shows an irregular shaped, irregular margined, heterogeneous enhancing mass (arrows). (B,C) Perfusion analysis was performed using the maximum slope algorithm. Regions of interest (ROIs) were drawn manually for the entire tumor extent (B) and hot spot (C) for each cancer and four perfusion parameters were measured at each ROI: perfusion, peak enhancement intensity (PEI), time to peak (TTP), and blood volume (BV). (D,E) Texture analysis was performed using a filtration histogram technique. The ROI was drawn manually for the entire tumor and a CT texture histogram was obtained (D). From the histogram, six statistical-based metrics were extracted (E). These six texture parameters were extracted from precontrast and postcontrast CT images for special scale filters (SSFs) 0, 2, and 5: mean, standard deviation (SD), entropy, mean of positive pixels (MPP), skewness, and kurtosis.
Tumor characteristics.
| Characteristic | Training Set | Test Set |
|---|---|---|
| Age | 52.2 ± 9 years | 52.5 ± 10 years |
| Tumor size | 25.3 ± 16 mm | 23.5 ± 14 mm |
| ER status | ||
| Negative | 39 (35%) | 9 (24%) |
| Positive | 71 (65%) | 28 (76%) |
| PR status | ||
| Negative | 41 (37%) | 9 (24%) |
| Positive | 69 (63%) | 28 (76%) |
| HER2 status | ||
| Negative | 93 (85%) | 30 (81%) |
| Positive | 17 (15%) | 7 (19%) |
| Ki67 status | ||
| Negative | 50 (45%) | 23 (62%) |
| Positive | 60 (55%) | 14 (38%) |
| Tumor grade | ||
| Low | 68 (62%) | 28 (76%) |
| High | 42 (38%) | 9 (24%) |
| Molecular subtype | ||
| Luminal | 75 (68%) | 28 (76%) |
| Non-luminal | 35(32%) | 9 (24%) |
ER: estrogen receptor, PR: progesterone receptor, HER2: human epidermal growth factor receptor 2. Data are numbers of cancers, with percentages in parentheses.
p values for association tests between CT perfusion parameters and histological factors in the training set.
| CT Perfusion Parameter | ER | PR | HER2 | Ki67 | Grade | Subtype |
|---|---|---|---|---|---|---|
|
| ||||||
| Perfusion | <0.001 | 0.001 | 0.01 | <0.001 | <0.001 | <0.001 |
| PEI | 0.001 | 0.06 | 0.01 | <0.001 | 0.001 | 0.001 |
| TTP | 0.001 | 0.02 | 0.04 | 0.02 | 0.01 | <0.001 |
| Blood volume | 0.06 | 0.23 | 0.09 | 0.001 | 0.004 | 0.001 |
|
| ||||||
| Perfusion | 0.001 | 0.10 | 0.11 | 0.01 | 0.001 | 0.01 |
| PEI | 0.01 | 0.17 | 0.08 | 0.01 | 0.04 | 0.01 |
| TTP | 0.02 | 0.28 | 0.43 | 0.37 | 0.01 | 0.23 |
| Blood volume | 0.04 | 0.14 | 0.35 | 0.001 | 0.01 | 0.001 |
ER: estrogen receptor, PR: progesterone receptor, HER2: human epidermal growth factor receptor 2, PEI: peak enhancement intensity, TTP: time-to-peak enhancement. A p value < 0.05 for perfusion features was considered significant.
p values for association between CT texture parameters and histological factors in the training set.
| CT Texture Parameter | ER | PR | HER2 | Ki67 | Grade | Subtype |
|---|---|---|---|---|---|---|
|
| ||||||
| Mean_precontrast | 0.11 | 0.41 | 0.003 | 0.02 | <0.001 | 0.01 |
| Standard deviation_precontrast | 0.55 | 0.18 | 0.15 | 0.07 | 0.30 | 0.01 |
| Entropy_precontrast | 0.002 | <0.001 | 0.09 | 0.33 | 0.07 | 0.01 |
| Mean of positive pixels_precontrast | 0.05 | 0.26 | 0.01 | 0.18 | <0.001 | 0.09 |
| Skewness_precontrast | 0.048 | 0.14 | 0.07 | 0.15 | 0.09 | 0.21 |
| Kurtosis_precontrast | 0.05 | 0.11 | 0.12 | 0.06 | 0.02 | 0.18 |
| Mean_postccontrast | 0.05 | 0.38 | 0.01 | 0.02 | 0.01 | 0.03 |
| Standard deviation_postcontrast | 0.56 | 0.41 | 0.16 | 0.24 | 0.51 | 0.05 |
| Entropy_postcontrast | <0.001 | <0.001 | 0.003 | 0.02 | 0.002 | <0.001 |
| Mean of positive pixels_postcontrast | 0.049 | 0.43 | 0.02 | 0.05 | 0.02 | 0.07 |
| Skewness_postcontrast | 0.01 | 0.15 | 0.06 | 0.09 | 0.03 | 0.05 |
| Kurtosis_postcontrast | 0.01 | 0.18 | 0.22 | 0.048 | 0.06 | 0.08 |
|
| ||||||
| Mean_precontrast | 0.046 | 0.12 | 0.12 | 0.01 | 0.06 | 0.01 |
| Standard deviation_precontrast | 0.82 | 0.19 | 0.19 | 0.92 | 0.68 | 0.32 |
| Entropy_precontrast | <0.001 | <0.001 | <0.001 | 0.01 | 0.002 | <0.001 |
| Mean of positive pixels_precontrast | 0.21 | 0.55 | 0.55 | 0.04 | 0.18 | 0.01 |
| Skewness_precontrast | 0.13 | 0.46 | 0.46 | 0.70 | 0.23 | 0.23 |
| Kurtosis_precontrast | 0.77 | 0.37 | 0.37 | 0.79 | 0.68 | 0.85 |
| Mean_postccontrast | 0.04 | 0.06 | 0.06 | 0.03 | 0.046 | 0.03 |
| Standard deviation_postcontrast | 0.63 | 0.72 | 0.72 | 0.37 | 0.85 | 0.78 |
| Entropy_postcontrast | <0.001 | <0.001 | <0.001 | 0.001 | <0.001 | <0.001 |
| Mean of positive pixels_postcontrast | 0.16 | 0.26 | 0.26 | 0.15 | 0.21 | 0.08 |
| Skewness_postcontrast | 0.97 | 0.50 | 0.50 | 0.18 | 0.35 | 0.02 |
| Kurtosis_postcontrast | 0.70 | 0.55 | 0.55 | 0.20 | 0.18 | 0.50 |
|
| ||||||
| Mean_precontrast | 0.18 | 0.29 | 0.21 | 0.02 | 0.27 | 0.04 |
| Standard deviation_precontrast | 0.88 | 0.55 | 0.10 | 0.84 | 0.86 | 0.79 |
| Entropy_precontrast | <0.001 | <0.001 | <0.001 | 0.004 | 0.003 | <0.001 |
| Mean of positive pixels_precontrast | 0.21 | 0.34 | 0.22 | 0.05 | 0.33 | 0.11 |
| Skewness_precontrast | 0.57 | 0.93 | 0.35 | 0.80 | 0.97 | 0.63 |
| Kurtosis_precontrast | 0.01 | 0.04 | 0.01 | 0.11 | 0.04 | 0.02 |
| Mean_postccontrast | 0.14 | 0.30 | 0.14 | 0.07 | 0.19 | 0.10 |
| Standard deviation_postcontrast | 0.67 | 0.99 | 0.63 | 0.87 | 0.71 | 0.73 |
| Entropy_postcontrast | <0.001 | <0.001 | <0.001 | 0.001 | <0.001 | |
| Mean of positive pixels_postcontrast | 0.20 | 0.41 | 0.16 | 0.10 | 0.24 | 0.12 |
| Skewness_postcontrast | 0.74 | 0.97 | 0.26 | 0.10 | 0.96 | 0.58 |
| Kurtosis_postcontrast | 0.03 | 0.12 | 0.07 | 0.15 | 0.59 | 0.12 |
ER: estrogen receptor, PR: progesterone receptor, HER2: human epidermal growth factor receptor 2, SSF: spatial scale filter. A p value < 0.05 for texture features was considered significant.
Diagnostic performance of five machine learning models using perfusion and texture features to predict histological factors.
| Machine Learning Model | Diagnostic Performance | Perfusion Features * | Texture Features † | Integrating Perfusion and Texture Features ‡ | |
|---|---|---|---|---|---|
| Decision tree | AUC median | 0.55 | 0.59 | 0.65 | 0.04 |
| AUC mean | 0.55 | 0.58 | 0.62 | ||
| AUC SD | 0.25 | 0.34 | 0.28 | ||
| AUC 95% CI | 0.35, 0.75 | 0.42, 0.74 | 0.40, 0.84 | ||
| accuracy | 59% | 66% | 73% | ||
| sensitivity | 68% | 38% | 51% | ||
| specificity | 47% | 64% | 73% | ||
| NPV | 42% | 49% | 61% | ||
| PPV | 73% | 57% | 58% | ||
| Naïve Bayes | AUC median | 0.69 | 0.54 | 0.73 | 0.63 |
| AUC mean | 0.69 | 0.59 | 0.71 | ||
| AUC SD | 0.31 | 0.18 | 0.32 | ||
| AUC 95% CI | 0.44, 0.94 | 0.51, 0.67 | 0.45, 0.97 | ||
| accuracy | 67% | 51% | 65% | ||
| sensitivity | 80% | 60% | 60% | ||
| specificity | 52% | 67% | 64% | ||
| NPV | 56% | 63% | 78% | ||
| PPV | 77% | 69% | 61% | ||
| Logistic regression | AUC median | 0.65 | 0.50 | 0.71 | 0.41 |
| AUC mean | 0.63 | 0.53 | 0.70 | ||
| AUC SD | 0.29 | 0.42 | 0.32 | ||
| AUC 95% CI | 0.40, 0.86 | 0.34, 0.72 | 0.44, 0.96 | ||
| accuracy | 63% | 62% | 73% | ||
| sensitivity | 80% | 24% | 46% | ||
| specificity | 43% | 68% | 71% | ||
| NPV | 53% | 45% | 64% | ||
| PPV | 74% | 25% | 27% | ||
| ANN | AUC median | 0.60 | 0.56 | 0.66 | 0.17 |
| AUC mean | 0.60 | 0.57 | 0.68 | ||
| AUC SD | 0.27 | 0.28 | 0.31 | ||
| AUC 95% CI | 0.38, 0.82 | 0.44, 0.70 | 0.43, 0.93 | ||
| accuracy | 61% | 59% | 68% | ||
| sensitivity | 70% | 39% | 55% | ||
| specificity | 43% | 67% | 72% | ||
| NPV | 38% | 47% | 71% | ||
| PPV | 74% | 55% | 61% | ||
| Random forest | AUC median | 0.65 | 0.61 | 0.76 | … |
| AUC mean | 0.66 | 0.61 | 0.75 | ||
| AUC SD | 0.30 | 0.32 | 0.34 | ||
| AUC 95% CI | 0.42, 0.90 | 0.46, 0.76 | 0.48, 1.00 | ||
| accuracy | 65% | 65% | 74% | ||
| sensitivity | 81% | 27% | 50% | ||
| specificity | 36% | 72% | 76% | ||
| NPV | 48% | 48% | 69% | ||
| PPV | 72% | 64% | 70% |
ANN: artificial neural network, AUC: area under the receiver operating characteristic curve, SD: standard deviation, CI: confidence interval, NPV: negative predictive value, PPV: positive predictive value. * Perfusion features were measured for both the hot spots of the tumor and the whole tumor. † Texture features were measured at SSF 0, 2, and 5. ‡ The AUC of integrated model was highest when perfusion features of hot spots and texture features at SSF 0 were used. p values are for comparing the median AUC values with the random forest model among the integrating models by the DeLong test.
AUCs and accuracies of integrated machine learning models using perfusion and texture features to predict each histological factor.
| Machine Learning Model | Diagnostic Performance | ER | PR | HER2 | Ki67 | Grade | Subtype |
|---|---|---|---|---|---|---|---|
| Decision tree | AUC median | 0.65 | 0.55 | 0.61 | 0.53 | 0.69 | 0.68 |
| accuracy | 77% | 59% | 83% | 52% | 73% | 73% | |
| Naïve Bayes | AUC median | 0.76 | 0.60 | 0.75 | 0.72 | 0.73 | 0.68 |
| accuracy | 65% | 59% | 62% | 68% | 73% | 65% | |
| Logistic regression | AUC median | 0.76 | 072 | 0.67 | 0.55 | 0.69 | 0.79 |
| accuracy | 70% | 73% | 89% | 49% | 73% | 81% | |
| ANN | AUC median | 0.66 | 0.60 | 0.67 | 0.66 | 0.73 | 0.75 |
| accuracy | 68% | 54% | 84% | 68% | 68% | 65% | |
| Random forest | AUC median | 0.76 | 0.69 | 0.86 | 0.65 | 0.75 | 0.79 |
| accuracy | 76% | 74% | 92% | 65% | 67% | 75% |
ER: estrogen receptor, HER2: human epidermal growth factor receptor 2, AUC: area under the receiver operating characteristic curve, ANN: artificial neural network. Integrating machine learning model was built using perfusion features of hot spots and texture features at SSF 0.
Top five important CT parameters from the integrated random forest model to predict histological biomarkers and molecular subtypes.
| Rank | Important CT Parameters |
|---|---|
| 1 | Entropy_postcontrast |
| 2 | Perfusion_hot spot |
| 3 | TTP_hot spot |
| 4 | PEI_hot spot |
| 5 | Entropy_precontrast |
TTP: time-to-peak enhancement, PEI: peak enhancement intensity. Integrated machine learning model was built using perfusion features of hot spots and texture features at SSF 0.
Values of the top five important CT parameters according to histological factors in the training set.
| Histological Facor | Entropy_ | Perfusion_ | TTP_ | PEI_ | Entropy_Precontrast |
|---|---|---|---|---|---|
|
| |||||
| − | 4.75 | 46.98 | 34.68 | 82.22 | 4.54 |
| + | 4.50 | 26.94 | 51.92 | 65.54 | 4.35 |
|
| |||||
| − | 4.55 | 31.33 | 47.63 | 68.69 | 4.40 |
| + | 4.76 | 48.88 | 35.82 | 86.61 | 4.54 |
|
| |||||
| low | 4.52 | 27.67 | 51.01 | 64.33 | 4.36 |
| high | 4.70 | 44.36 | 37.38 | 83.00 | 4.51 |
|
| |||||
| luminal A | 4.48 | 20.60 | 53.20 | 58.03 | 4.35 |
| luminal B | 4.54 | 35.46 | 54.59 | 80.88 | 4.35 |
| HER2-enriched | 4.78 | 59.22 | 21.06 | 86.98 | 4.58 |
| Triple-negative | 4.75 | 44.71 | 33.32 | 76.89 | 4.56 |
ER: estrogen receptor, HER2: human epidermal growth factor receptor 2, PEI: peak enhancement intensity, TTP: time-to-peak enhancement.
Comparison of AUCs according to number of CT parameters of integrated machine learning models to predict histological factors and treatment failure.
| CT Parameter | AUC | |
|---|---|---|
| All parameters for predicting ER, HER2, and molecular subtype | ||
| Decision tree | 0.59 | 0.002 |
| Naïve Bayes | 0.75 | 0.52 |
| Logistic regression | 0.76 | 0.63 |
| ANN | 0.67 | 0.04 |
| Random forest | 0.79 | … |
| Top five important parameters for predicting ER, HER2, and molecular subtype | ||
| Decision tree | 0.62 | 0.03 |
| Naïve Bayes | 0.76 | 0.95 |
| Logistic regression | 0.76 | 0.97 |
| ANN | 0.70 | 0.34 |
| Random forest | 0.76 | … |
| All parameters for predicting ER, HER2, molecular subtype, and treatment failure | ||
| Decision tree | 0.52 | |
| Naïve Bayes | 0.81 | 0.29 |
| Logistic regression | 0.69 | 0.26 |
| ANN | 0.70 | 0.36 |
| Random forest | 0.76 | … |
| Top five parameters for predicting ER, HER2, molecular subtype, and treatment failure | ||
| Decision tree | 0.52 | |
| Naïve Bayes | 0.83 | 0.06 |
| Logistic regression | 0.72 | 0.82 |
| ANN | 0.68 | 0.64 |
| Random forest | 0.74 | … |
* p values are for comparison with the random forest model among the integrating models by the DeLong test. ER: estrogen receptor, HER2: human epidermal growth factor receptor 2, AUC: area under the receiver operating characteristic curve, ANN: artificial neural network.