| Literature DB >> 34307153 |
Bin Zhang1, Lirong Song2, Jiandong Yin2.
Abstract
PURPOSE: To evaluate the potential of the texture features extracted from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) intratumoral subregions to distinguish benign from malignant breast tumors.Entities:
Keywords: DCE-MRI; breast tumors; machine learning; magnetic resonance imaging; texture analysis
Year: 2021 PMID: 34307153 PMCID: PMC8299951 DOI: 10.3389/fonc.2021.688182
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Clinical characteristics of the patients selected for this study.
| Characteristic | Training cohort | Validation cohort | ||
|---|---|---|---|---|
| Number | % | Number | % | |
|
| 209 | 90 | ||
| Benign (age range, 25–82 years) | 84 | 40.2 | 40 | 44.4 |
| Malignant (age range, 29–84 years) | 125 | 59.8 | 50 | 55.6 |
|
| ||||
| 3 | 18 | 8.6 | 7 | 7.8 |
| 4A | 56 | 26.8 | 27 | 30 |
| 4B | 43 | 20.6 | 16 | 17.8 |
| 4C | 68 | 32.5 | 35 | 38.9 |
| 5 | 24 | 11.5 | 5 | 5.5 |
|
| ||||
| Benign | 84 | 40.2 | 40 | 44.4 |
| Adenosis | 48 | 23.0 | 23 | 25.5 |
| Fibroadenoma | 32 | 15.3 | 14 | 15.5 |
| Papilloma | 4 | 1.9 | 3 | 3.4 |
| Malignant | 125 | 59.8 | 50 | 55.6 |
| Invasive carcinoma of no special type | 116 | 55.5 | 41 | 45.6 |
| Ductal carcinoma in situ | 6 | 2.8 | 5 | 5.6 |
| Invasive micropapillary carcinoma | 2 | 1.0 | 3 | 3.3 |
| Invasive lobular carcinoma | 1 | 0.5 | 1 | 1.1 |
Figure 1The flowchart adopted in this study.
Detailed information on the extracted features.
| Methods | Texture features | Number |
|---|---|---|
| Histogram | Mean, Kurtosis, Skewness, Variance | 4 |
| GLCM | Autocorrelation, Contrast, Correlation, Cluster prominence, Cluster shadow, Dissimilarity, Energy, Entropy, Homogeneity, Maximum probability, Sum of square, Sum average, Sum variance, Sum entropy, Difference square, Difference entropy, Information measure of correlation, Inverse difference normalized, Inverse difference moment normalized | 380 |
| GRLM | Short run emphasis, Long run emphasis, Gray-level non-uniformity, Run length non-uniformity, Fraction of image in runs, Low gray-level run emphasis, High gray-level run emphasis, Short run low gray-level emphasis, Short run high gray-level emphasis, Long run low gray-level emphasis, Long run high gray-level emphasis | 44 |
| DWT | Harr parameters | 13 |
| Deubechies2 parameters | 13 | |
| Symlet4 parameters | 13 | |
| Total | 467 |
GLCM, gray-level co-occurrence matrix; GRLM, gray-level run length matrix; DWT, discrete wavelet transform.
Figure 2Results of whole tumor segmentation and intratumoral subregion partition. The first row shows the results of a benign case: (A) subtraction image with the maximum tumor diameter; (B) result of the whole tumor area segmented with a semi-automatic method; (C) result of intratumoral subregion partition, in which red, green, and blue represent the early, moderate, and late subregions, respectively. The second row shows the results of a malignant case: (D) subtraction image; (E) result of the whole tumor area; (F) result of intratumoral subregion partition.
Univariate analysis for predicting benign and malignant breast tumors.
| Methods | Subregions | Features | AUC | 95% CI |
|
|---|---|---|---|---|---|
| Intratumoral subregions | Early | Run length nonuniformity (1, 0) | 0.886 | 0.836–0.926 | <0.001 |
| Difference square (0, 1) | 0.877 | 0.825–0.918 | 0.004 | ||
| Short run emphasis (1, 0) | 0.870 | 0.817–0.913 | <0.001 | ||
| Correlation (−1, 0) | 0.836 | 0.779–0.884 | 0.081 | ||
| Information measure of correlation (−2, 0) | 0.820 | 0.761–0.870 | 0.391 | ||
| Deubechies2_2HH | 0.787 | 0.725–0.840 | 0.186 | ||
| Moderate | Gray-level non-uniformity (1, 0) | 0.777 | 0.715–0.832 | <0.001 | |
| Deubechies2_1VH | 0.740 | 0.675–0.798 | 0.357 | ||
| Haar_1DH | 0.736 | 0.671–0.795 | <0.001 | ||
| Symlet4_1DH | 0.729 | 0.664–0.788 | 0.016 | ||
| Deubechies2_1DH | 0.718 | 0.651–0.778 | 0.003 | ||
| Mean | 0.715 | 0.648–0.775 | 0.238 | ||
| Late | Information measure of correlation (0,1) | 0.884 | 0.833–0.924 | 0.002 | |
| Information measure of correlation (−1,0) | 0.853 | 0.798–0.898 | 0.059 | ||
| Deubechies2_2VH | 0.849 | 0.797–0.898 | 0.001 | ||
| Haar_1HH | 0.840 | 0.784–0.887 | 0.001 | ||
| Haar_4HH | 0.724 | 0.658–0.783 | <0.001 | ||
| Mean | 0.685 | 0.617–0.747 | 0.157 | ||
| Whole tumor area | / | Deubechies2_2DH | 0.786 | 0.725–0.840 | / |
| Haar_2DH | 0.779 | 0.717–0.833 | / | ||
| Symlet4_2VH | 0.776 | 0.713–0.831 | / | ||
| Symlet4_2HH | 0.747 | 0.682–0.804 | / | ||
| Deubechies2_3DH | 0.734 | 0.669–0.793 | / | ||
| Mean | 0.732 | 0.667–0.791 | / |
AUC, area under the receiver operating characteristic curve; CI, confidence interval.
P-value represents the comparison results of the features from the three intratumoral subregions and the same features from the whole tumor area.
The symbol ("/") represents null.
Performance of classification models for identifying benign and malignant breast tumors.
| Models | Cohort | AUC | 95% CI | Sensitivity | Specificity | Accuracy |
| |
|---|---|---|---|---|---|---|---|---|
| DT | Early | Training | 0.863 | 0.808–0.906 | 80.0% | 91.7% | 79.8% | 0.004 |
| Validation | 0.839 | 0.747–0.908 | 90.0% | 80.0% | 77.8% | 0.006 | ||
| Moderate | Training | 0.777 | 0.715–0.832 | 79.2% | 76.2% | 76.5% | 0.473 | |
| Validation | 0.718 | 0.613–0.808 | 70.0% | 75.0% | 74.4% | 0.406 | ||
| Late | Training | 0.860 | 0.806–0.904 | 80.8% | 84.5% | 78.5% | 0.015 | |
| Validation | 0.784 | 0.601–0.798 | 82.0% | 77.5% | 76.7% | 0.043 | ||
| Whole | Training | 0.744 | 0.679–0.802 | 86.4% | 67.9% | 74.2% | / | |
| Validation | 0.670 | 0.563–0.766 | 74.0% | 65.0% | 67.8% | / | ||
| SVM | Early | Training | 0.934 | 0.891–0.963 | 89.6% | 86.9% | 88.5% | 0.002 |
| Validation | 0.890 | 0.806–0.946 | 84.0% | 85.0% | 83.3% | 0.001 | ||
| Moderate | Training | 0.868 | 0.814–0.911 | 81.6% | 84.5% | 80.4% | 0.078 | |
| Validation | 0.737 | 0.634–0.824 | 80.0% | 73.5% | 72.2% | 0.664 | ||
| Late | Training | 0.921 | 0.876–0.954 | 86.4% | 85.7% | 84.5% | 0.002 | |
| Validation | 0.865 | 0.777–0.928 | 82.0% | 80.0% | 80.0% | 0.007 | ||
| Whole | Training | 0.806 | 0.746–0.857 | 69.6% | 83.3% | 65.5% | / | |
| Validation | 0.708 | 0.602–0.799 | 88.0% | 67.5% | 61.1% | / | ||
AUC, area under the receiver operating characteristic curve; CI, confidence interval; DT, decision tree; SVM, support vector machine.
P-value represents the comparison results of the AUC value of the same model established by features from intratumoral subregions and the whole tumor area..
The symbol ("/") represents null.
Figure 3ROC curves of the DT classification models established by using the features extracted from the three intratumoral subregions and the whole tumor area. (A) ROC curves from the training cohort. (B) ROC curve from the external validation cohort.
Figure 4ROC curves of the SVM classification models established by using the features extracted from the three intratumoral subregions and the whole tumor area. (A) ROC curves from the training cohort. (B) ROC curves from the external validation cohort.
P-values of DeLong tests between subregion models in the training cohort.
| Classifier | DT_Early | DT_Moderate | DT_Late | SVM_Early | SVM_Moderate | SVM_Late |
|---|---|---|---|---|---|---|
| DT_Early | / | 0.013 | 0.945 | 0.003 | 0.002 | 0.034 |
| DT_Moderate | 0.013 | / | 0.035 | 0.001 | 0.004 | 0.001 |
| DT_Late | 0.945 | 0.035 | / | 0.008 | 0.843 | 0.026 |
| SVM_Early | 0.003 | 0.001 | 0.008 | / | 0.020 | 0.524 |
| SVM_Moderate | 0.002 | 0.004 | 0.843 | 0.020 | / | 0.091 |
| SVM_Late | 0.034 | 0.001 | 0.026 | 0.524 | 0.091 | / |
DT, decision tree; SVM, support vector machine.
The symbol ("/") represents null.
P-values of DeLong tests between subregion models in the validation cohort.
| Classifier | DT_Early | DT_Moderate | DT_Late | SVM_Early | SVM_Moderate | SVM_Late |
|---|---|---|---|---|---|---|
| DT_Early | / | 0.068 | 0.332 | 0.018 | 0.111 | 0.047 |
| DT_Moderate | 0.068 | / | 0.370 | 0.006 | 0.760 | 0.012 |
| DT_Late | 0.332 | 0.370 | / | 0.035 | 0.511 | 0.029 |
| SVM_Early | 0.018 | 0.006 | 0.035 | / | 0.007 | 0.523 |
| SVM_Moderate | 0.111 | 0.760 | 0.511 | 0.007 | / | 0.032 |
| SVM_Late | 0.047 | 0.012 | 0.029 | 0.523 | 0.032 | / |
The symbol ("/") represents null.
Texture features extracted from early and late subregions selected with LASSO.
| Features | Benign | Malignant |
|---|---|---|
|
| ||
| Mean | 133.642 ± 41.162 | 168.686 ± 42.720 |
| Variance | 27.638 ± 10.281 | 33.551 ± 8.434 |
| Difference square (0, 1) | 0.220 ± 0.104 | 0.991 ± 0.357 |
| Correlation (−2, 0) | 0.654 ± 0.259 | 0.823 ± 0.0987 |
| Information measure of correlation (0, 1) | 0.6131 ± 0.186 | 0.822 ± 0.0691 |
| Short run emphasis (1, 0) | 0.897 ± 0.0944 | 0.622 ± 0.114 |
| Run length non-uniformity (1, 0) | 560.054 ± 13.620 | 426.963 ± 52.547 |
| Deubechies2_2HH | 7.810 ± 2.364 | 4.399 ± 1.231 |
| Deubechies2_1VH | 9.040 ± 4.241 | 4.853 ± 1.680 |
| Symlet4_1VH | 8.174 ± 3.807 | 4.359 ± 1.445 |
| Haar_4HH | 3.231 ± 1.749 | 5.317 ± 1.992 |
| Deubechies2_3HH | 4.963 ± 1.313 | 3.826 ± 0.831 |
| Symlet4_4VH | 3.089 ± 1.469 | 4.938 ± 1.845 |
| Symlet4_1DH | 5.212 ± 2.273 | 2.517 ± 0.964 |
|
| ||
| Mean | 117.859 ± 29.076 | 136.495 ± 29.933 |
| Variance | 30.496 ± 7.022 | 35.016 ± 7.631 |
| Contrast (0,1) | 0.384 ± 0.203 | 0.716 ± 0.252 |
| Information measure of correlation (0, 1) | −0.595 ± 0.099 | −0.441 ± 0.0807 |
| Information measure of correlation (−1, 0) | −0.594 ± 0.0807 | −0.462 ± 0.0492 |
| Short run emphasis (1, 0) | 0.672 ± 0.117 | 0.810 ± 0.149 |
| Haar_1HH | 7.709 ± 4.446 | 13.814 ± 4.073 |
| Deubechies2_2VH | 5.455 ± 2.949 | 9.708 ± 2.897 |
| Haar_2HH | 6.651 ± 4.085 | 8.691 ± 2.746 |
| Haar_4HH | 5.029 ± 1.993 | 2.895 ± 0.978 |
| Haar_3VH | 4.496 ± 1.049 | 5.425 ± 1.343 |
| Haar_4DH | 2.048 ± 0.881 | 1.567 ± 0.569 |
| Deubechies2_3VH | 4.640 ± 1.481 | 5.489 ± 1.427 |
| Deubechies2_4VH | 4.287 ± 1.630 | 3.103 ± 1.293 |
| Deubechies2_4DH | 1.292 ± 0.393 | 1.835 ± 0.720 |
| Symlet4_3HH | 3.335 ± 1.161 | 4.755 ± 1.129 |
| Symlet4_3VH | 3.722 ± 1.434 | 5.611 ± 1.262 |
The data are means ± SD.
The data are medians ± interquartile range.