| Literature DB >> 31676937 |
Magda Marcon1, Alexander Ciritsis2, Cristina Rossi2, Anton S Becker2, Nicole Berger2, Moritz C Wurnig2, Matthias W Wagner2, Thomas Frauenfelder2, Andreas Boss2.
Abstract
BACKGROUND: Our aims were to determine if features derived from texture analysis (TA) can distinguish normal, benign, and malignant tissue on automated breast ultrasound (ABUS); to evaluate whether machine learning (ML) applied to TA can categorise ABUS findings; and to compare ML to the analysis of single texture features for lesion classification.Entities:
Keywords: Breast neoplasms; Machine learning; Ultrasonography
Mesh:
Year: 2019 PMID: 31676937 PMCID: PMC6825080 DOI: 10.1186/s41747-019-0121-6
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
First order and second and high order texture features
| Histogram-derived | GLCM | GLRLM | GLSZM |
|---|---|---|---|
| Entropy | Contrast | Short-run emphasis (SRE) | Small zone emphasis (SZE) |
| Variance | Correlation | Long-run emphasis (LRE) | Large zone emphasis (LZE) |
| Skewness | Energy | Grey-level non-uniformity (GLN) | Grey-level non-uniformity (GLN) |
| Kurtosis | Homogeneity | Run length non-uniformity (RLN) | Zone-size non-uniformity (ZSN) |
| Contrast | Short-run emphasis (SRE) | Run percentage (RP) | Zone percentage (ZP) |
| Correlation | Long-run emphasis (LRE) | Low grey-level run emphasis (LGRE) | Low grey-level zone emphasis (LGZE) |
| Energy | Grey-level non-uniformity (GLN) | High grey-level run emphasis (HGRE) | High grey-level zone emphasis (HGZE) |
| Homogeneity | Run length non-uniformity (RLN) | Short-run low grey-level emphasis (m_SRLGE) | Small zone low-grey level emphasis (SZLGE) |
| Run percentage (RP) | Short-run high grey-level emphasis (SRHGE) | Small zone high grey level emphasis (SHZGE) | |
| Low grey-level run emphasis (LGRE) | Long-run low grey-level emphasis (LRLGE) | Large zone low grey-level emphasis (LZLGE) | |
| High grey-level run emphasis (HGRE) | Long-run high-grey level emphasis (LRHGE) | Large zone high grey-level (LZHGE) | |
| Short-run low grey-level emphasis (SRLGE) | Grey-level variance (GLV) | ||
| Short-run high grey-level emphasis (SRHGE) | Zone-size variance (ZSV) | ||
| Long-run low grey-level emphasis (LRLGE) | |||
| Long-run high grey-level emphasis (LRHGE) |
GLCM Grey-level co-occurrence matrix, GLRLM Grey-level run length matrix, GLSZM Grey-level size zone matrix
Fig. 1Axial images obtained with automated breast ultrasound (ABUS). A region of interest was drawn freehand marking the outer edge of the lesion or the maximal continuous area of fat or fibroglandular tissue included in the image. Invasive ductal carcinoma in a 53-year-old patient undergoing screening mammography and ABUS (a). Stable benign solid lesion after a 48-month follow-up in a 35-year-old woman undergoing routine control (b). Fatty tissue (c). Fibroglandular tissue (d)
Texture features that showed significantly different mean values comparing lesions (benign and malignant) versus normal tissue (fat and fibroglandular) and corresponding area under the curve (AUC)
| Feature | Lesions (mean ± standard deviation) | Normal tissue (mean ± standard deviation) | AUC (95% confidence interval) | |
|---|---|---|---|---|
| Entropy | 5.48 ± 0.35 | 5.65 ± 0.24 | < 0.00001 for all | 0.67 (0.63–0.71) |
| Variance | 153.53 ± 23.59 | 137.75 ± 20.12 | 0.70 (0.66–0.73) | |
| Contrast | 24.10 ± 9.87 | 40.10 ± 12.5 | 0.84 (0.81–0.87) | |
| Correlation | 0.88 ± 0.06 | 0.81 ± 0.06 | 0.80 (0.65–0.83) | |
| Energy | 3.4 × 10−3 ± 1.5 × 10−3 | 2.2 × 10−3 ± 0.7 × 10−3 | 0.83 (0.80–0.86) | |
| Homogeneity | 0.36 ± 0.05 | 0.33 ± 0.04 | 0.72 (0.69–0.76) | |
| Contrast | 26.43 ± 12.38 | 40.94 ± 13.41 | 0.80 (0.76–0.83) | |
| Correlation | 0.87 ± 0.06 | 0.81 ± 0.07 | 0.78 (0.74–0.82) | |
| Energy | 3.9 × 10−3 ± 1.4 × 10−3 | 2.6 × 10−3 ± 0.7 × 10−3 | 0.86 (0.82–0.88) | |
| Homogeneity (GLCM) | 0.36 ± 0.05 | 0.33 ± 0.04 | 0.71 (0.66–0.74) | |
| GLN (GLCM) | 78.42 ± 77.69 | 156.28 ± 104.76 | 0.79 (0.76–0.83) | |
| RLN (GLCM) | 1,870.43 ± 1,792.01 | 4,085.00 ± 2,628.83 | 0.82 (0.79–0.85) | |
| LRHGE (GLCM) | 1,706.53 ± 209.23 | 1,635.50 ± 197.26 | 0.61 (0.57–0.65) | |
| SRE (GLRLM) | 0.91 ± 0.02 | 0.92 ± 0.04 | 0.69 (0.65–0.73) | |
| LRE (GLRLM) | 1.47 ± 0.17 | 1.39 ± 0.12 | 0.65 (0.61–0.69) | |
| GLN (GLRLM) | 78.53 ± 77.67 | 156.36 ± 104.73 | 0.79 (0.76–0.83) | |
| RLN (GLRLM) | 1,871.42 ± 1,786.51 | 4,080.90 ± 2,622.01 | 0.82 (0.79–0.85) | |
| LRHGE (GLRLM) | 1,677.30 ± 196.74 | 1,598.70 ± 168.52 | 0.61 (0.57–0.65) | |
| LZE | 5.38 ± 3.34 | 4.07 ± 2.38 | 0.66 (0.62–0.70) | |
| HGZE | 1,221.08 ± 45.52 | 1,191.37 ± 86.03 | 0.67 (0.63–0.71) |
GLCM Grey-level co-occurrence matrix, GLN Grey-level non-uniformity, RLN Run length non-uniformity, LRHGE Long-run high grey-level emphasis, GLRLM Grey-level run length matrix, SRE Short-run emphasis, LRE Long-run emphasis, LZE Large zone emphasis, HGZE High grey-level zone emphasis
Texture features that showed significantly different mean values comparing malignant versus benign solid lesions and corresponding area under the curve (AUC)
| Feature | Malignant lesions (mean ± standard deviation) | Benign lesions (mean ± standard deviation) | AUC (95% confidence interval) | |
|---|---|---|---|---|
| Entropy | 5.28 ± 0.38 | 5.67 ± 0.16 | < 0.00001 for all | 0.86 (0.82–0.89) |
| Skewness | 0.74 ± 0.33 | 0.54 ± 0.41 | 0.66 (0.61–0.70) | |
| Kurtosis | 0.53 ± 0.67 | 0.31 ± 0.66 | 0.61 (0.56–0.65) | |
| Contrast | 24.45 ± 10.61 | 28.40 ± 13.66 | 0.58 (0.53–0.63) | |
| GLN (GLCM) | 96.33 ± 87.47 | 60.58 ± 61.73 | 0.67 (0.62–0.71) | |
| RLN (GLCM) | 2,218.97 ± 1,834.77 | 1,523.25 ± 1,681.38 | 0.66 (0.62–0.71) | |
| HGRE (GLCM) | 1,163.54 ± 20.49 | 1,171.63 ± 19.35 | 0.62 (0.57–0.67) | |
| SRHGE (GLCM) | 1,067.79 ± 32.58 | 1,080.79 ± 29.69 | 0.63 (0.58–0.67) | |
| GLN (GLRLM) | 96.43 ± 87.45 | 60.70 ± 61.72 | 0.67 (0.62–0.71) | |
| RLN (GLRLM) | 2,218.89 ± 1,828.26 | 1,525.31 ± 1,677.17 | 0.67 (0.62–0.71) | |
| HGRE (GLRLM) | 1,165.00 ± 20.59 | 1,173.06 ± 19.57 | 0.62 (0.57–0.67) | |
| SRHGE (GLRLM) | 1,069.20 ± 33.13 | 1,082.28 ± 29.75 | 0.62 (0.58–0.67) |
GLN Grey-level non-uniformity, GLCM Grey-level co-occurrence matrix, RLN Run length non-uniformity, HGRE High grey-level run emphasis, SRHGE Short-run high grey-level emphasis, GLN Grey-level non-uniformity, GLRLM Grey-level run length matrix
Fig. 2Receiver operating characteristic (ROC) curves of texture analysis features with the highest area under the curve values (Table 2) as well as ROC curves obtained with machine learning support vector machine (SVM) algorithm with full and reduced features when used to compare lesions versus normal tissue on automated breast ultrasound
Fig. 3Receiver operating characteristic (ROC) curve obtained from the texture analysis for entropy as well as ROC curves obtained with machine learning support vector machine (SVM) algorithm with full and reduced features when used to compare malignant versus benign solid breast lesions on automated breast ultrasound
Area under the curve (AUC), accuracy, sensitivity, and specificity achieved with the validation set in the classification of lesions versus normal tissue and malignant versus benign solid lesions using the full texture feature set and the reduced feature set
| Sub-dataset | AUC (95% confidence interval) | Accuracy (%) | Sensitivity (%) | Specificity (%) |
|---|---|---|---|---|
| Lesions | ||||
| Full feature set | 0.96 (0.89–0.98) | 93.3 | 92.6 | 94.1 |
| Reduced feature set | 0.98 (0.92–0.99) | 94.4 | 96.3 | 92.1 |
| Malignant | ||||
| Full feature set | 0.98 (0.81–0.99) | 90.7 | 85.2 | 96.3 |
| Reduced feature set | 0.96 (0.90–0.99) | 87.1 | 81.5 | 92.6 |
Results in the validation set for the classification of lesions versus normal tissue and malignant versus benign solid lesions using the full texture feature and the reduced feature set
| Predicted | |||
|---|---|---|---|
| Actual | Lesions | Lesions (%) | Normal tissue (%) |
| Lesions ( | |||
| Full feature set | 50 (92.6) | 4 (7.4) | |
| Reduced feature set | 52 (96.3) | 2 (3.7) | |
| Normal tissue ( | |||
| Full feature set | 3 (5.9) | 48 (94.1) | |
| Reduced feature set | 4 (7.8) | 47 (92.2) | |
| Malignant | Malignant (%) | Benign (%) | |
| Malignant (tot = 27) | |||
| Full feature set | 23 (85.2) | 4 (14.8) | |
| Reduced feature set | 22 (81.5) | 5 (18.5) | |
| Benign (tot = 27) | |||
| Full feature set | 1 (3.7) | 26 (96.3) | |
| Reduced feature set | 2 (7.4) | 25 (92.6) | |
Fig. 4Ranked feature score in subset i (a) and in subset ii (b) (see text)
Fig. 5Lesions falsely classified as normal tissue using machine learning with the reduced feature set but correctly classified with the full feature set: invasive ductal carcinoma (maximal diameter 10 mm) in a 74-year-old patient (a) and fibroadenoma (maximal diameter 9 mm) in a 46-year-old patient (b). Lesion (a) was also falsely classified as benign in the comparison between malignant and benign lesions