| Literature DB >> 35148703 |
Redona Brahimetaj1, Inneke Willekens2, Annelien Massart2, Ramses Forsyth3, Jan Cornelis4, Johan De Mey2, Bart Jansen4,5.
Abstract
BACKGROUND: The detection of suspicious microcalcifications on mammography represents one of the earliest signs of a malignant breast tumor. Assessing microcalcifications' characteristics based on their appearance on 2D breast imaging modalities is in many cases challenging for radiologists. The aims of this study were to: (a) analyse the association of shape and texture properties of breast microcalcifications (extracted by scanning breast tissue with a high resolution 3D scanner) with malignancy, (b) evaluate microcalcifications' potential to diagnose benign/malignant patients.Entities:
Keywords: Breast Cancer; Computer aided detection and diagnosis systems; Machine learning; Microcalcifications; Radiomics; X-ray micro-CT
Mesh:
Year: 2022 PMID: 35148703 PMCID: PMC8832731 DOI: 10.1186/s12885-021-09133-4
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Patients’ clinicopathological characteristics. BI-RADS breast density assessment is expressed from A-D scaling: A (<25% glandular), B (25% - 50% glandular), C (51% - 75% glandular, D (>75% glandular). Patient reproductive history is expressed using Gravida-Para (GP) terminology (’has children’ label refers to patient with children but exact number was not specified/saved). The label ’undefined’ indicates cases for which information could not be retrieved from the hospital’ archives or the patient did not provide it
| 57.2 ±9.7 | 56.7 ±9.4 | |
| A ( | A ( | |
| B ( | B ( | |
| C ( | C ( | |
| D ( | D ( | |
| No ( | No ( | |
| Yes ( | Yes ( | |
| No ( | No ( | |
| Yes ( | Yes ( | |
| G0P0 ( | G0P0 ( | |
| G1P1 ( | G1P1 ( | |
| G2P1 ( | G2P1 ( | |
| G2P2 ( | G2P2 ( | |
| G3P1 ( | G3P3 ( | |
| G3P2 ( | G4P3 ( | |
| G3P3 ( | G6P6 ( | |
| G4P3 ( | G9P9 ( | |
| G6P6 ( | Has children ( | |
| G8P7 ( | Undefined ( | |
| Has children ( | - | |
| Undefined ( | - | |
| No ( | No ( | |
| Yes ( | Yes ( | |
| Undefined ( | Undefined ( | |
| No ( | No ( | |
| Yes ( | Yes ( | |
| Undefined ( | Undefined ( |
Number of extracted features (extracted on original images and transform domains) per each feature class (shape, first order, GLCM, GLRLM, GLSZM, GLDM, NGTDM)
| 17 | 19 | 24 | 15 | 16 | 14 | 5 | |
| 0 | 19 | 24 | 15 | 16 | 14 | 5 | |
| 0 | 418 | 528 | 330 | 352 | 308 | 110 | |
Gray Level Co-occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), Gray Level Size Zone Matrix (GLSZM), Gray Level Dependence Matrix (GLDM), Neighbouring Gray Tone Difference Matrix (NGTDM), Laplacian of Gaussian (LoG)
Results (expressed in percentage) of individual MCs classification experiments among 30 repetitions, no feature selection method applied
| MLP | 71.79 ±1.05 | 64.65 ±1.21 | 77.28 ±1.34 | 77.16 ±0.93 | 71.68 ±0.01 |
| SVM | 73.80 ±0.0 | 61.39 ±0.0 | 83.34 ±0.0 | 77.87 ±0.0 | 73.39 ±0.0 |
| AdaBoost | 75.68 ±0.0 | 61.58 ±0.0 | 86.52 ±0.0 | 77.89 ±0.0 | 75.17 ±0.0 |
Area Under the Curve (AUC), Multi Layer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM)
Results (expressed in percentage) of individual MCs classification experiments among 30 repetitions, RFE feature selection method applied
| MLP | 75.46 ±0.65 | 63.44 ±1.20 | 84.70 ±0.74 | 80.57 ±0.58 | 75.08 ±0.01 | 300 |
| SVM | 75.74 ±0.0 | 60.93 ±0.0 | 87.12 ±0.0 | 78.24 ±0.0 | 75.17 ±0.0 | 80 |
| AdaBoost | 76.42 ±0.0 | 63.09 ±0.0 | 86.67 ±0.0 | 77.40 ±0.0 | 75.97 ±0.0 | 300 |
Area Under the Curve (AUC), Multi Layer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM)
Fig. 1ROC curves and AUC values corresponding to experimental results reported in Tables 3, 4. The green points represent the decision threshold for the reported results in the corresponding tables
Sample classification, thresholding approach results (expressed in percentage), no feature selection
| 50% | MLP | 78.72 ±41.15 | 70.59 | 88.37 | 78.67 |
| 45% | SVM | 78.72 ±41.15 | 74.51 | 83.72 | 78.75 |
| 35% | AdaBoost | 79.79 ±40.37 | 72.55 | 88.37 | 79.77 |
| 35% | MLP | 78.72 ±41.15 | 80.39 | 76.74 | 78.72 |
| 30% | SVM | 78.72 ±41.15 | 76.47 | 81.36 | 78.76 |
| 30% | AdaBoost | 77.66 ±41.88 | 74.51 | 81.40 | 77.70 |
| 25% | RF | 80.85 ±39.56 | 76.47 | 86.04 | 80.88 |
| 25% | AdaBoost | 78.72 ±41.15 | 80.39 | 76.74 | 78.72 |
| 20% | RF | 78.72 ±41.15 | 76.47 | 81.39 | 78.76 |
| 15% | RF | 78.72 ±41.15 | 82.35 | 74.41 | 78.67 |
| 10% | RF | 75.53 ±43.22 | 86.27 | 62.79 | 75.09 |
| 10% | AdaBoost | 71.28 ±45.49 | 92.16 | 46.51 | 69.46 |
| 5% | RF | 74.47 ±43.84 | 96.07 | 48.83 | 72.69 |
Multi Layer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM)
Sample classification, thresholding approach results (expressed in percentage), RFE feature selection
| 50% | MLP | 76.6 ±42.57 | 64.70 | 90.69 | 76.37 |
| 45% | MLP | 77.66 ±41.88 | 68.62 | 88.37 | 77.58 |
| 40% | AdaBoost | 79.79 ±40.37 | 70.59 | 90.70 | 79.71 |
| 35% | AdaBoost | 80.85 ±39.56 | 74.51 | 88.37 | 80.85 |
| 35% | SVM | 78.72 ±41.15 | 70.58 | 88.37 | 78.67 |
| 30% | AdaBoost | 80.85 ±39.56 | 78.43 | 83.72 | 80.89 |
| 30% | SVM | 78.72 ±41.15 | 72.55 | 86.05 | 78.72 |
| 25% | RF | 78.72 ±41.15 | 76.47 | 81.39 | 78.76 |
| 20% | AdaBoost | 80.85 ±39.56 | 88.24 | 72.09 | 80.66 |
| 15% | RF | 77.66 ±41.88 | 82.35 | 72.09 | 77.57 |
| 10% | RF | 75.53 ±43.22 | 86.27 | 62.79 | 75.09 |
| 10% | SVM | 74.47 ±43.84 | 88.24 | 58.14 | 73.74 |
| 5% | RF | 73.4 ±44.42 | 92.15 | 51.16 | 72.03 |
Multi Layer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM)
Sample classification, multiple instance-learning algorithms results (expressed in percentage)
| 150 | ||||||
| STK | 65.95 | 82.35 | 46.51 | 72.87 | 64.70 | 150 |
| sMIL | 73.40 | 78.43 | 67.44 | 79.66 | 73.30 | 50 |
| MISVM | 73.40 | 62.75 | 86.04 | 70.59 | 73.21 | 150 |
| miSVM | 72.34 | 64.70 | 81.39 | 70.63 | 72.28 | 150 |
| MissSVM | 72.34 | 58.82 | 88.37 | 74.33 | 71.93 | 30 |
Area Under the Curve (AUC), Normalized set kernel (NSK), Statistics kernel (STK), Sparse multiple instance learning (sMIL), Maximum bag margin support vector machine (MISVM), Maximum pattern margin support vector machine (miSVM), Multi instance learning by semi-supervised support vector machine (MissSVM)
Fig. 2ROC curves and AUC values, multiple instance-learning algorithms. The green points represent the decision threshold for the reported results in the Table 7