| Literature DB >> 35341421 |
Alexandros Vamvakas1, Dimitra Tsivaka1, Andreas Logothetis2, Katerina Vassiou3, Ioannis Tsougos1.
Abstract
Introduction: This study aims to assess the utility of Boosting ensemble classification methods for increasing the diagnostic performance of multiparametric Magnetic Resonance Imaging (mpMRI) radiomic models, in differentiating benign and malignant breast lesions.Entities:
Keywords: AdaBoost; Boruta; Gradient Boosting; LightGBM; Radiomics; XGBoost
Mesh:
Substances:
Year: 2022 PMID: 35341421 PMCID: PMC8966070 DOI: 10.1177/15330338221087828
Source DB: PubMed Journal: Technol Cancer Res Treat ISSN: 1533-0338
Figure 1.The radiomic analysis workflow.
Figure 2.Flow diagram of the study participants selection.
Demographic and Clinical Characteristics of the Patient Sample.
| Benign | Malignant | |
|---|---|---|
|
| 70 females (50%) | 70 females (50%) |
|
| 70 (50%) | 70 (50%) |
|
| 44.6 ± 11.8 | 57.4 ± 12.5 |
|
| 1.8 ± 1.6 | 4.0 ± 2.4 |
|
| ||
| FA | 70 (100%) | |
| IDC | 53 (76%) | |
| ILC | 12 (17%) | |
| DCIS | 4 (6%) | |
| LCIS | 1 (1%) | |
|
| ||
| I | 10 (14%) | |
| II | 39 (56%) | |
| III | 21 (30%) | |
|
| ||
| 2 | 48 (69%) | |
| 3 | 19 (27%) | |
| 4 | 3 (4%) | 14 (20%) |
| 5 | 56 (80%) |
FA, Fibroadenoma; IDC, Invasive Ductal Carcinoma; ILC, Invasive Lobular Carcinoma; DCIS, Ductal Carcinoma In-Situ; LCIS, Lobular Carcinoma In-Situ
Boruta Selected Features.
| Index | Feature name | Importance (mean ± std) | Index | Feature name | Importance (mean ± std) |
|---|---|---|---|---|---|
| 0 | original_shape_LeastAxisLength | 1.43 ± 0.76 | 16 | DCE_original_firstorder_Skewness | 0.72 ± 0.35 |
| 1 | original_shape_Maximum2DDiameterColumn | 2.14 ± 0.80 | 17 | DCE_original_firstorder_TotalEnergy | 1.81 ± 0.70 |
| 2 | original_shape_Maximum2DDiameterRow | 1.52 ± 0.82 | 18 | DCE_original_gldm_DependenceNonUniformity | 1.06 ± 0.58 |
| 3 | original_shape_Maximum2DDiameterSlice | 2.22 ± 1.10 | 19 | DCE_original_glrlm_GrayLevelNonUniformity | 0.70 ± 0.44 |
| 4 | original_shape_Maximum3DDiameter | 0.94 ± 0.54 | 20 | DCE_original_glrlm_RunLengthNonUniformity | 1.35 ± 0.60 |
| 5 | original_shape_MeshVolume | 1.65 ± 0.63 | 21 | DCE_original_glszm_GrayLevelNonUniformity | 1.12 ± 0.48 |
| 6 | original_shape_MinorAxisLength | 1.38 ± 0.58 | 22 | DCE_original_glszm_SizeZoneNonUniformity | 1.55 ± 0.49 |
| 7 | original_shape_Sphericity | 2.46 ± 0.72 | 23 | ADC_original_firstorder_10Percentile | 4.91 ± 1.75 |
| 8 | original_shape_SurfaceArea | 3.51 ± 1.37 | 24 | ADC_original_firstorder_90Percentile | 5.36 ± 1.58 |
| 9 | original_shape_VoxelVolume | 1.50 ± 0.58 | 25 | ADC_original_firstorder_Maximum | 2.74 ± 0.87 |
| 10 | DCE_original_firstorder_90Percentile | 0.82 ± 0.36 | 26 | ADC_original_firstorder_Mean | 6.65 ± 2.18 |
| 11 | DCE_original_firstorder_Energy | 1.33 ± 0.61 | 27 | ADC_original_firstorder_Median | 6.22 ± 1.59 |
| 12 | DCE_original_firstorder_Maximum | 0.89 ± 0.40 | 28 | ADC_original_firstorder_Minimum | 3.86 ± 1.41 |
| 13 | DCE_original_firstorder_Mean | 0.94 ± 0.44 | 29 | ADC_original_firstorder_RootMeanSquared | 7.10 ± 1.98 |
| 14 | DCE_original_firstorder_Median | 0.90 ± 0.37 | 30 | ADC_original_firstorder_TotalEnergy | 1.26 ± 0.59 |
| 15 | DCE_original_firstorder_RootMeanSquared | 0.87 ± 0.43 | 31 | ADC_original_ngtdm_Busyness | 0.67 ± 0.45 |
Figure 3.Feature importance in terms of z-score distributions of RF permutations for the highly important feature subset nominated by Boruta.
Figure 4.The Hierarchical Clustering dendrogram (a) illustrating clusters arrangement as informed by the correlation plot (b) of Boruta selected features.
Classification Model Results (Mean [95% Confidence Intervals]).
| XGBoost | LGBM | AdaBoost | GB | SVM | |
|---|---|---|---|---|---|
| Acc | 0.88 [0.84-0.92] | 0.87 [0.83-0.91] | 0.83 [0.80-0.86] | 0.83 [0.80-0.86] | 0.84 [0.80-0.88] |
| Se | 0.91 [0.85-0.97] | 0.90 [0.84-0.96] | 0.83 [0.78-0.88] | 0.82 [0.77-0.87] | 0.80 [0.77-0.88] |
| Sp | 0.90 [0.82-0.98] | 0.89 [0.81-0.97] | 0.82 [0.75-0.89] | 0.80 [0.73-0.87] | 0.79 [0.70-0.88] |
| AUC | 0.95 [0.91-0.99] | 0.94 [0.90-0.98] | 0.90 [0.87-0.93] | 0.89 [0.86-0.92] | 0.88 [0.84-0.92] |
P-values of the Pairwise Statistical Comparisons of the Classification Models AUC Values Derived From DeLong's Test.
| XGBoost | LGBM | AdaBoost | GB | SVM | |
|---|---|---|---|---|---|
| XGBoost | 0.77 | 0.029 | 0.022 | 0.017 | |
| LGBM | 0.032 | 0.026 | 0.020 | ||
| AdaBoost | 0.93 | 0.71 | |||
| GB | 0.81 | ||||
| SVM |
Figure 5.Receiver operating characteristic (ROC) curves of the classification models.