| Literature DB >> 35655690 |
Laila Akhouayri1,2,3, Meriem Regragui4, Samira Benayad4, Nisrine Bennani Guebessi4, Farida Marnissi4, Giovanna Chiorino2, Mehdi Karkouri1,4.
Abstract
Introduction: breast cancer (BC) is a malignancy with very high incidence and mortality in Africa, especially in Western Africa, where more than 25 thousand deaths are registered every year. Not all BC have the same prognosis, and being able to personalize treatment and predict aggressiveness is of crucial importance. The purpose of our study is to explore further subdivisions associated with prognosis, beyond breast cancer molecular classification that is routinely established in pathology departments.Entities:
Keywords: Ki-67; clustering; estrogen; immunohistochemistry; progesterone; prognosis
Mesh:
Substances:
Year: 2022 PMID: 35655690 PMCID: PMC9120749 DOI: 10.11604/pamj.2022.41.170.31239
Source DB: PubMed Journal: Pan Afr Med J
Figure 1estimation maximization clustering outcome on the Moroccan dataset
Ki-67 distributions in cluster 1 and cluster 2 according to ER/PgR/HER2 status
| ER-PgR- | HER2 0+ C1 | HER2 0+ C2 | HER2 1+ C1 | HER2 1+ C2 | HER2 3+ C1 | HER2+ 3+ C2 |
|---|---|---|---|---|---|---|
| Patients (%) | 40.6 | 59.3 | 52.3 | 47.6 | 23.4 | 76.5 |
| Minimal-maximal values | 0-40 | 45-100 | 1-50 | 60-90 | 0-70 | 75-100 |
| Mean Ki-67 | 16.4 | 73 | 20.4 | 71.7 | 42.3 | 92.5 |
| Standard deviation | 13 | 15.4 | 16.3 | 8.2 | 21.4 | 9.2 |
| Total | 155 | 42 | 161 | |||
|
|
|
|
|
|
|
|
| ER+PgR- | HER2 0+ C1 | HER2 0+ C2 | HER2 1+ C1 | HER2 1+ C2 | HER2 3+ C1 | HER2 3+ C2 |
| Patients (%) | 83.3% | 16.6% | 88.8% | 11.1% | 77 | 23 |
| Minimal-maximal values | 0-40 | 50-90 | 0-40 | 60-80 | 0-65 | 100-100 |
| Mean Ki-67 | 16.86 | 68.33 | 14.7 | 70 | 26.3 | 100 |
| Standard deviation | 13.32 | 15.7 | 13.7 | 14 | 20.2 | 0 |
| Total | 36 | 18 | 39 | |||
|
|
|
|
|
|
|
|
| ER+PgR+ | HER 2 0+ C1 | HER2 0+ C2 | HER2 1+ C1 | HER2 1+ C2 | HER2 3+ C1 | HER2 3+ C2 |
| EM clusters | C1 | C2 | C1 | C2 | C1 | C2 |
| Patients (%) | 85.8 | 14.1 | 90.3 | 9.7 | 79 | 21 |
| Minimal-maximal values | 0-20 | 25-90 | 0-30 | 40-100 | 0-40 | 45-100 |
| Mean Ki-67 | 9.5 | 45 | 14.8 | 51.7 | 17.5 | 71.1 |
| Standard deviation | 6.3 | 18 | 9.1 | 15.1 | 12.5 | 16.4 |
| Total | 340 | 165 | 171 | |||
| Molecular subgroup | Luminal A/B HER2- | Luminal A/B HER2- | Luminal A/B HER2- | Luminal A/B HER2- | Luminal B HER2+ | Luminal B HER2+ |
ER: estrogen receptor; PgR: progesteron receptor; HER2: human epidermal growth factor receptor 2; EM: estimation-maximisation; C1: cluster1; C2: cluster2 (For convenience and clarity of the clutter-free table, the luminal A HER2- subgroup has been combined in the same cell as the Luminal B HER2- subgroup because they both form the luminal type)
evaluation summary of eight prediction algorithms for clusters membership prediction
| Prediction algorithms | Accuracy (%) | AUC (%) | Precision (%) | Recall (%) | F- measure (%) | Sensitivity (%) | Specificity (%) | Classification error (%) | Scoring time (ms) |
|---|---|---|---|---|---|---|---|---|---|
| NB | 80±2.4 | 80 | 81.4 | 91.9 | 86.2 | 91.9 | 54.1 | 20 | 190 |
| GLM | 80.8±2.2 | 83.1 | 79.2 | 97.7 | 87.4 | 97.7 | 44.9 | 19.2 | 246 |
| FLM | 80.5±3.4 | 78.4 | 85.3 | 86.4 | 85.8 | 86.4 | 68.1 | 19.5 | 159 |
| DL | 79±1.7 | 78.6 | 78.4 | 96.0 | 86.2 | 96 | 41.8 | 21 | 429 |
| RF | 80±4.3 | 80.3 | 82.9 | 89.1 | 85.9 | 89.1 | 60.4 | 20 | 657 |
| GBT | 81.4±2.5 | 81.1 | 82.7 | 92.4 | 87.2 | 92.4 | 57.5 | 18.6 | 1000 |
| SVM | 68.4±0.7 | 52.5 | 68.4 | 100.0 | 81.2 | 100 | 0 | 31.6 | 2000 |
| DT | 81.4±3 | 81.6 | 80.9 | 95.5 | 87.6 | 95.5 | 51.1 | 18.6 | 166 |
Rows: prediction algorithms; NB: naive bayes; GLM: generalised linear model; FLM: fast large margin; DL: deep learning; DT: decision tree; RF: random forest; GBT: gradient boosted trees; SVM: support vector machine; evaluation metrics (columns): accuracy, area under the curve receiver operating characteristic (ROC-AUC); precision, recall, F-measure, sensitivity, specificity, classification error and total running time
Figure 2five (5) years overall survival analysis for Moroccan dataset patients
Figure 3comparing minimal depth and vimp rankings
summary of clustering methods and internal/stability measures for optimal number of clusters calculation in TCGA-BRCA dataset
| Clustering method | Internal measures | 2 clusters | 3 clusters | 4 clusters | 5 clusters |
|---|---|---|---|---|---|
| Hierarchical | Connectivity | 2.9290 | 10.4544 | 24.9115 | 28.7694 |
| Dunn | 0.2603 | 0.0933 | 0.0915 | 0.0915 | |
| Silhouette | 0.463 | 0.2450 | 0.2619 | 0.2328 | |
| K-means | Connectivity | 62.1687 | 152.5885 | 133.0683 | 149.8202 |
| Dunn | 0.0349 | 0.0183 | 0.0315 | 0.0335 | |
| Silhouette | 0.4114 | 0.2430 | 0.3040 | 0.3036 | |
| PAM | Connectivity | 63.6940 | 69.7341 | 161.0004 | 194.0429 |
| Dunn | 0.0437 | 0.0418 | 0.0214 | 0.0328 | |
| Silhouette | 0.4163 | 0.4331 | 0.2987 | 0.2888 | |
| Stability measures | APN | AD | ADM | FOM | |
| Score | 0.0808 | 1.6819 | 0.3595 | 0.8451 | |
| Clustering method | PAM | K-means | PAM | K-means | |
| Optimal number of clusters | 2 | 5 | 2 | 5 |
PAM: partition around medoids; APN: average proportion of non-overlap; AD: average distance; ADM: average distance between means; FOM: figure of merit