| Literature DB >> 31992350 |
Mustafa I Jaber1, Bing Song2, Clive Taylor3, Charles J Vaske4, Stephen C Benz4, Shahrooz Rabizadeh1,2, Patrick Soon-Shiong2, Christopher W Szeto5.
Abstract
BACKGROUND: Breast cancer intrinsic molecular subtype (IMS) as classified by the expression-based PAM50 assay is considered a strong prognostic feature, even when controlled for by standard clinicopathological features such as age, grade, and nodal status, yet the molecular testing required to elucidate these subtypes is not routinely performed. Furthermore, when such bulk assays as RNA sequencing are performed, intratumoral heterogeneity that may affect prognosis and therapeutic decision-making can be missed.Entities:
Keywords: Breast cancer; Deep learning algorithm; Intrinsic molecular subtype (IMS); Whole-slide imaging (WSI)
Year: 2020 PMID: 31992350 PMCID: PMC6988279 DOI: 10.1186/s13058-020-1248-3
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Fig. 1Proposed WSI-based IMS classifier and heterogeneity detection system. WSIs are broken into multiscale 400px × 400px patches and converted to descriptive tensors using the Inception v3 neural net architecture. A subset of cancer-enriched patches is selected to summarize WSI tumor content. Each patch is assigned a subtype in a 4-way classifier (Basal-like, HER2-enriched, Luminal A, and Luminal B). WSI-based subtype classifications can be made by employing a voting mechanism upon the patch-based results. Heterogeneity analysis is further performed on WSIs displaying significant concurrent Basal-like and Luminal A image-based predictions
Molecular subtyping accuracy across folds. Sample size and performance statistics within the held-out test set across fivefold cross-validation
| No. of patches | No. of WSIs | No. of patients | Patch-level accuracy (%) | WSI-level accuracy (%) | Patient-level accuracy (%) | |
|---|---|---|---|---|---|---|
| Fold1 | 7505 | 95 | 92 | 60.47 | 70.53 | 71.74 |
| Fold2 | 7501 | 94 | 89 | 56.97 | 67.02 | 67.42 |
| Fold3 | 7581 | 95 | 88 | 57 | 67.37 | 69.32 |
| Fold4 | 7564 | 95 | 86 | 59.56 | 65.26 | 65.12 |
| Fold5 | 7420 | 93 | 88 | 59.1 | 60.22 | 62.5 |
| Average | 7514.2 | 94.4 | 88.6 | 58.62 | 66.1 | 67.27 |
Molecular subtyping error and accuracy in two test settings. Confusion matrices between true labels (RNA-seq-based IMS in columns) and predicted labels (WSI-based IMS in rows) at the patient-level for unselected (left) and low-confidence (right) by RNA-seq-based classification
| Unselected test patients ( | Low-confidence test patients ( | |||||||
|---|---|---|---|---|---|---|---|---|
| Basal-like | HER2-enriched | Luminal A | Luminal B | Basal-like | HER2-enriched | Luminal A | Luminal B | |
| Basal-like | 13 | 0.9 | 0.9 | 0 | 0.96 | 0 | 0 | 0 |
| HER2-enriched | 4.48 | 1.79 | 2.24 | 0.45 | 0.96 | 0 | 0 | 0 |
| Luminal A | 1.35 | 1.79 | 46.19 | 2.69 | 4.81 | 1.92 | 43.27 | 1.92 |
| Luminal B | 6.73 | 0.9 | 11.66 | 4.93 | 6.73 | 2.88 | 24.04 | 12.5 |
| Patient-level accuracy 65.92% | Patient-level accuracy 56.73% | |||||||
Fig. 2Subtyping cancer-enriched multiscale patches. Four examples of patch-level subtype classifications: a Basal-like, b HER2-enriched, c Luminal A, and d Luminal B. Below each WSI are 4 example multiscale patch representations from the 80 selected. The bottom table shows the percentages for each predicted subtype within the selected cancer-rich multiscale patches
Fig. 3WSI-based IMS vs. RNA-seq-based molecular PAM50. a Kaplan-Meier curves for Luminal A and Basal-like based on molecular PAM50 calls with HR = 1.25 and log-rank tests p = 0.39 (n = 533). b Kaplan-Meier curves for Luminal A and Basal-like based on WSI-IMS calls with HR = 1.59 and log-rank tests p = 0.06 (n = 488). c All the cases analyzed were molecularly classified as LumA, but the WSI-based system classified some of these (n = 31) as Basal (yellow); the expression levels of ESR1 and PGR for cases WSI-subtyped as Basal were lower compared to confirmed LumA (blue). d Conversely, the receptor levels of molecularly subtyped Basal cases WSI-subtyped to be LumA (n = 15) are higher than confirmed Basal cases
Fig. 4Evidence for heterogeneity. a An example of a HET WSI with markup on patches predicted as Basal-like and LumA. b Expression levels of key hormone receptors ESR1 and PGR in the three settings. Mann-Whitney U p values of being drawn from the same distribution are reported for each pair of settings. Inputs are BasalIMG, HET, and LumAIMG cohorts as defined by the WSI-based IMS system. c Kaplan-Meier curves for BasalIMG, HET, and LumAIMG cohorts show HET survival to be intermediate between the other two. Cox proportional hazard test is included