| Literature DB >> 34845230 |
Yasha Ektefaie1, William Yuan1, Deborah A Dillon2, Nancy U Lin3, Jeffrey A Golden4,5, Isaac S Kohane1, Kun-Hsing Yu6,7.
Abstract
Histopathologic evaluation of biopsy slides is a critical step in diagnosing and subtyping breast cancers. However, the connections between histology and multi-omics status have never been systematically explored or interpreted. We developed weakly supervised deep learning models over hematoxylin-and-eosin-stained slides to examine the relations between visual morphological signal, clinical subtyping, gene expression, and mutation status in breast cancer. We first designed fully automated models for tumor detection and pathology subtype classification, with the results validated in independent cohorts (area under the receiver operating characteristic curve ≥ 0.950). Using only visual information, our models achieved strong predictive performance in estrogen/progesterone/HER2 receptor status, PAM50 status, and TP53 mutation status. We demonstrated that these models learned lymphocyte-specific morphological signals to identify estrogen receptor status. Examination of the PAM50 cohort revealed a subset of PAM50 genes whose expression reflects cancer morphology. This work demonstrates the utility of deep learning-based image models in both clinical and research regimes, through its ability to uncover connections between visual morphology and genetic statuses.Entities:
Year: 2021 PMID: 34845230 PMCID: PMC8630188 DOI: 10.1038/s41523-021-00357-y
Source DB: PubMed Journal: NPJ Breast Cancer ISSN: 2374-4677
Demographics summary of TCGA, Sunnybrook, University of Pennsylvania and the Cancer Institute of New Jersey (UPenn and CINJ) cohorts.
| TCGA | Sunnybrook | UPenn and CINJ | |
|---|---|---|---|
| Total number of patients | 1099 | 54 | 162 |
| Total number of slides | 1983 | 96 | 279 |
| Total number of image tiles | 395116 | 19200 | 277524 |
| Average age at diagnosis | 59.49 | 51.23 | |
| Stdev age at diagnosis | 13.22 | 12.33 | |
| % Post menopause | 76.05 | 37.50 | |
| % ER+ | 77.79 | 64.29 | |
| % PR+ | 74.52 | 53.57 | |
| % HER2+ | 22.89 | 24.44 | |
| % Lobular (vs Ductal) | 21.58 | 8.93 | 0 |
Validation ROC-AUC (binary tasks)/accuracy (non-binary tasks) values for all image-based classifiers.
| Image classifier validation set results | ||
|---|---|---|
| Patient-level ROC-AUC/accuracy (95% CI) | Tile-level ROC-AUC/accuracy (95% CI) | |
Tumor vs. normal held-out test set | 0.985 (0.968–0.995) | 0.921 (0.919–0.924) |
| Independent validation set | 0.950 (0.935–0.964) | 0.859 (0.857–0.860) |
Histological subtype held-out test set | 0.920 (0.845–0.993) | 0.800 (0.790–0.803) |
| Independent validation set | 0.996 (0.984–1.0) | 0.843 (0.836–0.852) |
| Estrogen receptor (ER) status +/− | 0.982 (0.97–0.994) | 0.929 (0.912–0.946) |
| Progesterone receptor (PR) status +/− | 0.983 (0.977–0.989) | 0.908 (0.90–0.916) |
| HER2 Receptor status +/− | 0.979 (0.971–0.981) | 0.829 (0.80–0.858) |
| PAM50 Status (4 Class, Top-1) | 0.654 (0.636–0.672) | 0.406 (0.361–0.451) |
| PAM50 Status (4 Class, Top-2) | 0.790 (0.763–0.817) | 0.609 (0.596–0.622) |
| TP53 Mutation status | 0.833 (0.829–0.837) | 0.658 (0.634–0.682) |
Comparison of average image-classifier confidence for ER/PR status tasks given concordant (ER+/PR+ or ER−/PR−) or discordant (ER+/PR− or ER−/PR+) patients.
| ER+ (Patient count) | ER− (Patient count) | |
|---|---|---|
| Concordant | 0.775 (118) | 0.250 (54) |
| Discordant | 0.784 (14) | 0.101 (1) |
| KS test | 0.758 | 0.255 |
| PR+ (Patient count) | PR− (Patient count) | |
| Concordant | 0.728 (118) | 0.255 (54) |
| Discordant | 0.803 (1) | 0.254 (14) |
| KS test | 0.706 | 0.784 |
Fig. 2Lymphocyte infiltration patterns distinguished progesterone receptor and estrogen receptor statuses.
The distributions of visible lymphocytes for progesterone receptor (left) status + /− and estrogen receptor (right) status +/− are shown.
Fig. 1Associations between lymphocytes and convolutional image filters.
A H&E stained image with annotated lymphocytes, B lymphocyte mask, C thresholded convolutional activations to 2A, D comparison of filter colocalization with lymphocyte and nuclei masks, the line represents equality.
The distinction between receptor status positive/negative patients: comparative performance of image/gene expression-based measurements of immune infiltration for separating receptor status.
| KS Test | KS Test | Bootstrap Δmean | Bootstrap Δmean | |
|---|---|---|---|---|
| Visible lymphocyte signal | <2.2E−16 | 4.00E−15 | <1E−05 | <1E−05 |
| ESTIMATE Immune infiltration (Yoshihara, et al.) | 5.36E−08 | 2.14E−03 | <1E−05 | 3.40E−03 |
| TIMER Immune infiltration (Li, et al.) | 3.75E−07 | 1.40E−02 | <1E−05 | 1.43E−02 |
| Pathologic stage (discontinuous distribution) | n/a | n/a | 0.2509 | 0.981 |