| Literature DB >> 30182055 |
Heather D Couture1, Lindsay A Williams2, Joseph Geradts3, Sarah J Nyante4, Ebonee N Butler2, J S Marron5,6, Charles M Perou5,7, Melissa A Troester2,5, Marc Niethammer1,8.
Abstract
RNA-based, multi-gene molecular assays are available and widely used for patients with ER-positive/HER2-negative breast cancers. However, RNA-based genomic tests can be costly and are not available in many countries. Methods for inferring molecular subtype from histologic images may identify patients most likely to benefit from further genomic testing. To identify patients who could benefit from molecular testing based on H&E stained histologic images, we developed an image analysis approach using deep learning. A training set of 571 breast tumors was used to create image-based classifiers for tumor grade, ER status, PAM50 intrinsic subtype, histologic subtype, and risk of recurrence score (ROR-PT). The resulting classifiers were applied to an independent test set (n = 288), and accuracy, sensitivity, and specificity of each was assessed on the test set. Histologic image analysis with deep learning distinguished low-intermediate vs. high tumor grade (82% accuracy), ER status (84% accuracy), Basal-like vs. non-Basal-like (77% accuracy), Ductal vs. Lobular (94% accuracy), and high vs. low-medium ROR-PT score (75% accuracy). Sampling considerations in the training set minimized bias in the test set. Incorrect classification of ER status was significantly more common for Luminal B tumors. These data provide proof of principle that molecular marker status, including a critical clinical biomarker (i.e., ER status), can be predicted with accuracy >75% based on H&E features. Image-based methods could be promising for identifying patients with a greater need for further genomic testing, or in place of classically scored variables typically accomplished using human-based scoring.Entities:
Year: 2018 PMID: 30182055 PMCID: PMC6120869 DOI: 10.1038/s41523-018-0079-1
Source DB: PubMed Journal: NPJ Breast Cancer ISSN: 2374-4677
Patient and tumor characteristics for the image analysis training and test set, CBCS3
| Training set | Test set | Chi-square | |
|---|---|---|---|
| Age | |||
| ≤50 years | 280 (29.6) | 133 (28.0) | 0.64 |
| >50 years | 291 (70.4) | 155 (72.0) | |
| Race | |||
| White | 298 (79.0) | 150 (78.7) | 0.90 |
| African-American | 272 (21.0) | 138 (21.3) | |
| Missing | 1 | ||
| Grade | |||
| Low-intermediate | 330 (65.8) | 162 (66.5) | 0.85 |
| High | 240 (34.2) | 125 (33.5) | |
| Missing | 1 | 1 | |
| Stage | |||
| I, II | 485 (86.4) | 259 (90.2) | 0.17 |
| III, IV | 85 (13.6) | 29 (9.8) | |
| Missing | 1 | ||
| Node status | |||
| Negative | 354 (65.2) | 191 (69.1) | 0.35 |
| Positive | 214 (34.8) | 97 (30.9) | |
| Missing | 3 | ||
| Tumor size | |||
| ≤2 cm | 334 (62.5) | 174 (67.2) | 0.26 |
| >2 cm | 235 (37.5) | 114 (32.8) | |
| Missing | 2 | ||
| ER status | |||
| Negative | 164 (24.9) | 91 (23.1) | 0.62 |
| Positive | 405 (75.1) | 197 (76.9) | |
| Missing | 2 | ||
| PAM50 subtype | |||
| Luminal A | 149 (46.1) | 74 (47.1) | 0.27 |
| Luminal B | 78 (18.2) | 33 (20.9) | |
| Basal-like | 92 (20.9) | 49 (21.6) | |
| HER2 | 46 (11.9) | 15 (5.9) | |
| Normal-like | 9 (2.9) | 9 (4.5) | |
| Missing | 197 | 108 | |
aAll percentages weighted for sampling design
Fig. 1a. Histogram for probability of high-grade tumor by image analysis according to proportion of pathologist-classified low-intermediate (black) or high grade (red) in the test set. The cut point of >0.80 was selected. b. Bee Swarm plot displaying pathologist classification of tumor grade as a function of the image grade score in the test set. Points within each grade group are adjusted horizontally to avoid overlap. The black dots indicate image analysis classified low-intermediate tumor grade and the red dots indicate image analysis classified high-grade tumors
Agreement between pathologists and between pathologists and image analysis in the test set for low-intermediate grade and high-grade tumors, CBCS3
| Pathologist agreement on tumor grade classificationa ( | Image analysis agreement with pathologist tumor grade classificationb ( | ||||
|---|---|---|---|---|---|
| Pathologist 2 | Clinical grade | ||||
| Pathologist 1 | Low-intermediate grade | High grade | Patient average grade | Low-intermediate grade | High grade |
| Low-intermediate grade | 113 | 23 | Low-intermediate grade | 118 | 8 |
| High grade | 4 | 102 | High grade | 45 | 117 |
| % Agreement | 89 | % Agreement | 82 | ||
| 0.78 (0.70–0.86) | 0.64 (0.55-0.72) | ||||
aTo assess agreement between two pathologists, patients were sampled from CBCS Phases 1, 2, and 3 for second pathology review
bTo assess agreement between image analysis and a pathologist, only samples with digital image data (CBCS3 only) were included
Impact of weighting by grade on accuracy, sensitivity, and specificity of ER status[1] in the test set, CBCS3
| Unweighted | Grade-trained | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IHC ER status | IHC ER status | |||||||||||
| Image analysis | Negative | Positive | Sensitivity (%) | Specificity (%) | Accuracy (%) | Negative | Positive | Sensitivity (%) | Specificity (%) | Accuracy (%) | ||
| Overall | ||||||||||||
| ER negative | 260 | 80 | 88 | 76 | 84 | 0.64 (0.59–0.69) | 246 | 104 | 84 | 72 | 80 | 0.55 (0.50–0.61) |
| ER positive | 83 | 572 | 97 | 548 | ||||||||
| Low-intermediate grade | ||||||||||||
| ER negative | 21 | 24 | 95 | 46 | 91 | 0.41 (0.28–0.55) | 28 | 69 | 86 | 61 | 84 | 0.31 (0.21–0.42) |
| ER positive | 25 | 467 | 18 | 422 | ||||||||
| High grade | ||||||||||||
| ER negative | 239 | 46 | 69 | 80 | 77 | 0.49 (0.40-.57) | 218 | 35 | 78 | 73 | 75 | 0.48 (0.44–0.56) |
| ER positive | 58 | 104 | 79 | 125 | ||||||||
aNumbers represent individual cores (n = 995) from 288 patients, with up to four cores per patient; H&E cores were excluded if missing IHC data (n = 11)
Accuracy, sensitivity, and specificity of non-Basal-like intrinsic subtype, ROR-PT, and histologic subtype based on image analysisa in the test set, CBCS3
| Image analysis |
| |||||
|---|---|---|---|---|---|---|
| Basal-like | Non-Basal-like | Sensitivity (%) | Specificity (%) | Accuracy (%) | ||
| Overall | ||||||
| Basal-like | 131 | 101 | 78 | 73 | 77 | 0.47 (0.32–0.54) |
| Non-Basal-like | 48 | 368 | ||||
| Low-intermediate grade | ||||||
| Basal-like | 11 | 41 | 86 | 73 | 85 | 0.27 (0.13–0.41) |
| Non-Basal-like | 4 | 245 | ||||
| High grade | ||||||
| Basal-like | 120 | 60 | 67 | 73 | 70 | 0.40 (0.31–0.50) |
| Non-Basal-like | 44 | 123 | ||||
|
| ||||||
| Low-Med | High | Sensitivity (%) | Specificity (%) | Accuracy (%) | ||
| Overall | ||||||
| Low-Med | 342 | 40 | 79 | 74 | 76 | 0.47 (0.40–0.54) |
| High | 118 | 148 | ||||
| Low-intermediate grade | ||||||
| Low-med | 245 | 16 | 47 | 90 | 86 | 0.32 (0.17–0.48) |
| High | 26 | 14 | ||||
| High grade | ||||||
| Low-med | 97 | 24 | 85 | 51 | 67 | 0.35 (0.26–0.44) |
| High | 92 | 134 | ||||
|
| ||||||
| Ductal | Lobular | Sensitivity (%) | Specificity (%) | Accuracy (%) | ||
| Overall | ||||||
| Ductal | 710 | 24 | 71 | 96 | 94 | 0.66 (0.57–0.74) |
| Lobular | 28 | 58 | ||||
| Low-intermediate grade | ||||||
| Ductal | 268 | 24 | 71 | 94 | 89 | 0.63 (0.53–0.73) |
| Lobular | 23 | 58 | ||||
| High grade | ||||||
| Ductal | 442 | 0 | N/A | 99 | 99 | N/A |
| Lobular | 5 | 0 | ||||
aNumbers represent individual cores from patients where 1–4 cores were available. Cores were excluded if RNA data (n = 358) was missing
bOne-hundred eighty patients with 648 cores for intrinsic subtype and ROR-PT
cTwo-hundred thirty-three patients with 820 cores for histologic subtype
Fig. 2Four H&E cores from a single patient and heat maps indicating the class predictions over different regions of the image. Class probabilities are indicated by the intensity of red/blue color with greater intensity for higher probabilities. Uncertainty in the prediction is indicated by white. This patient was labeled as high grade, ER negative, Basal-like intrinsic subtype, ductal histologic subtype, and high ROR