| Literature DB >> 34580357 |
Anna Ray Laury1, Sami Blom2, Tuomas Ropponen2, Anni Virtanen3, Olli Mikael Carpén4,5,3.
Abstract
High-grade extrauterine serous carcinoma (HGSC) is an aggressive tumor with high rates of recurrence, frequent chemotherapy resistance, and overall 5-year survival of less than 50%. Beyond determining and confirming the diagnosis itself, pathologist review of histologic slides provides no prognostic or predictive information, which is in sharp contrast to almost all other carcinoma types. Deep-learning based image analysis has recently been able to predict outcome and/or identify morphology-based representations of underlying molecular alterations in other tumor types, such as colorectal carcinoma, lung carcinoma, breast carcinoma, and melanoma. Using a carefully stratified HGSC patient cohort consisting of women (n = 30) with similar presentations who experienced very different treatment responses (platinum free intervals of either ≤ 6 months or ≥ 18 months), we used whole slide images (WSI, n = 205) to train a convolutional neural network. The neural network was trained, in three steps, to identify morphologic regions (digital biomarkers) that are highly associating with one or the other treatment response group. We tested the classifier using a separate 22 slide test set, and 18/22 slides were correctly classified. We show that a neural network based approach can discriminate extremes in patient response to primary platinum-based chemotherapy with high sensitivity (73%) and specificity (91%). These proof-of-concept results are novel, because for the first time, prospective prognostic information is identified specifically within HGSC tumor morphology.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34580357 PMCID: PMC8476598 DOI: 10.1038/s41598-021-98480-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Neural network training. (a) Neural Net 1. Supervised learning of tumor segmentation. Manual semantic segmentation annotations are performed, and the neural network learns tumor segmentation. (b) Neural Net 2. Weakly supervised learning, using patient outcome group as the label. Tumor annotations from Neural Net 1 are relabeled as PFI-S or PFI-L, and the neural network learns tumor features associating with the two outcome groups. (c) Neural Net 3. Supervised learning based on annotations of digital biomarkers. The results of Neural Net 2 are visualized and confidence filtered. Features within the high-confidence masks (digital biomarkers) are reviewed and new semantic segmentation annotations of these regions are performed. The neural net learns tumor features that are strongly associating with outcome group. (d) Combined Inference Pipeline. Neural Net 1 and Neural Net 3 were combined in a single inference pipeline. The neural nets are applied to the validation test set; output is visualized and classified. Yellow slides represent WSI classified as PFI-S by the neural network, with the blue star indicating the misclassified WSI. Blue slides represent WSI classified as PFI-L by the neural network; those with yellow stars represent misclassified slides.
Clinicopathologic features summary.
| Training set | Validation test set | |||
|---|---|---|---|---|
| Short PFI (n = 17) | Long PFI (n = 13) | Short PFI (n = 11) | Long PFI (n = 11) | |
| Mean (range) | 63.6 (54–75) | 64.9 (51–73) | 61.6 (43–78) | 61.1 (50–73) |
| ≥ 64 | 7 | 8 | 4 | 5 |
| < 64 | 10 | 5 | 7 | 6 |
| IIIB | 1 | 2 | 0 | 4 |
| IIIC | 12 | 10 | 10 | 7 |
| IVB | 4 | 1 | 1 | 0 |
| R0 | 1 | 3 | 0 | 2 |
| R1 | 0 | 2 | 0 | 2 |
| R2 | 16 | 8 | 11 | 7 |
| Mean (range) | 2.8 (0–5) | 46.9 (19–149) | 3.6 (0–6) | 41.4 (18–87) |
| 2006–2012 | 2007–2013 | 2006–2011 | 2006–2013 | |
| 105 | 100 | 11 | 11 | |
| Mean per tumor (range) | 6.2 (1–13) | 7.7 (2–12) | 1 | 1 |
Morphologic classification of the training and testing sets.
| Tumor morphology | Training set | Test set | ||
|---|---|---|---|---|
| PFI-S (n = 17) | PFI-L (n = 13) | PFI-S (n = 11) | PFI-L (n = 11) | |
| Classic papillary (uniform) | 6 (0) | 4 (0) | 3(1) | 2(0) |
| Non-classic (uniform) | 11 (2) | 9 (2) | 8(1) | 9(1) |
| Classic foci, any amount | 14 | 10 | 9 | 10 |
Results and classification of validation test set replicates.
| Replicate 1 | Replicate 2 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Test set slide | Long- DBM area % | Short- DBM area % | Short/Long ratio | Outcome | AI prediction | Test set slide | Long-DBM area % | Short-DBM area % | Short/Long ratio | Outcome | AI prediction |
| T13 | 9.4 | 4.2 | 0.4 | Long | Long | ||||||
| T15 | 7.0 | 3.4 | 0.5 | Long | Long | T15 | 15.8 | 7.4 | 0.5 | Long | Long |
| T17 | 2.3 | 1.8 | 0.8 | Long | Long | T16 | 16.2 | 9.7 | 0.6 | Long | Long |
| T16 | 9.3 | 9.5 | 1.0 | Long | Long | T13 | 8.1 | 7.9 | 1.0 | Long | Long |
| T17 | 6.5 | 8.6 | 1.3 | Long | Long | ||||||
| T12 | 4.1 | 11.4 | 2.7 | Long | Long | T14 | 6.2 | 17.0 | 2.8 | Long | Long |
| T12 | 8.0 | 28.8 | 3.6 | Long | Long | ||||||
| T14 | 4.2 | 28.7 | 6.8 | Long | Long | T20 | 4.5 | 18.5 | 4.1 | Long | Long |
| T20 | 1.2 | 19.6 | 16.5 | Long | Long | ||||||
| T18 | 0.9 | 16.9 | 18.0 | Long | Long | T19 | 1.2 | 22.0 | 18.7 | Long | Long |
| T04 | 0.6 | 23.0 | 35.9 | Short | Short | T18 | 1.5 | 38.2 | 25.1 | Long | Long |
| T21 | 1.9 | 49.6 | 25.7 | Long | Long | ||||||
| T03 | 2.2 | 66.5 | 30.2 | Short | Short | ||||||
| T03 | 1.2 | 57.5 | 49.6 | Short | Short | T09 | 0.2 | 30.2 | 124.7 | Short | Short |
| T11 | 0.5 | 78.2 | 146.5 | Short | Short | ||||||
| T01 | 0.4 | 72.9 | 166.6 | Short | Short | T06 | 0.4 | 63.8 | 181.0 | Short | Short |
| T09 | 0.1 | 18.3 | 198.3 | Short | Short | T04 | 0.3 | 48.9 | 186.3 | Short | Short |
| T06 | 0.1 | 36.4 | 347.1 | Short | Short | ||||||
| T11 | 0.1 | 63.4 | 537.6 | Short | Short | T01 | 0.2 | 87.4 | 374.5 | Short | Short |
| T05 | 0.0* | 56.6 | 1446.4 | Short | Short | T05 | 0.1 | 68.8 | 523.1 | Short | Short |
| T10 | 0.0* | 56.5 | 8506.6 | Short | Short | T10 | 0.0* | 70.0 | 8150.6 | Short | Short |
Digital biomarker (DBM) area refers to percent of total tumor area classified as a biomarker. Outcome refers to the actual PFI group. Italicized rows are slides incorrectly classified by the neural network.
*Does not represent true 0%.
Figure 2(a) Representative digital biomarkers for short PFI. Each tile (970 × 996px) is from a separate training set WSI and separate tumor. Each tile represents a region identified by NN2 as being highly associated with PFI-S, and that was annotated for inclusion in NN3 training. (b) Representative digital biomarkers for long PFI. Each tile (970 × 996px) is from a separate training set WSI; tiles from the same tumor are indicated within boxes. Each tile represents a region identified by NN2 as being highly associated with PFI-L, and that was annotated for inclusion in NN3 training.