| Literature DB >> 35644123 |
Matthew Brendel1, Vanesa Getseva2, Majd Al Assaad3, Michael Sigouros4, Alexandros Sigaras5, Troy Kane4, Pegah Khosravi6, Juan Miguel Mosquera3, Olivier Elemento7, Iman Hajirasouliha8.
Abstract
BACKGROUND: Estimating tumor purity is especially important in the age of precision medicine. Purity estimates have been shown to be critical for correction of tumor sequencing results, and higher purity samples allow for more accurate interpretations from next-generation sequencing results. Molecular-based purity estimates using computational approaches require sequencing of tumors, which is both time-consuming and expensive.Entities:
Keywords: Computational pathology; Deep Learning; Precision medicine; Tumor purity estimation
Mesh:
Year: 2022 PMID: 35644123 PMCID: PMC9157012 DOI: 10.1016/j.ebiom.2022.104067
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 11.205
Figure 1(a) Workflow of wsPurity. To get a slide output, the original svs file is tiled, passed through a deep learning model with an attention mechanism to combine information from all tiles to perform two tasks, cancer type prediction and tumor purity prediction. (b) Schematic of the multi-attention multi-task MIL approach. The model uses the structure of Resnet-34-IBN, which is a modified Resnet model using InstanceNorm. A gated attention mechanism generates two feature representations, which pass through a set of linear and dropout layers for the final predictions. (c) Schematic of a residual block from the Resnet-34-IBN-b model (Right - Adapted from Pan et al.).
Figure 2Representative test set distribution for tumor purity data stratified by tissue type. Red lines show the thresholds used for identifying low vs high tumor purity.
Reported values of F1-score, precision and recall for the tissue type prediction for the validation set, test set (TCGA cohort), and test set (WCM independent cohort).
| Metric | Validation set | Testing set (TCGA) | Testing set (WCM) | |
|---|---|---|---|---|
| ACC | F1-score | 0.84 | 0.77 | - |
| Precision | 0.93 | 0.83 | - | |
| Recall | 0.76 | 0.71 | - | |
| BRCA | F1-score | 0.96 | 0.96 | 0.67 |
| Precision | 0.94 | 0.94 | 1.00 | |
| Recall | 0.99 | 0.97 | 0.50 | |
| HNSC | F1-score | 0.86 | 0.87 | - |
| Precision | 0.80 | 0.79 | - | |
| Recall | 0.93 | 0.96 | - | |
| LUAD and LUSC | F1-score | 0.93 | 0.93 | - |
| Precision | 0.98 | 0.97 | - | |
| Recall | 0.89 | 0.88 | - | |
| OV | F1-score | 0.89 | 0.91 | - |
| Precision | 0.87 | 0.92 | - | |
| Recall | 0.91 | 0.90 | - | |
| PRAD | F1-score | 0.96 | 1.00 | 0.83 |
| Precision | 1.00 | 0.99 | 0.93 | |
| Recall | 0.93 | 1.00 | 0.75 |
Figure 4(a, b) tSNE Plots using the feature embedding from the tissue type prediction (compared true vs. predicted, respectively) (c, d) tSNE Plots using the feature embedding from the tumor purity score prediction (compared true vs. predicted, respectively).
Figure 3ROC curves of high vs low tumor purity. We set the thresholds at 10% tumor purity, 60% tumor purity, and 70% tumor purity to identify model performance comparing normal vs. tumor tissue.
Figure 5(a) (Left) A view of the overall tissue architecture. (Middle-Left) The distribution of the tumor purity predictions. (Middle-Right) Attention Maps based on wsPurity. (Right) Pathologist-derived annotations for the tumor region within the slide. (b) Display of the top four (green) and bottom four (red). Rows in A correspond to rows in B. The patches were chosen by normalizing the attention weights using the maximum and minimum values per 120-patch bag, multiplying this normalized attention weight by the predicted purity, and then ranking the values and taking the top four and bottom four patches from this ranked list.