| Literature DB >> 35251119 |
Chiara Maria Lavinia Loeffler1,2, Nadine T Gaisa3,2, Hannah Sophie Muti1,2, Marko van Treeck1,2, Amelie Echle1,2, Narmin Ghaffari Laleh1,2, Christian Trautwein1,2, Lara R Heij3,4,5,2, Heike I Grabsch6,7, Nadina Ortiz Bruechle3,2, Jakob Nikolas Kather1,7,8,2.
Abstract
In the last four years, advances in Deep Learning technology have enabled the inference of selected mutational alterations directly from routine histopathology slides. In particular, recent studies have shown that genetic changes in clinically relevant driver genes are reflected in the histological phenotype of solid tumors and can be inferred by analysing routine Haematoxylin and Eosin (H&E) stained tissue sections with Deep Learning. However, these studies mostly focused on selected individual genes in selected tumor types. In addition, genetic changes in solid tumors primarily act by changing signaling pathways that regulate cell behaviour. In this study, we hypothesized that Deep Learning networks can be trained to directly predict alterations of genes and pathways across a spectrum of solid tumors. We manually outlined tumor tissue in H&E-stained tissue sections from 7,829 patients with 23 different tumor types from The Cancer Genome Atlas. We then trained convolutional neural networks in an end-to-end way to detect alterations in the most clinically relevant pathways or genes, directly from histology images. Using this automatic approach, we found that alterations in 12 out of 14 clinically relevant pathways and numerous single gene alterations appear to be detectable in tissue sections, many of which have not been reported before. Interestingly, we show that the prediction performance for single gene alterations is better than that for pathway alterations. Collectively, these data demonstrate the predictability of genetic alterations directly from routine cancer histology images and show that individual genes leave a stronger morphological signature than genetic pathways.Entities:
Keywords: TCGA; artificail intelligence (AI); cancer pathway; cancer pathway genes; deep learning; genetic
Year: 2022 PMID: 35251119 PMCID: PMC8889144 DOI: 10.3389/fgene.2021.806386
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1(A) Biological hypothesis of this study. TME: tumor microenvironment. (B) Workflow for selection of data and Deep Learning methods. (1) Tumors from TCGA were analyzed. (2) Genes were selected based on the MSKCC cohort and OnkoKB platforms. (3) Alterations were grouped based on different sources. (4) Genes were grouped into pathways (see Supplementary Table S1). (5) Processing of images and training of the network for genes alone and grouped into pathways. (images from https://smart.servier.com, and Twitter Twemoji under a CC-BY license). TME: tumor microenvironment, TCGA: The Cancer Genome Atlas, MSKCC, OnkoKB, WT: wild type, MUT: mutation present.
FIGURE 2Heatmap comparing the area under the receiver operating curve (AUROC) between the different tumor types. On the y-axis all tumor types are listed and sorted by tumor with most significant results from top to bottom. Number of patients indicated in brackets behind. Pathways are ordered on the x-axis from most (left) to least (right) significant results. AUROC values for (A) the twelve pathway analysis and (B) for the 69 gene analysis. Coloured values stand for significant detected (p > 0.05) pathways and grey for not significantly (p > 0.05). TCGA tumor type abbreviations are used (https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/tcga-study-abbreviations).
Top genes result overview. Single gene analysis results with area under the receiver operating curve (AUROC), confidence interval and p-Value. Selected genes were very well predicted by the neural network with AUROCs at least above 0.75. (1–3) FGFR3, PBRM1, IDH1 are clinically relevant, (4–6) CDH1, BRAF is associated with different morphological features and (7–9) SETD2, NOTCH2 have prognostic value.
| ID | Tumor type | Gene | AUROC |
|
|---|---|---|---|---|
| 1 | BLCA |
| 0.78 [0.72–0.822] | <0.001 |
| 2 | LGG |
| 0.764 [0.735–0.805] | <0.001 |
| 3 | HNSC |
| 0.79 [0.739–0.977] | =0.001 |
| 4 | THCA |
| 0.86 [0.816–0.886] | <0.001 |
| 5 | BRCA |
| 0.81 [0.758–0.849] | <0.001 |
| 6 | PRAD |
| 0.895 [0.827–0.951] | =0.005 |
| 7 | KIRP |
| 0.752 [0.571–0.939] | =0.006 |
| 8 | CRC |
| 0.934 [0.893–0.978] | <0.001 |
| 9 | STAD |
| 0.919 [0.846–0.982] | <0.001 |
FIGURE 3Comparison of the performance of the single gene area under the receiver operating curve (AUROC) vs. pathway AUROC. The top three pathways (A) MAPK, (B) PI3K, (C) TP53 AUROC results for the three top tumor cohorts (STAD, CRC, UCEC) are illustrated. AUROC values are compared between single gene vs. whole pathway. Coloured values stand for significantly detected pathways and grey for not significantly detected (p > 0.05).
FIGURE 4Prediction performance for single gene alterations, representative genes in nine tumor types. Receiver operating curve for: (A) FGFR3 alterations in bladder cancer (BLCA), (B) IDH1 alterations in low grade glioma (LGG), (C) BRAF alterations in head and neck squamous cell carcinoma (HNSC), (D) BRAF alterations in thyroid carcinoma (THCA), (E) CDH1 alterations in invasive breast carcinoma (BRCA), (F) SETD2 alterations in prostate adenocarcinoma (PRAD), (G) PBRM1 alterations in renal cell carcinoma (KIRP), (H) NOTCH2 alterations in colorectal adenocarcinoma (CRC), (I) NOTCH2 alterations in stomach adenocarcinoma (STAD), MUT: mutated, WT: wild type
FIGURE 5Deep learning predicted heatmaps. Visualization of manually annotated histological slides hematoxylin & eosin (H&E) with corresponding prediction maps for altered genes. Blue areas are wild type (WT) predicted regions and red areas are identified as mutated (MUT) parts by the neural network. (A) H&E slide of a BRAF WT patient (ID: TCGA-CQ-5333) from the head and neck squamous cell carcinoma (HNSC) cohort. The homogenous blue heatmap is consistent with the wild type status of the patient. (B) H&E slide of a BRAF mutated patient (ID: TCGA-EL-A3H7) from the thyroid carcinoma (THCA) cohort. The heatmap is more than 50% red, which means the patient was correctly classified as MUT. Intermingled blue areas in tumor regions reflect stroma and artifacts that disturb these areas. (C) H&E slide of a CDH1 mutated patient (ID: TCGA-PE-A5DD) of the breast invasive carcinoma (BRCA) cohort. The prediction heatmap shows that stroma tissue is mostly predicted as WT (blue areas condensed connective tissue) and diffuse invasive-lobular cancer is mostly red.