Hryhorii Chereda1, Annalen Bleckmann2, Kerstin Menck2, Júlia Perera-Bel3, Philip Stegmaier4, Florian Auer5, Frank Kramer5, Andreas Leha6, Tim Beißbarth7,8. 1. Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany. 2. Dept. of Medicine A (Hematology, Oncology, Hemostaseology and Pulmonology), University Hospital Münster, Münster, Germany. 3. Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain. 4. geneXplain GmbH, Wolfenbüttel, Germany. 5. IT Infrastructure for Translational Medical Research, University of Augsburg, Augsburg, Germany. 6. Medical Statistics, University Medical Center Göttingen, Göttingen, Germany. 7. Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany. tim.beissbarth@bioinf.med.uni-goettingen.de. 8. Campus-Institute Data Science (CIDAS), University of Göttingen, Göttingen, Germany. tim.beissbarth@bioinf.med.uni-goettingen.de.
Abstract
BACKGROUND: Contemporary deep learning approaches show cutting-edge performance in a variety of complex prediction tasks. Nonetheless, the application of deep learning in healthcare remains limited since deep learning methods are often considered as non-interpretable black-box models. However, the machine learning community made recent elaborations on interpretability methods explaining data point-specific decisions of deep learning techniques. We believe that such explanations can assist the need in personalized precision medicine decisions via explaining patient-specific predictions. METHODS: Layer-wise Relevance Propagation (LRP) is a technique to explain decisions of deep learning methods. It is widely used to interpret Convolutional Neural Networks (CNNs) applied on image data. Recently, CNNs started to extend towards non-Euclidean domains like graphs. Molecular networks are commonly represented as graphs detailing interactions between molecules. Gene expression data can be assigned to the vertices of these graphs. In other words, gene expression data can be structured by utilizing molecular network information as prior knowledge. Graph-CNNs can be applied to structured gene expression data, for example, to predict metastatic events in breast cancer. Therefore, there is a need for explanations showing which part of a molecular network is relevant for predicting an event, e.g., distant metastasis in cancer, for each individual patient. RESULTS: We extended the procedure of LRP to make it available for Graph-CNN and tested its applicability on a large breast cancer dataset. We present Graph Layer-wise Relevance Propagation (GLRP) as a new method to explain the decisions made by Graph-CNNs. We demonstrate a sanity check of the developed GLRP on a hand-written digits dataset and then apply the method on gene expression data. We show that GLRP provides patient-specific molecular subnetworks that largely agree with clinical knowledge and identify common as well as novel, and potentially druggable, drivers of tumor progression. CONCLUSIONS: The developed method could be potentially highly useful on interpreting classification results in the context of different omics data and prior knowledge molecular networks on the individual patient level, as for example in precision medicine approaches or a molecular tumor board.
BACKGROUND: Contemporary deep learning approaches show cutting-edge performance in a variety of complex prediction tasks. Nonetheless, the application of deep learning in healthcare remains limited since deep learning methods are often considered as non-interpretable black-box models. However, the machine learning community made recent elaborations on interpretability methods explaining data point-specific decisions of deep learning techniques. We believe that such explanations can assist the need in personalized precision medicine decisions via explaining patient-specific predictions. METHODS: Layer-wise Relevance Propagation (LRP) is a technique to explain decisions of deep learning methods. It is widely used to interpret Convolutional Neural Networks (CNNs) applied on image data. Recently, CNNs started to extend towards non-Euclidean domains like graphs. Molecular networks are commonly represented as graphs detailing interactions between molecules. Gene expression data can be assigned to the vertices of these graphs. In other words, gene expression data can be structured by utilizing molecular network information as prior knowledge. Graph-CNNs can be applied to structured gene expression data, for example, to predict metastatic events in breast cancer. Therefore, there is a need for explanations showing which part of a molecular network is relevant for predicting an event, e.g., distant metastasis in cancer, for each individual patient. RESULTS: We extended the procedure of LRP to make it available for Graph-CNN and tested its applicability on a large breast cancer dataset. We present Graph Layer-wise Relevance Propagation (GLRP) as a new method to explain the decisions made by Graph-CNNs. We demonstrate a sanity check of the developed GLRP on a hand-written digits dataset and then apply the method on gene expression data. We show that GLRP provides patient-specific molecular subnetworks that largely agree with clinical knowledge and identify common as well as novel, and potentially druggable, drivers of tumor progression. CONCLUSIONS: The developed method could be potentially highly useful on interpreting classification results in the context of different omics data and prior knowledge molecular networks on the individual patient level, as for example in precision medicine approaches or a molecular tumor board.
Entities:
Keywords:
Classification of cancer; Deep learning; Explainable AI; Gene expression data; Molecular networks; Personalized medicine; Precision medicine; Prior knowledge
Authors: Marc Johannes; Jan C Brase; Holger Fröhlich; Stephan Gade; Mathias Gehrmann; Maria Fälth; Holger Sültmann; Tim Beissbarth Journal: Bioinformatics Date: 2010-06-30 Impact factor: 6.937
Authors: Deena M A Gendoo; Natchar Ratanasirigulchai; Markus S Schröder; Laia Paré; Joel S Parker; Aleix Prat; Benjamin Haibe-Kains Journal: Bioinformatics Date: 2015-11-24 Impact factor: 6.937
Authors: F Klauschen; K-R Müller; A Binder; M Bockmayr; M Hägele; P Seegerer; S Wienert; G Pruneri; S de Maria; S Badve; S Michiels; T O Nielsen; S Adams; P Savas; F Symmans; S Willis; T Gruosso; M Park; B Haibe-Kains; B Gallas; A M Thompson; I Cree; C Sotiriou; C Solinas; M Preusser; S M Hewitt; D Rimm; G Viale; S Loi; S Loibl; R Salgado; C Denkert Journal: Semin Cancer Biol Date: 2018-07-07 Impact factor: 15.707
Authors: Philipp Keyl; Michael Bockmayr; Daniel Heim; Gabriel Dernbach; Grégoire Montavon; Klaus-Robert Müller; Frederick Klauschen Journal: NPJ Precis Oncol Date: 2022-06-07
Authors: Khoa A Tran; Olga Kondrashova; Andrew Bradley; Elizabeth D Williams; John V Pearson; Nicola Waddell Journal: Genome Med Date: 2021-09-27 Impact factor: 11.117