Literature DB >> 34402865

XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data.

Eloise Withnell1,2, Xiaoyu Zhang1, Kai Sun1, Yike Guo1,3.   

Abstract

The lack of explainability is one of the most prominent disadvantages of deep learning applications in omics. This 'black box' problem can undermine the credibility and limit the practical implementation of biomedical deep learning models. Here we present XOmiVAE, a variational autoencoder (VAE)-based interpretable deep learning model for cancer classification using high-dimensional omics data. XOmiVAE is capable of revealing the contribution of each gene and latent dimension for each classification prediction and the correlation between each gene and each latent dimension. It is also demonstrated that XOmiVAE can explain not only the supervised classification but also the unsupervised clustering results from the deep learning network. To the best of our knowledge, XOmiVAE is one of the first activation level-based interpretable deep learning models explaining novel clusters generated by VAE. The explainable results generated by XOmiVAE were validated by both the performance of downstream tasks and the biomedical knowledge. In our experiments, XOmiVAE explanations of deep learning-based cancer classification and clustering aligned with current domain knowledge including biological annotation and academic literature, which shows great potential for novel biomedical knowledge discovery from deep learning models.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  cancer classification; deep learning; explainable artificial intelligence; gene expression; omics data

Mesh:

Substances:

Year:  2021        PMID: 34402865      PMCID: PMC8575033          DOI: 10.1093/bib/bbab315

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  33 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  What is principal component analysis?

Authors:  Markus Ringnér
Journal:  Nat Biotechnol       Date:  2008-03       Impact factor: 54.908

3.  The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.

Authors:  Gil Stelzer; Naomi Rosen; Inbar Plaschkes; Shahar Zimmerman; Michal Twik; Simon Fishilevich; Tsippi Iny Stein; Ron Nudel; Iris Lieder; Yaron Mazor; Sergey Kaplan; Dvir Dahary; David Warshawsky; Yaron Guan-Golan; Asher Kohn; Noa Rappaport; Marilyn Safran; Doron Lancet
Journal:  Curr Protoc Bioinformatics       Date:  2016-06-20

4.  Visualizing and interpreting cancer genomics data via the Xena platform.

Authors:  Mary J Goldman; Brian Craft; Mim Hastie; Kristupas Repečka; Fran McDade; Akhil Kamath; Ayan Banerjee; Yunhai Luo; Dave Rogers; Angela N Brooks; Jingchun Zhu; David Haussler
Journal:  Nat Biotechnol       Date:  2020-06       Impact factor: 54.908

5.  Toward a Shared Vision for Cancer Genomic Data.

Authors:  Robert L Grossman; Allison P Heath; Vincent Ferretti; Harold E Varmus; Douglas R Lowy; Warren A Kibbe; Louis M Staudt
Journal:  N Engl J Med       Date:  2016-09-22       Impact factor: 91.245

6.  Oncogenic Signaling Pathways in The Cancer Genome Atlas.

Authors:  Francisco Sanchez-Vega; Marco Mina; Joshua Armenia; Walid K Chatila; Augustin Luna; Konnor C La; Sofia Dimitriadoy; David L Liu; Havish S Kantheti; Sadegh Saghafinia; Debyani Chakravarty; Foysal Daian; Qingsong Gao; Matthew H Bailey; Wen-Wei Liang; Steven M Foltz; Ilya Shmulevich; Li Ding; Zachary Heins; Angelica Ochoa; Benjamin Gross; Jianjiong Gao; Hongxin Zhang; Ritika Kundra; Cyriac Kandoth; Istemi Bahceci; Leonard Dervishi; Ugur Dogrusoz; Wanding Zhou; Hui Shen; Peter W Laird; Gregory P Way; Casey S Greene; Han Liang; Yonghong Xiao; Chen Wang; Antonio Iavarone; Alice H Berger; Trever G Bivona; Alexander J Lazar; Gary D Hammer; Thomas Giordano; Lawrence N Kwong; Grant McArthur; Chenfei Huang; Aaron D Tward; Mitchell J Frederick; Frank McCormick; Matthew Meyerson; Eliezer M Van Allen; Andrew D Cherniack; Giovanni Ciriello; Chris Sander; Nikolaus Schultz
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

7.  Integrated multi-omics analysis of ovarian cancer using variational autoencoders.

Authors:  Muta Tah Hira; M A Razzaque; Claudio Angione; James Scrivens; Saladin Sawan; Mosharraf Sarkar
Journal:  Sci Rep       Date:  2021-03-18       Impact factor: 4.379

Review 8.  Role of Extracellular Matrix in Development and Cancer Progression.

Authors:  Cameron Walker; Elijah Mojares; Armando Del Río Hernández
Journal:  Int J Mol Sci       Date:  2018-10-04       Impact factor: 5.923

9.  PathME: pathway based multi-modal sparse autoencoders for clustering of patient-level multi-omics data.

Authors:  Amina Lemsara; Salima Ouadfel; Holger Fröhlich
Journal:  BMC Bioinformatics       Date:  2020-04-16       Impact factor: 3.169

10.  Automatic identification of relevant genes from low-dimensional embeddings of single-cell RNA-seq data.

Authors:  Philipp Angerer; David S Fischer; Fabian J Theis; Antonio Scialdone; Carsten Marr
Journal:  Bioinformatics       Date:  2020-08-01       Impact factor: 6.937

View more
  2 in total

1.  A novel liver cancer diagnosis method based on patient similarity network and DenseGCN.

Authors:  Ge Zhang; Zhen Peng; Chaokun Yan; Jianlin Wang; Junwei Luo; Huimin Luo
Journal:  Sci Rep       Date:  2022-04-26       Impact factor: 4.996

2.  Deep learning explains the biology of branched glycans from single-cell sequencing data.

Authors:  Rui Qin; Lara K Mahal; Daniel Bojar
Journal:  iScience       Date:  2022-09-19
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.