| Literature DB >> 34093987 |
Lianhe Zhao1,2, Qiongye Dong1, Chunlong Luo1,2, Yang Wu1, Dechao Bu1, Xiaoning Qi1,2, Yufan Luo1,2, Yi Zhao1,3.
Abstract
Integrative analysis of multi-omics data can elucidate valuable insights into complex molecular mechanisms for various diseases. However, due to their different modalities and high dimension, utilizing and integrating different types of omics data suffers from great challenges. There is an urgent need to develop a powerful method to improve survival prediction and detect functional gene modules from multi-omics data. To deal with these problems, we present DeepOmix (a scalable and interpretable multi-Omics Deep learning framework and application in cancer survival analysis), a flexible, scalable, and interpretable method for extracting relationships between the clinical survival time and multi-omics data based on a deep learning framework. DeepOmix enables the non-linear combination of variables from different omics datasets and incorporates prior biological information defined by users (such as signaling pathways and tissue networks). Benchmark experiments demonstrate that DeepOmix outperforms the other five cutting-edge prediction methods. Besides, Lower Grade Glioma (LGG) is taken as the case study to perform the prognosis prediction and illustrate the functional module nodes which are associated with the prognostic result in the prediction model.Entities:
Keywords: Deep learning; Interpretable model; Multi-omics; Prognosis prediction; Survival analysis
Year: 2021 PMID: 34093987 PMCID: PMC8131983 DOI: 10.1016/j.csbj.2021.04.067
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Workflow of DeepOmix. A. Multi-omics data at the gene-level was used as the input data on the input gene layer. B. Functional module gene sets defined by users determines the number of nodes in the functional module layer and the edges between this layer and the input gene layer. C. The framework in DeepOmix. It includes five layers, namely the input gene layer, functional module layer, two hidden layers, and the output layer. The functional module layer is the low-dimensional representations of the gene layer, which is a non-linear function of the gene nodes. D. Samples were classified into high and low risk subgroups according to the output layer, treated as the prognostic values. Kolmogorov-Smirnov test was performed to rank the meaningful pathways and visualization of nodes on hidden layers among two groups.
Fig. 2Performance comparison between DeepOmix and five other methods (A) Boxplot of Differences between the average C-index of other methods and DeepMusicsBoxplot of C-index difference between other methods and DeepOmix. One-tailed t-test was applied to test the difference of the means of the differential C-index for each comparison. (B) Differences between the average C-index of other methods and DeepOmix on each cancer dataset.
Fig. 3Prognosis prediction result of DeepOmix on LGG. A. Kaplan-Meier plot for two different survival risk groups. B. Visualization of Multi-Dimensional Scaling (MDS) result of the second hidden layer in the two different prognostic groups.
Ten top-ranked pathways in LGG.
| Pathway name | Bonferroni adjusted | |
|---|---|---|
| Formation of Incision Complex In GG-NER | 1.90E-14 | 1.63E-11 |
| Glycoprotein Hormones | 5.73E-14 | 4.93E-11 |
| KEGG of Colorectal Cancer | 2.69E-12 | 2.32E-09 |
| Adenylate Cyclase Inhibitory Pathway | 3.94E-11 | 3.39E-08 |
| Advanced Glycosylation End-product Receptor Signalling | 1.03E-10 | 8.89E-08 |
| KEGG of Acute Myeloid Leukaemia | 5.61E-09 | 4.83E-06 |
| RNA Pol III Chain Elongation | 7.42E-09 | 6.38E-06 |
| Folate Biosynthesis | 6.31E-08 | 5.43E-05 |
| KEGG of Alpha Linolenic Acid Metabolism | 6.36E-08 | 5.47E-05 |
| HS GAG Biosynthesis | 9.85E-08 | 8.47E-05 |
Fig. 4The pre-process of multi-omics data and the pipeline of performance evaluation experiments.