Yuqi Wen1, Xinyu Song2, Song He3, Xiaochen Bo4, Bowei Yan1, Xiaoxi Yang5, Lianlian Wu1,6, Dongjin Leng1. 1. Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, 100850, People's Republic of China. 2. Department of Biomedical Engineering, Chinese PLA General Hospital, Beijing, 100853, People's Republic of China. 3. Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, 100850, People's Republic of China. hes1224@163.com. 4. Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, 100850, People's Republic of China. boxc@bmi.ac.cn. 5. Experimental Center, Beijing Friendship Hospital, Capital Medical University, Beijing, 100069, People's Republic of China. 6. Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, 300072, People's Republic of China.
Abstract
BACKGROUND: The accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine. However, the algorithm development for multi-omics data integration remains a pressing challenge. RESULTS: Here, we propose a multi-omics data integration algorithm based on random walk with restart (RWR) on multiplex network. We call the resulting methodology Random Walk with Restart for multi-dimensional data Fusion (RWRF). RWRF uses similarity network of samples as the basis for integration. It constructs the similarity network for each data type and then connects corresponding samples of multiple similarity networks to create a multiplex sample network. By applying RWR on the multiplex network, RWRF uses stationary probability distribution to fuse similarity networks. We applied RWRF to The Cancer Genome Atlas (TCGA) data to identify subtypes in different cancer data sets. Three types of data (mRNA expression, DNA methylation, and microRNA expression data) are integrated and network clustering is conducted. Experiment results show that RWRF performs better than single data type analysis and previous integrative methods. CONCLUSIONS: RWRF provides powerful support to users to decipher the cancer molecular subtypes, thus may benefit precision treatment of specific patients in clinical practice.
BACKGROUND: The accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine. However, the algorithm development for multi-omics data integration remains a pressing challenge. RESULTS: Here, we propose a multi-omics data integration algorithm based on random walk with restart (RWR) on multiplex network. We call the resulting methodology Random Walk with Restart for multi-dimensional data Fusion (RWRF). RWRF uses similarity network of samples as the basis for integration. It constructs the similarity network for each data type and then connects corresponding samples of multiple similarity networks to create a multiplex sample network. By applying RWR on the multiplex network, RWRF uses stationary probability distribution to fuse similarity networks. We applied RWRF to The Cancer Genome Atlas (TCGA) data to identify subtypes in different cancer data sets. Three types of data (mRNA expression, DNA methylation, and microRNA expression data) are integrated and network clustering is conducted. Experiment results show that RWRF performs better than single data type analysis and previous integrative methods. CONCLUSIONS: RWRF provides powerful support to users to decipher the cancer molecular subtypes, thus may benefit precision treatment of specific patients in clinical practice.
Entities:
Keywords:
Cancer subtyping; Multi-dimensional data integration; Multiplex network; Random walk with restart
Authors: Jianfang Liu; Tara Lichtenberg; Katherine A Hoadley; Laila M Poisson; Alexander J Lazar; Andrew D Cherniack; Albert J Kovatich; Christopher C Benz; Douglas A Levine; Adrian V Lee; Larsson Omberg; Denise M Wolf; Craig D Shriver; Vesteinn Thorsson; Hai Hu Journal: Cell Date: 2018-04-05 Impact factor: 41.582
Authors: Jessica Gliozzo; Marco Mesiti; Marco Notaro; Alessandro Petrini; Alex Patak; Antonio Puertas-Gallardo; Alberto Paccanaro; Giorgio Valentini; Elena Casiraghi Journal: Brief Bioinform Date: 2022-07-18 Impact factor: 13.994