Literature DB >> 33639858

Multi-dimensional data integration algorithm based on random walk with restart.

Yuqi Wen1, Xinyu Song2, Song He3, Xiaochen Bo4, Bowei Yan1, Xiaoxi Yang5, Lianlian Wu1,6, Dongjin Leng1.   

Abstract

BACKGROUND: The accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine. However, the algorithm development for multi-omics data integration remains a pressing challenge.
RESULTS: Here, we propose a multi-omics data integration algorithm based on random walk with restart (RWR) on multiplex network. We call the resulting methodology Random Walk with Restart for multi-dimensional data Fusion (RWRF). RWRF uses similarity network of samples as the basis for integration. It constructs the similarity network for each data type and then connects corresponding samples of multiple similarity networks to create a multiplex sample network. By applying RWR on the multiplex network, RWRF uses stationary probability distribution to fuse similarity networks. We applied RWRF to The Cancer Genome Atlas (TCGA) data to identify subtypes in different cancer data sets. Three types of data (mRNA expression, DNA methylation, and microRNA expression data) are integrated and network clustering is conducted. Experiment results show that RWRF performs better than single data type analysis and previous integrative methods.
CONCLUSIONS: RWRF provides powerful support to users to decipher the cancer molecular subtypes, thus may benefit precision treatment of specific patients in clinical practice.

Entities:  

Keywords:  Cancer subtyping; Multi-dimensional data integration; Multiplex network; Random walk with restart

Mesh:

Substances:

Year:  2021        PMID: 33639858      PMCID: PMC7912853          DOI: 10.1186/s12859-021-04029-3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  37 in total

Review 1.  Integrative methods for analyzing big data in precision medicine.

Authors:  Vladimir Gligorijević; Noël Malod-Dognin; Nataša Pržulj
Journal:  Proteomics       Date:  2016-03       Impact factor: 3.984

2.  ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking.

Authors:  Matthew D Wilkerson; D Neil Hayes
Journal:  Bioinformatics       Date:  2010-04-28       Impact factor: 6.937

3.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Authors:  Ronglai Shen; Adam B Olshen; Marc Ladanyi
Journal:  Bioinformatics       Date:  2009-09-16       Impact factor: 6.937

4.  Integrative clustering methods for high-dimensional molecular data.

Authors:  Prabhakar Chalise; Devin C Koestler; Milan Bimali; Qing Yu; Brooke L Fridley
Journal:  Transl Cancer Res       Date:  2014-06-01       Impact factor: 1.241

Review 5.  The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge.

Authors:  Katarzyna Tomczak; Patrycja Czerwińska; Maciej Wiznerowicz
Journal:  Contemp Oncol (Pozn)       Date:  2015

6.  Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.

Authors:  Dingming Wu; Dongfang Wang; Michael Q Zhang; Jin Gu
Journal:  BMC Genomics       Date:  2015-12-01       Impact factor: 3.969

7.  Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.

Authors:  Prabhakar Chalise; Brooke L Fridley
Journal:  PLoS One       Date:  2017-05-01       Impact factor: 3.240

8.  Tensorial blind source separation for improved analysis of multi-omic data.

Authors:  Andrew E Teschendorff; Han Jing; Dirk S Paul; Joni Virta; Klaus Nordhausen
Journal:  Genome Biol       Date:  2018-06-08       Impact factor: 13.583

9.  An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics.

Authors:  Jianfang Liu; Tara Lichtenberg; Katherine A Hoadley; Laila M Poisson; Alexander J Lazar; Andrew D Cherniack; Albert J Kovatich; Christopher C Benz; Douglas A Levine; Adrian V Lee; Larsson Omberg; Denise M Wolf; Craig D Shriver; Vesteinn Thorsson; Hai Hu
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

Review 10.  microRNAs as Potential Biomarkers in Adrenocortical Cancer: Progress and Challenges.

Authors:  Nadia Cherradi
Journal:  Front Endocrinol (Lausanne)       Date:  2016-01-20       Impact factor: 5.555

View more
  5 in total

Review 1.  Heterogeneous data integration methods for patient similarity networks.

Authors:  Jessica Gliozzo; Marco Mesiti; Marco Notaro; Alessandro Petrini; Alex Patak; Antonio Puertas-Gallardo; Alberto Paccanaro; Giorgio Valentini; Elena Casiraghi
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  In silico drug repositioning based on integrated drug targets and canonical correlation analysis.

Authors:  Hailin Chen; Zuping Zhang; Jingpu Zhang
Journal:  BMC Med Genomics       Date:  2022-03-06       Impact factor: 3.063

Review 3.  Unsupervised Multi-Omics Data Integration Methods: A Comprehensive Review.

Authors:  Nasim Vahabi; George Michailidis
Journal:  Front Genet       Date:  2022-03-22       Impact factor: 4.599

4.  A benchmark study of deep learning-based multi-omics data fusion methods for cancer.

Authors:  Dongjin Leng; Linyi Zheng; Yuqi Wen; Yunhao Zhang; Lianlian Wu; Jing Wang; Meihong Wang; Zhongnan Zhang; Song He; Xiaochen Bo
Journal:  Genome Biol       Date:  2022-08-09       Impact factor: 17.906

Review 5.  Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context.

Authors:  Vivian Robin; Antoine Bodein; Marie-Pier Scott-Boyer; Mickaël Leclercq; Olivier Périn; Arnaud Droit
Journal:  Front Mol Biosci       Date:  2022-09-08
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.