Literature DB >> 30295871

Multi-omic and multi-view clustering algorithms: review and cancer benchmark.

Nimrod Rappoport1, Ron Shamir1.   

Abstract

Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of multi-omic data. Clustering multi-omic data has the potential to reveal further systems-level insights, but raises computational and biological challenges. Here, we review algorithms for multi-omics clustering, and discuss key issues in applying these algorithms. Our review covers methods developed specifically for omic data as well as generic multi-view methods developed in the machine learning community for joint clustering of multiple data types. In addition, using cancer data from TCGA, we perform an extensive benchmark spanning ten different cancer types, providing the first systematic comparison of leading multi-omics and multi-view clustering algorithms. The results highlight key issues regarding the use of single- versus multi-omics, the choice of clustering strategy, the power of generic multi-view methods and the use of approximated p-values for gauging solution quality. Due to the growing use of multi-omics data, we expect these issues to be important for future progress in the field.

Entities:  

Mesh:

Year:  2018        PMID: 30295871      PMCID: PMC6237755          DOI: 10.1093/nar/gky889

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  56 in total

Review 1.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

2.  Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis.

Authors:  Jun Chen; Frederic D Bushman; James D Lewis; Gary D Wu; Hongzhe Li
Journal:  Biostatistics       Date:  2012-10-15       Impact factor: 5.899

3.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

Authors:  Eric F Lock; Katherine A Hoadley; J S Marron; Andrew B Nobel
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

4.  Discovering transcriptional modules by Bayesian data integration.

Authors:  Richard S Savage; Zoubin Ghahramani; Jim E Griffin; Bernard J de la Cruz; David L Wild
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

5.  Accurate computation of survival statistics in genome-wide studies.

Authors:  Fabio Vandin; Alexandra Papoutsaki; Benjamin J Raphael; Eli Upfal
Journal:  PLoS Comput Biol       Date:  2015-05-07       Impact factor: 4.475

6.  Clusternomics: Integrative context-dependent clustering for heterogeneous datasets.

Authors:  Evelina Gabasova; John Reid; Lorenz Wernisch
Journal:  PLoS Comput Biol       Date:  2017-10-16       Impact factor: 4.475

7.  Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.

Authors:  Katherine A Hoadley; Christina Yau; Toshinori Hinoue; Denise M Wolf; Alexander J Lazar; Esther Drill; Ronglai Shen; Alison M Taylor; Andrew D Cherniack; Vésteinn Thorsson; Rehan Akbani; Reanne Bowlby; Christopher K Wong; Maciej Wiznerowicz; Francisco Sanchez-Vega; A Gordon Robertson; Barbara G Schneider; Michael S Lawrence; Houtan Noushmehr; Tathiane M Malta; Joshua M Stuart; Christopher C Benz; Peter W Laird
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

8.  Sparse canonical methods for biological data integration: application to a cross-platform study.

Authors:  Kim-Anh Lê Cao; Pascal G P Martin; Christèle Robert-Granié; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2009-01-26       Impact factor: 3.169

9.  Group sparse canonical correlation analysis for genomic data integration.

Authors:  Dongdong Lin; Jigang Zhang; Jingyao Li; Vince D Calhoun; Hong-Wen Deng; Yu-Ping Wang
Journal:  BMC Bioinformatics       Date:  2013-08-12       Impact factor: 3.169

Review 10.  More Is Better: Recent Progress in Multi-Omics Data Integration Methods.

Authors:  Sijia Huang; Kumardeep Chaudhary; Lana X Garmire
Journal:  Front Genet       Date:  2017-06-16       Impact factor: 4.599

View more
  78 in total

1.  web-rMKL: a web server for dimensionality reduction and sample clustering of multi-view data based on unsupervised multiple kernel learning.

Authors:  Benedict Röder; Nicolas Kersten; Marius Herr; Nora K Speicher; Nico Pfeifer
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

2.  An Integrated Workflow for Global, Glyco-, and Phospho-proteomic Analysis of Tumor Tissues.

Authors:  Yangying Zhou; Tung-Shing Mamie Lih; Ganglong Yang; Shao-Yung Chen; Lijun Chen; Daniel W Chan; Hui Zhang; Qing Kay Li
Journal:  Anal Chem       Date:  2020-01-03       Impact factor: 6.986

3.  MOSClip: multi-omic and survival pathway analysis for the identification of survival associated gene and modules.

Authors:  Paolo Martini; Monica Chiogna; Enrica Calura; Chiara Romualdi
Journal:  Nucleic Acids Res       Date:  2019-08-22       Impact factor: 16.971

Review 4.  Leveraging -omics for asthma endotyping.

Authors:  Scott R Tyler; Supinda Bunyavanich
Journal:  J Allergy Clin Immunol       Date:  2019-07       Impact factor: 10.793

5.  DLRAPom: a hybrid pipeline of Optimized XGBoost-guided integrative multiomics analysis for identifying targetable disease-related lncRNA-miRNA-mRNA regulatory axes.

Authors:  Chen Shen; Huiyu Li; Miao Li; Yu Niu; Jing Liu; Li Zhu; Hongsheng Gui; Wei Han; Huiying Wang; Wenpei Zhang; Xiaochen Wang; Xiao Luo; Yu Sun; Jiangwei Yan; Fanglin Guan
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

6.  MONET: Multi-omic module discovery by omic selection.

Authors:  Nimrod Rappoport; Roy Safra; Ron Shamir
Journal:  PLoS Comput Biol       Date:  2020-09-15       Impact factor: 4.475

7.  Integrating Clinical Data and Imputed Transcriptome from GWAS to Uncover Complex Disease Subtypes: Applications in Psychiatry and Cardiology.

Authors:  Liangying Yin; Carlos K L Chau; Pak-Chung Sham; Hon-Cheong So
Journal:  Am J Hum Genet       Date:  2019-11-27       Impact factor: 11.025

Review 8.  Review of multi-omics data resources and integrative analysis for human brain disorders.

Authors:  Xianjun Dong; Chunyu Liu; Mikhail Dozmorov
Journal:  Brief Funct Genomics       Date:  2021-07-17       Impact factor: 4.241

9.  Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes.

Authors:  Scott R Tyler; Yoojin Chun; Victoria M Ribeiro; Galina Grishina; Alexander Grishin; Gabriel E Hoffman; Anh N Do; Supinda Bunyavanich
Journal:  Cell Rep       Date:  2021-04-13       Impact factor: 9.423

Review 10.  Requirements and reliability of AI in the medical context.

Authors:  Yoganand Balagurunathan; Ross Mitchell; Issam El Naqa
Journal:  Phys Med       Date:  2021-03-13       Impact factor: 2.685

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.