Literature DB >> 34229612

Consensus clustering applied to multi-omics disease subtyping.

Patricia Thébault1, Raluca Uricaru1, Galadriel Brière2,3, Élodie Darbo1,4.   

Abstract

BACKGROUND: Facing the diversity of omics data and the difficulty of selecting one result over all those produced by several methods, consensus strategies have the potential to reconcile multiple inputs and to produce robust results.
RESULTS: Here, we introduce ClustOmics, a generic consensus clustering tool that we use in the context of cancer subtyping. ClustOmics relies on a non-relational graph database, which allows for the simultaneous integration of both multiple omics data and results from various clustering methods. This new tool conciliates input clusterings, regardless of their origin, their number, their size or their shape. ClustOmics implements an intuitive and flexible strategy, based upon the idea of evidence accumulation clustering. ClustOmics computes co-occurrences of pairs of samples in input clusters and uses this score as a similarity measure to reorganize data into consensus clusters.
CONCLUSION: We applied ClustOmics to multi-omics disease subtyping on real TCGA cancer data from ten different cancer types. We showed that ClustOmics is robust to heterogeneous qualities of input partitions, smoothing and reconciling preliminary predictions into high-quality consensus clusters, both from a computational and a biological point of view. The comparison to a state-of-the-art consensus-based integration tool, COCA, further corroborated this statement. However, the main interest of ClustOmics is not to compete with other tools, but rather to make profit from their various predictions when no gold-standard metric is available to assess their significance. AVAILABILITY: The ClustOmics source code, released under MIT license, and the results obtained on TCGA cancer data are available on GitHub: https://github.com/galadrielbriere/ClustOmics .

Entities:  

Keywords:  Consensus clustering; Data integration; Disease subtyping; Multi-omic data

Year:  2021        PMID: 34229612     DOI: 10.1186/s12859-021-04279-1

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Properties of the Hubert-Arabie adjusted Rand index.

Authors:  Douglas Steinley
Journal:  Psychol Methods       Date:  2004-09

2.  Combining multiple clusterings using evidence accumulation.

Authors:  Ana L N Fred; Anil K Jain
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-06       Impact factor: 6.226

3.  Bayesian consensus clustering.

Authors:  Eric F Lock; David B Dunson
Journal:  Bioinformatics       Date:  2013-08-28       Impact factor: 6.937

4.  Multi-omics integration-a comparison of unsupervised clustering methodologies.

Authors:  Giulia Tini; Luca Marchetti; Corrado Priami; Marie-Pier Scott-Boyer
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

5.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

Authors:  Eric F Lock; Katherine A Hoadley; J S Marron; Andrew B Nobel
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

6.  Similarity network fusion for aggregating data types on a genomic scale.

Authors:  Bo Wang; Aziz M Mezlini; Feyyaz Demir; Marc Fiume; Zhuowen Tu; Michael Brudno; Benjamin Haibe-Kains; Anna Goldenberg
Journal:  Nat Methods       Date:  2014-01-26       Impact factor: 28.547

7.  Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin.

Authors:  Katherine A Hoadley; Christina Yau; Denise M Wolf; Andrew D Cherniack; David Tamborero; Sam Ng; Max D M Leiserson; Beifang Niu; Michael D McLellan; Vladislav Uzunangelov; Jiashan Zhang; Cyriac Kandoth; Rehan Akbani; Hui Shen; Larsson Omberg; Andy Chu; Adam A Margolin; Laura J Van't Veer; Nuria Lopez-Bigas; Peter W Laird; Benjamin J Raphael; Li Ding; A Gordon Robertson; Lauren A Byers; Gordon B Mills; John N Weinstein; Carter Van Waes; Zhong Chen; Eric A Collisson; Christopher C Benz; Charles M Perou; Joshua M Stuart
Journal:  Cell       Date:  2014-08-07       Impact factor: 41.582

Review 8.  Multi-omic and multi-view clustering algorithms: review and cancer benchmark.

Authors:  Nimrod Rappoport; Ron Shamir
Journal:  Nucleic Acids Res       Date:  2018-11-16       Impact factor: 16.971

9.  A novel approach for data integration and disease subtyping.

Authors:  Tin Nguyen; Rebecca Tagett; Diana Diaz; Sorin Draghici
Journal:  Genome Res       Date:  2017-10-24       Impact factor: 9.043

10.  NEMO: cancer subtyping by integration of partial multi-omic data.

Authors:  Nimrod Rappoport; Ron Shamir
Journal:  Bioinformatics       Date:  2019-09-15       Impact factor: 6.937

View more
  2 in total

1.  m6A Regulator-Mediated Methylation Modification Patterns and Characteristics in COVID-19 Patients.

Authors:  Xin Qing; Qian Chen; Ke Wang
Journal:  Front Public Health       Date:  2022-05-17

2.  Machine learning approach identified clusters for patients with low cardiac output syndrome and outcomes after cardiac surgery.

Authors:  Xu Zhao; Bowen Gu; Qiuying Li; Jiaxin Li; Weiwei Zeng; Yagang Li; Yanping Guan; Min Huang; Liming Lei; Guoping Zhong
Journal:  Front Cardiovasc Med       Date:  2022-08-18
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.