Literature DB >> 26917056

Integrative clustering of high-dimensional data with joint and individual clusters.

Kristoffer H Hellton1, Magne Thoresen2.   

Abstract

When measuring a range of genomic, epigenomic, and transcriptomic variables for the same tissue sample, an integrative approach to analysis can strengthen inference and lead to new insights. This is also the case when clustering patient samples, and several integrative cluster procedures have been proposed. Common for these methodologies is the restriction to a joint cluster structure, equal in all data layers. We instead present a clustering extension of the Joint and Individual Variance Explained algorithm (JIVE), Joint and Individual Clustering (JIC), enabling the construction of both joint and data type-specific clusters simultaneously. The procedure builds on the connection between k-means clustering and principal component analysis, and hence, the number of clusters can be determined by the number of relevant principal components. The proposed procedure is compared with iCluster, a method restricted to only joint clusters, and simulations show that JIC is clearly advantageous when both individual and joint clusters are present. The procedure is illustrated using gene expression and miRNA levels measured in breast cancer tissue from The Cancer Genome Atlas. The analysis suggests a division into three joint clusters common for both data types and two expression-specific clusters.
© The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Clustering; Integrative genomics; Principal component analysis; Singular value decomposition

Mesh:

Substances:

Year:  2016        PMID: 26917056     DOI: 10.1093/biostatistics/kxw005

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  7 in total

1.  Integrative factorization of bidimensionally linked matrices.

Authors:  Jun Young Park; Eric F Lock
Journal:  Biometrics       Date:  2019-11-10       Impact factor: 2.571

2.  Assisted gene expression-based clustering with AWNCut.

Authors:  Yang Li; Ruofan Bie; Sebastian J Teran Hidalgo; Yichen Qin; Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

3.  R.JIVE for exploration of multi-source molecular data.

Authors:  Michael J O'Connell; Eric F Lock
Journal:  Bioinformatics       Date:  2016-06-06       Impact factor: 6.937

4.  Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data.

Authors:  Minjie Wang; Genevera I Allen
Journal:  J Mach Learn Res       Date:  2021-01       Impact factor: 5.177

5.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

6.  Clusternomics: Integrative context-dependent clustering for heterogeneous datasets.

Authors:  Evelina Gabasova; John Reid; Lorenz Wernisch
Journal:  PLoS Comput Biol       Date:  2017-10-16       Impact factor: 4.475

7.  Integrative, multi-omics, analysis of blood samples improves model predictions: applications to cancer.

Authors:  Erica Ponzi; Magne Thoresen; Therese Haugdahl Nøst; Kajsa Møllersen
Journal:  BMC Bioinformatics       Date:  2021-08-05       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.