Literature DB >> 25243110

Integrative clustering methods for high-dimensional molecular data.

Prabhakar Chalise1, Devin C Koestler1, Milan Bimali1, Qing Yu1, Brooke L Fridley1.   

Abstract

High-throughput 'omic' data, such as gene expression, DNA methylation, DNA copy number, has played an instrumental role in furthering our understanding of the molecular basis in states of human health and disease. As cells with similar morphological characteristics can exhibit entirely different molecular profiles and because of the potential that these discrepancies might further our understanding of patient-level variability in clinical outcomes, there is significant interest in the use of high-throughput 'omic' data for the identification of novel molecular subtypes of a disease. While numerous clustering methods have been proposed for identifying of molecular subtypes, most were developed for single "omic' data types and may not be appropriate when more than one 'omic' data type are collected on study subjects. Given that complex diseases, such as cancer, arise as a result of genomic, epigenomic, transcriptomic, and proteomic alterations, integrative clustering methods for the simultaneous clustering of multiple 'omic' data types have great potential to aid in molecular subtype discovery. Traditionally, ad hoc manual data integration has been performed using the results obtained from the clustering of individual 'omic' data types on the same set of patient samples. However, such methods often result in inconsistent assignment of subjects to the molecular cancer subtypes. Recently, several methods have been proposed in the literature that offers a rigorous framework for the simultaneous integration of multiple 'omic' data types in a single comprehensive analysis. In this paper, we present a systematic review of existing integrative clustering methods.

Entities:  

Keywords:  Consensus clustering; cophenetic correlation; latent models; mixture models; non-negative matrix factorization

Year:  2014        PMID: 25243110      PMCID: PMC4166480          DOI: 10.3978/j.issn.2218-676X.2014.06.03

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


  39 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

3.  A mixture model-based approach to the clustering of microarray expression data.

Authors:  G J McLachlan; R W Bean; D Peel
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

4.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

5.  Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.

Authors:  Devin C Koestler; Carmen J Marsit; Brock C Christensen; Margaret R Karagas; Raphael Bueno; David J Sugarbaker; Karl T Kelsey; E Andres Houseman
Journal:  Bioinformatics       Date:  2010-08-16       Impact factor: 6.937

6.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

Review 7.  CpG island methylator phenotype in cancer.

Authors:  Jean-Pierre Issa
Journal:  Nat Rev Cancer       Date:  2004-12       Impact factor: 60.716

8.  Comprehensive profiling of DNA methylation in colorectal cancer reveals subgroups with distinct clinicopathological and molecular features.

Authors:  Pei Woon Ang; Marie Loh; Natalia Liem; Pei Li Lim; Fabienne Grieu; Aparna Vaithilingam; Cameron Platell; Wei Peng Yong; Barry Iacopetta; Richie Soong
Journal:  BMC Cancer       Date:  2010-05-21       Impact factor: 4.430

9.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

10.  Gene expression profiling identifies clinically relevant subtypes of prostate cancer.

Authors:  Jacques Lapointe; Chunde Li; John P Higgins; Matt van de Rijn; Eric Bair; Kelli Montgomery; Michelle Ferrari; Lars Egevad; Walter Rayford; Ulf Bergerheim; Peter Ekman; Angelo M DeMarzo; Robert Tibshirani; David Botstein; Patrick O Brown; James D Brooks; Jonathan R Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-07       Impact factor: 11.205

View more
  11 in total

1.  Assisted gene expression-based clustering with AWNCut.

Authors:  Yang Li; Ruofan Bie; Sebastian J Teran Hidalgo; Yichen Qin; Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

2.  Network-based integrative clustering of multiple types of genomic data using non-negative matrix factorization.

Authors:  Prabhakar Chalise; Yonghui Ni; Brooke L Fridley
Journal:  Comput Biol Med       Date:  2020-01-23       Impact factor: 4.589

3.  InterSIM: Simulation tool for multiple integrative 'omic datasets'.

Authors:  Prabhakar Chalise; Rama Raghavan; Brooke L Fridley
Journal:  Comput Methods Programs Biomed       Date:  2016-02-27       Impact factor: 5.428

4.  Multiplex methods provide effective integration of multi-omic data in genome-scale models.

Authors:  Claudio Angione; Max Conway; Pietro Lió
Journal:  BMC Bioinformatics       Date:  2016-03-02       Impact factor: 3.169

5.  Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.

Authors:  Prabhakar Chalise; Brooke L Fridley
Journal:  PLoS One       Date:  2017-05-01       Impact factor: 3.240

6.  A Multi-Objective Approach for Anti-Osteosarcoma Cancer Agents Discovery through Drug Repurposing.

Authors:  Alejandro Cabrera-Andrade; Andrés López-Cortés; Gabriela Jaramillo-Koupermann; Humberto González-Díaz; Alejandro Pazos; Cristian R Munteanu; Yunierkis Pérez-Castillo; Eduardo Tejera
Journal:  Pharmaceuticals (Basel)       Date:  2020-11-22

7.  Multi-dimensional data integration algorithm based on random walk with restart.

Authors:  Yuqi Wen; Xinyu Song; Song He; Xiaochen Bo; Bowei Yan; Xiaoxi Yang; Lianlian Wu; Dongjin Leng
Journal:  BMC Bioinformatics       Date:  2021-02-27       Impact factor: 3.169

8.  PIntMF: Penalized Integrative Matrix Factorization method for Multi-omics data.

Authors:  Morgane Pierre-Jean; Florence Mauger; Jean-François Deleuze; Edith Le Floch
Journal:  Bioinformatics       Date:  2021-11-26       Impact factor: 6.937

9.  Weighted Cox regression for the prediction of heterogeneous patient subgroups.

Authors:  Katrin Madjar; Jörg Rahnenführer
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-07       Impact factor: 2.796

10.  Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study.

Authors:  H Ross-Adams; A D Lamb; M J Dunning; S Halim; J Lindberg; C M Massie; L A Egevad; R Russell; A Ramos-Montoya; S L Vowler; N L Sharma; J Kay; H Whitaker; J Clark; R Hurst; V J Gnanapragasam; N C Shah; A Y Warren; C S Cooper; A G Lynch; R Stark; I G Mills; H Grönberg; D E Neal
Journal:  EBioMedicine       Date:  2015-07-29       Impact factor: 8.143

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.