Literature DB >> 30116772

Data-Driven Tree Transforms and Metrics.

Gal Mishne1, Ronen Talmon1, Israel Cohen1, Ronald R Coifman2, Yuval Kluger3.   

Abstract

We consider the analysis of high dimensional data given in the form of a matrix with columns consisting of observations and rows consisting of features. Often the data is such that the observations do not reside on a regular grid, and the given order of the features is arbitrary and does not convey a notion of locality. Therefore, traditional transforms and metrics cannot be used for data organization and analysis. In this paper, our goal is to organize the data by defining an appropriate representation and metric such that they respect the smoothness and structure underlying the data. We also aim to generalize the joint clustering of observations and features in the case the data does not fall into clear disjoint groups. For this purpose, we propose multiscale data-driven transforms and metrics based on trees. Their construction is implemented in an iterative refinement procedure that exploits the co-dependencies between features and observations. Beyond the organization of a single dataset, our approach enables us to transfer the organization learned from one dataset to another and to integrate several datasets together. We present an application to breast cancer gene expression analysis: learning metrics on the genes to cluster the tumor samples into cancer sub-types and validating the joint organization of both the genes and the samples. We demonstrate that using our approach to combine information from multiple gene expression cohorts, acquired by different profiling technologies, improves the clustering of tumor samples.

Entities:  

Keywords:  gene expression; geometric analysis; graph signal processing; multiscale representations; partition trees

Year:  2017        PMID: 30116772      PMCID: PMC6089386          DOI: 10.1109/TSIPN.2017.2743561

Source DB:  PubMed          Journal:  IEEE Trans Signal Inf Process Netw        ISSN: 2373-776X


  15 in total

1.  Biclustering of expression data.

Authors:  Y Cheng; G M Church
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

2.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

3.  Link communities reveal multiscale complexity in networks.

Authors:  Yong-Yeol Ahn; James P Bagrow; Sune Lehmann
Journal:  Nature       Date:  2010-06-20       Impact factor: 49.962

4.  Functional annotation and network reconstruction through cross-platform integration of microarray data.

Authors:  Xianghong Jasmine Zhou; Ming-Chih J Kao; Haiyan Huang; Angela Wong; Juan Nunez-Iglesias; Michael Primig; Oscar M Aparicio; Caleb E Finch; Todd E Morgan; Wing Hung Wong
Journal:  Nat Biotechnol       Date:  2005-01-16       Impact factor: 54.908

5.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R.

Authors:  Peter Langfelder; Bin Zhang; Steve Horvath
Journal:  Bioinformatics       Date:  2007-11-16       Impact factor: 6.937

6.  Spectral biclustering of microarray data: coclustering genes and conditions.

Authors:  Yuval Kluger; Ronen Basri; Joseph T Chang; Mark Gerstein
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

7.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

8.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

9.  Biomolecular events in cancer revealed by attractor metagenes.

Authors:  Wei-Yi Cheng; Tai-Hsien Ou Yang; Dimitris Anastassiou
Journal:  PLoS Comput Biol       Date:  2013-02-21       Impact factor: 4.475

10.  Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset.

Authors:  Heloisa H Milioli; Renato Vimieiro; Inna Tishchenko; Carlos Riveros; Regina Berretta; Pablo Moscato
Journal:  BioData Min       Date:  2016-01-13       Impact factor: 2.522

View more
  2 in total

1.  Multiway Graph Signal Processing on Tensors: Integrative analysis of irregular geometries.

Authors:  Jay S Stanley; Eric C Chi; Gal Mishne
Journal:  IEEE Signal Process Mag       Date:  2020-10-29       Impact factor: 12.551

2.  Multi-scale affinities with missing data: Estimation and applications.

Authors:  Min Zhang; Gal Mishne; Eric C Chi
Journal:  Stat Anal Data Min       Date:  2021-11-05       Impact factor: 1.247

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.