Literature DB >> 16929727

Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization.

Stéphane Lafon1, Ann B Lee.   

Abstract

We provide evidence that nonlinear dimensionality reduction, clustering, and data set parameterization can be solved within one and the same framework. The main idea is to define a system of coordinates with an explicit metric that reflects the connectivity of a given data set and that is robust to noise. Our construction, which is based on a Markov random walk on the data, offers a general scheme of simultaneously reorganizing and subsampling graphs and arbitrarily shaped data sets in high dimensions using intrinsic geometry. We show that clustering in embedding spaces is equivalent to compressing operators. The objective of data partitioning and clustering is to coarse-grain the random walk on the data while at the same time preserving a diffusion operator for the intrinsic geometry or connectivity of the data set up to some accuracy. We show that the quantization distortion in diffusion space bounds the error of compression of the operator, thus giving a rigorous justification for k-means clustering in diffusion space and a precise measure of the performance of general clustering algorithms.

Entities:  

Mesh:

Year:  2006        PMID: 16929727     DOI: 10.1109/TPAMI.2006.184

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  30 in total

1.  An information-theoretic derivation of min-cut-based clustering.

Authors:  Anil Raj; Chris H Wiggins
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-06       Impact factor: 6.226

2.  An optimal transportation approach for nuclear structure-based pathology.

Authors:  Wei Wang; John A Ozolek; Dejan Slepčev; Ann B Lee; Cheng Chen; Gustavo K Rohde
Journal:  IEEE Trans Med Imaging       Date:  2010-10-25       Impact factor: 10.048

3.  Optimal partition and effective dynamics of complex networks.

Authors:  Weinan E; Tiejun Li; Eric Vanden-Eijnden
Journal:  Proc Natl Acad Sci U S A       Date:  2008-02-26       Impact factor: 11.205

4.  Improved classifier for computer-aided polyp detection in CT colonography by nonlinear dimensionality reduction.

Authors:  Shijun Wang; Jianhua Yao; Ronald M Summers
Journal:  Med Phys       Date:  2008-04       Impact factor: 4.071

5.  The virtual brain integrates computational modeling and multimodal neuroimaging.

Authors:  Petra Ritter; Michael Schirner; Anthony R McIntosh; Viktor K Jirsa
Journal:  Brain Connect       Date:  2013

6.  Diffusion maps, clustering and fuzzy Markov modeling in peptide folding transitions.

Authors:  Lilia V Nedialkova; Miguel A Amat; Ioannis G Kevrekidis; Gerhard Hummer
Journal:  J Chem Phys       Date:  2014-09-21       Impact factor: 3.488

7.  Missing data and technical variability in single-cell RNA-sequencing experiments.

Authors:  Stephanie C Hicks; F William Townes; Mingxiang Teng; Rafael A Irizarry
Journal:  Biostatistics       Date:  2018-10-01       Impact factor: 5.899

8.  MicroRNA-integrated and network-embedded gene selection with diffusion distance.

Authors:  Di Huang; Xiaobo Zhou; Christopher J Lyon; Willa A Hsueh; Stephen T C Wong
Journal:  PLoS One       Date:  2010-10-29       Impact factor: 3.240

9.  Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data.

Authors:  Christoph Bartenhagen; Hans-Ulrich Klein; Christian Ruckert; Xiaoyi Jiang; Martin Dugas
Journal:  BMC Bioinformatics       Date:  2010-11-18       Impact factor: 3.169

10.  Decoupling function and anatomy in atlases of functional connectivity patterns: language mapping in tumor patients.

Authors:  Georg Langs; Andrew Sweet; Danial Lashkari; Yanmei Tie; Laura Rigolo; Alexandra J Golby; Polina Golland
Journal:  Neuroimage       Date:  2014-08-27       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.