Literature DB >> 26221713

Sampling from Determinantal Point Processes for Scalable Manifold Learning.

Christian Wachinger, Polina Golland.   

Abstract

High computational costs of manifold learning prohibit its application for large datasets. A common strategy to overcome this problem is to perform dimensionality reduction on selected landmarks and to successively embed the entire dataset with the Nyström method. The two main challenges that arise are: (i) the landmarks selected in non-Euclidean geometries must result in a low reconstruction error, (ii) the graph constructed from sparsely sampled landmarks must approximate the manifold well. We propose to sample the landmarks from determinantal distributions on non-Euclidean spaces. Since current determinantal sampling algorithms have the same complexity as those for manifold learning, we present an efficient approximation with linear complexity. Further, we recover the local geometry after the sparsification by assigning each landmark a local covariance matrix, estimated from the original point set. The resulting neighborhood selection .based on the Bhattacharyya distance improves the embedding of sparsely sampled manifolds. Our experiments show a significant performance improvement compared to state-of-the-art landmark selection techniques on synthetic and medical data.

Entities:  

Mesh:

Year:  2015        PMID: 26221713      PMCID: PMC4524741          DOI: 10.1007/978-3-319-19992-4_54

Source DB:  PubMed          Journal:  Inf Process Med Imaging        ISSN: 1011-2499


  6 in total

1.  A global geometric framework for nonlinear dimensionality reduction.

Authors:  J B Tenenbaum; V de Silva; J C Langford
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

2.  The isomap algorithm and topological stability.

Authors:  Mukund Balasubramanian; Eric L Schwartz
Journal:  Science       Date:  2002-01-04       Impact factor: 47.728

3.  Spectral grouping using the Nyström method.

Authors:  Charless Fowlkes; Serge Belongie; Fan Chung; Jitendra Malik
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2004-02       Impact factor: 6.226

4.  Parallel spectral clustering in distributed systems.

Authors:  Wen-Yen Chen; Yangqiu Song; Hongjie Bai; Chih-Jen Lin; Edward Y Chang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-03       Impact factor: 6.226

5.  Spectral methods in machine learning and new strategies for very large datasets.

Authors:  Mohamed-Ali Belabbas; Patrick J Wolfe
Journal:  Proc Natl Acad Sci U S A       Date:  2009-01-07       Impact factor: 11.205

6.  On landmark selection and sampling in high-dimensional data analysis.

Authors:  Mohamed-Ali Belabbas; Patrick J Wolfe
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.