Literature DB >> 18220179

DD-HDS: A method for visualization and exploration of high-dimensional data.

Sylvain Lespinats1, Michel Verleysen, Alain Giron, Bernard Fertil.   

Abstract

Mapping high-dimensional data in a low-dimensional space, for example, for visualization, is a problem of increasingly major concern in data analysis. This paper presents data-driven high-dimensional scaling (DD-HDS), a nonlinear mapping method that follows the line of multidimensional scaling (MDS) approach, based on the preservation of distances between pairs of data. It improves the performance of existing competitors with respect to the representation of high-dimensional data, in two ways. It introduces (1) a specific weighting of distances between data taking into account the concentration of measure phenomenon and (2) a symmetric handling of short distances in the original and output spaces, avoiding false neighbor representations while still allowing some necessary tears in the original distribution. More precisely, the weighting is set according to the effective distribution of distances in the data set, with the exception of a single user-defined parameter setting the tradeoff between local neighborhood preservation and global mapping. The optimization of the stress criterion designed for the mapping is realized by "force-directed placement" (FDP). The mappings of low- and high-dimensional data sets are presented as illustrations of the features and advantages of the proposed algorithm. The weighting function specific to high-dimensional data and the symmetric handling of short distances can be easily incorporated in most distance preservation-based nonlinear dimensionality reduction methods.

Entities:  

Mesh:

Year:  2007        PMID: 18220179     DOI: 10.1109/tnn.2007.891682

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  6 in total

1.  Networking metabolites and diseases.

Authors:  Pascal Braun; Edward Rietman; Marc Vidal
Journal:  Proc Natl Acad Sci U S A       Date:  2008-07-16       Impact factor: 11.205

2.  Force feature spaces for visualization and classification.

Authors:  Dragana Veljkovic; Kay A Robbins
Journal:  Int Conf Digit Signal Process Proc       Date:  2008-12-11

3.  Locally linear embedding (LLE) for MRI based Alzheimer's disease classification.

Authors:  Xin Liu; Duygu Tosun; Michael W Weiner; Norbert Schuff
Journal:  Neuroimage       Date:  2013-06-21       Impact factor: 6.556

4.  Molecular and evolutionary bases of within-patient genotypic and phenotypic diversity in Escherichia coli extraintestinal infections.

Authors:  Maxime Levert; Oana Zamfir; Olivier Clermont; Odile Bouvet; Sylvain Lespinats; Marie Claire Hipeaux; Catherine Branger; Bertrand Picard; Claude Saint-Ruf; Françoise Norel; Thierry Balliau; Michel Zivy; Hervé Le Nagard; Stéphane Cruveiller; Stéphane Cruvellier; Béatrice Chane-Woon-Ming; Susanna Nilsson; Ivana Gudelj; Katherine Phan; Thomas Ferenci; Olivier Tenaillon; Erick Denamur
Journal:  PLoS Pathog       Date:  2010-09-30       Impact factor: 6.823

5.  How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling.

Authors:  Sylvain Lespinats; Delphine Grando; Eric Maréchal; Mohamed-Ali Hakimi; Olivier Tenaillon; Olivier Bastien
Journal:  Evol Bioinform Online       Date:  2011-06-07       Impact factor: 1.625

6.  ColorPhylo: A Color Code to Accurately Display Taxonomic Classifications.

Authors:  Sylvain Lespinats; Bernard Fertil
Journal:  Evol Bioinform Online       Date:  2011-11-13       Impact factor: 1.625

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.