Literature DB >> 22034367

Improved similarity trees and their application to visual data classification.

Jose Gustavo S Paiva1, Laura Florian-Cruz, Helio Pedrini, Guilherme P Telles, Rosane Minghim.   

Abstract

An alternative form to multidimensional projections for the visual analysis of data represented in multidimensional spaces is the deployment of similarity trees, such as Neighbor Joining trees. They organize data objects on the visual plane emphasizing their levels of similarity with high capability of detecting and separating groups and subgroups of objects. Besides this similarity-based hierarchical data organization, some of their advantages include the ability to decrease point clutter; high precision; and a consistent view of the data set during focusing, offering a very intuitive way to view the general structure of the data set as well as to drill down to groups and subgroups of interest. Disadvantages of similarity trees based on neighbor joining strategies include their computational cost and the presence of virtual nodes that utilize too much of the visual space. This paper presents a highly improved version of the similarity tree technique. The improvements in the technique are given by two procedures. The first is a strategy that replaces virtual nodes by promoting real leaf nodes to their place, saving large portions of space in the display and maintaining the expressiveness and precision of the technique. The second improvement is an implementation that significantly accelerates the algorithm, impacting its use for larger data sets. We also illustrate the applicability of the technique in visual data mining, showing its advantages to support visual classification of data sets, with special attention to the case of image classification. We demonstrate the capabilities of the tree for analysis and iterative manipulation and employ those capabilities to support evolving to a satisfactory data organization and classification.
© 2011 IEEE

Entities:  

Year:  2011        PMID: 22034367     DOI: 10.1109/TVCG.2011.212

Source DB:  PubMed          Journal:  IEEE Trans Vis Comput Graph        ISSN: 1077-2626            Impact factor:   4.579


  3 in total

1.  Interactive Machine Learning by Visualization: A Small Data Solution.

Authors:  Huang Li; Shiaofen Fang; Snehasis Mukhopadhyay; Andrew J Saykin; Li Shen
Journal:  Proc IEEE Int Conf Big Data       Date:  2019-01-24

2.  Integrative analysis to select cancer candidate biomarkers to targeted validation.

Authors:  Rebeca Kawahara; Gabriela V Meirelles; Henry Heberle; Romênia R Domingues; Daniela C Granato; Sami Yokoo; Rafael R Canevarolo; Flavia V Winck; Ana Carolina P Ribeiro; Thaís Bianca Brandão; Paulo R Filgueiras; Karen S P Cruz; José Alexandre Barbuto; Ronei J Poppi; Rosane Minghim; Guilherme P Telles; Felipe Paiva Fonseca; Jay W Fox; Alan R Santos-Silva; Ricardo D Coletta; Nicholas E Sherman; Adriana F Paes Leme
Journal:  Oncotarget       Date:  2015-12-22

3.  Live neighbor-joining.

Authors:  Guilherme P Telles; Graziela S Araújo; Maria E M T Walter; Marcelo M Brigido; Nalvo F Almeida
Journal:  BMC Bioinformatics       Date:  2018-05-16       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.