Literature DB >> 26357159

Comparing Clusterings Using Bertin's Idea.

A Pilhofer1, A Gribov, A Unwin.   

Abstract

Classifying a set of objects into clusters can be done in numerous ways, producing different results. They can be visually compared using contingency tables, mosaicplots, fluctuation diagrams, tableplots, (modified) parallel coordinates plots, Parallel Sets plots or circos diagrams. Unfortunately the interpretability of all these graphical displays decreases rapidly with the numbers of categories and clusterings. In his famous book A Semiology of Graphics Bertin writes "the discovery of an ordered concept appears as the ultimate point in logical simplification since it permits reducing to a single instant the assimilation of series which previously required many instants of study". Or in more everyday language, if you use good orderings you can see results immediately that with other orderings might take a lot of effort. This is also related to the idea of effect ordering, that data should be organised to reflect the effect you want to observe. This paper presents an efficient algorithm based on Bertin's idea and concepts related to Kendall's t, which finds informative joint orders for two or more nominal classification variables. We also show how these orderings improve the various displays and how groups of corresponding categories can be detected using a top-down partitioning algorithm. Different clusterings based on data on the environmental performance of cars sold in Germany are used for illustration. All presented methods are available in the R package extracat which is used to compute the optimized orderings for the example dataset.

Year:  2012        PMID: 26357159     DOI: 10.1109/TVCG.2012.207

Source DB:  PubMed          Journal:  IEEE Trans Vis Comput Graph        ISSN: 1077-2626            Impact factor:   4.579


  2 in total

1.  Domino: Extracting, Comparing, and Manipulating Subsets Across Multiple Tabular Datasets.

Authors:  Samuel Gratzl; Nils Gehlenborg; Alexander Lex; Hanspeter Pfister; Marc Streit
Journal:  IEEE Trans Vis Comput Graph       Date:  2014-12       Impact factor: 4.579

2.  XCluSim: a visual analytics tool for interactively comparing multiple clustering results of bioinformatics data.

Authors:  Sehi L'Yi; Bongkyung Ko; DongHwa Shin; Young-Joon Cho; Jaeyong Lee; Bohyoung Kim; Jinwook Seo
Journal:  BMC Bioinformatics       Date:  2015-08-13       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.