Literature DB >> 33816801

MCLEAN: Multilevel Clustering Exploration As Network.

Daniel Alcaide1,2, Jan Aerts1,2.   

Abstract

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs to be analyzed from different perspectives. Indeed, highly variable patterns may mask underlying trends in the dataset. Dendrograms are graphical representations resulting from agglomerative hierarchical clustering and provide a framework for viewing the clustering at different levels of detail. However, dendrograms become cluttered when the dataset gets large, and the single cut of the dendrogram to demarcate different clusters can be insufficient in heterogeneous datasets. In this work, we propose a visual analytics methodology called MCLEAN that offers a general approach for guiding the user through the exploration and detection of clusters. Powered by a graph-based transformation of the relational data, it supports a scalable environment for representation of heterogeneous datasets by changing the spatialization. We thereby combine multilevel representations of the clustered dataset with community finding algorithms. Our approach entails displaying the results of the heuristics to users, providing a setting from which to start the exploration and data analysis. To evaluate our proposed approach, we conduct a qualitative user study, where participants are asked to explore a heterogeneous dataset, comparing the results obtained by MCLEAN with the dendrogram. These qualitative results reveal that MCLEAN is an effective way of aiding users in the detection of clusters in heterogeneous datasets. The proposed methodology is implemented in an R package available at https://bitbucket.org/vda-lab/mclean. ©2018 Alcaide and Aerts.

Entities:  

Keywords:  Exploratory data analysis; Graph and network visualization; Hierarchical clustering; Visual analytics

Year:  2018        PMID: 33816801      PMCID: PMC7924466          DOI: 10.7717/peerj-cs.145

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  10 in total

1.  Graph visualization techniques for web clustering engines.

Authors:  Emilio Di Giacomo; Walter Didimo; Luca Grilli; Giuseppe Liotta
Journal:  IEEE Trans Vis Comput Graph       Date:  2007 Mar-Apr       Impact factor: 4.579

2.  ASK-GraphView: A large scale graph visualization system.

Authors:  James Abello; Frank van Ham; Neeraj Krishnan
Journal:  IEEE Trans Vis Comput Graph       Date:  2006 Sep-Oct       Impact factor: 4.579

3.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R.

Authors:  Peter Langfelder; Bin Zhang; Steve Horvath
Journal:  Bioinformatics       Date:  2007-11-16       Impact factor: 6.937

Review 4.  Maps of random walks on complex networks reveal community structure.

Authors:  Martin Rosvall; Carl T Bergstrom
Journal:  Proc Natl Acad Sci U S A       Date:  2008-01-23       Impact factor: 11.205

5.  GrouseFlocks: steerable exploration of graph hierarchy space.

Authors:  Daniel Archambault; Tamara Munzner; David Auber
Journal:  IEEE Trans Vis Comput Graph       Date:  2008 Jul-Aug       Impact factor: 4.579

6.  Constructing overview+detail dendrogram-matrix views.

Authors:  Jin Chen; Alan M MacEachren; Donna J Peuquet
Journal:  IEEE Trans Vis Comput Graph       Date:  2009 Nov-Dec       Impact factor: 4.579

7.  Spark: a navigational paradigm for genomic data exploration.

Authors:  Cydney B Nielsen; Hamid Younesy; Henriette O'Geen; Xiaoqin Xu; Andrew R Jackson; Aleksandar Milosavljevic; Ting Wang; Joseph F Costello; Martin Hirst; Peggy J Farnham; Steven J M Jones
Journal:  Genome Res       Date:  2012-09-07       Impact factor: 9.043

8.  Topological data analysis of biological aggregation models.

Authors:  Chad M Topaz; Lori Ziegelmeier; Tom Halverson
Journal:  PLoS One       Date:  2015-05-13       Impact factor: 3.240

9.  dendsort: modular leaf ordering methods for dendrogram representations in R.

Authors:  Ryo Sakai; Raf Winand; Toni Verbeiren; Andrew Vande Moere; Jan Aerts
Journal:  F1000Res       Date:  2014-07-30

10.  Semi-supervised adaptive-height snipping of the hierarchical clustering tree.

Authors:  Askar Obulkasim; Gerrit A Meijer; Mark A van de Wiel
Journal:  BMC Bioinformatics       Date:  2015-01-16       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.