Literature DB >> 18599913

Interactive entity resolution in relational data: a visual analytic tool and its evaluation.

Hyunmo Kang1, Lise Getoor, Ben Shneiderman, Mustafa Bilgic, Louis Licamele.   

Abstract

Databases often contain uncertain and imprecise references to real-world entities. Entity resolution, the process of reconciling multiple references to underlying real-world entities, is an important data cleaning process required before accurate visualization or analysis of the data is possible. In many cases, in addition to noisy data describing entities, there is data describing the relationships among the entities. This relational data is important during the entity resolution process; it is useful both for the algorithms which determine likely database references to be resolved and for visual analytic tools which support the entity resolution process. In this paper, we introduce a novel user interface, D-Dupe, for interactive entity resolution in relational data. D-Dupe effectively combines relational entity resolution algorithms with a novel network visualization that enables users to make use of an entity's relational context for making resolution decisions. Since resolution decisions often are interdependent, D-Dupe facilitates understanding this complex process through animations which highlight combined inferences and a history mechanism which allows users to inspect chains of resolution decisions. An empirical study with 12 users confirmed the benefits of the relational context visualization on the performance of entity resolution tasks in relational data in terms of time as well as users' confidence and satisfaction.

Mesh:

Year:  2008        PMID: 18599913     DOI: 10.1109/TVCG.2008.55

Source DB:  PubMed          Journal:  IEEE Trans Vis Comput Graph        ISSN: 1077-2626            Impact factor:   4.579


  2 in total

Review 1.  Privacy preserving interactive record linkage (PPIRL).

Authors:  Hye-Chung Kum; Ashok Krishnamurthy; Ashwin Machanavajjhala; Michael K Reiter; Stanley Ahalt
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

2.  StratomeX: Visual Analysis of Large-Scale Heterogeneous Genomics Data for Cancer Subtype Characterization.

Authors:  A Lex; M Streit; H-J Schulz; C Partl; D Schmalstieg; P J Park; N Gehlenborg
Journal:  Comput Graph Forum       Date:  2012-06-25       Impact factor: 2.078

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.