Literature DB >> 20703684

Evaluating cluster preservation in frequent itemset integration for distributed databases.

Sumeet Dua1, Michael P Dessauer, Prerna Sethi.   

Abstract

Medical sciences are rapidly emerging as a data rich discipline where the amount of databases and their dimensionality increases exponentially with time. Data integration algorithms often rely upon discovering embedded, useful, and novel relationships between feature attributes that describe the data. Such algorithms require data integration prior to knowledge discovery, which can lack the timeliness, scalability, robustness, and reliability of discovered knowledge. Knowledge integration algorithms offer pattern discovery on segmented and distributed databases but require sophisticated methods for pattern merging and evaluating integration quality. We propose a unique computational framework for discovering and integrating frequent sets of features from distributed databases and then exploiting them for unsupervised learning from the integrated space. Assorted indices of cluster quality are used to assess the accuracy of knowledge merging. The approach preserves significant cluster quality under various cluster distributions and noise conditions. Exhaustive experimentation is performed to further evaluate the scalability and robustness of the proposed methodology.

Mesh:

Year:  2010        PMID: 20703684     DOI: 10.1007/s10916-010-9512-1

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  1 in total

1.  Associative Classification of Mammograms using Weighted Rules.

Authors:  Sumeet Dua; Harpreet Singh; H W Thompson
Journal:  Expert Syst Appl       Date:  2009-07-01       Impact factor: 6.954

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.