Literature DB >> 20689710

A New-Fangled FES-k-Means Clustering Algorithm for Disease Discovery and Visual Analytics.

Tonny J Oyana1.   

Abstract

The central purpose of this study is to further evaluate the quality of the performance of a new algorithm. The study provides additional evidence on this algorithm that was designed to increase the overall efficiency of the original k-means clustering technique-the Fast, Efficient, and Scalable k-means algorithm (FES-k-means). The FES-k-means algorithm uses a hybrid approach that comprises the k-d tree data structure that enhances the nearest neighbor query, the original k-means algorithm, and an adaptation rate proposed by Mashor. This algorithm was tested using two real datasets and one synthetic dataset. It was employed twice on all three datasets: once on data trained by the innovative MIL-SOM method and then on the actual untrained data in order to evaluate its competence. This two-step approach of data training prior to clustering provides a solid foundation for knowledge discovery and data mining, otherwise unclaimed by clustering methods alone. The benefits of this method are that it produces clusters similar to the original k-means method at a much faster rate as shown by runtime comparison data; and it provides efficient analysis of large geospatial data with implications for disease mechanism discovery. From a disease mechanism discovery perspective, it is hypothesized that the linear-like pattern of elevated blood lead levels discovered in the city of Chicago may be spatially linked to the city's water service lines.

Year:  2010        PMID: 20689710      PMCID: PMC3171363          DOI: 10.1155/2010/746021

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  5 in total

1.  Dynamic self-organizing maps with controlled growth for knowledge discovery.

Authors:  D Alahakoon; S K Halgamuge; B Srinivasan
Journal:  IEEE Trans Neural Netw       Date:  2000

2.  Self organization of a massive document collection.

Authors:  T Kohonen; S Kaski; K Lagus; J Salojarvi; J Honkela; V Paatero; A Saarela
Journal:  IEEE Trans Neural Netw       Date:  2000

3.  Clustering of the self-organizing map.

Authors:  J Vesanto; E Alhoniemi
Journal:  IEEE Trans Neural Netw       Date:  2000

4.  A cluster separation measure.

Authors:  D L Davies; D W Bouldin
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1979-02       Impact factor: 6.226

5.  Using a spatial filter and a geographic information system to improve rabies surveillance data.

Authors:  A Curtis
Journal:  Emerg Infect Dis       Date:  1999 Sep-Oct       Impact factor: 6.883

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.