Literature DB >> 27260783

Information bottleneck based incremental fuzzy clustering for large biomedical data.

Yongli Liu1, Xing Wan2.   

Abstract

Incremental fuzzy clustering combines advantages of fuzzy clustering and incremental clustering, and therefore is important in classifying large biomedical literature. Conventional algorithms, suffering from data sparsity and high-dimensionality, often fail to produce reasonable results and may even assign all the objects to a single cluster. In this paper, we propose two incremental algorithms based on information bottleneck, Single-Pass fuzzy c-means (spFCM-IB) and Online fuzzy c-means (oFCM-IB). These two algorithms modify conventional algorithms by considering different weights for each centroid and object and scoring mutual information loss to measure the distance between centroids and objects. spFCM-IB and oFCM-IB are used to group a collection of biomedical text abstracts from Medline database. Experimental results show that clustering performances of our approaches are better than such prominent counterparts as spFCM, spHFCM, oFCM and oHFCM, in terms of accuracy.
Copyright © 2016 Elsevier Inc. All rights reserved.

Keywords:  Fuzzy clustering; Incremental clustering; Information bottleneck

Mesh:

Year:  2016        PMID: 27260783     DOI: 10.1016/j.jbi.2016.05.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  2 in total

1.  A fuzzy co-clustering algorithm for biomedical data.

Authors:  Yongli Liu; Shuai Wu; Zhizhong Liu; Hao Chao
Journal:  PLoS One       Date:  2017-04-26       Impact factor: 3.240

2.  Incremental fuzzy C medoids clustering of time series data using dynamic time warping distance.

Authors:  Yongli Liu; Jingli Chen; Shuai Wu; Zhizhong Liu; Hao Chao
Journal:  PLoS One       Date:  2018-05-24       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.