Literature DB >> 17155095

Subdimension-based similarity measure for DNA microarray data clustering.

Benson S Y Lam1, Hong Yan.   

Abstract

Microarray data analysis is useful for understanding biological processes. A number of clustering algorithms have been used to achieve this task. However, the performance of these methods can be significantly degraded due to the presence of nonsignificant conditions. In this paper, we propose a robust clustering algorithm based on a similarity measure. The key concept of the proposed similarity measure is to measure the similarity between two data points by their subdimensions. For example, assume that x1, x2, and x3 are ten-dimensional data vectors. The data point x3 is said to be closer to x1 than x2 if more than half of the dimensions of x1 and x3 are closer to x1 than x2. Thus, if two patterns are very similar except for a small amount of features, this measure will preserve the similarity. We have performed eight experiments to test the robustness of the proposed method, including three synthetic data sets, three real world data sets, and two microarray data sets. We also have compared the proposed method with four different clustering algorithms. Experimental results show that the proposed method yields better results than existing clustering algorithms.

Mesh:

Year:  2006        PMID: 17155095     DOI: 10.1103/PhysRevE.74.041906

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  2 in total

1.  Entropy based sub-dimensional evaluation and selection method for DNA microarray data classification.

Authors:  Yi Wang; Hong Yan
Journal:  Bioinformation       Date:  2008-11-03

2.  Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces.

Authors:  Hongya Zhao; Debby D Wang; Long Chen; Xinyu Liu; Hong Yan
Journal:  PLoS One       Date:  2016-09-06       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.