Literature DB >> 27714008

Importance of proximity measures in clustering of cancer and miRNA datasets: proposal of an automated framework.

Sudipta Acharya1, Sriparna Saha1.   

Abstract

Distance plays an important role in the clustering process for allocating data points to different clusters. Several distance or proximity measures have been developed and reported in the literature to determine dissimilarities between two given points. The choice of distance measure depends on a particular domain as well as different data sets of the same domain. It is important to automatically determine the appropriate distance measure which acts best for a particular data set. In this study we have developed an automatic clustering technique using the search capability of multiobjective optimization which can automatically determine the relevant distance measure and the corresponding partitioning from a given data set. Our proposed automated framework is generic in nature i.e., any number of different distance measures can be incorporated into it. In our work we have used four existing widely used distance measures, i.e., Euclidean, line symmetry, point symmetry and city block distance to be explored for each data set. In order to measure the richness of an obtained partitioning using a particular distance, four cluster validity indices, the Silhouette index, the DB index, the adjusted rand index and classification accuracy are used. A new encoding strategy which can encode the set of cluster centers and the particular distance function is used to represent the problem. The appropriate distance function and the corresponding partitioning are determined using the search capability of a multiobjective optimization based technique. The efficiency of the proposed technique is shown on clustering three microRNA and three microarray gene expression data sets having varying complexities. The results show the usefulness of the proposed automated approach.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27714008     DOI: 10.1039/c6mb00609d

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  1 in total

1.  Circular RNA Complement Factor H (CFH) Promotes Glioma Progression by Sponging miR-149 and Regulating AKT1.

Authors:  Aimiao Bian; Yanping Wang; Ji Liu; Xiaodong Wang; Dai Liu; Jian Jiang; Lianshu Ding; Xiaobo Hui
Journal:  Med Sci Monit       Date:  2018-08-16
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.