Literature DB >> 26208367

Use of Semisupervised Clustering and Feature-Selection Techniques for Identification of Co-expressed Genes.

Sriparna Saha, Abhay Kumar Alok, Asif Ekbal.   

Abstract

Studying the patterns hidden in gene-expression data helps to understand the functionality of genes. In general, clustering techniques are widely used for the identification of natural partitionings from the gene expression data. In order to put constraints on dimensionality, feature selection is the key issue because not all features are important from clustering point of view. Moreover some limited amount of supervised information can help to fine tune the obtained clustering solution. In this paper, the problem of simultaneous feature selection and semisupervised clustering is formulated as a multiobjective optimization (MOO) task. A modern simulated annealing-based MOO technique namely AMOSA is utilized as the background optimization methodology. Here, features and cluster centers are represented in the form of a string and the assignment of genes to different clusters is done using a point symmetry-based distance. Six optimization criteria based on several internal and external cluster validity indices are utilized. In order to generate the supervised information, a popular clustering technique, Fuzzy C-mean, is utilized. Appropriate subset of features, proper number of clusters and the proper partitioning are determined using the search capability of AMOSA. The effectiveness of this proposed semisupervised clustering technique, Semi-FeaClustMOO, is demonstrated on five publicly available benchmark gene-expression datasets. Comparison results with the existing techniques for gene-expression data clustering again reveal the superiority of the proposed technique. Statistical and biological significance tests have also been carried out.

Mesh:

Year:  2015        PMID: 26208367     DOI: 10.1109/JBHI.2015.2451735

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  3 in total

1.  Unsupervised gene selection using biological knowledge : application in sample clustering.

Authors:  Sudipta Acharya; Sriparna Saha; N Nikhil
Journal:  BMC Bioinformatics       Date:  2017-11-22       Impact factor: 3.169

2.  Ringed Seal Search for Global Optimization via a Sensitive Search Model.

Authors:  Younes Saadi; Iwan Tri Riyadi Yanto; Tutut Herawan; Vimala Balakrishnan; Haruna Chiroma; Anhar Risnumawan
Journal:  PLoS One       Date:  2016-01-20       Impact factor: 3.240

3.  A multi-objective gene clustering algorithm guided by apriori biological knowledge with intensification and diversification strategies.

Authors:  Jorge Parraga-Alava; Marcio Dorn; Mario Inostroza-Ponta
Journal:  BioData Min       Date:  2018-08-07       Impact factor: 2.522

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.