Literature DB >> 24079964

A new measure for gene expression biclustering based on non-parametric correlation.

Jose L Flores1, Iñaki Inza, Pedro Larrañaga, Borja Calvo.   

Abstract

BACKGROUND: One of the emerging techniques for performing the analysis of the DNA microarray data known as biclustering is the search of subsets of genes and conditions which are coherently expressed. These subgroups provide clues about the main biological processes. Until now, different approaches to this problem have been proposed. Most of them use the mean squared residue as quality measure but relevant and interesting patterns can not be detected such as shifting, or scaling patterns. Furthermore, recent papers show that there exist new coherence patterns involved in different kinds of cancer and tumors such as inverse relationships between genes which can not be captured.
RESULTS: The proposed measure is called Spearman's biclustering measure (SBM) which performs an estimation of the quality of a bicluster based on the non-linear correlation among genes and conditions simultaneously. The search of biclusters is performed by using a evolutionary technique called estimation of distribution algorithms which uses the SBM measure as fitness function. This approach has been examined from different points of view by using artificial and real microarrays. The assessment process has involved the use of quality indexes, a set of bicluster patterns of reference including new patterns and a set of statistical tests. It has been also examined the performance using real microarrays and comparing to different algorithmic approaches such as Bimax, CC, OPSM, Plaid and xMotifs.
CONCLUSIONS: SBM shows several advantages such as the ability to recognize more complex coherence patterns such as shifting, scaling and inversion and the capability to selectively marginalize genes and conditions depending on the statistical significance.
Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Entities:  

Keywords:  Artificial intelligence; Biclustering; Biomedicine; Machine learning

Mesh:

Year:  2013        PMID: 24079964     DOI: 10.1016/j.cmpb.2013.07.025

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  5 in total

1.  A New Approach for Mining Order-Preserving Submatrices Based on All Common Subsequences.

Authors:  Yun Xue; Zhengling Liao; Meihang Li; Jie Luo; Qiuhua Kuang; Xiaohui Hu; Tiechen Li
Journal:  Comput Math Methods Med       Date:  2015-05-28       Impact factor: 2.238

2.  biDCG: a new method for discovering global features of DNA microarray data via an iterative re-clustering procedure.

Authors:  Chia-Pei Chen; Hsieh Fushing; Rob Atwill; Patrice Koehl
Journal:  PLoS One       Date:  2014-07-21       Impact factor: 3.240

3.  MCbiclust: a novel algorithm to discover large-scale functionally related gene sets from massive transcriptomics data collections.

Authors:  Robert B Bentham; Kevin Bryson; Gyorgy Szabadkai
Journal:  Nucleic Acids Res       Date:  2017-09-06       Impact factor: 16.971

Review 4.  Quality measures for gene expression biclusters.

Authors:  Beatriz Pontes; Ral Girldez; Jess S Aguilar-Ruiz
Journal:  PLoS One       Date:  2015-03-12       Impact factor: 3.240

5.  Pairwise gene GO-based measures for biclustering of high-dimensional expression data.

Authors:  Juan A Nepomuceno; Alicia Troncoso; Isabel A Nepomuceno-Chamorro; Jesús S Aguilar-Ruiz
Journal:  BioData Min       Date:  2018-03-27       Impact factor: 2.522

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.