Literature DB >> 11238068

A hierarchical unsupervised growing neural network for clustering gene expression patterns.

J Herrero1, A Valencia, J Dopazo.   

Abstract

MOTIVATION: We describe a new approach to the analysis of gene expression data coming from DNA array experiments, using an unsupervised neural network. DNA array technologies allow monitoring thousands of genes rapidly and efficiently. One of the interests of these studies is the search for correlated gene expression patterns, and this is usually achieved by clustering them. The Self-Organising Tree Algorithm, (SOTA) (Dopazo,J. and Carazo,J.M. (1997) J. Mol. Evol., 44, 226-233), is a neural network that grows adopting the topology of a binary tree. The result of the algorithm is a hierarchical cluster obtained with the accuracy and robustness of a neural network.
RESULTS: SOTA clustering confers several advantages over classical hierarchical clustering methods. SOTA is a divisive method: the clustering process is performed from top to bottom, i.e. the highest hierarchical levels are resolved before going to the details of the lowest levels. The growing can be stopped at the desired hierarchical level. Moreover, a criterion to stop the growing of the tree, based on the approximate distribution of probability obtained by randomisation of the original data set, is provided. By means of this criterion, a statistical support for the definition of clusters is proposed. In addition, obtaining average gene expression patterns is a built-in feature of the algorithm. Different neurons defining the different hierarchical levels represent the averages of the gene expression patterns contained in the clusters. Since SOTA runtimes are approximately linear with the number of items to be classified, it is especially suitable for dealing with huge amounts of data. The method proposed is very general and applies to any data providing that they can be coded as a series of numbers and that a computable measure of similarity between data items can be used. AVAILABILITY: A server running the program can be found at: http://bioinfo.cnio.es/sotarray.

Mesh:

Year:  2001        PMID: 11238068     DOI: 10.1093/bioinformatics/17.2.126

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  105 in total

1.  Systematic learning of gene functional classes from DNA array expression data by using multilayer perceptrons.

Authors:  Alvaro Mateos; Joaquín Dopazo; Ronald Jansen; Yuhai Tu; Mark Gerstein; Gustavo Stolovitzky
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

2.  GEPAS: A web-based resource for microarray gene expression data analysis.

Authors:  Javier Herrero; Fátima Al-Shahrour; Ramón Díaz-Uriarte; Alvaro Mateos; Juan M Vaquerizas; Javier Santoyo; Joaquín Dopazo
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Comparative analysis of the Arabidopsis pollen transcriptome.

Authors:  David Honys; David Twell
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

4.  The functional genomic distribution of protein divergence in two animal phyla: coevolution, genomic conflict, and constraint.

Authors:  Cristian I Castillo-Davis; Fyodor A Kondrashov; Daniel L Hartl; Rob J Kulathinal
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

5.  Transcriptomic and phenotypic analyses identify coregulated, overlapping regulons among PrfA, CtsR, HrcA, and the alternative sigma factors sigmaB, sigmaC, sigmaH, and sigmaL in Listeria monocytogenes.

Authors:  Soraya Chaturongakul; Sarita Raengpradub; M Elizabeth Palmer; Teresa M Bergholz; Renato H Orsi; Yuewei Hu; Juliane Ollinger; Martin Wiedmann; Kathryn J Boor
Journal:  Appl Environ Microbiol       Date:  2010-10-29       Impact factor: 4.792

6.  Applying pattern recognition methods to analyze the molecular properties of a homologous series of nitrogen mustard agents.

Authors:  Ronald Bartzatt; Laura Donigan
Journal:  AAPS PharmSciTech       Date:  2006-04-14       Impact factor: 3.246

Review 7.  Bioinformatics and cancer: an essential alliance.

Authors:  Joaquín Dopazo
Journal:  Clin Transl Oncol       Date:  2006-06       Impact factor: 3.405

8.  Hierarchical Bayesian neural network for gene expression temporal patterns.

Authors:  Yulan Liang; Arpad G Kelemen
Journal:  Stat Appl Genet Mol Biol       Date:  2004-09-03

9.  Towards answering biological questions with experimental evidence: automatically identifying text that summarize image content in full-text articles.

Authors:  Hong Yu
Journal:  AMIA Annu Symp Proc       Date:  2006

10.  Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data.

Authors:  Céline Becquet; Sylvain Blachon; Baptiste Jeudy; Jean-Francois Boulicaut; Olivier Gandrillon
Journal:  Genome Biol       Date:  2002-11-21       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.