Literature DB >> 10582567

Clustering gene expression patterns.

A Ben-Dor1, R Shamir, Z Yakhini.   

Abstract

Recent advances in biotechnology allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. Analysis of data produced by such experiments offers potential insight into gene function and regulatory mechanisms. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. The corresponding algorithmic problem is to cluster multicondition gene expression patterns. In this paper we describe a novel clustering algorithm that was developed for analysis of gene expression data. We define an appropriate stochastic error model on the input, and prove that under the conditions of the model, the algorithm recovers the cluster structure with high probability. The running time of the algorithm on an n-gene dataset is O[n2[log(n)]c]. We also present a practical heuristic based on the same algorithmic ideas. The heuristic was implemented and its performance is demonstrated on simulated data and on real gene expression data, with very promising results.

Mesh:

Year:  1999        PMID: 10582567     DOI: 10.1089/106652799318274

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  136 in total

1.  Assessing clusters and motifs from gene expression data.

Authors:  L M Jakt; L Cao; K S Cheah; D K Smith
Journal:  Genome Res       Date:  2001-01       Impact factor: 9.043

2.  Finding genes in the C2C12 osteogenic pathway by k-nearest-neighbor classification of expression data.

Authors:  Joachim Theilhaber; Timothy Connolly; Sergio Roman-Roman; Steven Bushnell; Amanda Jackson; Kathy Call; Teresa Garcia; Roland Baron
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

3.  Structure and evolution of the hAT transposon superfamily.

Authors:  E Rubin; G Lithwick; A A Levy
Journal:  Genetics       Date:  2001-07       Impact factor: 4.562

4.  Systematic learning of gene functional classes from DNA array expression data by using multilayer perceptrons.

Authors:  Alvaro Mateos; Joaquín Dopazo; Ronald Jansen; Yuhai Tu; Mark Gerstein; Gustavo Stolovitzky
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

5.  Identification of the binding sites of regulatory proteins in bacterial genomes.

Authors:  Hao Li; Virgil Rhodius; Carol Gross; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-14       Impact factor: 11.205

6.  ESPD: a pattern detection model underlying gene expression profiles.

Authors:  Chun Tang; Aidong Zhang; Murali Ramanathan
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

7.  Integration of genomic datasets to predict protein complexes in yeast.

Authors:  Ronald Jansen; Ning Lan; Jiang Qian; Mark Gerstein
Journal:  J Struct Funct Genomics       Date:  2002

8.  Algorithms for optimization of the transport system in living and artificial cells.

Authors:  A V Melkikh; M I Sutormina
Journal:  Syst Synth Biol       Date:  2011-06-17

9.  Translational bioinformatics and healthcare informatics: computational and ethical challenges.

Authors:  Prerna Sethi; Kimberly Theodos
Journal:  Perspect Health Inf Manag       Date:  2009-09-16

Review 10.  A structured approach to predictive modeling of a two-class problem using multidimensional data sets.

Authors:  Heidi Spratt; Hyunsu Ju; Allan R Brasier
Journal:  Methods       Date:  2013-01-12       Impact factor: 3.608

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.