Literature DB >> 11156620

Assessing clusters and motifs from gene expression data.

L M Jakt1, L Cao, K S Cheah, D K Smith.   

Abstract

Large-scale gene expression studies and genomic sequencing projects are providing vast amounts of information that can be used to identify or predict cellular regulatory processes. Genes can be clustered on the basis of the similarity of their expression profiles or function and these clusters are likely to contain genes that are regulated by the same transcription factors. Searches for cis-regulatory elements can then be undertaken in the noncoding regions of the clustered genes. However, it is necessary to assess the efficiency of both the gene clustering and the postulated regulatory motifs, as there are many difficulties associated with clustering and determining the functional relevance of matches to sequence motifs. We have developed a method to assess the potential functional significance of clusters and motifs based on the probability of finding a certain number of matches to a motif in all of the gene clusters. To avoid problems with threshold scores for a match, the top matches to a motif are taken in several sample sizes. Genes from a sample are then counted by the cluster in which they appear. The probability of observing these counts by chance is calculated using the hypergeometric distribution. Because of the multiple sample sizes, strong and weak matching motifs can be detected and refined and significant matches to motifs across cluster boundaries are observed as all clusters are considered. By applying this method to many motifs and to a cluster set of yeast genes, we detected a similarity between Swi Five Factor and forkhead proteins and suggest that the currently unidentified Swi Five Factor is one of the yeast forkhead proteins.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11156620      PMCID: PMC311053          DOI: 10.1101/gr.148301

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  54 in total

Review 1.  Regulatory elements and expression profiles.

Authors:  P Bucher
Journal:  Curr Opin Struct Biol       Date:  1999-06       Impact factor: 6.809

Review 2.  Exploring expression data: identification and analysis of coexpressed genes.

Authors:  L J Heyer; S Kruglyak; S Yooseph
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

3.  ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons.

Authors:  F Corpet; F Servant; J Gouzy; D Kahn
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  TRANSFAC: an integrated system for gene expression regulation.

Authors:  E Wingender; X Chen; R Hehl; H Karas; I Liebich; V Matys; T Meinhardt; M Prüss; I Reuter; F Schacherer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

5.  Data analysis and integration: of steps and arrows.

Authors:  M Bittner; P Meltzer; J Trent
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

6.  Knowledge-based analysis of microarray gene expression data by using support vector machines.

Authors:  M P Brown; W N Grundy; D Lin; N Cristianini; C W Sugnet; T S Furey; M Ares; D Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2000-01-04       Impact factor: 11.205

7.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.

Authors:  P Tamayo; D Slonim; J Mesirov; Q Zhu; S Kitareewan; E Dmitrovsky; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

8.  The yeast proteome database (YPD) and Caenorhabditis elegans proteome database (WormPD): comprehensive resources for the organization and comparison of model organism protein information.

Authors:  M C Costanzo; J D Hogan; M E Cusick; B P Davis; A M Fancher; P E Hodges; P Kondu; C Lengieza; J E Lew-Smith; C Lingner; K J Roberg-Perez; M Tillberg; J E Brooks; J I Garrels
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

9.  DNA-binding requirements of the yeast protein Rap1p as selected in silico from ribosomal protein gene promoter sequences.

Authors:  R F Lascaris; W H Mager; R J Planta
Journal:  Bioinformatics       Date:  1999-04       Impact factor: 6.937

10.  The fork head transcription factor Hcm1p participates in the regulation of SPC110, which encodes the calmodulin-binding protein in the yeast spindle pole body.

Authors:  G Zhu; T N Davis
Journal:  Biochim Biophys Acta       Date:  1998-12-10
View more
  17 in total

1.  Expression profiling of human tumors: the end of surgical pathology?

Authors:  M Ladanyi; W C Chan; T J Triche; W L Gerald
Journal:  J Mol Diagn       Date:  2001-08       Impact factor: 5.568

2.  EZ-Retrieve: a web-server for batch retrieval of coordinate-specified human DNA sequences and underscoring putative transcription factor-binding sites.

Authors:  Haibo Zhang; Y Ramanathan; Patricia Soteropoulos; Michael L Recce; Peter P Tolias
Journal:  Nucleic Acids Res       Date:  2002-11-01       Impact factor: 16.971

3.  The p44/wdr77-dependent cellular proliferation process during lung development is reactivated in lung cancer.

Authors:  Z Gu; F Zhang; Z-Q Wang; W Ma; R E Davis; Z Wang
Journal:  Oncogene       Date:  2012-06-04       Impact factor: 9.867

4.  An approach to identify over-represented cis-elements in related sequences.

Authors:  Jiashun Zheng; Jiajin Wu; Zhirong Sun
Journal:  Nucleic Acids Res       Date:  2003-04-01       Impact factor: 16.971

5.  Judging the quality of gene expression-based clustering methods using gene annotation.

Authors:  Francis D Gibbons; Frederick P Roth
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

6.  Using the gene ontology for microarray data mining: a comparison of methods and application to age effects in human prefrontal cortex.

Authors:  Paul Pavlidis; Jie Qin; Victoria Arango; John J Mann; Etienne Sibille
Journal:  Neurochem Res       Date:  2004-06       Impact factor: 3.996

7.  Transcriptional regulation: a genomic overview.

Authors:  José Luis Riechmann
Journal:  Arabidopsis Book       Date:  2002-04-04

8.  Gene set-based module discovery decodes cis-regulatory codes governing diverse gene expression across human multiple tissues.

Authors:  Atsushi Niida; Seiya Imoto; Rui Yamaguchi; Masao Nagasaki; Satoru Miyano
Journal:  PLoS One       Date:  2010-06-09       Impact factor: 3.240

9.  The identification of functional motifs in temporal gene expression analysis.

Authors:  Jiuzhou Song; Jaime Bjarnason; Michael G Surette
Journal:  Evol Bioinform Online       Date:  2007-02-27       Impact factor: 1.625

10.  DISCLOSE : DISsection of CLusters Obtained by SEries of transcriptome data using functional annotations and putative transcription factor binding sites.

Authors:  Evert-Jan Blom; Sacha A F T van Hijum; Klaas J Hofstede; Remko Silvis; Jos B T M Roerdink; Oscar P Kuipers
Journal:  BMC Bioinformatics       Date:  2008-12-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.