Literature DB >> 15865625

Ab initio identification of putative human transcription factor binding sites by comparative genomics.

D Corà1, C Herrmann, C Dieterich, F Di Cunto, P Provero, M Caselle.   

Abstract

BACKGROUND: Understanding transcriptional regulation of gene expression is one of the greatest challenges of modern molecular biology. A central role in this mechanism is played by transcription factors, which typically bind to specific, short DNA sequence motifs usually located in the upstream region of the regulated genes. We discuss here a simple and powerful approach for the ab initio identification of these cis-regulatory motifs. The method we present integrates several elements: human-mouse comparison, statistical analysis of genomic sequences and the concept of coregulation. We apply it to a complete scan of the human genome.
RESULTS: By using the catalogue of conserved upstream sequences collected in the CORG database we construct sets of genes sharing the same overrepresented motif (short DNA sequence) in their upstream regions both in human and in mouse. We perform this construction for all possible motifs from 5 to 8 nucleotides in length and then filter the resulting sets looking for two types of evidence of coregulation: first, we analyze the Gene Ontology annotation of the genes in the set, searching for statistically significant common annotations; second, we analyze the expression profiles of the genes in the set as measured by microarray experiments, searching for evidence of coexpression. The sets which pass one or both filters are conjectured to contain a significant fraction of coregulated genes, and the upstream motifs characterizing the sets are thus good candidates to be the binding sites of the TF's involved in such regulation. In this way we find various known motifs and also some new candidate binding sites.
CONCLUSION: We have discussed a new integrated algorithm for the "ab initio" identification of transcription factor binding sites in the human genome. The method is based on three ingredients: comparative genomics, overrepresentation, different types of coregulation. The method is applied to a full-scan of the human genome, giving satisfactory results.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15865625      PMCID: PMC1097714          DOI: 10.1186/1471-2105-6-110

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  38 in total

1.  Analysis of vertebrate SCL loci identifies conserved enhancers.

Authors:  B Göttgens; L M Barton; J G Gilbert; A J Bench; M J Sanchez; S Bahn; S Mistry; D Grafham; A McMurray; M Vaudin; E Amaya; D R Bentley; A R Green; A M Sinclair
Journal:  Nat Biotechnol       Date:  2000-02       Impact factor: 54.908

2.  An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences.

Authors:  C E Lawrence; A A Reilly
Journal:  Proteins       Date:  1990

3.  Identifying protein-binding sites from unaligned DNA fragments.

Authors:  G D Stormo; G W Hartzell
Journal:  Proc Natl Acad Sci U S A       Date:  1989-02       Impact factor: 11.205

4.  Regulation of the cyclin E gene by transcription factor E2F1.

Authors:  K Ohtani; J DeGregori; J R Nevins
Journal:  Proc Natl Acad Sci U S A       Date:  1995-12-19       Impact factor: 11.205

5.  Strong conservation of non-coding sequences during vertebrates evolution: potential involvement in post-transcriptional regulation of gene expression.

Authors:  L Duret; F Dorkeld; C Gautier
Journal:  Nucleic Acids Res       Date:  1993-05-25       Impact factor: 16.971

6.  Structure of a cluster of mouse histone genes.

Authors:  D B Sittman; R A Graves; W F Marzluff
Journal:  Nucleic Acids Res       Date:  1983-10-11       Impact factor: 16.971

7.  Candidate regulatory sequence elements for cell cycle-dependent transcription in Saccharomyces cerevisiae.

Authors:  T G Wolfsberg; A E Gabrielian; M J Campbell; R J Cho; J L Spouge; D Landsman
Journal:  Genome Res       Date:  1999-08       Impact factor: 9.043

8.  Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies.

Authors:  J van Helden; B André; J Collado-Vides
Journal:  J Mol Biol       Date:  1998-09-04       Impact factor: 5.469

9.  Paired MyoD-binding sites regulate myosin light chain gene expression.

Authors:  B M Wentworth; M Donoghue; J C Engert; E B Berglund; N Rosenthal
Journal:  Proc Natl Acad Sci U S A       Date:  1991-02-15       Impact factor: 11.205

Review 10.  Of mice and men: phylogenetic footprinting aids the discovery of regulatory elements.

Authors:  Zhaolei Zhang; Mark Gerstein
Journal:  J Biol       Date:  2003-06-06
View more
  17 in total

1.  Identification of functional TFAP2A and SP1 binding sites in new TFAP2A-modulated genes.

Authors:  Francesca Orso; Davide Corà; Benedetta Ubezio; Paolo Provero; Michele Caselle; Daniela Taverna
Journal:  BMC Genomics       Date:  2010-06-03       Impact factor: 3.969

2.  CircuitsDB: a database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse.

Authors:  Olivier Friard; Angela Re; Daniela Taverna; Michele De Bortoli; Davide Corá
Journal:  BMC Bioinformatics       Date:  2010-08-23       Impact factor: 3.169

3.  Inflammatory gene regulatory networks in amnion cells following cytokine stimulation: translational systems approach to modeling human parturition.

Authors:  Ruth Li; William E Ackerman; Taryn L Summerfield; Lianbo Yu; Parul Gulati; Jie Zhang; Kun Huang; Roberto Romero; Douglas A Kniss
Journal:  PLoS One       Date:  2011-06-02       Impact factor: 3.240

4.  Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns.

Authors:  Jérémy Gruel; Michel LeBorgne; Nolwenn LeMeur; Nathalie Théret
Journal:  BMC Bioinformatics       Date:  2011-09-12       Impact factor: 3.169

5.  A survey of motif discovery methods in an integrated framework.

Authors:  Geir Kjetil Sandve; Finn Drabløs
Journal:  Biol Direct       Date:  2006-04-06       Impact factor: 4.540

6.  The identification of functional motifs in temporal gene expression analysis.

Authors:  Jiuzhou Song; Jaime Bjarnason; Michael G Surette
Journal:  Evol Bioinform Online       Date:  2007-02-27       Impact factor: 1.625

7.  c-REDUCE: incorporating sequence conservation to detect motifs that correlate with expression.

Authors:  Katerina Kechris; Hao Li
Journal:  BMC Bioinformatics       Date:  2008-11-28       Impact factor: 3.169

8.  Short Promoters in Viral Vectors Drive Selective Expression in Mammalian Inhibitory Neurons, but do not Restrict Activity to Specific Inhibitory Cell-Types.

Authors:  Jason L Nathanson; Roberto Jappelli; Eric D Scheeff; Gerard Manning; Kunihiko Obata; Sydney Brenner; Edward M Callaway
Journal:  Front Neural Circuits       Date:  2009-11-09       Impact factor: 3.492

9.  Genome-wide survey of microRNA-transcription factor feed-forward regulatory circuits in human.

Authors:  Angela Re; Davide Corá; Daniela Taverna; Michele Caselle
Journal:  Mol Biosyst       Date:  2009-06-19

10.  Positional distribution of human transcription factor binding sites.

Authors:  Mark Koudritsky; Eytan Domany
Journal:  Nucleic Acids Res       Date:  2008-10-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.