Literature DB >> 21775309

coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq data.

Mengyuan Xu1, Clarice R Weinberg, David M Umbach, Leping Li.   

Abstract

MOTIVATION: ChIP-seq data are enriched in binding sites for the protein immunoprecipitated. Some sequences may also contain binding sites for a coregulator. Biologists are interested in knowing which coregulatory factor motifs may be present in the sequences bound by the protein ChIP'ed.
RESULTS: We present a finite mixture framework with an expectation-maximization algorithm that considers two motifs jointly and simultaneously determines which sequences contain both motifs, either one or neither of them. Tested on 10 simulated ChIP-seq datasets, our method performed better than repeated application of MEME in predicting sequences containing both motifs. When applied to a mouse liver Foxa2 ChIP-seq dataset involving ~ 12 000 400-bp sequences, coMOTIF identified co-occurrence of Foxa2 with Hnf4a, Cebpa, E-box, Ap1/Maf or Sp1 motifs in ~6-33% of these sequences. These motifs are either known as liver-specific transcription factors or have an important role in liver function. AVAILABILITY: Freely available at http://www.niehs.nih.gov/research/resources/software/comotif/. CONTACT: li3@niehs.nih.gov SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21775309      PMCID: PMC3179653          DOI: 10.1093/bioinformatics/btr397

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  De novo cis-regulatory module elicitation for eukaryotic genomes.

Authors:  Mayetri Gupta; Jun S Liu
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-09       Impact factor: 11.205

2.  fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control.

Authors:  Leping Li; Robert L Bass; Yu Liang
Journal:  Bioinformatics       Date:  2008-02-22       Impact factor: 6.937

3.  The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation.

Authors:  Edgar Wingender
Journal:  Brief Bioinform       Date:  2008-04-24       Impact factor: 11.622

4.  An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences.

Authors:  C E Lawrence; A A Reilly
Journal:  Proteins       Date:  1990

5.  Methods for calculating the probabilities of finding patterns in sequences.

Authors:  R Staden
Journal:  Comput Appl Biosci       Date:  1989-04

6.  Fitting a mixture model by expectation maximization to discover motifs in biopolymers.

Authors:  T L Bailey; C Elkan
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1994

7.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation.

Authors:  F P Roth; J D Hughes; P W Estep; G M Church
Journal:  Nat Biotechnol       Date:  1998-10       Impact factor: 54.908

8.  Assessing computational tools for the discovery of transcription factor binding sites.

Authors:  Martin Tompa; Nan Li; Timothy L Bailey; George M Church; Bart De Moor; Eleazar Eskin; Alexander V Favorov; Martin C Frith; Yutao Fu; W James Kent; Vsevolod J Makeev; Andrei A Mironov; William Stafford Noble; Giulio Pavesi; Graziano Pesole; Mireille Régnier; Nicolas Simonis; Saurabh Sinha; Gert Thijs; Jacques van Helden; Mathias Vandenbogaert; Zhiping Weng; Christopher Workman; Chun Ye; Zhou Zhu
Journal:  Nat Biotechnol       Date:  2005-01       Impact factor: 54.908

9.  NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence.

Authors:  Thomas A Down; Tim J P Hubbard
Journal:  Nucleic Acids Res       Date:  2005-03-10       Impact factor: 16.971

10.  The Gibbs Centroid Sampler.

Authors:  William A Thompson; Lee A Newberg; Sean Conlan; Lee Ann McCue; Charles E Lawrence
Journal:  Nucleic Acids Res       Date:  2007-05-05       Impact factor: 16.971

View more
  5 in total

1.  Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements.

Authors:  Harendra Guturu; Andrew C Doxey; Aaron M Wenger; Gill Bejerano
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2013-11-11       Impact factor: 6.237

2.  Modular discovery of monomeric and dimeric transcription factor binding motifs for large data sets.

Authors:  Jarkko Toivonen; Teemu Kivioja; Arttu Jolma; Yimeng Yin; Jussi Taipale; Esko Ukkonen
Journal:  Nucleic Acids Res       Date:  2018-05-04       Impact factor: 16.971

3.  YY1TargetDB: an integral information resource for Yin Yang 1 target loci.

Authors:  Andy M Guo; Kun Sun; Xiaoxi Su; Huating Wang; Hao Sun
Journal:  Database (Oxford)       Date:  2013-02-14       Impact factor: 3.451

4.  MODER2: first-order Markov modeling and discovery of monomeric and dimeric binding motifs.

Authors:  Jarkko Toivonen; Pratyush K Das; Jussi Taipale; Esko Ukkonen
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

5.  MatrixCatch--a novel tool for the recognition of composite regulatory elements in promoters.

Authors:  Igor V Deyneko; Alexander E Kel; Olga V Kel-Margoulis; Elena V Deineko; Edgar Wingender; Siegfried Weiss
Journal:  BMC Bioinformatics       Date:  2013-08-08       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.