Literature DB >> 16873465

MotifCut: regulatory motifs finding with maximum density subgraphs.

Eugene Fratkin1, Brian T Naughton, Douglas L Brutlag, Serafim Batzoglou.   

Abstract

MOTIVATION: DNA motif finding is one of the core problems in computational biology, for which several probabilistic and discrete approaches have been developed. Most existing methods formulate motif finding as an intractable optimization problem and rely either on expectation maximization (EM) or on local heuristic searches. Another challenge is the choice of motif model: simpler models such as the position-specific scoring matrix (PSSM) impose biologically unrealistic assumptions such as independence of the motif positions, while more involved models are harder to parametrize and learn.
RESULTS: We present MotifCut, a graph-theoretic approach to motif finding leading to a convex optimization problem with a polynomial time solution. We build a graph where the vertices represent all k-mers in the input sequences, and edges represent pairwise k-mer similarity. In this graph, we search for a motif as the maximum density subgraph, which is a set of k-mers that exhibit a large number of pairwise similarities. Our formulation does not make strong assumptions regarding the structure of the motif and in practice both motifs that fit well the PSSM model, and those that exhibit strong dependencies between position pairs are found as dense subgraphs. We benchmark MotifCut on both synthetic and real yeast motifs, and find that it compares favorably to existing popular methods. The ability of MotifCut to detect motifs appears to scale well with increasing input size. Moreover, the motifs we discover are different from those discovered by the other methods. AVAILABILITY: MotifCut server and other materials can be found at motifcut.stanford.edu.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16873465     DOI: 10.1093/bioinformatics/btl243

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

Authors:  Can Alkan; Maria Francesca Cardone; Claudia Rita Catacchio; Francesca Antonacci; Stephen J O'Brien; Oliver A Ryder; Stefania Purgato; Monica Zoli; Giuliano Della Valle; Evan E Eichler; Mario Ventura
Journal:  Genome Res       Date:  2010-11-16       Impact factor: 9.043

2.  Circulating microRNA trafficking and regulation: computational principles and practice.

Authors:  Juan Cui; Jiang Shu
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

3.  Motif discovery and transcription factor binding sites before and after the next-generation sequencing era.

Authors:  Federico Zambelli; Graziano Pesole; Giulio Pavesi
Journal:  Brief Bioinform       Date:  2012-04-19       Impact factor: 11.622

4.  RecMotif: a novel fast algorithm for weak motif discovery.

Authors:  He Quan Sun; Malcolm Yoke Hean Low; Wen Jing Hsu; Jagath C Rajapakse
Journal:  BMC Bioinformatics       Date:  2010-12-14       Impact factor: 3.169

5.  MotifClick: prediction of cis-regulatory binding sites via merging cliques.

Authors:  Shaoqiang Zhang; Shan Li; Meng Niu; Phuc T Pham; Zhengchang Su
Journal:  BMC Bioinformatics       Date:  2011-06-16       Impact factor: 3.169

6.  A highly efficient and effective motif discovery method for ChIP-seq/ChIP-chip data using positional information.

Authors:  Xiaotu Ma; Ashwinikumar Kulkarni; Zhihua Zhang; Zhenyu Xuan; Robert Serfling; Michael Q Zhang
Journal:  Nucleic Acids Res       Date:  2012-01-06       Impact factor: 16.971

7.  Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis.

Authors:  Je-Keun Rhee; Je-Gun Joung; Jeong-Ho Chang; Zhangjun Fei; Byoung-Tak Zhang
Journal:  BMC Genomics       Date:  2009-12-03       Impact factor: 3.969

8.  PairMotif: A new pattern-driven algorithm for planted (l, d) DNA motif search.

Authors:  Qiang Yu; Hongwei Huo; Yipu Zhang; Hongzhi Guo
Journal:  PLoS One       Date:  2012-10-31       Impact factor: 3.240

9.  PairMotif+: a fast and effective algorithm for de novo motif discovery in DNA sequences.

Authors:  Qiang Yu; Hongwei Huo; Yipu Zhang; Hongzhi Guo; Haitao Guo
Journal:  Int J Biol Sci       Date:  2013-04-29       Impact factor: 6.580

10.  Efficient exact motif discovery.

Authors:  Tobias Marschall; Sven Rahmann
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.