Literature DB >> 23859273

A heuristic cluster-based EM algorithm for the planted (l, d) problem.

Yipu Zhang1, Hongwei Huo, Qiang Yu.   

Abstract

The planted motif search problem arises from locating the transcription factor binding sites (TFBSs) which are crucial for understanding the gene regulatory relationship. Many attempts in using expectation maximization for TFBSs discovery are successful in past. However, identifying highly degenerate motifs and reducing the effect of local optima are still an arduous task. To alleviate the vulnerability of EM to local optima trapping, we present a heuristic cluster-based EM algorithm, CEM, which refines the cluster subsets in EM method to explore the best local optimal solution. Based on experiments using both synthetic and real datasets, our algorithm demonstrates significant improvements in identifying the motif instances and performs better than current widely used algorithms. CEM is a novel planted motif finding algorithm, which is able to solve the challenging instances and easy to parallel since the process of solving each cluster subset is independent.

Mesh:

Substances:

Year:  2013        PMID: 23859273     DOI: 10.1142/S0219720013500091

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  2 in total

1.  An Entropy-Based Position Projection Algorithm for Motif Discovery.

Authors:  Yipu Zhang; Ping Wang; Maode Yan
Journal:  Biomed Res Int       Date:  2016-11-02       Impact factor: 3.411

2.  A Fast Cluster Motif Finding Algorithm for ChIP-Seq Data Sets.

Authors:  Yipu Zhang; Ping Wang
Journal:  Biomed Res Int       Date:  2015-07-05       Impact factor: 3.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.