Literature DB >> 21030733

A cluster refinement algorithm for motif discovery.

Gang Li1, Tak-Ming Chan, Kwong-Sak Leung, Kin-Hong Lee.   

Abstract

Finding Transcription Factor Binding Sites, i.e., motif discovery, is crucial for understanding the gene regulatory relationship. Motifs are weakly conserved and motif discovery is an NP-hard problem. We propose a new approach called Cluster Refinement Algorithm for Motif Discovery (CRMD). CRMD employs a flexible statistical motif model allowing a variable number of motifs and motif instances. CRMD first uses a novel entropy-based clustering to find complete and good starting candidate motifs from the DNA sequences. CRMD then employs an effective greedy refinement to search for optimal motifs from the candidate motifs. The refinement is fast, and it changes the number of motif instances based on the adaptive thresholds. The performance of CRMD is further enhanced if the problem has one occurrence of motif instance per sequence. Using an appropriate similarity test of motifs, CRMD is also able to find multiple motifs. CRMD has been tested extensively on synthetic and real data sets. The experimental results verify that CRMD usually outperforms four other state-of-the-art algorithms in terms of the qualities of the solutions with competitive computing time. It finds a good balance between finding true motif instances and screening false motif instances, and is robust on problems of various levels of difficulty.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21030733     DOI: 10.1109/TCBB.2009.25

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  5 in total

1.  Sequence homology in eukaryotes (SHOE): interactive visual tool for promoter analysis.

Authors:  Natalia Polouliakh; Paul Horton; Kazuhiro Shibanai; Kodai Takata; Vanessa Ludwig; Samik Ghosh; Hiroaki Kitano
Journal:  BMC Genomics       Date:  2018-09-27       Impact factor: 3.969

2.  A fast weak motif-finding algorithm based on community detection in graphs.

Authors:  Caiyan Jia; Matthew B Carson; Jian Yu
Journal:  BMC Bioinformatics       Date:  2013-07-17       Impact factor: 3.169

3.  An Affinity Propagation-Based DNA Motif Discovery Algorithm.

Authors:  Chunxiao Sun; Hongwei Huo; Qiang Yu; Haitao Guo; Zhigang Sun
Journal:  Biomed Res Int       Date:  2015-08-10       Impact factor: 3.411

4.  A new exhaustive method and strategy for finding motifs in ChIP-enriched regions.

Authors:  Caiyan Jia; Matthew B Carson; Yang Wang; Youfang Lin; Hui Lu
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

5.  Discovery of survival factor for primitive chronic myeloid leukemia cells using induced pluripotent stem cells.

Authors:  Kran Suknuntha; Yuki Ishii; Lihong Tao; Kejin Hu; Brian E McIntosh; David Yang; Scott Swanson; Ron Stewart; Jean Y J Wang; James Thomson; Igor Slukvin
Journal:  Stem Cell Res       Date:  2015-10-31       Impact factor: 2.020

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.