Literature DB >> 29116820

GSMC: Combining Parallel Gibbs Sampling with Maximal Cliques for Hunting DNA Motif.

Chao Pei1, Shu-Lin Wang1, Jianwen Fang2, Wei Zhang1.   

Abstract

Regulatory elements are responsible for regulating gene transcription. Therefore, identification of these elements is a tremendous challenge in the field of gene expression. Transcription factors (TFs) play a key role in gene regulation by binding to target promoter sequences. A set of conserved sequence patterns with a highly similar structure that is bound by a TF is called a motif. Motif discovery has been a difficult problem over the past decades. Meanwhile, it is a foundation stone in meeting this challenge. Recent advances in obtaining genomic sequences and high-throughput gene expression analysis techniques have enabled the rapid development of computational methods for motif discovery. As a result, a large number of motif-finding algorithms aiming at various motif models have sprung up in the past few years. However, most of them are not suitable for analysis of the large data sets generated by next-generation sequencing. To better handle large-scale ChIP-Seq data and achieve better performance in computational time and motif detection accuracy, we propose an excellent motif-finding algorithm known as GSMC (Combining Parallel Gibbs Sampling with Maximal Cliques for hunting DNA Motif). The GSMC algorithm consists of two steps. First, we employ the commonly used Gibbs sampling to generating initial motifs. Second, we utilize maximal cliques to cluster motifs according to Similarity with Position Information Contents (SPIC). Consequently, we raise the detection accuracy in a great degree, in the meantime holding comparative computation efficiency. In addition, we can find much more credible cofactor interacting motifs.

Entities:  

Keywords:  DNA motif; Gibbs sampling; SPIC; cofactor motif; maximal cliques

Mesh:

Substances:

Year:  2017        PMID: 29116820      PMCID: PMC5749607          DOI: 10.1089/cmb.2017.0100

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  23 in total

1.  TRANSFAC: an integrated system for gene expression regulation.

Authors:  E Wingender; X Chen; R Hehl; H Karas; I Liebich; V Matys; T Meinhardt; M Prüss; I Reuter; F Schacherer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae.

Authors:  J D Hughes; P W Estep; S Tavazoie; G M Church
Journal:  J Mol Biol       Date:  2000-03-10       Impact factor: 5.469

3.  ANN-Spec: a method for discovering transcription factor binding sites with improved specificity.

Authors:  C T Workman; G D Stormo
Journal:  Pac Symp Biocomput       Date:  2000

4.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Authors:  Albin Sandelin; Wynand Alkema; Pär Engström; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

5.  Mining ChIP-chip data for transcription factor and cofactor binding sites.

Authors:  Andrew D Smith; Pavel Sumazin; Debopriya Das; Michael Q Zhang
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

6.  An Efficient Algorithm for Discovering Motifs in Large DNA Data Sets.

Authors:  Qiang Yu; Hongwei Huo; Xiaoyang Chen; Haitao Guo; Jeffrey Scott Vitter; Jun Huan
Journal:  IEEE Trans Nanobioscience       Date:  2015-04-09       Impact factor: 2.935

7.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.

Authors:  C E Lawrence; S F Altschul; M S Boguski; J S Liu; A F Neuwald; J C Wootton
Journal:  Science       Date:  1993-10-08       Impact factor: 47.728

8.  SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents.

Authors:  Shaoqiang Zhang; Xiguo Zhou; Chuanbin Du; Zhengchang Su
Journal:  BMC Syst Biol       Date:  2013-12-17

9.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

Review 10.  A survey of DNA motif finding algorithms.

Authors:  Modan K Das; Ho-Kwok Dai
Journal:  BMC Bioinformatics       Date:  2007-11-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.