Literature DB >> 15388800

Bipartite pattern discovery by entropy minimization-based multiple local alignment.

Chengpeng Bi1, Peter K Rogan.   

Abstract

Many multimeric transcription factors recognize DNA sequence patterns by cooperatively binding to bipartite elements composed of half sites separated by a flexible spacer. We developed a novel bipartite algorithm, bipartite pattern discovery (Bipad), which produces a mathematical model based on information maximization or Shannon's entropy minimization principle, for discovery of bipartite sequence patterns. Bipad is a C++ program that applies greedy methods to search the bipartite alignment space and examines the upstream or downstream regions of co-regulated genes, looking for cis-regulatory bipartite patterns. An input sequence file with zero or one site per locus is required, and the left and right motif widths and a range of possible gap lengths must be specified. Bipad can run in either single-block or bipartite pattern search modes, and it is capable of comprehensively searching all four orientations of half-site patterns. Simulation studies showed that the accuracy of this motif discovery algorithm depends on sample size and motif conservation level, but results were independent of background composition. Bipad performed equivalent with or better than other pattern search algorithms in correctly identifying Escherichia coli cyclic AMP receptor protein and Bacillus subtilis sigma factor binding site sequences based on experimentally defined benchmarks. Finally, a new bipartite information weight matrix for vitamin D3 receptor/retinoid X receptor alpha (VDR/RXRalpha) binding sites was derived that comprehensively models the natural variability inherent in these sequence elements.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15388800      PMCID: PMC521645          DOI: 10.1093/nar/gkh825

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  35 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  Identifying target sites for cooperatively binding factors.

Authors:  D GuhaThakurta; G D Stormo
Journal:  Bioinformatics       Date:  2001-07       Impact factor: 6.937

3.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Authors:  Albin Sandelin; Wynand Alkema; Pär Engström; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  BioOptimizer: a Bayesian scoring function approach to motif discovery.

Authors:  Shane T Jensen; Jun S Liu
Journal:  Bioinformatics       Date:  2004-02-12       Impact factor: 6.937

5.  Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments.

Authors:  L R Cardon; G D Stormo
Journal:  J Mol Biol       Date:  1992-01-05       Impact factor: 5.469

6.  Evidence for regulation of amelogenin gene expression by 1,25-dihydroxyvitamin D(3) in vivo.

Authors:  P Papagerakis; D Hotton; F Lezot; S Brookes; W Bonass; C Robinson; N Forest; A Berdal
Journal:  J Cell Biochem       Date:  1999-12       Impact factor: 4.429

7.  Methods for calculating the probabilities of finding patterns in sequences.

Authors:  R Staden
Journal:  Comput Appl Biosci       Date:  1989-04

8.  Using Dirichlet mixture priors to derive hidden Markov models for protein families.

Authors:  M Brown; R Hughey; A Krogh; I S Mian; K Sjölander; D Haussler
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1993

9.  Information content of individual genetic sequences.

Authors:  T D Schneider
Journal:  J Theor Biol       Date:  1997-12-21       Impact factor: 2.691

10.  All natural DR3-type vitamin D response elements show a similar functionality in vitro.

Authors:  A Toell; P Polly; C Carlberg
Journal:  Biochem J       Date:  2000-12-01       Impact factor: 3.857

View more
  17 in total

1.  Discovering gapped binding sites of yeast transcription factors.

Authors:  Chien-Yu Chen; Huai-Kuang Tsai; Chen-Ming Hsu; Mei-Ju May Chen; Hao-Geng Hung; Grace Tzu-Wei Huang; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2008-02-13       Impact factor: 11.205

2.  Discovery and validation of information theory-based transcription factor and cofactor binding site motifs.

Authors:  Ruipeng Lu; Eliseos J Mucaki; Peter K Rogan
Journal:  Nucleic Acids Res       Date:  2017-03-17       Impact factor: 16.971

3.  FANCM c.5791C>T nonsense mutation (rs144567652) induces exon skipping, affects DNA repair activity and is a familial breast cancer risk factor.

Authors:  Paolo Peterlongo; Irene Catucci; Mara Colombo; Laura Caleca; Eliseos Mucaki; Massimo Bogliolo; Maria Marin; Francesca Damiola; Loris Bernard; Valeria Pensotti; Sara Volorio; Valentina Dall'Olio; Alfons Meindl; Claus Bartram; Christian Sutter; Harald Surowy; Valérie Sornin; Marie-Gabrielle Dondon; Séverine Eon-Marchais; Dominique Stoppa-Lyonnet; Nadine Andrieu; Olga M Sinilnikova; Gillian Mitchell; Paul A James; Ella Thompson; Marina Marchetti; Cristina Verzeroli; Carmen Tartari; Gabriele Lorenzo Capone; Anna Laura Putignano; Maurizio Genuardi; Veronica Medici; Isabella Marchi; Massimo Federico; Silvia Tognazzo; Laura Matricardi; Simona Agata; Riccardo Dolcetti; Lara Della Puppa; Giulia Cini; Viviana Gismondi; Valeria Viassolo; Chiara Perfumo; Maria Antonietta Mencarelli; Margherita Baldassarri; Bernard Peissel; Gaia Roversi; Valentina Silvestri; Piera Rizzolo; Francesca Spina; Caterina Vivanet; Maria Grazia Tibiletti; Maria Adelaide Caligo; Gaetana Gambino; Stefania Tommasi; Brunella Pilato; Carlo Tondini; Chiara Corna; Bernardo Bonanni; Monica Barile; Ana Osorio; Javier Benitez; Luisa Balestrino; Laura Ottini; Siranoush Manoukian; Marco A Pierotti; Alessandra Renieri; Liliana Varesco; Fergus J Couch; Xianshu Wang; Peter Devilee; Florentine S Hilbers; Christi J van Asperen; Alessandra Viel; Marco Montagna; Laura Cortesi; Orland Diez; Judith Balmaña; Jan Hauke; Rita K Schmutzler; Laura Papi; Miguel Angel Pujana; Conxi Lázaro; Anna Falanga; Kenneth Offit; Joseph Vijai; Ian Campbell; Barbara Burwinkel; Anders Kvist; Hans Ehrencrona; Sylvie Mazoyer; Sara Pizzamiglio; Paolo Verderio; Jordi Surralles; Peter K Rogan; Paolo Radice
Journal:  Hum Mol Genet       Date:  2015-06-30       Impact factor: 6.150

4.  BML: a versatile web server for bipartite motif discovery.

Authors:  Mohammad Vahed; Majid Vahed; Lana X Garmire
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  Modular discovery of monomeric and dimeric transcription factor binding motifs for large data sets.

Authors:  Jarkko Toivonen; Teemu Kivioja; Arttu Jolma; Yimeng Yin; Jussi Taipale; Esko Ukkonen
Journal:  Nucleic Acids Res       Date:  2018-05-04       Impact factor: 16.971

6.  Finding subtypes of transcription factor motif pairs with distinct regulatory roles.

Authors:  Abha Singh Bais; Naftali Kaminski; Panayiotis V Benos
Journal:  Nucleic Acids Res       Date:  2011-04-12       Impact factor: 16.971

7.  AMD, an automated motif discovery tool using stepwise refinement of gapped consensuses.

Authors:  Jiantao Shi; Wentao Yang; Mingjie Chen; Yanzhi Du; Ji Zhang; Kankan Wang
Journal:  PLoS One       Date:  2011-09-12       Impact factor: 3.240

8.  BIPAD: a web server for modeling bipartite sequence elements.

Authors:  Chengpeng Bi; Peter K Rogan
Journal:  BMC Bioinformatics       Date:  2006-02-17       Impact factor: 3.169

9.  Practical strategies for discovering regulatory DNA sequence motifs.

Authors:  Kenzie D MacIsaac; Ernest Fraenkel
Journal:  PLoS Comput Biol       Date:  2006-04       Impact factor: 4.475

10.  Identification of thyroid hormone receptor binding sites in developing mouse cerebellum.

Authors:  Remi Gagne; James R Green; Hongyan Dong; Mike G Wade; Carole L Yauk
Journal:  BMC Genomics       Date:  2013-05-23       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.