Literature DB >> 8996790

Syntactic recognition of regulatory regions in Escherichia coli.

D A Rosenblueth1, D Thieffry, A M Huerta, H Salgado, J Collado-Vides.   

Abstract

MOTIVATION: One of the most common methodologies to identify cis-regulatory sites in regulatory regions in the DNA is that of weight matrices, as testified by several articles in this issue. An alternative to strengthen the computational predictions in regulatory regions is to develop methods that incorporate more biological properties present in such DNA regions. The grammatical implementation presented in this paper provides a concrete example in this direction.
RESULTS: On the basis of the analysis of an exhaustive collection of regulatory regions in Escherichia coli, a grammatical model for the regulatory regions of sigma 70 promoters has been developed. The terminal symbols of the grammar represent individual sites for the binding of activator and repressor proteins, and include the precise position of sites in relation to transcription initiation. Combining these symbols, the grammar generates a large number of different sentences, each of which can be searched for matching against a collection of regulatory regions by means of weight matrices specific for each set of sites for individual proteins. On the basis of this grammatical model, a Prolog syntactic recognizer is presented here. Specific subgrammars for ArgR, LexA and TyrR were implemented. When parsing a collection of 128 sigma 70 promoter regions, the syntactic recognizer produces a much lower number of false-positive sites than the standard search using weight matrices.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8996790     DOI: 10.1093/bioinformatics/12.5.415

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  4 in total

1.  A motif co-occurrence approach for genome-wide prediction of transcription-factor-binding sites in Escherichia coli.

Authors:  Martha L Bulyk; Abigail M McGuire; Nobuhisa Masuda; George M Church
Journal:  Genome Res       Date:  2004-02       Impact factor: 9.043

2.  Operons in Escherichia coli: genomic analyses and predictions.

Authors:  H Salgado; G Moreno-Hagelsieb; T F Smith; J Collado-Vides
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-06       Impact factor: 11.205

3.  Data Compression Concepts and Algorithms and their Applications to Bioinformatics.

Authors:  O U Nalbantog̃lu; D J Russell; K Sayood
Journal:  Entropy (Basel)       Date:  2010-01-01       Impact factor: 2.524

4.  Bioinformatics in Latin America and SoIBio impact, a tale of spin-off and expansion around genomes and protein structures.

Authors:  Javier De Las Rivas; Cesar Bonavides-Martínez; Francisco Jose Campos-Laborie
Journal:  Brief Bioinform       Date:  2019-03-22       Impact factor: 11.622

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.