Literature DB >> 11294788

Basic Gene Grammars and DNA-ChartParser for language processing of Escherichia coli promoter DNA sequences.

S Leung 1, C Mellish, D Robertson.   

Abstract

MOTIVATION: The field of 'DNA linguistics' has emerged from pioneering work in computational linguistics and molecular biology. Most formal grammars in this field are expressed using Definite Clause Grammars but these have computational limitations which must be overcome. The present study provides a new DNA parsing system, comprising a logic grammar formalism called Basic Gene Grammars and a bidirectional chart parser DNA-ChartParser.
RESULTS: The use of Basic Gene Grammars is demonstrated in representing many formulations of the knowledge of Escherichia coli promoters, including knowledge acquired from human experts, consensus sequences, statistics (weight matrices), symbolic learning, and neural network learning. The DNA-ChartParser provides bidirectional parsing facilities for BGGs in handling overlapping categories, gap categories, approximate pattern matching, and constraints. Basic Gene Grammars and the DNA-ChartParser allowed different sources of knowledge for recognizing E.coli promoters to be combined to achieve better accuracy as assessed by parsing these DNA sequences in real-world data sets.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11294788     DOI: 10.1093/bioinformatics/17.3.226

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Data Compression Concepts and Algorithms and their Applications to Bioinformatics.

Authors:  O U Nalbantog̃lu; D J Russell; K Sayood
Journal:  Entropy (Basel)       Date:  2010-01-01       Impact factor: 2.524

2.  A composite method based on formal grammar and DNA structural features in detecting human polymerase II promoter region.

Authors:  Sutapa Datta; Subhasis Mukhopadhyay
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

Review 3.  Dichotomy in the definition of prescriptive information suggests both prescribed data and prescribed algorithms: biosemiotics applications in genomic systems.

Authors:  David J D'Onofrio; David L Abel; Donald E Johnson
Journal:  Theor Biol Med Model       Date:  2012-03-14       Impact factor: 2.432

4.  A grammar inference approach for predicting kinase specific phosphorylation sites.

Authors:  Sutapa Datta; Subhasis Mukhopadhyay
Journal:  PLoS One       Date:  2015-04-17       Impact factor: 3.240

5.  Context-driven discovery of gene cassettes in mobile integrons using a computational grammar.

Authors:  Guy Tsafnat; Enrico Coiera; Sally R Partridge; Jaron Schaeffer; Jon R Iredell
Journal:  BMC Bioinformatics       Date:  2009-09-08       Impact factor: 3.169

6.  Triad pattern algorithm for predicting strong promoter candidates in bacterial genomes.

Authors:  Michael Dekhtyar; Amelie Morin; Vehary Sakanyan
Journal:  BMC Bioinformatics       Date:  2008-05-09       Impact factor: 3.169

7.  Gains and unexpected lessons from genome-scale promoter mapping.

Authors:  K S Shavkunov; I S Masulis; M N Tutukina; A A Deev; O N Ozoline
Journal:  Nucleic Acids Res       Date:  2009-06-15       Impact factor: 16.971

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.