| Literature DB >> 7851881 |
S Dong1, D B Searls.
Abstract
The higher-order structure of genes and other features of biological sequences can be described by means of formal grammars. These grammars can then be used by general-purpose parsers to detect and to assemble such structures by means of syntactic pattern recognition. We describe a grammar and parser for eukaryotic protein-encoding genes, which by some measures is as effective as current connectionist and combinatorial algorithms in predicting gene structures for sequence database entries. Parameters of the grammar rules are optimized for several different species, and mixing experiments are performed to determine the degree of species specificity and the relative importance of compositional, signal-based, and syntactic components in gene prediction.Mesh:
Year: 1994 PMID: 7851881 DOI: 10.1006/geno.1994.1541
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736