Literature DB >> 7851881

Gene structure prediction by linguistic methods.

S Dong1, D B Searls.   

Abstract

The higher-order structure of genes and other features of biological sequences can be described by means of formal grammars. These grammars can then be used by general-purpose parsers to detect and to assemble such structures by means of syntactic pattern recognition. We describe a grammar and parser for eukaryotic protein-encoding genes, which by some measures is as effective as current connectionist and combinatorial algorithms in predicting gene structures for sequence database entries. Parameters of the grammar rules are optimized for several different species, and mixing experiments are performed to determine the degree of species specificity and the relative importance of compositional, signal-based, and syntactic components in gene prediction.

Mesh:

Year:  1994        PMID: 7851881     DOI: 10.1006/geno.1994.1541

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  17 in total

1.  Evaluation of gene-finding programs on mammalian sequences.

Authors:  S Rogic; A K Mackworth; F B Ouellette
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

Review 2.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

3.  Gene prediction by spectral rotation measure: a new method for identifying protein-coding regions.

Authors:  Daniel Kotlar; Yizhar Lavner
Journal:  Genome Res       Date:  2003-07-17       Impact factor: 9.043

4.  EpoDB: a database of genes expressed during vertebrate erythropoiesis.

Authors:  F Salas; J Haas; B Brunk; C J Stoeckert; G C Overton
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

5.  Logitlinear models for the prediction of splice sites in plant pre-mRNA sequences.

Authors:  J Kleffe; K Hermann; W Vahrson; B Wittig; V Brendel
Journal:  Nucleic Acids Res       Date:  1996-12-01       Impact factor: 16.971

6.  Mathematical model to predict regions of chromatin attachment to the nuclear matrix.

Authors:  G B Singh; J A Kramer; S A Krawetz
Journal:  Nucleic Acids Res       Date:  1997-04-01       Impact factor: 16.971

Review 7.  Pragmatic turn in biology: From biological molecules to genetic content operators.

Authors:  Guenther Witzany
Journal:  World J Biol Chem       Date:  2014-08-26

8.  Gene recognition via spliced sequence alignment.

Authors:  M S Gelfand; A A Mironov; P A Pevzner
Journal:  Proc Natl Acad Sci U S A       Date:  1996-08-20       Impact factor: 11.205

9.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

10.  Modeling structure-function relationships in synthetic DNA sequences using attribute grammars.

Authors:  Yizhi Cai; Matthew W Lux; Laura Adam; Jean Peccoud
Journal:  PLoS Comput Biol       Date:  2009-10-09       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.