Literature DB >> 1619647

Prediction of gene structure.

R Guigó1, S Knudsen, N Drake, T Smith.   

Abstract

We have developed a hierarchical rule base system for identifying genes in DNA sequences. Atomic sites (such as initiation codons, stop codons, acceptor sites and donor sites) are identified by a number of different methods and evaluated by a set of filters and rules chosen to maximize sensitivity; these are combined into higher-order gene elements (such as exons), evaluated, filtered and combined as equivalence classes into probable genes, which are evaluated and ranked. The system has been tested on an extensive collection of vertebrate genes smaller than 15,000 bases. Results obtained show that, on average, 88% of the predicted coding region for a transcription unit is actually coding, and 80% of the actual coding is correctly predicted. This will, in most applications, be sufficient for a search against protein sequence databases for the identification of probable gene function. In addition, the system provides a general test platform for both gene atomic site identification and the rules for their evaluation and assembly.

Entities:  

Mesh:

Year:  1992        PMID: 1619647     DOI: 10.1016/0022-2836(92)90130-c

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  71 in total

1.  Positional characterisation of false positives from computational prediction of human splice sites.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

2.  In silico identification of novel selenoproteins in the Drosophila melanogaster genome.

Authors:  S Castellano; N Morozova; M Morey; M J Berry; F Serras; M Corominas; R Guigó
Journal:  EMBO Rep       Date:  2001-08       Impact factor: 8.807

3.  Evaluation of gene-finding programs on mammalian sequences.

Authors:  S Rogic; A K Mackworth; F B Ouellette
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

4.  The transcriptional activity of human Chromosome 22.

Authors:  John L Rinn; Ghia Euskirchen; Paul Bertone; Rebecca Martone; Nicholas M Luscombe; Stephen Hartman; Paul M Harrison; F Kenneth Nelson; Perry Miller; Mark Gerstein; Sherman Weissman; Michael Snyder
Journal:  Genes Dev       Date:  2003-02-15       Impact factor: 11.361

5.  Reconsidering the evolution of eukaryotic selenoproteins: a novel nonmammalian family with scattered phylogenetic distribution.

Authors:  Sergi Castellano; Sergey V Novoselov; Gregory V Kryukov; Alain Lescure; Enrique Blanco; Alain Krol; Vadim N Gladyshev; Roderic Guigó
Journal:  EMBO Rep       Date:  2004-01       Impact factor: 8.807

6.  Comparative gene prediction in human and mouse.

Authors:  Genís Parra; Pankaj Agarwal; Josep F Abril; Thomas Wiehe; James W Fickett; Roderic Guigó
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

Review 7.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

8.  GAZE: a generic framework for the integration of gene-prediction data by dynamic programming.

Authors:  Kevin L Howe; Tom Chothia; Richard Durbin
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

Review 9.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

Review 10.  A beginner's guide to eukaryotic genome annotation.

Authors:  Mark Yandell; Daniel Ence
Journal:  Nat Rev Genet       Date:  2012-04-18       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.