Literature DB >> 12364589

Current methods of gene prediction, their strengths and weaknesses.

Catherine Mathé1, Marie-France Sagot, Thomas Schiex, Pierre Rouzé.   

Abstract

While the genomes of many organisms have been sequenced over the last few years, transforming such raw sequence data into knowledge remains a hard task. A great number of prediction programs have been developed that try to address one part of this problem, which consists of locating the genes along a genome. This paper reviews the existing approaches to predicting genes in eukaryotic genomes and underlines their intrinsic advantages and limitations. The main mathematical models and computational algorithms adopted are also briefly described and the resulting software classified according to both the method and the type of evidence used. Finally, the several difficulties and pitfalls encountered by the programs are detailed, showing that improvements are needed and that new directions must be considered.

Mesh:

Year:  2002        PMID: 12364589      PMCID: PMC140543          DOI: 10.1093/nar/gkf543

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  140 in total

1.  Gene structure prediction using information on homologous protein sequence.

Authors:  I B Rogozin; L Milanesi; N A Kolchanov
Journal:  Comput Appl Biosci       Date:  1996-06

2.  CENSOR--a program for identification and elimination of repetitive elements from DNA sequences.

Authors:  J Jurka; P Klonowski; V Dagman; P Pelton
Journal:  Comput Chem       Date:  1996-03

3.  Finding genes by computer: the state of the art.

Authors:  J W Fickett
Journal:  Trends Genet       Date:  1996-08       Impact factor: 11.639

4.  Evaluation of gene structure prediction programs.

Authors:  M Burset; R Guigó
Journal:  Genomics       Date:  1996-06-15       Impact factor: 5.736

5.  ORFs and genes: how strong a connection?

Authors:  J W Fickett
Journal:  J Comput Biol       Date:  1995       Impact factor: 1.479

6.  Identification of protein coding regions in genomic DNA.

Authors:  E E Snyder; G D Stormo
Journal:  J Mol Biol       Date:  1995-04-21       Impact factor: 5.469

7.  A frameshift error detection algorithm for DNA sequencing projects.

Authors:  G A Fichant; Y Quentin
Journal:  Nucleic Acids Res       Date:  1995-08-11       Impact factor: 16.971

8.  Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information.

Authors:  S M Hebsgaard; P G Korning; N Tolstrup; J Engelbrecht; P Rouzé; S Brunak
Journal:  Nucleic Acids Res       Date:  1996-09-01       Impact factor: 16.971

9.  Gene recognition via spliced sequence alignment.

Authors:  M S Gelfand; A A Mironov; P A Pevzner
Journal:  Proc Natl Acad Sci U S A       Date:  1996-08-20       Impact factor: 11.205

10.  Detection of new genes in a bacterial genome using Markov models for three gene classes.

Authors:  M Borodovsky; J D McIninch; E V Koonin; K E Rudd; C Médigue; A Danchin
Journal:  Nucleic Acids Res       Date:  1995-09-11       Impact factor: 16.971

View more
  103 in total

1.  The Streptomyces coelicolor polynucleotide phosphorylase homologue, and not the putative poly(A) polymerase, can polyadenylate RNA.

Authors:  Björn Sohlberg; Jianqiang Huang; Stanley N Cohen
Journal:  J Bacteriol       Date:  2003-12       Impact factor: 3.490

2.  GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders.

Authors:  William H Majoros; Mihaela Pertea; Corina Antonescu; Steven L Salzberg
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

Authors:  Sylvain Foissac; Philippe Bardou; Annick Moisan; Marie-Josée Cros; Thomas Schiex
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 4.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

5.  Gene prediction by spectral rotation measure: a new method for identifying protein-coding regions.

Authors:  Daniel Kotlar; Yizhar Lavner
Journal:  Genome Res       Date:  2003-07-17       Impact factor: 9.043

6.  ProbeLynx: a tool for updating the association of microarray probes to genes.

Authors:  Fiona M Roche; Karsten Hokamp; Michael Acab; Lorne A Babiuk; Robert E W Hancock; Fiona S L Brinkman
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  EGPred: prediction of eukaryotic genes using ab initio methods after combining with sequence similarity approaches.

Authors:  Biju Issac; Gajendra Pal Singh Raghava
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

8.  Accurate identification of novel human genes through simultaneous gene prediction in human, mouse, and rat.

Authors:  Colin Dewey; Jia Qian Wu; Simon Cawley; Marina Alexandersson; Richard Gibbs; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

9.  Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements.

Authors:  M Lescot; S Rombauts; J Zhang; S Aubourg; C Mathé; S Jansson; P Rouzé; W Boerjan
Journal:  Theor Appl Genet       Date:  2004-04-14       Impact factor: 5.699

Review 10.  Charting gene regulatory networks: strategies, challenges and perspectives.

Authors:  Gong-Hong Wei; De-Pei Liu; Chih-Chuan Liang
Journal:  Biochem J       Date:  2004-07-01       Impact factor: 3.857

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.