Literature DB >> 11544202

SGP-1: prediction and validation of homologous genes based on sequence alignments.

T Wiehe1, S Gebauer-Jung, T Mitchell-Olds, R Guigó.   

Abstract

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of depends little on species-specific properties such as codon usage or the nucleotide distribution. may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11544202      PMCID: PMC311140          DOI: 10.1101/gr.177401

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  21 in total

1.  An assessment of gene prediction accuracy in large DNA sequences.

Authors:  R Guigó; P Agarwal; J F Abril; M Burset; J W Fickett
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

2.  Extensive duplication and reshuffling in the Arabidopsis genome.

Authors:  G Blanc; A Barakat; R Guyot; R Cooke; M Delseny
Journal:  Plant Cell       Date:  2000-07       Impact factor: 11.277

3.  Genome sequence comparisons: hurdles in the fast lane to functional genomics.

Authors:  T Wiehe; R Guigó; W Miller
Journal:  Brief Bioinform       Date:  2000-11       Impact factor: 11.622

4.  Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence.

Authors:  H Roest Crollius; O Jaillon; A Bernot; C Dasilva; L Bouneau; C Fischer; C Fizames; P Wincker; P Brottier; F Quétier; W Saurin; J Weissenbach
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

5.  Rates of DNA sequence evolution differ between taxonomic groups.

Authors:  R J Britten
Journal:  Science       Date:  1986-03-21       Impact factor: 47.728

6.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

7.  An evaluation of the molecular clock hypothesis using mammalian DNA sequences.

Authors:  W H Li; M Tanimura; P M Sharp
Journal:  J Mol Evol       Date:  1987       Impact factor: 2.395

8.  Human and mouse gene structure: comparative analysis and application to exon prediction.

Authors:  S Batzoglou; L Pachter; J P Mesirov; B Berger; E S Lander
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

9.  The origins of genomic duplications in Arabidopsis.

Authors:  T J Vision; D G Brown; S D Tanksley
Journal:  Science       Date:  2000-12-15       Impact factor: 47.728

10.  Alignment of whole genomes.

Authors:  A L Delcher; S Kasif; R D Fleischmann; J Peterson; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-06-01       Impact factor: 16.971

View more
  35 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  Gene structure conservation aids similarity based gene prediction.

Authors:  Irmtraud M Meyer; Richard Durbin
Journal:  Nucleic Acids Res       Date:  2004-02-04       Impact factor: 16.971

3.  Comparative gene prediction in human and mouse.

Authors:  Genís Parra; Pankaj Agarwal; Josep F Abril; Thomas Wiehe; James W Fickett; Roderic Guigó
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

4.  CORG: a database for COmparative Regulatory Genomics.

Authors:  C Dieterich; H Wang; K Rateitschak; H Luz; M Vingron
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

5.  EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

Authors:  Sylvain Foissac; Philippe Bardou; Annick Moisan; Marie-Josée Cros; Thomas Schiex
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 6.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

7.  The CHAOS/DIALIGN WWW server for multiple alignment of genomic sequences.

Authors:  Michael Brudno; Rasmus Steinkamp; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

8.  AGenDA: gene prediction by cross-species sequence comparison.

Authors:  Leila Taher; Oliver Rinner; Saurabh Garg; Alexander Sczyrba; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

9.  AUGUSTUS: a web server for gene finding in eukaryotes.

Authors:  Mario Stanke; Rasmus Steinkamp; Stephan Waack; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

10.  Testing the coding potential of conserved short genomic sequences.

Authors:  Jing Wu
Journal:  Adv Bioinformatics       Date:  2010-03-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.