Literature DB >> 10869013

Optimal spliced alignment of homologous cDNA to a genomic DNA template.

J Usuka1, W Zhu, V Brendel.   

Abstract

MOTIVATION: Supplementary cDNA or EST evidence is often decisive for discriminating between alternative gene predictions derived from computational sequence inspection by any of a number of requisite programs. Without additional experimental effort, this approach must rely on the occurrence of cognate ESTs for the gene under consideration in available, generally incomplete, EST collections for the given species. In some cases, particular exon assignments can be supported by sequence matching even if the cDNA or EST is produced from non-cognate genomic DNA, including different loci of a gene family or homologous loci from different species. However, marginally significant sequence matching alone can also be misleading. We sought to develop an algorithm that would simultaneously score for predicted intrinsic splice site strength and sequence matching between the genomic DNA template and a related cDNA or EST. In this case, weakly predicted splice sites may be chosen for the optimal scoring spliced alignment on the basis of surrounding sequence matching. Strongly predicted splice sites will enter the optimal spliced alignment even without strong sequence matching.
RESULTS: We designed a novel algorithm that produces the optimal spliced alignment of a genomic DNA with a cDNA or EST based on scoring for both sequence matching and intrinsic splice site strength. By example, we demonstrate that this combined approach appears to improve gene prediction accuracy compared with current methods that rely only on either search by content and signal or on sequence similarity. AVAILABILITY: The algorithm is available as a C subroutine and is implemented in the SplicePredictor and GeneSeqer programs. The source code is available via anonymous ftp from ftp. zmdb.iastate.edu. Both programs are also implemented as a Web service at http://gremlin1.zool.iastate.edu/cgi-bin/s p.cgiand http://gremlin1.zool.iastate.edu/cgi-bin/g s.cgi, respectively. CONTACT: vbrendel@iastate.edu

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10869013     DOI: 10.1093/bioinformatics/16.3.203

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  58 in total

1.  Gene2EST: a BLAST2 server for searching expressed sequence tag (EST) databases with eukaryotic gene-sized queries.

Authors:  C Gemünd; C Ramu; B Altenberg-Greulich; T J Gibson
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

2.  Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes.

Authors:  Ritu Pandey; Andreas Müller; Carolyn A Napoli; David A Selinger; Craig S Pikaard; Eric J Richards; Judith Bender; David W Mount; Richard A Jorgensen
Journal:  Nucleic Acids Res       Date:  2002-12-01       Impact factor: 16.971

3.  The maize genome contains a helitron insertion.

Authors:  Shailesh K Lal; Michael J Giroux; Volker Brendel; C Eduardo Vallejos; L Curtis Hannah
Journal:  Plant Cell       Date:  2003-02       Impact factor: 11.277

4.  GeneSeqer@PlantGDB: Gene structure prediction in plant genomes.

Authors:  Shannon D Schlueter; Qunfeng Dong; Volker Brendel
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.

Authors:  Wei Zhu; Shannon D Schlueter; Volker Brendel
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

Review 6.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

7.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.

Authors:  Brian J Haas; Arthur L Delcher; Stephen M Mount; Jennifer R Wortman; Roger K Smith; Linda I Hannick; Rama Maiti; Catherine M Ronning; Douglas B Rusch; Christopher D Town; Steven L Salzberg; Owen White
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

8.  e2g: an interactive web-based server for efficiently mapping large EST and cDNA sets to genomic sequences.

Authors:  Jan Krüger; Alexander Sczyrba; Stefan Kurtz; Robert Giegerich
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 9.  A beginner's guide to eukaryotic genome annotation.

Authors:  Mark Yandell; Daniel Ence
Journal:  Nat Rev Genet       Date:  2012-04-18       Impact factor: 53.242

10.  MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics.

Authors:  Heiko Schoof; Rebecca Ernst; Vladimir Nazarov; Lukas Pfeifer; Hans-Werner Mewes; Klaus F X Mayer
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.