Literature DB >> 12176826

Improving gene recognition accuracy by combining predictions from two gene-finding programs.

Sanja Rogic1, B F Francis Ouellette, Alan K Mackworth.   

Abstract

MOTIVATION: Despite constant improvements in prediction accuracy, gene-finding programs are still unable to provide automatic gene discovery with desired correctness. The current programs can identify up to 75% of exons correctly and less than 50% of predicted gene structures correspond to actual genes. New approaches to computational gene-finding are clearly needed.
RESULTS: In this paper we have explored the benefits of combining predictions from already existing gene prediction programs. We have introduced three novel methods for combining predictions from programs Genscan and HMMgene. The methods primarily aim to improve exon level accuracy of gene-finding by identifying more probable exon boundaries and by eliminating false positive exon predictions. This approach results in improved accuracy at both the nucleotide and exon level, especially the latter, where the average improvement on the newly assembled dataset is 7.9% compared to the best result obtained by Genscan and HMMgene. When tested on a long genomic multi-gene sequence, our method that maintains reading frame consistency improved nucleotide level specificity by 21.0% and exon level specificity by 32.5% compared to the best result obtained by either of the two programs individually. AVAILABILITY: The scripts implementing our methods are available from http://www.cs.ubc.ca/labs/beta/genefinding/

Mesh:

Substances:

Year:  2002        PMID: 12176826     DOI: 10.1093/bioinformatics/18.8.1034

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Computational gene prediction using multiple sources of evidence.

Authors:  Jonathan E Allen; Mihaela Pertea; Steven L Salzberg
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

2.  EGPred: prediction of eukaryotic genes using ab initio methods after combining with sequence similarity approaches.

Authors:  Biju Issac; Gajendra Pal Singh Raghava
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

3.  Accurate identification of novel human genes through simultaneous gene prediction in human, mouse, and rat.

Authors:  Colin Dewey; Jia Qian Wu; Simon Cawley; Marina Alexandersson; Richard Gibbs; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

4.  Fugu ESTs: new resources for transcription analysis and genome annotation.

Authors:  Melody S Clark; Yvonne J K Edwards; Dan Peterson; Sandra W Clifton; Amanda J Thompson; Masahide Sasaki; Yutaka Suzuki; Kiyoshi Kikuchi; Shugo Watabe; Koichi Kawakami; Sumio Sugano; Greg Elgar; Stephen L Johnson
Journal:  Genome Res       Date:  2003-11-12       Impact factor: 9.043

5.  Gene discovery at the human T-cell receptor alpha/delta locus.

Authors:  Marsha R Haynes; Gillian E Wu
Journal:  Immunogenetics       Date:  2006-12-13       Impact factor: 2.846

6.  A method for construction, cloning and expression of intron-less gene from unannotated genomic DNA.

Authors:  Vineet Agrawal; Bharti Gupta; Uttam Chand Banerjee; Nilanjan Roy
Journal:  Mol Biotechnol       Date:  2008-06-10       Impact factor: 2.695

7.  Anopheles gambiae genome reannotation through synthesis of ab initio and comparative gene prediction algorithms.

Authors:  Jun Li; Michelle M Riehle; Yan Zhang; Jiannong Xu; Frederick Oduol; Shawn M Gomez; Karin Eiglmeier; Beatrix M Ueberheide; Jeffrey Shabanowitz; Donald F Hunt; José M C Ribeiro; Kenneth D Vernick
Journal:  Genome Biol       Date:  2006-03-27       Impact factor: 13.583

8.  Strategies and tools for whole-genome alignments.

Authors:  Olivier Couronne; Alexander Poliakov; Nicolas Bray; Tigran Ishkhanov; Dmitriy Ryaboy; Edward Rubin; Lior Pachter; Inna Dubchak
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

9.  The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics.

Authors:  Lincoln D Stein; Zhirong Bao; Darin Blasiar; Thomas Blumenthal; Michael R Brent; Nansheng Chen; Asif Chinwalla; Laura Clarke; Chris Clee; Avril Coghlan; Alan Coulson; Peter D'Eustachio; David H A Fitch; Lucinda A Fulton; Robert E Fulton; Sam Griffiths-Jones; Todd W Harris; LaDeana W Hillier; Ravi Kamath; Patricia E Kuwabara; Elaine R Mardis; Marco A Marra; Tracie L Miner; Patrick Minx; James C Mullikin; Robert W Plumb; Jane Rogers; Jacqueline E Schein; Marc Sohrmann; John Spieth; Jason E Stajich; C Wei; David Willey; Richard K Wilson; Richard Durbin; Robert H Waterston
Journal:  PLoS Biol       Date:  2003-11-17       Impact factor: 8.029

10.  Marine organism cell biology and regulatory sequence discoveryin comparative functional genomics.

Authors:  David W Barnes; Carolyn J Mattingly; Angela Parton; Lori M Dowell; Christopher J Bayne; John N Forrest
Journal:  Cytotechnology       Date:  2005-11-30       Impact factor: 2.058

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.