Literature DB >> 15961499

ExonHunter: a comprehensive approach to gene finding.

Brona Brejová1, Daniel G Brown, Ming Li, Tomás Vinar.   

Abstract

MOTIVATION: We present ExonHunter, a new and comprehensive gene finding system that outperforms existing systems and features several new ideas and approaches. Our system combines numerous sources of information (genomic sequences, expressed sequence tags and protein databases of related species) into a gene finder based on a hidden Markov model in a novel and systematic way. In our framework, various sources of information are expressed as partial probabilistic statements about positions in the sequence and their annotation. We then combine these into the final prediction via a quadratic programming method, which we show to be an extension of existing methods. Allowing only partial statements is key to our transparent handling of missing information and coping with the heterogeneous character of individual sources of information. In addition, we give a new method for modeling the length distribution of intergenic regions in hidden Markov models.
RESULTS: On a commonly used test set, ExonHunter performs significantly better than the existing gene finders ROSETTA, SLAM and TWINSCAN, with more than two-thirds of genes predicted completely correctly. AVAILABILITY: Supplementary material available at http://www.bioinformatics.uwaterloo.ca/supplements/05eh/

Entities:  

Mesh:

Year:  2005        PMID: 15961499     DOI: 10.1093/bioinformatics/bti1040

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Prediction of small, noncoding RNAs in bacteria using heterogeneous data.

Authors:  Brian Tjaden
Journal:  J Math Biol       Date:  2007-03-13       Impact factor: 2.259

2.  Conrad: gene prediction using conditional random fields.

Authors:  David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan
Journal:  Genome Res       Date:  2007-08-09       Impact factor: 9.043

3.  Approaches to Fungal Genome Annotation.

Authors:  Brian J Haas; Qiandong Zeng; Matthew D Pearson; Christina A Cuomo; Jennifer R Wortman
Journal:  Mycology       Date:  2011-10-03

4.  Reference based annotation with GeneMapper.

Authors:  Sourav Chatterji; Lior Pachter
Journal:  Genome Biol       Date:  2006-04-05       Impact factor: 13.583

5.  Evidence-based gene predictions in plant genomes.

Authors:  Chengzhi Liang; Long Mao; Doreen Ware; Lincoln Stein
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

6.  Assessment and improvement of the Plasmodium yoelii yoelii genome annotation through comparative analysis.

Authors:  Ashley Vaughan; Sum-Ying Chiu; Gowthaman Ramasamy; Ling Li; Malcolm J Gardner; Alice S Tarun; Stefan H I Kappe; Xinxia Peng
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

7.  Reranking candidate gene models with cross-species comparison for improved gene prediction.

Authors:  Qian Liu; Koby Crammer; Fernando C N Pereira; David S Roos
Journal:  BMC Bioinformatics       Date:  2008-10-14       Impact factor: 3.169

8.  nGASP--the nematode genome annotation assessment project.

Authors:  Avril Coghlan; Tristan J Fiedler; Sheldon J McKay; Paul Flicek; Todd W Harris; Darin Blasiar; Lincoln D Stein
Journal:  BMC Bioinformatics       Date:  2008-12-19       Impact factor: 3.169

9.  Finding genes in Schistosoma japonicum: annotating novel genomes with help of extrinsic evidence.

Authors:  Brona Brejová; Tomás Vinar; Yangyi Chen; Shengyue Wang; Guoping Zhao; Daniel G Brown; Ming Li; Yan Zhou
Journal:  Nucleic Acids Res       Date:  2009-03-05       Impact factor: 16.971

10.  Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources.

Authors:  Mario Stanke; Oliver Schöffmann; Burkhard Morgenstern; Stephan Waack
Journal:  BMC Bioinformatics       Date:  2006-02-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.