Literature DB >> 15691859

Efficient implementation of a generalized pair hidden Markov model for comparative gene finding.

W H Majoros1, M Pertea, S L Salzberg.   

Abstract

MOTIVATION: The increased availability of genome sequences of closely related organisms has generated much interest in utilizing homology to improve the accuracy of gene prediction programs. Generalized pair hidden Markov models (GPHMMs) have been proposed as one means to address this need. However, all GPHMM implementations currently available are either closed-source or the details of their operation are not fully described in the literature, leaving a significant hurdle for others wishing to advance the state of the art in GPHMM design.
RESULTS: We have developed an open-source GPHMM gene finder, TWAIN, which performs very well on two related Aspergillus species, A.fumigatus and A.nidulans, finding 89% of the exons and predicting 74% of the gene models exactly correctly in a test set of 147 conserved gene pairs. We describe the implementation of this GPHMM and we explicitly address the assumptions and limitations of the system. We suggest possible ways of relaxing those assumptions to improve the utility of the system without sacrificing efficiency beyond what is practical. AVAILABILITY: Available at http://www.tigr.org/software/pirate/twain/twain.html under the open-source Artistic License.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15691859     DOI: 10.1093/bioinformatics/bti297

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Multiple whole-genome alignments without a reference organism.

Authors:  Inna Dubchak; Alexander Poliakov; Andrey Kislyuk; Michael Brudno
Journal:  Genome Res       Date:  2009-01-28       Impact factor: 9.043

2.  Approaches to Fungal Genome Annotation.

Authors:  Brian J Haas; Qiandong Zeng; Matthew D Pearson; Christina A Cuomo; Jennifer R Wortman
Journal:  Mycology       Date:  2011-10-03

3.  Predicting gene structure changes resulting from genetic variants via exon definition features.

Authors:  William H Majoros; Carson Holt; Michael S Campbell; Doreen Ware; Mark Yandell; Timothy E Reddy
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

4.  Hidden Markov Models and their Applications in Biological Sequence Analysis.

Authors:  Byung-Jun Yoon
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

5.  Detecting overlapping coding sequences in virus genomes.

Authors:  Andrew E Firth; Chris M Brown
Journal:  BMC Bioinformatics       Date:  2006-02-16       Impact factor: 3.169

6.  Efficient decoding algorithms for generalized hidden Markov model gene finders.

Authors:  William H Majoros; Mihaela Pertea; Arthur L Delcher; Steven L Salzberg
Journal:  BMC Bioinformatics       Date:  2005-01-24       Impact factor: 3.169

7.  Improving model construction of profile HMMs for remote homology detection through structural alignment.

Authors:  Juliana S Bernardes; Alberto M R Dávila; Vítor S Costa; Gerson Zaverucha
Journal:  BMC Bioinformatics       Date:  2007-11-09       Impact factor: 3.169

8.  Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments.

Authors:  Brian J Haas; Steven L Salzberg; Wei Zhu; Mihaela Pertea; Jonathan E Allen; Joshua Orvis; Owen White; C Robin Buell; Jennifer R Wortman
Journal:  Genome Biol       Date:  2008-01-11       Impact factor: 13.583

9.  Novel insights into the unfolded protein response using Pichia pastoris specific DNA microarrays.

Authors:  Alexandra Graf; Brigitte Gasser; Martin Dragosits; Michael Sauer; Germán G Leparc; Thomas Tüchler; David P Kreil; Diethard Mattanovich
Journal:  BMC Genomics       Date:  2008-08-19       Impact factor: 3.969

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.