Literature DB >> 9322033

Two methods for improving performance of an HMM and their application for gene finding.

A Krogh1.   

Abstract

A hidden Markov model for gene finding consists of submodels for coding regions, splice sites, introns, intergenic regions and possibly more. It is described how to estimate the model as a whole from labeled sequences instead of estimating the individual parts independently from subsequences. It is argued that the standard maximum likelihood estimation criterion is not optimal for training such a model. Instead of maximizing the probability of the DNA sequence, one should maximize the probability of the correct prediction. Such a criterion, called conditional maximum likelihood, is used for the gene finder 'HMM-gene'. A new (approximative) algorithm is described, which finds the most probable prediction summed over all paths yielding the same prediction. We show that these methods contribute significantly to the high performance of HMMgene.

Mesh:

Substances:

Year:  1997        PMID: 9322033

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  53 in total

1.  An assessment of gene prediction accuracy in large DNA sequences.

Authors:  R Guigó; P Agarwal; J F Abril; M Burset; J W Fickett
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

2.  GAZE: a generic framework for the integration of gene-prediction data by dynamic programming.

Authors:  Kevin L Howe; Tom Chothia; Richard Durbin
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

3.  eShadow: a tool for comparing closely related sequences.

Authors:  Ivan Ovcharenko; Dario Boffelli; Gabriela G Loots
Journal:  Genome Res       Date:  2004-06       Impact factor: 9.043

4.  Evaluating bacterial gene-finding HMM structures as probabilistic logic programs.

Authors:  Søren Mørk; Ian Holmes
Journal:  Bioinformatics       Date:  2012-01-03       Impact factor: 6.937

5.  Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models.

Authors:  Stephan Waack; Oliver Keller; Roman Asper; Thomas Brodag; Carsten Damm; Wolfgang Florian Fricke; Katharina Surovcik; Peter Meinicke; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2006-03-16       Impact factor: 3.169

6.  Uncertainty in homology inferences: assessing and improving genomic sequence alignment.

Authors:  Gerton Lunter; Andrea Rocco; Naila Mimouni; Andreas Heger; Alexandre Caldeira; Jotun Hein
Journal:  Genome Res       Date:  2007-12-11       Impact factor: 9.043

7.  Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Authors:  Vardges Ter-Hovhannisyan; Alexandre Lomsadze; Yury O Chernoff; Mark Borodovsky
Journal:  Genome Res       Date:  2008-08-29       Impact factor: 9.043

8.  Genome annotation assessment in Drosophila melanogaster.

Authors:  M G Reese; G Hartzell; N L Harris; U Ohler; J F Abril; S E Lewis
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

9.  Using GeneWise in the Drosophila annotation experiment.

Authors:  E Birney; R Durbin
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

10.  Identification and distribution of rRNH1, a gene upregulated after spinal cord primary neuron injury.

Authors:  Chenfu Zhao; Yanli Ni; Yong Liu; Tao Liu; Shuguang Yang; Zhaoqing Liu; Shaojun Liu
Journal:  In Vitro Cell Dev Biol Anim       Date:  2013-11-28       Impact factor: 2.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.