Literature DB >> 15072695

transition priors for protein hidden Markov models: an empirical study towards maximum discrimination.

Markus Wistrand1, Erik L L Sonnhammer.   

Abstract

Insertions and deletions in a profile hidden Markov model (HMM) are modeled by transition probabilities between insert, delete and match states. These are estimated by combining observed data and prior probabilities. The transition prior probabilities can be defined either ad hoc or by maximum likelihood (ML) estimation. We show that the choice of transition prior greatly affects the HMM's ability to discriminate between true and false hits. HMM discrimination was measured using the HMMER 2.2 package applied to 373 families from Pfam. We measured the discrimination between true members and noise sequences employing various ML transition priors and also systematically scanned the parameter space of ad hoc transition priors. Our results indicate that ML priors produce far from optimal discrimination, and we present an empirically derived prior that considerably decreases the number of misclassifications compared to ML. Most of the difference stems from the probabilities for exiting a delete state. The ML prior, which is unaware of noise sequences, estimates a delete-to-delete probability that is relatively high and does not penalize noise sequences enough for optimal discrimination.

Mesh:

Year:  2004        PMID: 15072695     DOI: 10.1089/106652704773416957

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes.

Authors:  Anne-Kathrin Schultz; Ming Zhang; Thomas Leitner; Carla Kuiken; Bette Korber; Burkhard Morgenstern; Mario Stanke
Journal:  BMC Bioinformatics       Date:  2006-05-22       Impact factor: 3.169

2.  Improved profile HMM performance by assessment of critical algorithmic features in SAM and HMMER.

Authors:  Markus Wistrand; Erik L L Sonnhammer
Journal:  BMC Bioinformatics       Date:  2005-04-15       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.