Literature DB >> 8744772

Hidden Markov models for sequence analysis: extension and analysis of the basic method.

R Hughey1, A Krogh.   

Abstract

Hidden Markov models (HMMs) are a highly effective means of modeling a family of unaligned sequences or a common motif within a set of unaligned sequences. The trained HMM can then be used for discrimination or multiple alignment. The basic mathematical description of an HMM and its expectation-maximization training procedure is relatively straightforward. In this paper, we review the mathematical extensions and heuristics that move the method from the theoretical to the practical. We then experimentally analyze the effectiveness of model regularization, dynamic model modification and optimization strategies. Finally it is demonstrated on the SH2 domain how a domain can be found from unaligned sequences using a special model type. The experimental work was completed with the aid of the Sequence Alignment and Modeling software suite.

Mesh:

Year:  1996        PMID: 8744772     DOI: 10.1093/bioinformatics/12.2.95

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  106 in total

1.  Evolution and horizontal transfer of dUTPase-encoding genes in viruses and their hosts.

Authors:  A M Baldo; M A McClure
Journal:  J Virol       Date:  1999-09       Impact factor: 5.103

2.  Common fold in helix-hairpin-helix proteins.

Authors:  X Shao; N V Grishin
Journal:  Nucleic Acids Res       Date:  2000-07-15       Impact factor: 16.971

3.  A comparison of position-specific score matrices based on sequence and structure alignments.

Authors:  Anna R Panchenko; Stephen H Bryant
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

4.  The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors:  Parag Mallick; Robert Weiss; David Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-02       Impact factor: 11.205

5.  A branch point consensus from Arabidopsis found by non-circular analysis allows for better prediction of acceptor sites.

Authors:  N Tolstrup; P Rouzé; S Brunak
Journal:  Nucleic Acids Res       Date:  1997-08-01       Impact factor: 16.971

6.  Using structural motif templates to identify proteins with DNA binding function.

Authors:  Susan Jones; Jonathan A Barker; Irene Nobeli; Janet M Thornton
Journal:  Nucleic Acids Res       Date:  2003-06-01       Impact factor: 16.971

7.  Finding weak similarities between proteins by sequence profile comparison.

Authors:  Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

8.  A comparison of profile hidden Markov model procedures for remote homology detection.

Authors:  Martin Madera; Julian Gough
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

9.  PANTHER: a library of protein families and subfamilies indexed by function.

Authors:  Paul D Thomas; Michael J Campbell; Anish Kejariwal; Huaiyu Mi; Brian Karlak; Robin Daverman; Karen Diemer; Anushya Muruganujan; Apurva Narechania
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

10.  Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information.

Authors:  Håkan Viklund; Arne Elofsson
Journal:  Protein Sci       Date:  2004-07       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.