Literature DB >> 7584370

Using Dirichlet mixture priors to derive hidden Markov models for protein families.

M Brown1, R Hughey, A Krogh, I S Mian, K Sjölander, D Haussler.   

Abstract

A Bayesian method for estimating the amino acid distributions in the states of a hidden Markov model (HMM) for a protein family or the columns of a multiple alignment of that family is introduced. This method uses Dirichlet mixture densities as priors over amino acid distributions. These mixture densities are determined from examination of previously constructed HMMs or multiple alignments. It is shown that this Bayesian method can improve the quality of HMMs produced from small training sets. Specific experiments on the EF-hand motif are reported, for which these priors are shown to produce HMMs with higher likelihood on unseen data, and fewer false positives and false negatives in a database search task.

Mesh:

Substances:

Year:  1993        PMID: 7584370

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  38 in total

1.  Finding functional sequence elements by multiple local alignment.

Authors:  Martin C Frith; Ulla Hansen; John L Spouge; Zhiping Weng
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

2.  Bipartite pattern discovery by entropy minimization-based multiple local alignment.

Authors:  Chengpeng Bi; Peter K Rogan
Journal:  Nucleic Acids Res       Date:  2004-09-23       Impact factor: 16.971

3.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

4.  Compositional adjustment of Dirichlet mixture priors.

Authors:  Xugang Ye; Yi-Kuo Yu; Stephen F Altschul
Journal:  J Comput Biol       Date:  2010-12       Impact factor: 1.479

5.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

6.  On the inference of dirichlet mixture priors for protein sequence comparison.

Authors:  Xugang Ye; Yi-Kuo Yu; Stephen F Altschul
Journal:  J Comput Biol       Date:  2011-06-24       Impact factor: 1.479

7.  The complexity of the dirichlet model for multiple alignment data.

Authors:  Yi-Kuo Yu; Stephen F Altschul
Journal:  J Comput Biol       Date:  2011-06-24       Impact factor: 1.479

8.  Statistical modeling and analysis of the LAGLIDADG family of site-specific endonucleases and identification of an intein that encodes a site-specific endonuclease of the HNH family.

Authors:  J Z Dalgaard; A J Klar; M J Moser; W R Holley; A Chatterjee; I S Mian
Journal:  Nucleic Acids Res       Date:  1997-11-15       Impact factor: 16.971

9.  Assessing the impact of secondary structure and solvent accessibility on protein evolution.

Authors:  N Goldman; J L Thorne; D T Jones
Journal:  Genetics       Date:  1998-05       Impact factor: 4.562

10.  Eukaryotic translation elongation factor 1 gamma contains a glutathione transferase domain--study of a diverse, ancient protein superfamily using motif search and structural modeling.

Authors:  E V Koonin; A R Mushegian; R L Tatusov; S F Altschul; S H Bryant; P Bork; A Valencia
Journal:  Protein Sci       Date:  1994-11       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.