Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

Literature DB >> 8902360

Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

K Sjölander¹, K Karplus, M Brown, R Hughey, A Krogh, I S Mian, D Haussler.

Abstract

We present a method for condensing the information in multiple alignments of proteins into a mixture of Dirichlet densities over amino acid distributions. Dirichlet mixture densities are designed to be combined with observed amino acid frequencies to form estimates of expected amino acid probabilities at each position in a profile, hidden Markov model or other statistical model. These estimates give a statistical model greater generalization capacity, so that remotely related family members can be more reliably recognized by the model. This paper corrects the previously published formula for estimating these expected probabilities, and contains complete derivations of the Dirichlet mixture formulas, methods for optimizing the mixtures to match particular databases, and suggestions for efficient implementation.

Entities: Chemical

Mesh：

Substances：
Proteins

Year: 1996 PMID： 8902360 DOI： 10.1093/bioinformatics/12.4.327

Source DB: PubMed Journal: Comput Appl Biosci ISSN： 0266-7061

Keyword Cloud
Cited

100 in total

1. Predicting deleterious amino acid substitutions.

Authors: P C Ng; S Henikoff
Journal: Genome Res Date: 2001-05 Impact factor: 9.043

2. Testing computational prediction of missense mutation phenotypes: functional characterization of 204 mutations of human cystathionine beta synthase.

Authors: Qiong Wei; Liqun Wang; Qiang Wang; Warren D Kruger; Roland L Dunbrack
Journal: Proteins Date: 2010-07

3. Recent improvements to the PROSITE database.

Authors: Nicolas Hulo; Christian J A Sigrist; Virginie Le Saux; Petra S Langendijk-Genevaux; Lorenza Bordoli; Alexandre Gattiker; Edouard De Castro; Philipp Bucher; Amos Bairoch
Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971

4. PANTHER: a library of protein families and subfamilies indexed by function.

Authors: Paul D Thomas; Michael J Campbell; Anish Kejariwal; Huaiyu Mi; Brian Karlak; Robin Daverman; Karen Diemer; Anushya Muruganujan; Apurva Narechania
Journal: Genome Res Date: 2003-09 Impact factor: 9.043

5. Scoring profile-to-profile sequence alignments.

Authors: Guoli Wang; Roland L Dunbrack
Journal: Protein Sci Date: 2004-06 Impact factor: 6.725

6. Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information.

Authors: Håkan Viklund; Arne Elofsson
Journal: Protein Sci Date: 2004-07 Impact factor: 6.725

Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

1. Predicting deleterious amino acid substitutions.

2. Testing computational prediction of missense mutation phenotypes: functional characterization of 204 mutations of human cystathionine beta synthase.

3. Recent improvements to the PROSITE database.

4. PANTHER: a library of protein families and subfamilies indexed by function.

5. Scoring profile-to-profile sequence alignments.

6. Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information.

7. MotifPrototyper: a Bayesian profile model for motif families.

8. MUSCLE: multiple sequence alignment with high accuracy and high throughput.

9. Functional classification of proteins and protein variants.

10. An assessment of substitution scores for protein profile-profile comparison.