Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The complexity of the dirichlet model for multiple alignment data.

Literature DB >> 21702692

The complexity of the dirichlet model for multiple alignment data.

Abstract

A model is a set of possible theories for describing a set of data. When the data are used to select a maximum-likelihood theory, an important question is how many effectively independent theories the model contains; the log of this number is called the model's complexity. The Dirichlet model is the set of all Dirichlet distributions, which are probability densities over the space of multinomials. A Dirichlet distribution may be used to describe multiple-alignment data, consisting of n columns of letters, with c letters in each column. We here derive, in the limit of large n and c, a closed-form expression for the complexity of the Dirichlet model applied to such data. For small c, we derive as well a minor correction to this formula, which is easily calculated by Monte Carlo simulation. Although our results are confined to the Dirichlet model, they may cast light as well on the complexity of Dirichlet mixture models, which have been applied fruitfully to the study of protein multiple sequence alignments.

Mesh：

Substances：
Proteins

Year: 2011 PMID： 21702692 PMCID： PMC3145953 DOI： 10.1089/cmb.2011.0039

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

5 in total

1. On the inference of dirichlet mixture priors for protein sequence comparison.

Authors: Xugang Ye; Yi-Kuo Yu; Stephen F Altschul
Journal: J Comput Biol Date: 2011-06-24 Impact factor: 1.479

2. Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

Authors: K Sjölander; K Karplus; M Brown; R Hughey; A Krogh; I S Mian; D Haussler
Journal: Comput Appl Biosci Date: 1996-08

3. Using Dirichlet mixture priors to derive hidden Markov models for protein families.

Authors: M Brown; R Hughey; A Krogh; I S Mian; K Sjölander; D Haussler
Journal: Proc Int Conf Intell Syst Mol Biol Date: 1993

4. The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors: Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal: PLoS Comput Biol Date: 2010-07-15 Impact factor: 4.475

5. PSI-BLAST pseudocounts and the minimum description length principle.

Authors: Stephen F Altschul; E Michael Gertz; Richa Agarwala; Alejandro A Schäffer; Yi-Kuo Yu
Journal: Nucleic Acids Res Date: 2008-12-16 Impact factor: 16.971

5 in total

2 in total

1. On the inference of dirichlet mixture priors for protein sequence comparison.

Authors: Xugang Ye; Yi-Kuo Yu; Stephen F Altschul
Journal: J Comput Biol Date: 2011-06-24 Impact factor: 1.479

2. Dirichlet mixtures, the Dirichlet process, and the structure of protein space.

Authors: Viet-An Nguyen; Jordan Boyd-Graber; Stephen F Altschul
Journal: J Comput Biol Date: 2013-01 Impact factor: 1.479

2 in total