Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Complexity reduction in context-dependent DNA substitution models.

Literature DB >> 19017657

Complexity reduction in context-dependent DNA substitution models.

Abstract

MOTIVATION: The modeling of conservation patterns in genomic DNA has become increasingly popular for a number of bioinformatic applications. While several systems developed to date incorporate context-dependence in their substitution models, the impact on computational complexity and generalization ability of the resulting higher order models invites the question of whether simpler approaches to context modeling might permit appreciable reductions in model complexity and computational cost, without sacrificing prediction accuracy.
RESULTS: We formulate several alternative methods for context modeling based on windowed Bayesian networks, and compare their effects on both accuracy and computational complexity for the task of discriminating functionally distinct segments in vertebrate DNA. Our results show that substantial reductions in the complexity of both the model and the associated inference algorithm can be achieved without reducing predictive accuracy.

Mesh：

Substances：
DNA

Year: 2008 PMID： 19017657 PMCID： PMC2732293 DOI： 10.1093/bioinformatics/btn598

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

29 in total

Complexity reduction in context-dependent DNA substitution models.

1. Evidence for a high frequency of simultaneous double-nucleotide substitutions.

2. The human genome browser at UCSC.

3. Phylogenetic estimation of context-dependent substitution rates by maximum likelihood.

Review 4. Inferring cellular networks using probabilistic graphical models.

5. Gene finding with a hidden Markov model of genome structure and evolution.

6. A low rate of simultaneous double-nucleotide mutations in primates.

7. Codon and rate variation models in molecular phylogeny.

8. MAVID: constrained ancestral alignment of multiple sequences.

9. Phylogenetic motif detection by expectation-maximization on evolutionary mixtures.

10. ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.

1. Modeling the evolution of regulatory elements by simultaneous detection and alignment with phylogenetic pair HMMs.

2. COMIT: identification of noncoding motifs under selection in coding sequences.