Literature DB >> 19017657

Complexity reduction in context-dependent DNA substitution models.

William H Majoros1, Uwe Ohler.   

Abstract

MOTIVATION: The modeling of conservation patterns in genomic DNA has become increasingly popular for a number of bioinformatic applications. While several systems developed to date incorporate context-dependence in their substitution models, the impact on computational complexity and generalization ability of the resulting higher order models invites the question of whether simpler approaches to context modeling might permit appreciable reductions in model complexity and computational cost, without sacrificing prediction accuracy.
RESULTS: We formulate several alternative methods for context modeling based on windowed Bayesian networks, and compare their effects on both accuracy and computational complexity for the task of discriminating functionally distinct segments in vertebrate DNA. Our results show that substantial reductions in the complexity of both the model and the associated inference algorithm can be achieved without reducing predictive accuracy.

Mesh:

Substances:

Year:  2008        PMID: 19017657      PMCID: PMC2732293          DOI: 10.1093/bioinformatics/btn598

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Evidence for a high frequency of simultaneous double-nucleotide substitutions.

Authors:  M Averof; A Rokas; K H Wolfe; P M Sharp
Journal:  Science       Date:  2000-02-18       Impact factor: 47.728

2.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

3.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood.

Authors:  Adam Siepel; David Haussler
Journal:  Mol Biol Evol       Date:  2003-12-05       Impact factor: 16.240

Review 4.  Inferring cellular networks using probabilistic graphical models.

Authors:  Nir Friedman
Journal:  Science       Date:  2004-02-06       Impact factor: 47.728

5.  Gene finding with a hidden Markov model of genome structure and evolution.

Authors:  Jakob Skou Pedersen; Jotun Hein
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

6.  A low rate of simultaneous double-nucleotide mutations in primates.

Authors:  Nick G C Smith; Matthew T Webster; Hans Ellegren
Journal:  Mol Biol Evol       Date:  2003-01       Impact factor: 16.240

7.  Codon and rate variation models in molecular phylogeny.

Authors:  Eric Schadt; Kenneth Lange
Journal:  Mol Biol Evol       Date:  2002-09       Impact factor: 16.240

8.  MAVID: constrained ancestral alignment of multiple sequences.

Authors:  Nicolas Bray; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

9.  Phylogenetic motif detection by expectation-maximization on evolutionary mixtures.

Authors:  A M Moses; D Y Chiang; M B Eisen
Journal:  Pac Symp Biocomput       Date:  2004

10.  ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.

Authors:  James Taylor; Svitlana Tyekucheva; David C King; Ross C Hardison; Webb Miller; Francesca Chiaromonte
Journal:  Genome Res       Date:  2006-10-19       Impact factor: 9.043

View more
  2 in total

1.  Modeling the evolution of regulatory elements by simultaneous detection and alignment with phylogenetic pair HMMs.

Authors:  William H Majoros; Uwe Ohler
Journal:  PLoS Comput Biol       Date:  2010-12-16       Impact factor: 4.475

2.  COMIT: identification of noncoding motifs under selection in coding sequences.

Authors:  Deniz Kural; Yang Ding; Jiantao Wu; Alicia M Korpi; Jeffrey H Chuang
Journal:  Genome Biol       Date:  2009-11-20       Impact factor: 13.583

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.