Literature DB >> 18853356

A model-based approach to study nearest-neighbor influences reveals complex substitution patterns in non-coding sequences.

Guy Baele1, Yves Van de Peer, Stijn Vansteelandt.   

Abstract

In this article, we present a likelihood-based framework for modeling site dependencies. Our approach builds upon standard evolutionary models but incorporates site dependencies across the entire tree by letting the evolutionary parameters in these models depend upon the ancestral states at the neighboring sites. It thus avoids the need for introducing new and high-dimensional evolutionary models for site-dependent evolution. We propose a Markov chain Monte Carlo approach with data augmentation to infer the evolutionary parameters under our model. Although our approach allows for wide-ranging site dependencies, we illustrate its use, in two non-coding datasets, in the case of nearest-neighbor dependencies (i.e., evolution directly depending only upon the immediate flanking sites). The results reveal that the general time-reversible model with nearest-neighbor dependencies substantially improves the fit to the data as compared to the corresponding model with site independence. Using the parameter estimates from our model, we elaborate on the importance of the 5-methylcytosine deamination process (i.e., the CpG effect) and show that this process also depends upon the 5' neighboring base identity. We hint at the possibility of a so-called TpA effect and show that the observed substitution behavior is very complex in the light of dinucleotide estimates. We also discuss the presence of CpG effects in a nuclear small subunit dataset and find significant evidence that evolutionary models incorporating context-dependent effects perform substantially better than independent-site models and in some cases even outperform models that incorporate varying rates across sites.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18853356     DOI: 10.1080/10635150802422324

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  11 in total

1.  Using non-reversible context-dependent evolutionary models to study substitution patterns in primate non-coding sequences.

Authors:  Guy Baele; Yves Van de Peer; Stijn Vansteelandt
Journal:  J Mol Evol       Date:  2010-07-11       Impact factor: 2.395

2.  Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

Authors:  Nicolas Rodrigue; Hervé Philippe; Nicolas Lartillot
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-22       Impact factor: 11.205

3.  Context dependent substitution biases vary within the human genome.

Authors:  P Andrew Nevarez; Christopher M DeBoever; Benjamin J Freeland; Marissa A Quitt; Eliot C Bush
Journal:  BMC Bioinformatics       Date:  2010-09-15       Impact factor: 3.169

4.  Sigma-2: Multiple sequence alignment of non-coding DNA via an evolutionary model.

Authors:  Gayathri Jayaraman; Rahul Siddharthan
Journal:  BMC Bioinformatics       Date:  2010-09-16       Impact factor: 3.169

5.  Modelling the ancestral sequence distribution and model frequencies in context-dependent models for primate non-coding sequences.

Authors:  Guy Baele; Yves Van de Peer; Stijn Vansteelandt
Journal:  BMC Evol Biol       Date:  2010-08-10       Impact factor: 3.260

6.  Coordinated genome-wide modifications within proximal promoter cis-regulatory elements during vertebrate evolution.

Authors:  Ken Daigoro Yokoyama; Jeffrey L Thorne; Gregory A Wray
Journal:  Genome Biol Evol       Date:  2010-11-30       Impact factor: 3.416

7.  Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes.

Authors:  Guy Baele; Yves Van de Peer; Stijn Vansteelandt
Journal:  BMC Evol Biol       Date:  2011-05-27       Impact factor: 3.260

8.  Efficient context-dependent model building based on clustering posterior distributions for non-coding sequences.

Authors:  Guy Baele; Yves Van de Peer; Stijn Vansteelandt
Journal:  BMC Evol Biol       Date:  2009-04-30       Impact factor: 3.260

9.  Make the most of your samples: Bayes factor estimators for high-dimensional models of sequence evolution.

Authors:  Guy Baele; Philippe Lemey; Stijn Vansteelandt
Journal:  BMC Bioinformatics       Date:  2013-03-06       Impact factor: 3.169

10.  COMIT: identification of noncoding motifs under selection in coding sequences.

Authors:  Deniz Kural; Yang Ding; Jiantao Wu; Alicia M Korpi; Jeffrey H Chuang
Journal:  Genome Biol       Date:  2009-11-20       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.