Literature DB >> 21037244

Bayesian sampling of genomic rearrangement scenarios via double cut and join.

István Miklós1, Eric Tannier.   

Abstract

MOTIVATION: When comparing the organization of two genomes, it is important not to draw conclusions on their modes of evolution from a single most parsimonious scenario explaining their differences. Better estimations can be obtained by sampling many different genomic rearrangement scenarios. For this problem, the Double Cut and Join (DCJ) model, while less relevant, is computationally easier than the Hannenhalli-Pevzner (HP) model. Indeed, in some special cases, the total number of DCJ sorting scenarios can be analytically calculated, and uniformly distributed random DCJ scenarios can be drawn in polynomial running time, while the complexity of counting the number of HP scenarios and sampling from the uniform distribution of their space is unknown, and conjectured to be #P-complete. Statistical methods, like Markov chain Monte Carlo (MCMC) for sampling from the uniform distribution of the most parsimonious or the Bayesian distribution of all possible HP scenarios are required.
RESULTS: We use the computational facilities of the DCJ model to draw a sampling of HP scenarios. It is based on a parallel MCMC method that cools down DCJ scenarios to HP scenarios. We introduce two theorems underlying the theoretical mixing properties of this parallel MCMC method. The method was tested on yeast and mammalian genomic data, and allowed us to provide estimates of the different modes of evolution in diverse lineages. AVAILABILITY: The program implemented in Java 1.5 programming language is available from http://www.renyi.hu/~miklosi/DCJ2HP/.

Entities:  

Mesh:

Year:  2010        PMID: 21037244     DOI: 10.1093/bioinformatics/btq574

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Sampling solution traces for the problem of sorting permutations by signed reversals.

Authors:  Christian Baudet; Zanoni Dias; Marie-France Sagot
Journal:  Algorithms Mol Biol       Date:  2012-06-15       Impact factor: 1.405

2.  Linearization of ancestral multichromosomal genomes.

Authors:  Ján Maňuch; Murray Patterson; Roland Wittler; Cedric Chauve; Eric Tannier
Journal:  BMC Bioinformatics       Date:  2012-12-19       Impact factor: 3.169

Review 3.  The inference of gene trees with species trees.

Authors:  Gergely J Szöllősi; Eric Tannier; Vincent Daubin; Bastien Boussau
Journal:  Syst Biol       Date:  2014-07-28       Impact factor: 15.683

4.  Sampling and counting genome rearrangement scenarios.

Authors:  István Miklós; Heather Smith
Journal:  BMC Bioinformatics       Date:  2015-10-02       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.