Literature DB >> 18192698

Simulating DNA coding sequence evolution with EvolveAGene 3.

Barry G Hall1.   

Abstract

Phylogenetic reconstruction based upon multiple alignments of molecular sequences is important to most branches of modern biology and is central to molecular evolution. Understanding the historical relationships among macromolecules depends upon computer programs that implement a variety of analytical methods. Because it is impossible to know those historical relationships with certainty, assessment of the accuracy of methods and the programs that implement them requires the use of programs that realistically simulate the evolution of DNA sequences. EvolveAGene 3 is a realistic coding sequence simulation program that separates mutation from selection and allows the user to set selection conditions, including variable regions of selection intensity within the sequence and variation in intensity of selection over branches. Variation includes base substitutions, insertions, and deletions. To the best of my knowledge, it is the only program available that simulates the evolution of intact coding sequences. Output includes the true tree and true alignments of the resulting coding sequence and corresponding protein sequences. A log file reports the frequencies of each kind of base substitution, the ratio of transition to transversion substitutions, the ratio of indel to base substitution mutations, and the numbers of silent and amino acid replacement mutations. The realism of the data sets has been assessed by comparing the d(N)/d(S) ratio, the ratio of transition to transversion substitutions, and the ratio of indel to base substitution mutations of the simulated data sets with those parameters of real data sets from the "gold standard" BaliBase collection of structural alignments. Results show that the data sets produced by EvolveAGene 3 are very similar to real data sets, and EvolveAGene 3 is therefore a realistic simulation program that can be used to evaluate a variety of programs and methods in molecular evolution.

Entities:  

Mesh:

Year:  2008        PMID: 18192698     DOI: 10.1093/molbev/msn008

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  23 in total

1.  Molecular epidemiology of Mycobacterium leprae as determined by structure-neighbor clustering.

Authors:  Barry G Hall; Stephen J Salipante
Journal:  J Clin Microbiol       Date:  2010-03-29       Impact factor: 5.948

2.  Translational selection of genes coding for perfectly conserved proteins among three mosquito vectors.

Authors:  Olaf Rodriguez; Brajendra K Singh; David W Severson; Susanta K Behura
Journal:  Infect Genet Evol       Date:  2012-06-15       Impact factor: 3.342

3.  High frequency of hotspot mutations in core genes of Escherichia coli due to short-term positive selection.

Authors:  Sujay Chattopadhyay; Scott J Weissman; Vladimir N Minin; Thomas A Russo; Daniel E Dykhuizen; Evgeni V Sokurenko
Journal:  Proc Natl Acad Sci U S A       Date:  2009-07-15       Impact factor: 11.205

4.  Convergent molecular evolution of genomic cores in Salmonella enterica and Escherichia coli.

Authors:  Sujay Chattopadhyay; Sandip Paul; Dagmara I Kisiela; Elena V Linardopoulou; Evgeni V Sokurenko
Journal:  J Bacteriol       Date:  2012-07-13       Impact factor: 3.490

5.  Simulation of molecular data under diverse evolutionary scenarios.

Authors:  Miguel Arenas
Journal:  PLoS Comput Biol       Date:  2012-05-31       Impact factor: 4.475

6.  SuiteMSA: visual tools for multiple sequence alignment comparison and molecular sequence simulation.

Authors:  Catherine L Anderson; Cory L Strope; Etsuko N Moriyama
Journal:  BMC Bioinformatics       Date:  2011-05-21       Impact factor: 3.169

7.  HGT-Gen: a tool for generating a phylogenetic tree with horizontal gene transfer.

Authors:  Tokumasa Horiike; Daisuke Miyata; Yoshio Tateno; Ryoichi Minai
Journal:  Bioinformation       Date:  2011-10-31

8.  INDELible: a flexible simulator of biological sequence evolution.

Authors:  William Fletcher; Ziheng Yang
Journal:  Mol Biol Evol       Date:  2009-05-07       Impact factor: 16.240

9.  Biological sequence simulation for testing complex evolutionary hypotheses: indel-Seq-Gen version 2.0.

Authors:  Cory L Strope; Kevin Abel; Stephen D Scott; Etsuko N Moriyama
Journal:  Mol Biol Evol       Date:  2009-08-03       Impact factor: 16.240

10.  Tools for simulating evolution of aligned genomic regions with integrated parameter estimation.

Authors:  Avinash Varadarajan; Robert K Bradley; Ian H Holmes
Journal:  Genome Biol       Date:  2008-10-08       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.