Literature DB >> 17158778

indel-Seq-Gen: a new protein family simulator incorporating domains, motifs, and indels.

Cory L Strope1, Stephen D Scott, Etsuko N Moriyama.   

Abstract

Reconstructing the evolutionary history of protein sequences will provide a better understanding of divergence mechanisms of protein superfamilies and their functions. Long-term protein evolution often includes dynamic changes such as insertion, deletion, and domain shuffling. Such dynamic changes make reconstructing protein sequence evolution difficult and affect the accuracy of molecular evolutionary methods, such as multiple alignments and phylogenetic methods. Unfortunately, currently available simulation methods are not sufficiently flexible and do not allow biologically realistic dynamic protein sequence evolution. We introduce a new method, indel-Seq-Gen (iSG), that can simulate realistic evolutionary processes of protein sequences with insertions and deletions (indels). Unlike other simulation methods, iSG allows the user to simulate multiple subsequences according to different evolutionary parameters, which is necessary for generating realistic protein families with multiple domains. iSG tracks all evolutionary events including indels and outputs the "true" multiple alignment of the simulated sequences. iSG can also generate a larger sequence space by allowing the use of multiple related root sequences. With all these functions, iSG can be used to test the accuracy of, for example, multiple alignment methods, phylogenetic methods, evolutionary hypotheses, ancestral protein reconstruction methods, and protein family classification methods. We empirically evaluated the performance of iSG against currently available methods by simulating the evolution of the G protein-coupled receptor and lipocalin protein families. We examined their true multiple alignments, reconstruction of the transmembrane regions and beta-strands, and the results of similarity search against a protein database using the simulated sequences. We also presented an example of using iSG for examining how phylogenetic reconstruction is affected by high indel rates.

Mesh:

Substances:

Year:  2006        PMID: 17158778     DOI: 10.1093/molbev/msl195

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  17 in total

1.  SuiteMSA: visual tools for multiple sequence alignment comparison and molecular sequence simulation.

Authors:  Catherine L Anderson; Cory L Strope; Etsuko N Moriyama
Journal:  BMC Bioinformatics       Date:  2011-05-21       Impact factor: 3.169

2.  HGT-Gen: a tool for generating a phylogenetic tree with horizontal gene transfer.

Authors:  Tokumasa Horiike; Daisuke Miyata; Yoshio Tateno; Ryoichi Minai
Journal:  Bioinformation       Date:  2011-10-31

3.  ALF--a simulation framework for genome evolution.

Authors:  Daniel A Dalquen; Maria Anisimova; Gaston H Gonnet; Christophe Dessimoz
Journal:  Mol Biol Evol       Date:  2011-12-08       Impact factor: 16.240

4.  Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  PLoS One       Date:  2015-09-23       Impact factor: 3.240

5.  Ortholog-Finder: A Tool for Constructing an Ortholog Data Set.

Authors:  Tokumasa Horiike; Ryoichi Minai; Daisuke Miyata; Yoji Nakamura; Yoshio Tateno
Journal:  Genome Biol Evol       Date:  2016-01-18       Impact factor: 3.416

6.  INDELible: a flexible simulator of biological sequence evolution.

Authors:  William Fletcher; Ziheng Yang
Journal:  Mol Biol Evol       Date:  2009-05-07       Impact factor: 16.240

7.  Biological sequence simulation for testing complex evolutionary hypotheses: indel-Seq-Gen version 2.0.

Authors:  Cory L Strope; Kevin Abel; Stephen D Scott; Etsuko N Moriyama
Journal:  Mol Biol Evol       Date:  2009-08-03       Impact factor: 16.240

8.  Tools for simulating evolution of aligned genomic regions with integrated parameter estimation.

Authors:  Avinash Varadarajan; Robert K Bradley; Ian H Holmes
Journal:  Genome Biol       Date:  2008-10-08       Impact factor: 13.583

9.  Coev-web: a web platform designed to simulate and evaluate coevolving positions along a phylogenetic tree.

Authors:  Linda Dib; Xavier Meyer; Panu Artimo; Vassilios Ioannidis; Heinz Stockinger; Nicolas Salamin
Journal:  BMC Bioinformatics       Date:  2015-11-23       Impact factor: 3.169

10.  How Do Genomes Create Novel Phenotypes? Insights from the Loss of the Worker Caste in Ant Social Parasites.

Authors:  Chris R Smith; Sara Helms Cahan; Carsten Kemena; Seán G Brady; Wei Yang; Erich Bornberg-Bauer; Ti Eriksson; Juergen Gadau; Martin Helmkampf; Dietrich Gotzek; Misato Okamoto Miyakawa; Andrew V Suarez; Alexander Mikheyev
Journal:  Mol Biol Evol       Date:  2015-07-29       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.