Literature DB >> 21306634

Tree Pruner: An efficient tool for selecting data from a biased genetic database.

Mohan Krishnamoorthy1, Pragneshkumar Patel, Mira Dimitrijevic, Jonathan Dietrich, Margaret Green, Catherine Macken.   

Abstract

BACKGROUND: Large databases of genetic data are often biased in their representation. Thus, selection of genetic data with desired properties, such as evolutionary representation or shared genotypes, is problematic. Selection on the basis of epidemiological variables may not achieve the desired properties. Available automated approaches to the selection of influenza genetic data make a tradeoff between speed and simplicity on the one hand and control over quality and contents of the dataset on the other hand. A poorly chosen dataset may be detrimental to subsequent analyses.
RESULTS: We developed a tool, Tree Pruner, for obtaining a dataset with desired evolutionary properties from a large, biased genetic database. Tree Pruner provides the user with an interactive phylogenetic tree as a means of editing the initial dataset from which the tree was inferred. The tree visualization changes dynamically, using colors and shading, reflecting Tree Pruner actions. At the end of a Tree Pruner session, the editing actions are implemented in the dataset. Currently, Tree Pruner is implemented on the Influenza Research Database (IRD). The data management capabilities of the IRD allow the user to store a pruned dataset for additional pruning or for subsequent analysis. Tree Pruner can be easily adapted for use with other organisms.
CONCLUSIONS: Tree Pruner is an efficient, manual tool for selecting a high-quality dataset with desired evolutionary properties from a biased database of genetic sequences. It offers an important alternative to automated approaches to the same goal, by providing the user with a dynamic, visual guide to the ongoing selection process and ultimate control over the contents (and therefore quality) of the dataset.

Entities:  

Mesh:

Year:  2011        PMID: 21306634      PMCID: PMC3045304          DOI: 10.1186/1471-2105-12-51

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.307


  8 in total

1.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

2.  Estimating maximum likelihood phylogenies with PhyML.

Authors:  Stéphane Guindon; Frédéric Delsuc; Jean-François Dufayard; Olivier Gascuel
Journal:  Methods Mol Biol       Date:  2009

3.  Mining the NCBI Influenza Sequence Database: adaptive grouping of BLAST results using precalculated neighbor indexing.

Authors:  Leonid Zaslavsky; Tatiana Tatusova
Journal:  PLoS Curr       Date:  2009-10-30

4.  Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation.

Authors:  Leonid Zaslavsky; Yiming Bao; Tatiana A Tatusova
Journal:  BMC Bioinformatics       Date:  2008-05-16       Impact factor: 3.169

5.  PHY.FI: fast and easy online creation and manipulation of phylogeny color figures.

Authors:  Jakob Fredslund
Journal:  BMC Bioinformatics       Date:  2006-06-22       Impact factor: 3.169

6.  TreeDyn: towards dynamic graphics and annotations for analyses of trees.

Authors:  François Chevenet; Christine Brun; Anne-Laure Bañuls; Bernard Jacq; Richard Christen
Journal:  BMC Bioinformatics       Date:  2006-10-10       Impact factor: 3.169

7.  PhyloExplorer: a web server to validate, explore and query phylogenetic trees.

Authors:  Vincent Ranwez; Nicolas Clairon; Frédéric Delsuc; Saeed Pourali; Nicolas Auberval; Sorel Diser; Vincent Berry
Journal:  BMC Evol Biol       Date:  2009-05-18       Impact factor: 3.260

8.  BioHealthBase: informatics support in the elucidation of influenza virus host pathogen interactions and virulence.

Authors:  Burke Squires; Catherine Macken; Adolfo Garcia-Sastre; Shubhada Godbole; Jyothi Noronha; Victoria Hunt; Roger Chang; Christopher N Larsen; Ed Klem; Kevin Biersack; Richard H Scheuermann
Journal:  Nucleic Acids Res       Date:  2007-10-26       Impact factor: 16.971

  8 in total
  4 in total

1.  TreeTuner: A pipeline for minimizing redundancy and complexity in large phylogenetic datasets.

Authors:  Xi Zhang; Yining Hu; Laura Eme; Shinichiro Maruyama; Robert J M Eveleigh; Bruce A Curtis; Shannon J Sibbald; Julia F Hopkins; Gina V Filloramo; Klaas J van Wijk; John M Archibald
Journal:  STAR Protoc       Date:  2022-02-15

2.  Influenza research database: an integrated bioinformatics resource for influenza research and surveillance.

Authors:  R Burke Squires; Jyothi Noronha; Victoria Hunt; Adolfo García-Sastre; Catherine Macken; Nicole Baumgarth; David Suarez; Brett E Pickett; Yun Zhang; Christopher N Larsen; Alvin Ramsey; Liwei Zhou; Sam Zaremba; Sanjeev Kumar; Jon Deitrich; Edward Klem; Richard H Scheuermann
Journal:  Influenza Other Respir Viruses       Date:  2012-01-20       Impact factor: 4.380

3.  Treetrimmer: a method for phylogenetic dataset size reduction.

Authors:  Shinichiro Maruyama; Robert J M Eveleigh; John M Archibald
Journal:  BMC Res Notes       Date:  2013-04-12

4.  Treemmer: a tool to reduce large phylogenetic datasets with minimal loss of diversity.

Authors:  Fabrizio Menardo; Chloé Loiseau; Daniela Brites; Mireia Coscolla; Sebastian M Gygli; Liliana K Rutaihwa; Andrej Trauner; Christian Beisel; Sonia Borrell; Sebastien Gagneux
Journal:  BMC Bioinformatics       Date:  2018-05-02       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.