Luna L Sánchez-Reyes1, Martha Kandziora2,3, Emily Jane McTavish2. 1. School of Natural Sciences, University of California, Merced, USA. lsanchezreyes2@ucmerced.edu. 2. School of Natural Sciences, University of California, Merced, USA. 3. Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic.
Abstract
BACKGROUND: Phylogenies are a key part of research in many areas of biology. Tools that automate some parts of the process of phylogenetic reconstruction, mainly molecular character matrix assembly, have been developed for the advantage of both specialists in the field of phylogenetics and non-specialists. However, interpretation of results, comparison with previously available phylogenetic hypotheses, and selection of one phylogeny for downstream analyses and discussion still impose difficulties to one that is not a specialist either on phylogenetic methods or on a particular group of study. RESULTS: Physcraper is a command-line Python program that automates the update of published phylogenies by adding public DNA sequences to underlying alignments of previously published phylogenies. It also provides a framework for straightforward comparison of published phylogenies with their updated versions, by leveraging upon tools from the Open Tree of Life project to link taxonomic information across databases. The program can be used by the nonspecialist, as a tool to generate phylogenetic hypotheses based on publicly available expert phylogenetic knowledge. Phylogeneticists and taxonomic group specialists will find it useful as a tool to facilitate molecular dataset gathering and comparison of alternative phylogenetic hypotheses (topologies). CONCLUSION: The Physcraper workflow showcases the benefits of doing open science for phylogenetics, encouraging researchers to strive for better scientific sharing practices. Physcraper can be used with any OS and is released under an open-source license. Detailed instructions for installation and usage are available at https://physcraper.readthedocs.io.
BACKGROUND: Phylogenies are a key part of research in many areas of biology. Tools that automate some parts of the process of phylogenetic reconstruction, mainly molecular character matrix assembly, have been developed for the advantage of both specialists in the field of phylogenetics and non-specialists. However, interpretation of results, comparison with previously available phylogenetic hypotheses, and selection of one phylogeny for downstream analyses and discussion still impose difficulties to one that is not a specialist either on phylogenetic methods or on a particular group of study. RESULTS: Physcraper is a command-line Python program that automates the update of published phylogenies by adding public DNA sequences to underlying alignments of previously published phylogenies. It also provides a framework for straightforward comparison of published phylogenies with their updated versions, by leveraging upon tools from the Open Tree of Life project to link taxonomic information across databases. The program can be used by the nonspecialist, as a tool to generate phylogenetic hypotheses based on publicly available expert phylogenetic knowledge. Phylogeneticists and taxonomic group specialists will find it useful as a tool to facilitate molecular dataset gathering and comparison of alternative phylogenetic hypotheses (topologies). CONCLUSION: The Physcraper workflow showcases the benefits of doing open science for phylogenetics, encouraging researchers to strive for better scientific sharing practices. Physcraper can be used with any OS and is released under an open-source license. Detailed instructions for installation and usage are available at https://physcraper.readthedocs.io.
Entities:
Keywords:
DNA alignment; Gene phylogeny; Gene tree; Interoperability; Multilocus; Open Tree of Life; Open science; Otol; Public database; Reproducibility
Authors: Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon Journal: Bioinformatics Date: 2009-03-20 Impact factor: 6.937
Authors: Rutger A Vos; James P Balhoff; Jason A Caravas; Mark T Holder; Hilmar Lapp; Wayne P Maddison; Peter E Midford; Anurag Priyam; Jeet Sukumaran; Xuhua Xia; Arlin Stoltzfus Journal: Syst Biol Date: 2012-02-22 Impact factor: 15.683
Authors: Emily Jane McTavish; Cody E Hinchliff; James F Allman; Joseph W Brown; Karen A Cranston; Mark T Holder; Jonathan A Rees; Stephen A Smith Journal: Bioinformatics Date: 2015-05-04 Impact factor: 6.937
Authors: Alexandre Antonelli; Hannes Hettling; Fabien L Condamine; Karin Vos; R Henrik Nilsson; Michael J Sanderson; Hervé Sauquet; Ruud Scharn; Daniele Silvestro; Mats Töpel; Christine D Bacon; Bengt Oxelman; Rutger A Vos Journal: Syst Biol Date: 2017-03-01 Impact factor: 15.683