Literature DB >> 19812729

PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics.

Robert C Thomson1.   

Abstract

PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.

Entities:  

Keywords:  linux; operating system; phylogenetics; phyloinformatics

Year:  2009        PMID: 19812729      PMCID: PMC2747129          DOI: 10.4137/ebo.s3169

Source DB:  PubMed          Journal:  Evol Bioinform Online        ISSN: 1176-9343            Impact factor:   1.625


Introduction

Phylogenetic methods are playing a growing role in nearly all fields of biology. As researchers accumulate data, the size of phylogenetic analyses and the scope of inferences for which they are employed have both increased dramatically.1 Increasingly, researchers employ a diverse array of methods that have been developed by a large and talented group of biologists and programmers. While the availability of these methods is an obvious boon for biology as a whole, the sheer number of software packages that are now regularly employed in phylogenetic research forces biologists and system administrators to spend growing amounts of time installing, configuring and maintaining software rather than focusing on research. The availability of cheap multi-core processors exacerbates this issue, and now many even-moderately sized labs routinely build small clusters or groups of several phylogenetic workstations. Phyloinformatic research, in particular, depends largely on processing power and highly parallel analyses that are spread across many computers and processors. As phyloinformatic pipelines become more complex and sophisticated, careful standardization of operating systems across computers also becomes more complex. While this growing computational power gives researchers the ability to be more creative in attempting complex and time-consuming analyses, the process of compiling, installing, and configuring software for these machines becomes increasingly repetitive, error-prone, and time-consuming. This problem is, in principle, easily solved. What is needed is a simple, stable platform that is specifically geared toward performing phylogenetic analyses. Perhaps the simplest (from a user perspective) solution to this need is a lightweight Linux-based operating system, geared specifically toward phylogenetic and phyloinformatic research. Existing Linux distributions such as Bio-Linux2 and SciBuntu3 represent useful steps in this direction. However, these distributions are aimed at a more general usership, and thus do not incorporate many of the packages that are now standard tools for phylogenetic analysis. Moreover, because phylogenetic methods are currently experiencing rapid development, there is a need for a distribution specifically focusing on this area. PhyLIS aims to fill this need.

Implementation

PhyLIS v1.0 is a free GNU/Linux distribution based on the popular and user-friendly Ubuntu Linux distribution. The name PhyLIS is an acronym for Phylogenetic Linux for Informatics and Systematics. The distribution comes with most commonly used phylogenetic software pre-compiled, installed, and configured, which allows this software to be executed by simply typing the appropriate command (Table 1). PhyLIS also contains popular scripting languages (and appropriate phyloinformatic packages) including Perl (with BioPerl), Python (with BioPython), and R (with several packages). It implements parallel versions of several particularly processor-intensive programs using MPI (including BEST,4 MrBayes,5 and raxML).6 PhyLIS aims to present a complete phylogenetic workbench for all steps of analysis from sequence data manipulation to alignment and tree search, including visualization (for alignments and trees), model selection, divergence time estimation, macroevolutionary analyses and tools for automation and batch analysis.
Table 1

Bundled phylogenetic software packages and commands used to call them.

Software packageCommand
Ade47(R package, see documentation)
Ape8(R package, see documentation)
Aptreeshape9(R package, see documentation)
BEAST10beast
BEAUTi10beauti
BEST (including a parallel version)4mbbest, mbbest_mpi
Bioperl11(Perl package, see documentation)
Biopython12(Python package, see documentation)
Blast2 package13blast2, blastclust, blastall, megablast, etc.
Bucky14bucky
Clustalw15clustalw
Dialign216dialign2–2
FigTree17figtree
Garli18garli
Geiger19(R package, see documentation)
Hmmer20(See documentation)
jModelTest21jmodeltest
LogAnalyser17loganalyser
LogCombiner17logcombiner
MAFFT22mafft
Mesquite (graphical and headless versions)23run_mesquite.sh, run_headless_mesquite.sh
MrBayes (including a parallel version)5mb, mb_MpI
Muscle24muscle
Ouch25(R package, see documentation)
Paloverde26paloverde
PhYLIp27phylip
Phylocom28phylocom
PhyML29phyml
Physim30(R package, see documentation)
Phyutility31phyutility
PoA32poa
RR
r8s33r8s
RaxML (including a parallel version)6raxmlHPC, raxmlHPC_MpI
Readseq34readseq
Seaview35seaview
Seq-gen36seq-gen
T-coffee37t_coffee
Tracer17tracer
TreeAnnotator17treeannotator
TreeLogAnalyser17treeloganalyser
The distribution intentionally re-uses most of the system maintenance packages from Ubuntu, making the actual use of the operating system very similar to Ubuntu (and Debian, upon which Ubuntu is based). Overall, the non-phylogenetic aspects of PhyLIS (e.g. installation, updating software, file system structure) have been kept as close as possible to Ubuntu in order to preserve the large amount of development that the Ubuntu team has put into ensuring an easy-to-use operating system. Because of this, navigating, updating and using the operating system is largely intuitive for users that are already familiar with more widely used operating systems. The bundled phylogenetic tools (Table 1) are available via the command line interface, allowing for straightforward batch analyses and scripting. Software that employs a graphical user interface can be run using graphical launchers on the desktop, in addition to the command line.

Installation

PhyLIS is distributed as a live CD and can be used in two ways. First, it can be booted from the CD without making changes to the underlying operating system. This is useful, for example, for temporarily employing computers (which may not be configured for phylogenetics, or may be configured for other purposes) to run phylogenetic analyses when not in use for their primary purpose. At the completion of the analysis, the results can be transferred to permanent storage and the machine rebooted, restoring it to its previous configuration and operating system. The live CD mode is also useful for testing PhyLIS with little time commitment before deciding whether to install it. Second, PhyLIS can be directly installed from the live CD using a simple graphical installer that allows for a new, complete installation (erasing the previous operating system), or a partitioned installation (allowing for dual boot systems). The distribution has been tested and can be installed on most 32- and 64-bit PCs (including Apple computers that use Intel processors).

Conclusions

PhyLIS aims to simplify the process of carrying out complex phylogenetic analyses and has utility both for individual researchers and for teaching environments. The operating system presents a large suite of tools in a stable platform and should be useful for system administrators performing many installations. However, it is also simple enough to use that individual researchers with little previous Linux experience can employ it effectively. PhyLIS is under active development and undergoes periodic updates every six months to incorporate new versions of software and minor bug fixes. Users are encouraged to request additional software or features that would enhance the utility of the operating system; these will be incorporated into future releases of PhyLIS. The latest release is freely available at http://www.eve.ucdavis.edu/rcthomson/phylis.
  25 in total

1.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

2.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

3.  Multiple sequence alignment with the Clustal series of programs.

Authors:  Ramu Chenna; Hideaki Sugawara; Tadashi Koike; Rodrigo Lopez; Toby J Gibson; Desmond G Higgins; Julie D Thompson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

5.  apTreeshape: statistical analysis of phylogenetic tree shape.

Authors:  Nicolas Bortolussi; Eric Durand; Michael Blum; Olivier François
Journal:  Bioinformatics       Date:  2005-12-01       Impact factor: 6.937

6.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

7.  Bayesian estimation of concordance among gene trees.

Authors:  Cécile Ané; Bret Larget; David A Baum; Stacey D Smith; Antonis Rokas
Journal:  Mol Biol Evol       Date:  2006-11-09       Impact factor: 16.240

8.  Phyutility: a phyloinformatics tool for trees, alignments and molecular data.

Authors:  Stephen A Smith; Casey W Dunn
Journal:  Bioinformatics       Date:  2008-01-28       Impact factor: 6.937

9.  GEIGER: investigating evolutionary radiations.

Authors:  Luke J Harmon; Jason T Weir; Chad D Brock; Richard E Glor; Wendell Challenger
Journal:  Bioinformatics       Date:  2007-11-15       Impact factor: 6.937

10.  jModelTest: phylogenetic model averaging.

Authors:  David Posada
Journal:  Mol Biol Evol       Date:  2008-04-08       Impact factor: 16.240

View more
  3 in total

1.  BMPOS: a Flexible and User-Friendly Tool Sets for Microbiome Studies.

Authors:  Victor S Pylro; Daniel K Morais; Francislon S de Oliveira; Fausto G Dos Santos; Leandro N Lemos; Guilherme Oliveira; Luiz F W Roesch
Journal:  Microb Ecol       Date:  2016-05-24       Impact factor: 4.552

2.  Rapid progress on the vertebrate tree of life.

Authors:  Robert C Thomson; H Bradley Shaffer
Journal:  BMC Biol       Date:  2010-03-08       Impact factor: 7.431

3.  LXtoo: an integrated live Linux distribution for the bioinformatics community.

Authors:  Guangchuang Yu; Li-Gen Wang; Xiao-Hua Meng; Qing-Yu He
Journal:  BMC Res Notes       Date:  2012-07-19
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.