Literature DB >> 21593132

Java bioinformatics analysis web services for multiple sequence alignment--JABAWS:MSA.

Peter V Troshin1, James B Procter, Geoffrey J Barton.   

Abstract

SUMMARY: JABAWS is a web services framework that simplifies the deployment of web services for bioinformatics. JABAWS:MSA provides services for five multiple sequence alignment (MSA) methods (Probcons, T-coffee, Muscle, Mafft and ClustalW), and is the system employed by the Jalview multiple sequence analysis workbench since version 2.6. A fully functional, easy to set up server is provided as a Virtual Appliance (VA), which can be run on most operating systems that support a virtualization environment such as VMware or Oracle VirtualBox. JABAWS is also distributed as a Web Application aRchive (WAR) and can be configured to run on a single computer and/or a cluster managed by Grid Engine, LSF or other queuing systems that support DRMAA. JABAWS:MSA provides clients full access to each application's parameters, allows administrators to specify named parameter preset combinations and execution limits for each application through simple configuration files. The JABAWS command-line client allows integration of JABAWS services into conventional scripts.
AVAILABILITY AND IMPLEMENTATION: JABAWS is made freely available under the Apache 2 license and can be obtained from: http://www.compbio.dundee.ac.uk/jabaws.

Entities:  

Mesh:

Year:  2011        PMID: 21593132      PMCID: PMC3129525          DOI: 10.1093/bioinformatics/btr304

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

The explosion in biological data volume, diversity and complexity, has been paralleled by the development of computational methods to aid data analysis. Locally installed copies of programs give full access to their capabilities, and the ability to run long CPU or memory intensive jobs. However, configuring software to install easily on a range of operating systems is challenging. Instead, many developers make their techniques available through a custom web page, or increasingly as a web service, which provides a flexible, programmable and scriptable (Hull ) middle ground between installing software locally and accessing via a web browser. There is now a rich ecosystem of public web services (Bhagat ), but they do not always meet the needs of users. Their interfaces may have limited functionality and even the biggest service providers, such as EBI and NCBI, limit the computational resources (memory/CPU) available to an individual so that total demand can be met. Web services at the University of Dundee were created in 2005 as part of the Jalview project (Waterhouse ) to provide its users access to ClustalW (Larkin ), Mafft (Katoh and Toh, 2008) and Muscle (Edgar, 2004) multiple alignment programs and the JPred secondary structure predictor (Cole ). Although convenient, these suffer from the same public service limitations, and many of the > 20 000 users of Jalview requested the ability to run alignments locally on their workstation or to deploy the same services on their institution's cluster. Although locally installable web service systems such as Opal 2 (Krishnan ) and SoapLab 2 (Senger ) are available, they require significant investment in time or expertise to install. In contrast, the JABAWS:MSA system introduced here, is an easy to install web services solution for multiple alignment that can run on a desktop or laptop computer, local server or computer cluster. JABAWS:MSA's rich interface lets the user track execution progress, retrieve results as a data structure instead of file(s) and provides SOAP services conforming to WS-I Basic Profile 1.2/2.0. While developed primarily for the Jalview Desktop, these can be scripted against, and a command-line client for this purpose is distributed with the system.

2 THE SERVER

The JABAWS:MSA server and command-line client are both implemented in Java, but the server exploits native programs to provide ClustalW (Larkin ), Muscle (Edgar, 2004), Probcons (Do ), T-coffee (Notredame ) and Mafft (Katoh and Toh, 2008) alignment services. These programs can be difficult to compile or incompatible with some operating systems, so a range of JABAWS:MSA distributions have been created to suit most environments. The basic JABAWS:MSA distribution is a pre-configured Java Web Application aRchive (WAR) conforming to the Servlet 2.4 specification, optionally bundled with the source and prebuilt binaries of its dependent bioinformatics programs. This option is most appropriate for experienced users wishing to set up multiple alignment services for their group or institution, or those working on Linux or Mac operating systems. For most other users, a ‘Virtual Appliance’ (VA) (http://en.wikipedia.org/wiki/Virtual_appliance) is supplied that is built on TurnKey Linux and contains a fully operational JABAWS:MSA server. This can be run on a freely available virtualization product such as VMware player (http://www.vmware.com/products/player) or Oracle VirtualBox (http://www.virtualbox.org).

3 JABAWS:MSA CONFIGURATION OPTIONS

A native JABAWS:MSA installation can be tightly integrated with the local computing infrastructure. It allows fine-grained control over the calculations to be run on the server through three user-editable configuration files: ‘Parameters’, ‘Limits’ and ‘Presets’, for each alignment method. The Parameters file defines command-line parameters that are supported by the method, its valid ranges and which parameters are mandatory, while the Presets file allows combinations of parameter settings to be given a name and brief description to explain their use. For example, Mafft has six alternative presets, enabling a range of accuracy or speed-oriented options. The Limits file specifies the maximum length and number of sequences allowed for a method, and separate limits may be set for calculations to be run on the same server as Tomcat, or via a cluster queuing system. Further configuration files centralize the definition of application-specific directories, and any command-line flags needed to enable JABAWS to interact correctly with a queuing system. Together, these permit JABAWS to operate efficiently in nearly any compute environment, by executing jobs on the local machine, on a cluster through a queuing system or on whichever resource is most appropriate according to a job's size and the availability of local CPUs.

4 THE CLIENTS

Web services provided by a JABAWS:MSA installation can be accessed from any programming language, providing libraries are available for the consumption of SOAP web services. The service interface allows a client to discover the execution limits, command line parameters and named presets for a program, and once a calculation is submitted, check on its progress by retrieving any text normally output to the console by the program. Importantly, data structures and formats for all services are standardized, so the same client can submit data to, and handle the results from any of the alignment programs provided by any JABAWS:MSA server. A lightweight, ready to use JABAWS:MSA command-line client is provided which allows access to JABAWS alignment services. The client is Java based, and is also a useful guide for those who wish to develop or adapt their own clients or test new JABAWS:MSA installations. For non-programmers, a graphical client is available within Jalview 2.6. This client connects to a JABAWS:MSA server to discover which services are available, their parameters and presets, and then populates the user interface with these options.

5 CONCLUSION

A collection of portable web services, initially for multiple sequence alignment, has been developed. It is designed to interact with other software or services and be deployed easily on a variety of computing infrastructures, either directly or through the use of virtualization. JABAWS:MSA provides an in-house web services solution that is limited only by locally available computing resources. Furthermore, its client provides fault tolerance by allowing access to multiple server instances, while the service interface gives a high level of control over individual tasks. Comprehensive metadata provided by each service enables their integration with Jalview, to allow novices and experts alike to utilize their local compute resources efficiently. Finally, although JABAWS:MSA is focused on multiple alignment, the JABAWS framework is flexible, and will soon be extended to include analytical methods such as secondary structure and disorder prediction.
  9 in total

1.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

2.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

3.  ProbCons: Probabilistic consistency-based multiple sequence alignment.

Authors:  Chuong B Do; Mahathi S P Mahabhashyam; Michael Brudno; Serafim Batzoglou
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

4.  Recent developments in the MAFFT multiple sequence alignment program.

Authors:  Kazutaka Katoh; Hiroyuki Toh
Journal:  Brief Bioinform       Date:  2008-03-27       Impact factor: 11.622

5.  Clustal W and Clustal X version 2.0.

Authors:  M A Larkin; G Blackshields; N P Brown; R Chenna; P A McGettigan; H McWilliam; F Valentin; I M Wallace; A Wilm; R Lopez; J D Thompson; T J Gibson; D G Higgins
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

6.  Jalview Version 2--a multiple sequence alignment editor and analysis workbench.

Authors:  Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2009-01-16       Impact factor: 6.937

7.  BioCatalogue: a universal catalogue of web services for the life sciences.

Authors:  Jiten Bhagat; Franck Tanoh; Eric Nzuobontane; Thomas Laurent; Jerzy Orlowski; Marco Roos; Katy Wolstencroft; Sergejs Aleksejevs; Robert Stevens; Steve Pettifer; Rodrigo Lopez; Carole A Goble
Journal:  Nucleic Acids Res       Date:  2010-05-19       Impact factor: 16.971

8.  Taverna: a tool for building and running workflows of services.

Authors:  Duncan Hull; Katy Wolstencroft; Robert Stevens; Carole Goble; Mathew R Pocock; Peter Li; Tom Oinn
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  The Jpred 3 secondary structure prediction server.

Authors:  Christian Cole; Jonathan D Barber; Geoffrey J Barton
Journal:  Nucleic Acids Res       Date:  2008-05-07       Impact factor: 16.971

  9 in total
  47 in total

1.  Initial Characterization of the Two ClpP Paralogs of Chlamydia trachomatis Suggests Unique Functionality for Each.

Authors:  Nicholas A Wood; Krystal Y Chung; Amanda M Blocker; Nathalia Rodrigues de Almeida; Martin Conda-Sheridan; Derek J Fisher; Scot P Ouellette
Journal:  J Bacteriol       Date:  2018-12-20       Impact factor: 3.490

2.  The Tribbles 2 (TRB2) pseudokinase binds to ATP and autophosphorylates in a metal-independent manner.

Authors:  Fiona P Bailey; Dominic P Byrne; Krishnadev Oruganty; Claire E Eyers; Christopher J Novotny; Kevan M Shokat; Natarajan Kannan; Patrick A Eyers
Journal:  Biochem J       Date:  2015-04-01       Impact factor: 3.857

3.  Foldability of a Natural De Novo Evolved Protein.

Authors:  Dixie Bungard; Jacob S Copple; Jing Yan; Jimmy J Chhun; Vlad K Kumirov; Scott G Foy; Joanna Masel; Vicki H Wysocki; Matthew H J Cordes
Journal:  Structure       Date:  2017-10-12       Impact factor: 5.006

4.  Crystal structure of lipoate-bound lipoate ligase 1, LipL1, from Plasmodium falciparum.

Authors:  Alfredo J Guerra; Gustavo A Afanador; Sean T Prigge
Journal:  Proteins       Date:  2017-06-07

5.  Databases, Knowledgebases, and Software Tools for Virus Informatics.

Authors:  Yuxin Lin; Yulan Qian; Xin Qi; Bairong Shen
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 2.622

6.  RAV transcription factor regulatory function in response to salt stress in two Iranian wheat landraces.

Authors:  Mohamad Karami; Narjes Fatahi; Tahmineh Lohrasebi; Khadijeh Razavi
Journal:  J Plant Res       Date:  2021-12-01       Impact factor: 2.629

7.  Characterization of a novel water pocket inside the human Cx26 hemichannel structure.

Authors:  Raul Araya-Secchi; Tomas Perez-Acle; Seung-Gu Kang; Tien Huynh; Alejandro Bernardin; Yerko Escalona; Jose-Antonio Garate; Agustin D Martínez; Isaac E García; Juan C Sáez; Ruhong Zhou
Journal:  Biophys J       Date:  2014-08-05       Impact factor: 4.033

8.  Formiminoglutamase from Trypanosoma cruzi is an arginase-like manganese metalloenzyme.

Authors:  Yang Hai; Reilly Jane Dugery; David Healy; David W Christianson
Journal:  Biochemistry       Date:  2013-11-21       Impact factor: 3.162

9.  Structural analyses of the Haemophilus influenzae peptidoglycan synthase activator LpoA suggest multiple conformations in solution.

Authors:  Karthik Sathiyamoorthy; J Vijayalakshmi; Bhramara Tirupati; Lixin Fan; Mark A Saper
Journal:  J Biol Chem       Date:  2017-09-08       Impact factor: 5.157

10.  Alignment of Biological Sequences with Jalview.

Authors:  James B Procter; G Mungo Carstairs; Ben Soares; Kira Mourão; T Charles Ofoegbu; Daniel Barton; Lauren Lui; Anne Menard; Natasha Sherstnev; David Roldan-Martinez; Suzanne Duce; David M A Martin; Geoffrey J Barton
Journal:  Methods Mol Biol       Date:  2021
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.