Literature DB >> 20849574

ClustalXeed: a GUI-based grid computation version for high performance and terabyte size multiple sequence alignment.

Taeho Kim1, Hyun Joo.   

Abstract

BACKGROUND: There is an increasing demand to assemble and align large-scale biological sequence data sets. The commonly used multiple sequence alignment programs are still limited in their ability to handle very large amounts of sequences because the system lacks a scalable high-performance computing (HPC) environment with a greatly extended data storage capacity.
RESULTS: We designed ClustalXeed, a software system for multiple sequence alignment with incremental improvements over previous versions of the ClustalX and ClustalW-MPI software. The primary advantage of ClustalXeed over other multiple sequence alignment software is its ability to align a large family of protein or nucleic acid sequences. To solve the conventional memory-dependency problem, ClustalXeed uses both physical random access memory (RAM) and a distributed file-allocation system for distance matrix construction and pair-align computation. The computation efficiency of disk-storage system was markedly improved by implementing an efficient load-balancing algorithm, called "idle node-seeking task algorithm" (INSTA). The new editing option and the graphical user interface (GUI) provide ready access to a parallel-computing environment for users who seek fast and easy alignment of large DNA and protein sequence sets.
CONCLUSIONS: ClustalXeed can now compute a large volume of biological sequence data sets, which were not tractable in any other parallel or single MSA program. The main developments include: 1) the ability to tackle larger sequence alignment problems than possible with previous systems through markedly improved storage-handling capabilities. 2) Implementing an efficient task load-balancing algorithm, INSTA, which improves overall processing times for multiple sequence alignment with input sequences of non-uniform length. 3) Support for both single PC and distributed cluster systems.

Entities:  

Mesh:

Year:  2010        PMID: 20849574      PMCID: PMC2949895          DOI: 10.1186/1471-2105-11-467

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  13 in total

1.  On the parallelisation of bioinformatics applications.

Authors:  O Trelles
Journal:  Brief Bioinform       Date:  2001-05       Impact factor: 11.622

2.  Multiple sequence alignment with the Clustal series of programs.

Authors:  Ramu Chenna; Hideaki Sugawara; Tadashi Koike; Rodrigo Lopez; Toby J Gibson; Desmond G Higgins; Julie D Thompson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  ClustalW-MPI: ClustalW analysis using distributed and parallel computing.

Authors:  Kuo-Bin Li
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  DIALIGN P: fast pair-wise and multiple sequence alignment using parallel processors.

Authors:  Martin Schmollinger; Kay Nieselt; Michael Kaufmann; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2004-09-09       Impact factor: 3.169

5.  GOR method for predicting protein secondary structure from amino acid sequence.

Authors:  J Garnier; J F Gibrat; B Robson
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

6.  PHD: predicting one-dimensional protein structure by profile-based neural networks.

Authors:  B Rost
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

7.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.

Authors:  J D Thompson; T J Gibson; F Plewniak; F Jeanmougin; D G Higgins
Journal:  Nucleic Acids Res       Date:  1997-12-15       Impact factor: 16.971

8.  TreeView: an application to display phylogenetic trees on personal computers.

Authors:  R D Page
Journal:  Comput Appl Biosci       Date:  1996-08

9.  CLUSTAL: a package for performing multiple sequence alignment on a microcomputer.

Authors:  D G Higgins; P M Sharp
Journal:  Gene       Date:  1988-12-15       Impact factor: 3.688

10.  A comparison of MSA tools.

Authors:  Nadia Essoussi; Khaddouja Boujenfa; Mohamed Limam
Journal:  Bioinformation       Date:  2008-07-31
View more
  5 in total

1.  GmRAV confers ecological adaptation through photoperiod control of flowering time and maturity in soybean.

Authors:  Yuhe Wang; Chongjing Xu; Jiafan Sun; Lidong Dong; Minmin Li; Ying Liu; Jianhui Wang; Xiaoming Zhang; Dongmei Li; Jingzhe Sun; Yuntong Zhang; Jinming Shan; Wenbin Li; Lin Zhao
Journal:  Plant Physiol       Date:  2021-09-04       Impact factor: 8.005

2.  Microscale thermophoresis as a powerful tool for screening glycosyltransferases involved in cell wall biosynthesis.

Authors:  Wanchen Shao; Rita Sharma; Mads H Clausen; Henrik V Scheller
Journal:  Plant Methods       Date:  2020-07-28       Impact factor: 4.993

3.  Meta-alignment with crumble and prune: partitioning very large alignment problems for performance and parallelization.

Authors:  Krishna M Roskin; Benedict Paten; David Haussler
Journal:  BMC Bioinformatics       Date:  2011-05-10       Impact factor: 3.307

4.  Systematic analysis of differentially expressed ZmMYB genes related to drought stress in maize.

Authors:  Peng-Yu Zhang; Xiao Qiu; Jia-Xu Fu; Guo-Rui Wang; Li Wei; Tong-Chao Wang
Journal:  Physiol Mol Biol Plants       Date:  2021-05-29

5.  An improved distance matrix computation algorithm for multicore clusters.

Authors:  Mohammed W Al-Neama; Naglaa M Reda; Fayed F M Ghaleb
Journal:  Biomed Res Int       Date:  2014-06-12       Impact factor: 3.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.