Jason S Papadopoulos1, Richa Agarwala. 1. National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD 20894, USA.
Abstract
MOTIVATION: A tool that simultaneously aligns multiple protein sequences, automatically utilizes information about protein domains, and has a good compromise between speed and accuracy will have practical advantages over current tools. RESULTS: We describe COBALT, a constraint based alignment tool that implements a general framework for multiple alignment of protein sequences. COBALT finds a collection of pairwise constraints derived from database searches, sequence similarity and user input, combines these pairwise constraints, and then incorporates them into a progressive multiple alignment. We show that using constraints derived from the conserved domain database (CDD) and PROSITE protein-motif database improves COBALT's alignment quality. We also show that COBALT has reasonable runtime performance and alignment accuracy comparable to or exceeding that of other tools for a broad range of problems. AVAILABILITY: COBALT is included in the NCBI C++ toolkit. A Linux executable for COBALT, and CDD and PROSITE data used is available at: ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/cobalt
MOTIVATION: A tool that simultaneously aligns multiple protein sequences, automatically utilizes information about protein domains, and has a good compromise between speed and accuracy will have practical advantages over current tools. RESULTS: We describe COBALT, a constraint based alignment tool that implements a general framework for multiple alignment of protein sequences. COBALT finds a collection of pairwise constraints derived from database searches, sequence similarity and user input, combines these pairwise constraints, and then incorporates them into a progressive multiple alignment. We show that using constraints derived from the conserved domain database (CDD) and PROSITE protein-motif database improves COBALT's alignment quality. We also show that COBALT has reasonable runtime performance and alignment accuracy comparable to or exceeding that of other tools for a broad range of problems. AVAILABILITY: COBALT is included in the NCBI C++ toolkit. A Linux executable for COBALT, and CDD and PROSITE data used is available at: ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/cobalt
Authors: Alexis I Cocozaki; Nancy F Ramia; Yaming Shao; Caryn R Hale; Rebecca M Terns; Michael P Terns; Hong Li Journal: Structure Date: 2012-03-07 Impact factor: 5.006
Authors: David Hoogewijs; Bettina Ebner; Francesca Germani; Federico G Hoffmann; Andrej Fabrizius; Luc Moens; Thorsten Burmester; Sylvia Dewilde; Jay F Storz; Serge N Vinogradov; Thomas Hankeln Journal: Mol Biol Evol Date: 2011-11-24 Impact factor: 16.240
Authors: Manuel Maestre-Reyna; Rike Diderrich; Maik Stefan Veelders; Georg Eulenburg; Vitali Kalugin; Stefan Brückner; Petra Keller; Steffen Rupp; Hans-Ulrich Mösch; Lars-Oliver Essen Journal: Proc Natl Acad Sci U S A Date: 2012-10-03 Impact factor: 11.205
Authors: Lars Paßvogel; Patricia Trübe; Franziska Schuster; Barbara G Klupp; Thomas C Mettenleiter Journal: J Virol Date: 2013-02-06 Impact factor: 5.103