Literature DB >> 15201059

3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Orla O'Sullivan1, Karsten Suhre, Chantal Abergel, Desmond G Higgins, Cédric Notredame.   

Abstract

Most bioinformatics analyses require the assembly of a multiple sequence alignment. It has long been suspected that structural information can help to improve the quality of these alignments, yet the effect of combining sequences and structures has not been evaluated systematically. We developed 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-quality multiple sequence alignments. 3DCoffee is based on TCoffee version 2.00, and uses a mixture of pairwise sequence alignments and pairwise structure comparison methods to generate multiple sequence alignments. We benchmarked 3DCoffee using a subset of HOMSTRAD, the collection of reference structural alignments. We found that combining TCoffee with the threading program Fugue makes it possible to improve the accuracy of our HOMSTRAD dataset by four percentage points when using one structure only per dataset. Using two structures yields an improvement of ten percentage points. The measures carried out on HOM39, a HOMSTRAD subset composed of distantly related sequences, show a linear correlation between multiple sequence alignment accuracy and the ratio of number of provided structure to total number of sequences. Our results suggest that in the case of distantly related sequences, a single structure may not be enough for computing an accurate multiple sequence alignment.

Mesh:

Substances:

Year:  2004        PMID: 15201059     DOI: 10.1016/j.jmb.2004.04.058

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  107 in total

1.  PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL.

Authors:  Emanuele Bramucci; Alessandro Paiardini; Francesco Bossa; Stefano Pascarella
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

2.  Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures.

Authors:  Jean-Francois Taly; Cedrik Magis; Giovanni Bussotti; Jia-Ming Chang; Paolo Di Tommaso; Ionas Erb; Jose Espinosa-Carrasco; Carsten Kemena; Cedric Notredame
Journal:  Nat Protoc       Date:  2011-11       Impact factor: 13.491

3.  Tracing determinants of dual substrate specificity in glycoside hydrolase family 5.

Authors:  Zhiwei Chen; Gregory D Friedland; Jose H Pereira; Sonia A Reveco; Rosa Chan; Joshua I Park; Michael P Thelen; Paul D Adams; Adam P Arkin; Jay D Keasling; Harvey W Blanch; Blake A Simmons; Kenneth L Sale; Dylan Chivian; Swapnil R Chhabra
Journal:  J Biol Chem       Date:  2012-05-29       Impact factor: 5.157

4.  The role of a topologically conserved isoleucine in glutathione transferase structure, stability and function.

Authors:  Ikechukwu Achilonu; Samantha Gildenhuys; Loren Fisher; Jonathan Burke; Sylvia Fanucchi; B Trevor Sewell; Manuel Fernandes; Heini W Dirr
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2010-06-23

5.  Gene and genome duplication in Acanthamoeba polyphaga Mimivirus.

Authors:  Karsten Suhre
Journal:  J Virol       Date:  2005-11       Impact factor: 5.103

Review 6.  New methods for inferring population dynamics from microbial sequences.

Authors:  Marcos Pérez-Losada; Megan L Porter; Loubna Tazi; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2006-04-19       Impact factor: 3.342

7.  Residue centrality, functionally important residues, and active site shape: analysis of enzyme and non-enzyme families.

Authors:  Antonio del Sol; Hirotomo Fujihashi; Dolors Amoros; Ruth Nussinov
Journal:  Protein Sci       Date:  2006-08-01       Impact factor: 6.725

8.  The structure of the CRISPR-associated protein Csa3 provides insight into the regulation of the CRISPR/Cas system.

Authors:  Nathanael G Lintner; Kenneth A Frankel; Susan E Tsutakawa; Donald L Alsbury; Valérie Copié; Mark J Young; John A Tainer; C Martin Lawrence
Journal:  J Mol Biol       Date:  2010-11-18       Impact factor: 5.469

9.  Structural basis for promiscuity and specificity during Candida glabrata invasion of host epithelia.

Authors:  Manuel Maestre-Reyna; Rike Diderrich; Maik Stefan Veelders; Georg Eulenburg; Vitali Kalugin; Stefan Brückner; Petra Keller; Steffen Rupp; Hans-Ulrich Mösch; Lars-Oliver Essen
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-03       Impact factor: 11.205

10.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.