Literature DB >> 23828785

Rapid similarity search of proteins using alignments of domain arrangements.

Nicolas Terrapon1, January Weiner, Sonja Grath, Andrew D Moore, Erich Bornberg-Bauer.   

Abstract

MOTIVATION: Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capture the position-specific variation in an alignment of homologous sequences and can identify conserved motifs or domains. While profile-based search methods are generally more accurate than simple sequence comparison methods, they tend to be computationally more demanding. In recent years, several methods have emerged that perform protein similarity searches based on domain composition. However, few methods have considered the linear arrangements of domains when conducting similarity searches, despite strong evidence that domain order can harbour considerable functional and evolutionary signal.
RESULTS: Here, we introduce an alignment scheme that uses a classical dynamic programming approach to the global alignment of domains. We illustrate that representing proteins as strings of domains (domain arrangements) and comparing these strings globally allows for a both fast and sensitive homology search. Further, we demonstrate that the presented methods complement existing methods by finding similar proteins missed by popular amino-acid-based comparison methods. AVAILABILITY: An implementation of the presented algorithms, a web-based interface as well as a command-line program for batch searching against the UniProt database can be found at http://rads.uni-muenster.de. Furthermore, we provide a JAVA API for programmatic access to domain-string–based search methods.

Mesh:

Substances:

Year:  2013        PMID: 23828785     DOI: 10.1093/bioinformatics/btt379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

Review 1.  From local structure to a global framework: recognition of protein folds.

Authors:  Agnel Praveen Joseph; Alexandre G de Brevern
Journal:  J R Soc Interface       Date:  2014-04-16       Impact factor: 4.118

Review 2.  The language of the protein universe.

Authors:  Andrea Scaiewicz; Michael Levitt
Journal:  Curr Opin Genet Dev       Date:  2015-11-03       Impact factor: 5.578

3.  Domain similarity based orthology detection.

Authors:  Tristan Bitard-Feildel; Carsten Kemena; Jenny M Greenwood; Erich Bornberg-Bauer
Journal:  BMC Bioinformatics       Date:  2015-05-13       Impact factor: 3.169

4.  Glycan complexity dictates microbial resource allocation in the large intestine.

Authors:  Artur Rogowski; Jonathon A Briggs; Jennifer C Mortimer; Theodora Tryfona; Nicolas Terrapon; Elisabeth C Lowe; Arnaud Baslé; Carl Morland; Alison M Day; Hongjun Zheng; Theresa E Rogers; Paul Thompson; Alastair R Hawkins; Madhav P Yadav; Bernard Henrissat; Eric C Martens; Paul Dupree; Harry J Gilbert; David N Bolam
Journal:  Nat Commun       Date:  2015-06-26       Impact factor: 14.919

5.  SIMAP--the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage.

Authors:  Roland Arnold; Florian Goldenberg; Hans-Werner Mewes; Thomas Rattei
Journal:  Nucleic Acids Res       Date:  2013-10-27       Impact factor: 16.971

6.  MDAT- Aligning multiple domain arrangements.

Authors:  Carsten Kemena; Tristan Bitard-Feildel; Erich Bornberg-Bauer
Journal:  BMC Bioinformatics       Date:  2015-01-28       Impact factor: 3.169

7.  UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB.

Authors:  Tunca Doğan; Alistair MacDougall; Rabie Saidi; Diego Poggioli; Alex Bateman; Claire O'Donovan; Maria J Martin
Journal:  Bioinformatics       Date:  2016-03-07       Impact factor: 6.937

8.  PULDB: the expanded database of Polysaccharide Utilization Loci.

Authors:  Nicolas Terrapon; Vincent Lombard; Élodie Drula; Pascal Lapébie; Saad Al-Masaudi; Harry J Gilbert; Bernard Henrissat
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  How members of the human gut microbiota overcome the sulfation problem posed by glycosaminoglycans.

Authors:  Alan Cartmell; Elisabeth C Lowe; Arnaud Baslé; Susan J Firbank; Didier A Ndeh; Heath Murray; Nicolas Terrapon; Vincent Lombard; Bernard Henrissat; Jeremy E Turnbull; Mirjam Czjzek; Harry J Gilbert; David N Bolam
Journal:  Proc Natl Acad Sci U S A       Date:  2017-06-19       Impact factor: 11.205

Review 10.  Alignment-free sequence comparison: benefits, applications, and tools.

Authors:  Andrzej Zielezinski; Susana Vinga; Jonas Almeida; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2017-10-03       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.