Literature DB >> 15700405

Accurate detection of very sparse sequence motifs.

Andreas Heger1, Michael Lappe, Liisa Holm.   

Abstract

Protein sequence alignments are more reliable the shorter the evolutionary distance. Here, we align distantly related proteins using many closely spaced intermediate sequences as stepping stones. Such transitive alignments can be generated between any two proteins in a connected set, whether they are direct or indirect sequence neighbors in the underlying library of pairwise alignments. We have implemented a greedy algorithm, MaxFlow, using a novel consistency score to estimate the relative likelihood of alternative paths of transitive alignment. In contrast to traditional profile models of amino acid preferences, MaxFlow models the probability that two positions are structurally equivalent and retains high information content across large distances in sequence space. Thus, MaxFlow is able to identify sparse and narrow active-site sequence signatures which are embedded in high-entropy sequence segments in the structure based multiple alignment of large diverse enzyme superfamilies. In a challenging benchmark based on the urease superfamily, MaxFlow yields better reliability and double coverage compared to available sequence alignment software. This promises to increase information returns from functional and structural genomics, where reliable sequence alignment is a bottleneck to transferring the functional or structural characterization of model proteins to entire protein superfamilies.

Mesh:

Substances:

Year:  2004        PMID: 15700405     DOI: 10.1089/cmb.2004.11.843

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Comprehensive comparison of graph based multiple protein sequence alignment strategies.

Authors:  Ilya Plyusnin; Liisa Holm
Journal:  BMC Bioinformatics       Date:  2012-04-29       Impact factor: 3.169

2.  Distance matrix-based approach to protein structure prediction.

Authors:  Andrzej Kloczkowski; Robert L Jernigan; Zhijun Wu; Guang Song; Lei Yang; Andrzej Kolinski; Piotr Pokarowski
Journal:  J Struct Funct Genomics       Date:  2009-02-18

3.  Sequence comparison by sequence harmony identifies subtype-specific functional sites.

Authors:  Walter Pirovano; K Anton Feenstra; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2006-11-27       Impact factor: 16.971

4.  PairsDB atlas of protein sequence space.

Authors:  Andreas Heger; Eija Korpelainen; Taavi Hupponen; Kimmo Mattila; Vesa Ollikainen; Liisa Holm
Journal:  Nucleic Acids Res       Date:  2007-11-05       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.