Literature DB >> 19361329

Geometric aspects of biological sequence comparison.

Aleksandar Stojmirović1, Yi-Kuo Yu.   

Abstract

We introduce a geometric framework suitable for studying the relationships among biological sequences. In contrast to previous works, our formulation allows asymmetric distances (quasi-metrics), originating from uneven weighting of strings, which may induce non-trivial partial orders on sets of biosequences. The distances considered are more general than traditional generalized string edit distances. In particular, our framework enables non-trivial conversion between sequence similarities, both local and global, and distances. Our constructions apply to a wide class of scoring schemes and require much less restrictive gap penalties than the ones regularly used. Numerous examples are provided to illustrate the concepts introduced and their potential applications.

Entities:  

Mesh:

Year:  2009        PMID: 19361329      PMCID: PMC2801405          DOI: 10.1089/cmb.2008.0100

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  51 in total

1.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

2.  Optimization of a new score function for the detection of remote homologs.

Authors:  M Kann; B Qian; R A Goldstein
Journal:  Proteins       Date:  2000-12-01

3.  PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane.

Authors:  P C Ng; J G Henikoff; S Henikoff
Journal:  Bioinformatics       Date:  2000-09       Impact factor: 6.937

4.  Non-symmetric score matrices and the detection of homologous transmembrane proteins.

Authors:  T Müller; S Rahmann; M Rehmsmeier
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

5.  Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method.

Authors:  Tobias Müller; Rainer Spang; Martin Vingron
Journal:  Mol Biol Evol       Date:  2002-01       Impact factor: 16.240

6.  Efficient large-scale sequence comparison by locality-sensitive hashing.

Authors:  J Buhler
Journal:  Bioinformatics       Date:  2001-05       Impact factor: 6.937

7.  Structure-derived substitution matrices for alignment of distantly related sequences.

Authors:  A Prlić; F S Domingues; M J Sippl
Journal:  Protein Eng       Date:  2000-08

8.  A local alignment metric for accelerating biosequence database search.

Authors:  Peter A Spiro; Natasa Macura
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

Review 9.  Protein database searches using compositionally adjusted substitution matrices.

Authors:  Stephen F Altschul; John C Wootton; E Michael Gertz; Richa Agarwala; Aleksandr Morgulis; Alejandro A Schäffer; Yi-Kuo Yu
Journal:  FEBS J       Date:  2005-10       Impact factor: 5.542

10.  Pairwise sequence alignment below the twilight zone.

Authors:  J D Blake; F E Cohen
Journal:  J Mol Biol       Date:  2001-03-23       Impact factor: 5.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.