Literature DB >> 15613392

A structure-based method for protein sequence alignment.

Maricel G Kann1, Paul A Thiessen, Anna R Panchenko, Alejandro A Schäffer, Stephen F Altschul, Stephen H Bryant.   

Abstract

MOTIVATION: With the continuing rapid growth of protein sequence data, protein sequence comparison methods have become the most widely used tools of bioinformatics. Among these methods are those that use position-specific scoring matrices (PSSMs) to describe protein families. PSSMs can capture information about conserved patterns within families, which can be used to increase the sensitivity of searches for related sequences. Certain types of structural information, however, are not generally captured by PSSM search methods. Here we introduce a program, Structure-based ALignment TOol (SALTO), that aligns protein query sequences to PSSMs using rules for placing and scoring gaps that are consistent with the conserved regions of domain alignments from NCBI's Conserved Domain Database.
RESULTS: In most cases, the alignment scores obtained using the local alignment version follow an extreme value distribution. SALTO's performance in finding related sequences and producing accurate alignments is similar to or better than that of IMPALA; one advantage of SALTO is that it imposes an explicit gapping model on each protein family. AVAILABILITY: A stand-alone version of the program that can generate global or local alignments is available by ftp distribution (ftp://ftp.ncbi.nih.gov/pub/SALTO/), and has been incorporated to Cn3D structure/alignment viewer. CONTACT: bryant@ncbi.nlm.nih.gov.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15613392     DOI: 10.1093/bioinformatics/bti233

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Packing defects as selectivity switches for drug-based protein inhibitors.

Authors:  Ariel Fernández; Ridgway Scott; R Stephen Berry
Journal:  Proc Natl Acad Sci U S A       Date:  2005-12-30       Impact factor: 11.205

2.  Refining multiple sequence alignments with conserved core regions.

Authors:  Saikat Chakrabarti; Christopher J Lanczycki; Anna R Panchenko; Teresa M Przytycka; Paul A Thiessen; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2006-05-17       Impact factor: 16.971

3.  Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches.

Authors:  Yi-Kuo Yu; E Michael Gertz; Richa Agarwala; Alejandro A Schäffer; Stephen F Altschul
Journal:  Nucleic Acids Res       Date:  2006-10-26       Impact factor: 16.971

4.  CORAL: aligning conserved core regions across domain families.

Authors:  Jessica H Fong; Aron Marchler-Bauer
Journal:  Bioinformatics       Date:  2009-05-26       Impact factor: 6.937

5.  The identification of complete domains within protein sequences using accurate E-values for semi-global alignment.

Authors:  Maricel G Kann; Sergey L Sheetlin; Yonil Park; Stephen H Bryant; John L Spouge
Journal:  Nucleic Acids Res       Date:  2007-06-27       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.