Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A structure-based method for protein sequence alignment.

Literature DB >> 15613392

A structure-based method for protein sequence alignment.

Maricel G Kann¹, Paul A Thiessen, Anna R Panchenko, Alejandro A Schäffer, Stephen F Altschul, Stephen H Bryant.

Abstract

MOTIVATION: With the continuing rapid growth of protein sequence data, protein sequence comparison methods have become the most widely used tools of bioinformatics. Among these methods are those that use position-specific scoring matrices (PSSMs) to describe protein families. PSSMs can capture information about conserved patterns within families, which can be used to increase the sensitivity of searches for related sequences. Certain types of structural information, however, are not generally captured by PSSM search methods. Here we introduce a program, Structure-based ALignment TOol (SALTO), that aligns protein query sequences to PSSMs using rules for placing and scoring gaps that are consistent with the conserved regions of domain alignments from NCBI's Conserved Domain Database.
RESULTS: In most cases, the alignment scores obtained using the local alignment version follow an extreme value distribution. SALTO's performance in finding related sequences and producing accurate alignments is similar to or better than that of IMPALA; one advantage of SALTO is that it imposes an explicit gapping model on each protein family. AVAILABILITY: A stand-alone version of the program that can generate global or local alignments is available by ftp distribution (ftp://ftp.ncbi.nih.gov/pub/SALTO/), and has been incorporated to Cn3D structure/alignment viewer. CONTACT: bryant@ncbi.nlm.nih.gov.

Entities: Chemical Species

Mesh：

Substances：
Proteins

Year: 2004 PMID： 15613392 DOI： 10.1093/bioinformatics/bti233

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

5 in total

A structure-based method for protein sequence alignment.

1. Packing defects as selectivity switches for drug-based protein inhibitors.

2. Refining multiple sequence alignments with conserved core regions.

3. Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches.

4. CORAL: aligning conserved core regions across domain families.

5. The identification of complete domains within protein sequences using accurate E-values for semi-global alignment.