Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Multiple sequence alignment.

Literature DB >> 3806669

Multiple sequence alignment.

Abstract

A method has been developed for aligning segments of several sequences at once. The number of search steps depends only polynomially on the number of sequences, instead of exponentially, because most alignments are rejected without being evaluated explicitly. A data structure herein called the "heap" facilitates this process. For a set of n sequence segments, the overall similarity is taken to be the sum of all the constituent segment pair similarities, which are in turn sums of corresponding residue similarity scores from a Table. The statistical models that test alignments for significance make it possible to group sequences objectively, even when most or all of the interrelationships are weak. These tests are very sensitive, while remaining quite conservative, and discourage the addition of "misfit" sequences to an existing set. The new techniques are applied to a set of five DNA-binding proteins, to a group of three enzymes that employ the coenzyme FAD, and to a control set. The alignment previously proposed for the DNA-binding proteins on the basis of structural comparisons and inspection of sequences is supported quite dramatically, and a highly significant alignment is found for the FAD-binding proteins.

Entities: Chemical

Mesh：

Substances：

Year: 1986 PMID： 3806669 DOI： 10.1016/0022-2836(86)90252-4

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Keyword Cloud
Cited

27 in total

Multiple sequence alignment.

1. TBC: a clustering algorithm based on prokaryotic taxonomy.

2. An assessment of substitution scores for protein profile-profile comparison.

3. Compositional adjustment of Dirichlet mixture priors.

4. A survey of multiple sequence comparison methods.

5. A comparison of several similarity indices used in the classification of protein sequences: a multivariate analysis.

6. An efficient algorithm for identifying matches with errors in multiple long molecular sequences.

7. Modelling of peptide and protein structures.

8. Efficient methods for multiple sequence alignment with guaranteed error bounds.

9. A multiple sequence comparison method.

10. Self-organizing hierarchic networks for pattern recognition in protein sequence.