Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A reliable sequence alignment method based on probabilities of residue correspondences.

Literature DB >> 8771180

A reliable sequence alignment method based on probabilities of residue correspondences.

Abstract

Probabilities of all possible correspondences of residues in aligning two proteins are evaluated by assuming that the statistical weight of each alignment is proportional to the exponent of its total similarity score. Based on such probabilities, a probability alignment that includes the most probable correspondences is proposed. In the case of highly similar sequence pairs, the probability alignments agree with the maximum similarity alignments that correspond to the alignments with the maximum similarity score. Significant correspondences in the probability alignments are those whose probabilities are > 0.5. The probability alignment method is applied to a few protein pairs, and results indicate that such highly probable correspondences in the probability alignments are probably correct correspondences that agree with the structural alignments and that incorrect correspondences in the maximum similarity alignments are usually insignificant correspondences in the probability alignments. The root mean square deviations in superimposition of corresponding residues tend to be smaller for significant correspondences in the probability alignments than for all correspondences in the maximum similarity alignments, indicating that incorrect correspondences in the maximum similarity alignments tend to be insignificant correspondences in probability alignments. This fact is also confirmed in 109 protein pairs that are similar to each other with sequence identities between 90 and 35%. In addition, the probability alignment method may better predict correct correspondences than the maximum similarity alignment method. Probability alignments do, of course, depend on a scoring scheme but are less sensitive to the value of parameters such as gap penalties. The present probability alignment method is useful for constructing reliable alignments based on the probabilities of correspondences and can be used with any scoring scheme.

Mesh：

Substances：

Year: 1995 PMID： 8771180 DOI： 10.1093/protein/8.10.999

Source DB: PubMed Journal: Protein Eng ISSN： 0269-2139

Keyword Cloud
Cited

31 in total

1. A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

Authors: Elena Rivas; Raymond Lang; Sean R Eddy
Journal: RNA Date: 2011-12-22 Impact factor: 4.942

10. Genome-wide searching with base-pairing kernel functions for noncoding RNAs: computational and expression analysis of snoRNA families in Caenorhabditis elegans.

Authors: Kensuke Morita; Yutaka Saito; Kengo Sato; Kotaro Oka; Kohji Hotta; Yasubumi Sakakibara
Journal: Nucleic Acids Res Date: 2009-01-07 Impact factor: 16.971

A reliable sequence alignment method based on probabilities of residue correspondences.

1. A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

Review 2. A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA).

3. ProbCons: Probabilistic consistency-based multiple sequence alignment.

4. Centroid estimation in discrete high-dimensional spaces with applications in biology.

5. Effect of using suboptimal alignments in template-based protein structure prediction.

6. Parameters for accurate genome alignment.

7. Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments.

8. Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments.

9. Exact calculation of distributions on integers, with application to sequence alignment.

10. Genome-wide searching with base-pairing kernel functions for noncoding RNAs: computational and expression analysis of snoRNA families in Caenorhabditis elegans.