Literature DB >> 18369412

Splitting the BLOSUM score into numbers of biological significance.

Francesco Fabris1, Andrea Sgarro, Alessandro Tossi.   

Abstract

Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix could perform better.

Entities:  

Year:  2007        PMID: 18369412      PMCID: PMC3171334          DOI: 10.1155/2007/31450

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  20 in total

Review 1.  Molecular diversity in gene-encoded, cationic antimicrobial polypeptides.

Authors:  A Tossi; L Sandri
Journal:  Curr Pharm Des       Date:  2002       Impact factor: 3.116

2.  The compositional adjustment of amino acid substitution matrices.

Authors:  Yi-Kuo Yu; John C Wootton; Stephen F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-08       Impact factor: 11.205

3.  From analysis of protein structural alignments toward a novel approach to align protein sequences.

Authors:  Shamil R Sunyaev; Gennady A Bogopolsky; Natalia V Oleynikova; Peter K Vlasov; Alexei V Finkelstein; Mikhail A Roytberg
Journal:  Proteins       Date:  2004-02-15

4.  On the significance of sequence alignments when using multiple scoring matrices.

Authors:  Florian Frommlet; Andreas Futschik; Malgorzata Bogdan
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

5.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

6.  A generalized affine gap model significantly improves protein sequence alignment accuracy.

Authors:  Marcus A Zachariah; Gavin E Crooks; Stephen R Holbrook; Steven E Brenner
Journal:  Proteins       Date:  2005-02-01

Review 7.  Protein database searches using compositionally adjusted substitution matrices.

Authors:  Stephen F Altschul; John C Wootton; E Michael Gertz; Richa Agarwala; Aleksandr Morgulis; Alejandro A Schäffer; Yi-Kuo Yu
Journal:  FEBS J       Date:  2005-10       Impact factor: 5.542

8.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

9.  Matching sequences under deletion-insertion constraints.

Authors:  D Sankoff
Journal:  Proc Natl Acad Sci U S A       Date:  1972-01       Impact factor: 11.205

10.  A protein alignment scoring system sensitive at all evolutionary distances.

Authors:  S F Altschul
Journal:  J Mol Evol       Date:  1993-03       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.