Literature DB >> 21128854

Substitution matrices of residue triplets derived from protein blocks.

Xin Liu1, Ya-Pu Zhao.   

Abstract

In protein sequence alignment, residue similarity is usually evaluated by substitution matrix, which scores all possible exchanges of one amino acid with another. Several matrices are widely used in sequence alignment, including PAM matrices derived from homologous sequence and BLOSUM matrices derived from aligned segments of BLOCKS. However, most matrices have not addressed the high-order residue-residue interactions that are vital to the bio-properties of protein. With consideration for the inherent correlation in residue triplet, we present a new scoring scheme for sequence alignment. Protein sequence is treated as overlapping and successive 3-residue segments. Two edge residues of a triplet are clustered into hydrophobic or polar categories, respectively. Protein sequence is then rewritten into triplet sequence with 2 x 20 x 2 = 80 alphabets. Using a traditional approach, we construct a new scoring scheme named TLESUM(hp) (TripLEt SUbstitution Matrices with hydrophobic and polar information) for pairwise substitution of triplets, which characterizes the similarity of residue triplets. The applications of this matrix led to marked improvements in multiple sequence alignment and in searching structurally alike residue segments. The reason for the occurrence of the "twilight zone," i.e., structure explosion of low identity sequences, is also discussed.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21128854     DOI: 10.1089/cmb.2008.0035

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  New amino acid substitution matrix brings sequence alignments into agreement with structure matches.

Authors:  Kejue Jia; Robert L Jernigan
Journal:  Proteins       Date:  2021-02-02

2.  Revisiting amino acid substitution matrices for identifying distantly related proteins.

Authors:  Kazunori Yamada; Kentaro Tomii
Journal:  Bioinformatics       Date:  2013-11-26       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.