Literature DB >> 17609897

Grouping of amino acids and recognition of protein structurally conserved regions by reduced alphabets of amino acids.

Jing Li1, Wei Wang.   

Abstract

Sequence alignment is a common method for finding protein structurally conserved/similar regions. However, sequence alignment is often not accurate if sequence identities between to-be-aligned sequences are less than 30%. This is because that for these sequences, different residues may play similar structural roles and they are incorrectly aligned during the sequence alignment using substitution matrix consisting of 20 types of residues. Based on the similarity of physicochemical features, residues can be clustered into a few groups. Using such simplified alphabets, the complexity of protein sequences is reduced and at the same time the key information encoded in the sequences remains. As a result, the accuracy of sequence alignment might be improved if the residues are properly clustered. Here, by using a database of aligned protein structures (DAPS), a new clustering method based on the substitution scores is proposed for the grouping of residues, and substitution matrices of residues at different levels of simplification are constructed. The validity of the reduced alphabets is confirmed by relative entropy analysis. The reduced alphabets are applied to recognition of protein structurally conserved/similar regions by sequence alignment. The results indicate that the accuracy or efficiency of sequence alignment can be improved with the optimal reduced alphabet with N around 9.

Mesh:

Substances:

Year:  2007        PMID: 17609897     DOI: 10.1007/s11427-007-0023-3

Source DB:  PubMed          Journal:  Sci China C Life Sci        ISSN: 1006-9305


  6 in total

1.  Prediction of ketoacyl synthase family using reduced amino acid alphabets.

Authors:  Wei Chen; Pengmian Feng; Hao Lin
Journal:  J Ind Microbiol Biotechnol       Date:  2011-10-26       Impact factor: 3.346

2.  Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment.

Authors:  Eric L Peterson; Jané Kondev; Julie A Theriot; Rob Phillips
Journal:  Bioinformatics       Date:  2009-04-07       Impact factor: 6.937

Review 3.  Research progress of reduced amino acid alphabets in protein analysis and prediction.

Authors:  Yuchao Liang; Siqi Yang; Lei Zheng; Hao Wang; Jian Zhou; Shenghui Huang; Lei Yang; Yongchun Zuo
Journal:  Comput Struct Biotechnol J       Date:  2022-07-04       Impact factor: 6.155

4.  Prediction of O-glycosylation Sites Using Random Forest and GA-Tuned PSO Technique.

Authors:  Hebatallah Hassan; Amr Badr; M B Abdelhalim
Journal:  Bioinform Biol Insights       Date:  2015-07-05

5.  dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data.

Authors:  Jianhong Ou; Haibo Liu; Niraj K Nirala; Alexey Stukalov; Usha Acharya; Michael R Green; Lihua Julie Zhu
Journal:  PLoS One       Date:  2020-11-06       Impact factor: 3.240

6.  Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy.

Authors:  Jian Zhou; Suling Bo; Hao Wang; Lei Zheng; Pengfei Liang; Yongchun Zuo
Journal:  Front Cell Dev Biol       Date:  2021-07-16
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.