Literature DB >> 18533108

Predicting N-terminal acetylation based on feature selection method.

Yu-Dong Cai1, Lin Lu.   

Abstract

Methionine aminopeptidase and N-terminal acetyltransferase are two enzymes that contribute most to the N-terminal acetylation, which has long been recognized as a frequent and important kind of co-translational modifications [R.A. Bradshaw, W.W. Brickey, K.W. Walker, N-terminal processing: the methionine aminopeptidase and N alpha-acetyl transferase families, Trends Biochem. Sci. 23 (1998) 263-267]. The combined action of these two enzymes leads to two types of N-terminal acetylated proteins that are with/without the initiator methionine after the N-terminal acetylation. To accurately predict these two types of N-terminal acetylation, a new method based on feature selection has been developed. 1047 N-terminal acetylated and non-acetylated decapeptides retrieved from Swiss-Prot database (http://cn.expasy.org) are encoded into feature vectors by amino acid properties collected in Amino Acid Index database (http://www.genome.jp/aaindex). The Maximum Relevance Minimum Redundancy method (mRMR) combining with Incremental Feature Selection (IFS) and Feature Forward Selection (FFS) is then applied to extract informative features. Nearest Neighbor Algorithm (NNA) is used to build prediction models. Tested by Jackknife Cross-Validation, the correct rate of predictors reach 91.34% and 75.49% for each type, which are both better than that of 84.41% and 62.99% acquired by using motif methods [S. Huang, R.C. Elliott, P.S. Liu, R.K. Koduri, J.L. Weickmann, J.H. Lee, L.C. Blair, P. Ghosh-Dastidar, R.A. Bradshaw, K.M. Bryan, et al., Specificity of cotranslational amino-terminal processing of proteins in yeast, Biochemistry 26 (1987) 8242-8246; R. Yamada, R.A. Bradshaw, Rat liver polysome N alpha-acetyltransferase: substrate specificity, Biochemistry 30 (1991) 1017-1021]. Furthermore, the analysis of the informative features indicates that at least six downstream residues might have effect on the rules that guide the N-terminal acetylation, besides the penultimate residue. The software is available upon request.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18533108     DOI: 10.1016/j.bbrc.2008.05.143

Source DB:  PubMed          Journal:  Biochem Biophys Res Commun        ISSN: 0006-291X            Impact factor:   3.575


  10 in total

1.  Prediction of interactiveness of proteins and nucleic acids based on feature selections.

Authors:  YouLang Yuan; XiaoHe Shi; XinLei Li; WenCong Lu; YuDong Cai; Lei Gu; Liang Liu; MinJie Li; XiangYin Kong; Meng Xing
Journal:  Mol Divers       Date:  2009-10-09       Impact factor: 2.943

Review 2.  Protein acetylation in archaea, bacteria, and eukaryotes.

Authors:  Jörg Soppa
Journal:  Archaea       Date:  2010-09-16       Impact factor: 3.273

3.  Prediction and analysis of protein hydroxyproline and hydroxylysine.

Authors:  Le-Le Hu; Shen Niu; Tao Huang; Kai Wang; Xiao-He Shi; Yu-Dong Cai
Journal:  PLoS One       Date:  2010-12-31       Impact factor: 3.240

4.  Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property.

Authors:  Tao Huang; Lei Chen; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-09-28       Impact factor: 3.240

5.  Prediction of RNA-binding proteins by voting systems.

Authors:  C R Peng; L Liu; B Niu; Y L Lv; M J Li; Y L Yuan; Y B Zhu; W C Lu; Y D Cai
Journal:  J Biomed Biotechnol       Date:  2011-07-26

6.  PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.

Authors:  Huilin Wang; Mingjun Wang; Hao Tan; Yuan Li; Ziding Zhang; Jiangning Song
Journal:  PLoS One       Date:  2014-08-22       Impact factor: 3.240

7.  Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features.

Authors:  Yuan Li; Mingjun Wang; Huilin Wang; Hao Tan; Ziding Zhang; Geoffrey I Webb; Jiangning Song
Journal:  Sci Rep       Date:  2014-07-21       Impact factor: 4.379

8.  Prediction of neddylation sites from protein sequences and sequence-derived properties.

Authors:  Ahmet Sinan Yavuz; Namık Berk Sözer; Osman Uğur Sezerman
Journal:  BMC Bioinformatics       Date:  2015-12-09       Impact factor: 3.169

9.  Position-specific analysis and prediction for protein lysine acetylation based on multiple features.

Authors:  Sheng-Bao Suo; Jian-Ding Qiu; Shao-Ping Shi; Xing-Yu Sun; Shu-Yun Huang; Xiang Chen; Ru-Ping Liang
Journal:  PLoS One       Date:  2012-11-16       Impact factor: 3.240

10.  Discriminating between lysine sumoylation and lysine acetylation using mRMR feature selection and analysis.

Authors:  Ning Zhang; You Zhou; Tao Huang; Yu-Chao Zhang; Bi-Qing Li; Lei Chen; Yu-Dong Cai
Journal:  PLoS One       Date:  2014-09-15       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.