Literature DB >> 28292250

Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method.

JianZhao Gao1, Xue-Wen Tao2, Jia Zhao3, Yuan-Ming Feng2, Yu-Dong Cai4, Ning Zhang2.   

Abstract

AIM AND
OBJECTIVE: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction.
MATERIAL AND METHODS: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed.
RESULTS: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites.
CONCLUSION: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

Entities:  

Keywords:  Acetylation; dagging; epsilon lysine acetylation site; incrementalzzm321990feature selection; maximum relevance minimum redundancy; post-translational modification

Mesh:

Substances:

Year:  2017        PMID: 28292250     DOI: 10.2174/1386207320666170314093216

Source DB:  PubMed          Journal:  Comb Chem High Throughput Screen        ISSN: 1386-2073            Impact factor:   1.339


  1 in total

1.  SeqSVM: A Sequence-Based Support Vector Machine Method for Identifying Antioxidant Proteins.

Authors:  Lei Xu; Guangmin Liang; Shuhua Shi; Changrui Liao
Journal:  Int J Mol Sci       Date:  2018-06-15       Impact factor: 5.923

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.