Literature DB >> 27411307

A machine-learning approach for predicting palmitoylation sites from integrated sequence-based features.

Liqi Li1, Qifa Luo1, Weidong Xiao1, Jinhui Li1, Shiwen Zhou2, Yongsheng Li3, Xiaoqi Zheng4, Hua Yang1.   

Abstract

Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein-protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.

Entities:  

Keywords:  Amino acid physicochemical properties; K-mer amino acid composition; palmitoylation; position-specific score matrix; support vector machine-recursive feature elimination

Mesh:

Substances:

Year:  2016        PMID: 27411307     DOI: 10.1142/S0219720016500256

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  1 in total

1.  Dimensionality reduction reveals fine-scale structure in the Japanese population with consequences for polygenic risk prediction.

Authors:  Saori Sakaue; Jun Hirata; Masahiro Kanai; Ken Suzuki; Masato Akiyama; Chun Lai Too; Thurayya Arayssi; Mohammed Hammoudeh; Samar Al Emadi; Basel K Masri; Hussein Halabi; Humeira Badsha; Imad W Uthman; Richa Saxena; Leonid Padyukov; Makoto Hirata; Koichi Matsuda; Yoshinori Murakami; Yoichiro Kamatani; Yukinori Okada
Journal:  Nat Commun       Date:  2020-03-26       Impact factor: 14.919

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.