Literature DB >> 33340540

Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.

Sheraz Naseer1, Waqar Hussain2, Yaser Daanial Khan3, Nouman Rasool4.   

Abstract

Deep representations can be used to replace human-engineered representations, as such features are constrained by certain limitations. For the prediction of protein post-translation modifications (PTMs) sites, research community uses different feature extraction techniques applied on Pseudo amino acid compositions (PseAAC). Serine phosphorylation is one of the most important PTM as it is the most occurring, and is important for various biological functions. Creating efficient representations from large protein sequences, to predict PTM sites, is a time and resource intensive task. In this study we propose, implement and evaluate use of Deep learning to learn effective protein data representations from PseAAC to develop data driven PTM detection systems and compare the same with two human representations.. The comparisons are performed by training an xgboost based classifier using each representation. The best scores were achieved by RNN-LSTM based deep representation and CNN based representation with an accuracy score of 81.1% and 78.3% respectively. Human engineered representations scored 77.3% and 74.9% respectively. Based on these results, it is concluded that the deep features are promising feature engineering replacement to identify PhosS sites in a very efficient and accurate manner which can help scientists understand the mechanism of this modification in proteins.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Deep features; Phosphorylation; Phosphoserine; Position relevancy; Statistical moments

Mesh:

Substances:

Year:  2020        PMID: 33340540     DOI: 10.1016/j.ab.2020.114069

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  5 in total

1.  MPMABP: A CNN and Bi-LSTM-Based Method for Predicting Multi-Activities of Bioactive Peptides.

Authors:  You Li; Xueyong Li; Yuewu Liu; Yuhua Yao; Guohua Huang
Journal:  Pharmaceuticals (Basel)       Date:  2022-06-03

2.  Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.

Authors:  Sheraz Naseer; Rao Faizan Ali; Suliman Mohamed Fati; Amgad Muneer
Journal:  Sci Rep       Date:  2022-01-07       Impact factor: 4.379

3.  Identification of stress response proteins through fusion of machine learning models and statistical paradigms.

Authors:  Ebraheem Alzahrani; Wajdi Alghamdi; Malik Zaka Ullah; Yaser Daanial Khan
Journal:  Sci Rep       Date:  2021-11-05       Impact factor: 4.379

4.  Deep Transfer Network with Multi-Space Dynamic Distribution Adaptation for Bearing Fault Diagnosis.

Authors:  Xiaorong Zheng; Zhaojian Gu; Caiming Liu; Jiahao Jiang; Zhiwei He; Mingyu Gao
Journal:  Entropy (Basel)       Date:  2022-08-15       Impact factor: 2.738

5.  Deep Learning Approaches for Detection of Breast Adenocarcinoma Causing Carcinogenic Mutations.

Authors:  Asghar Ali Shah; Fahad Alturise; Tamim Alkhalifah; Yaser Daanial Khan
Journal:  Int J Mol Sci       Date:  2022-09-29       Impact factor: 6.208

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.