Literature DB >> 32613242

Using deep neural networks and biological subwords to detect protein S-sulfenylation sites.

Duyen Thi Do1, Thanh Quynh Trang Le2, Nguyen Quoc Khanh Le3.   

Abstract

Protein S-sulfenylation is one kind of crucial post-translational modifications (PTMs) in which the hydroxyl group covalently binds to the thiol of cysteine. Some recent studies have shown that this modification plays an important role in signaling transduction, transcriptional regulation and apoptosis. To date, the dynamic of sulfenic acids in proteins remains unclear because of its fleeting nature. Identifying S-sulfenylation sites, therefore, could be the key to decipher its mysterious structures and functions, which are important in cell biology and diseases. However, due to the lack of effective methods, scientists in this field tend to be limited in merely a handful of some wet lab techniques that are time-consuming and not cost-effective. Thus, this motivated us to develop an in silico model for detecting S-sulfenylation sites only from protein sequence information. In this study, protein sequences served as natural language sentences comprising biological subwords. The deep neural network was consequentially employed to perform classification. The performance statistics within the independent dataset including sensitivity, specificity, accuracy, Matthews correlation coefficient and area under the curve rates achieved 85.71%, 69.47%, 77.09%, 0.5554 and 0.833, respectively. Our results suggested that the proposed method (fastSulf-DNN) achieved excellent performance in predicting S-sulfenylation sites compared to other well-known tools on a benchmark dataset.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; post-translational modification; protein function prediction; sulfenylation reaction; word embedding

Year:  2021        PMID: 32613242     DOI: 10.1093/bib/bbaa128

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  18 in total

1.  Improving language model of human genome for DNA-protein binding prediction based on task-specific pre-training.

Authors:  Hanyu Luo; Wenyu Shan; Cheng Chen; Pingjian Ding; Lingyun Luo
Journal:  Interdiscip Sci       Date:  2022-09-22       Impact factor: 3.492

2.  Fusion of text and graph information for machine learning problems on networks.

Authors:  Ilya Makarov; Mikhail Makarov; Dmitrii Kiselev
Journal:  PeerJ Comput Sci       Date:  2021-05-11

3.  Correction of out-of-focus microscopic images by deep learning.

Authors:  Chi Zhang; Hao Jiang; Weihuang Liu; Junyi Li; Shiming Tang; Mario Juhas; Yang Zhang
Journal:  Comput Struct Biotechnol J       Date:  2022-04-20       Impact factor: 6.155

4.  XGBoost Improves Classification of MGMT Promoter Methylation Status in IDH1 Wildtype Glioblastoma.

Authors:  Nguyen Quoc Khanh Le; Duyen Thi Do; Fang-Ying Chiu; Edward Kien Yee Yapp; Hui-Yuan Yeh; Cheng-Yu Chen
Journal:  J Pers Med       Date:  2020-09-15

5.  SSnet: A Deep Learning Approach for Protein-Ligand Interaction Prediction.

Authors:  Niraj Verma; Xingming Qu; Francesco Trozzi; Mohamed Elsaied; Nischal Karki; Yunwen Tao; Brian Zoltowski; Eric C Larson; Elfi Kraka
Journal:  Int J Mol Sci       Date:  2021-01-30       Impact factor: 5.923

6.  A deep learning framework combined with word embedding to identify DNA replication origins.

Authors:  Feng Wu; Runtao Yang; Chengjin Zhang; Lina Zhang
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

7.  Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis.

Authors:  Jianwei Xiao; Rongsheng Wang; Xu Cai; Zhizhong Ye
Journal:  Front Genet       Date:  2021-02-11       Impact factor: 4.599

8.  A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification.

Authors:  Nguyen Quoc Khanh Le; Duyen Thi Do; Truong Nguyen Khanh Hung; Luu Ho Thanh Lam; Tuan-Tu Huynh; Ngan Thi Kim Nguyen
Journal:  Int J Mol Sci       Date:  2020-11-28       Impact factor: 5.923

9.  Identification of potential gene signatures associated with osteosarcoma by integrated bioinformatics analysis.

Authors:  Yutao Jia; Yang Liu; Zhihua Han; Rong Tian
Journal:  PeerJ       Date:  2021-05-27       Impact factor: 2.984

10.  Improving classification based on physical surface tension-neural net for the prediction of psychosocial-risk level in public school teachers.

Authors:  Rodolfo Mosquera Navarro; Omar Danilo Castrillón; Liliana Parra Osorio; Tiago Oliveira; Paulo Novais; José Fernando Valencia
Journal:  PeerJ Comput Sci       Date:  2021-05-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.