Literature DB >> 35066812

Multiple Protein Subcellular Locations Prediction Based on Deep Convolutional Neural Networks with Self-Attention Mechanism.

Hanhan Cong1,2, Hong Liu3,4, Yi Cao5,6, Yuehui Chen5,6, Cheng Liang1.   

Abstract

As an important research field in bioinformatics, protein subcellular location prediction is critical to reveal the protein functions and provide insightful information for disease diagnosis and drug development. Predicting protein subcellular locations remains a challenging task due to the difficulty of finding representative features and robust classifiers. Many feature fusion methods have been widely applied to tackle the above issues. However, they still suffer from accuracy loss due to feature redundancy. Furthermore, multiple protein subcellular locations prediction is more complicated since it is fundamentally a multi-label classification problem. The traditional binary classifiers or even multi-class classifiers cannot achieve satisfactory results. This paper proposes a novel method for protein subcellular location prediction with both single and multiple sites based on deep convolutional neural networks. Specifically, we first obtain the integrated features by simultaneously considering the pseudo amino acid, amino acid index distribution, and physicochemical property. We then adopt deep convolutional neural networks to extract high-dimensional features from the fused feature, removing the redundant preliminary features and gaining better representations of the raw sequences. Moreover, we use the self-attention mechanism and a customized loss function to ensure that the model is more inclined to positive data. In addition, we use random k-label sets to reduce the number of prediction labels. Meanwhile, we employ a hybrid strategy of over-sampling and under-sampling to tackle the data imbalance problem. We compare our model with three representative classification alternatives. The experiment results show that our model achieves the best performance in terms of accuracy, demonstrating the efficacy of the proposed model.
© 2021. International Association of Scientists in the Interdisciplinary Areas.

Entities:  

Keywords:  Attention mechanism; Deep convolutional neural networks; Multi-label classification; Protein subcellular localization; Random k-label sets

Mesh:

Substances:

Year:  2022        PMID: 35066812     DOI: 10.1007/s12539-021-00496-7

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   2.233


  33 in total

1.  DeepGOPlus: improved protein function prediction from sequence.

Authors:  Maxat Kulmanov; Robert Hoehndorf
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

2.  Genome-wide analysis of RNA and protein localization and local translation in mESC-derived neurons.

Authors:  Katarzyna A Ludwik; Nicolai von Kuegelgen; Marina Chekulaeva
Journal:  Methods       Date:  2019-02-08       Impact factor: 3.608

3.  Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.

Authors:  Hang Zhou; Yang Yang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

4.  Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC.

Authors:  Yinan Shen; Jijun Tang; Fei Guo
Journal:  J Theor Biol       Date:  2018-11-16       Impact factor: 2.691

5.  Proteomic analysis and prediction of human phosphorylation sites in subcellular level reveal subcellular specificity.

Authors:  Xiang Chen; Shao-Ping Shi; Sheng-Bao Suo; Hao-Dong Xu; Jian-Ding Qiu
Journal:  Bioinformatics       Date:  2014-09-17       Impact factor: 6.937

6.  Subcellular localization prediction of apoptosis proteins based on evolutionary information and support vector machine.

Authors:  Qilin Xiang; Bo Liao; Xianhong Li; Huimin Xu; Jing Chen; Zhuoxing Shi; Qi Dai; Yuhua Yao
Journal:  Artif Intell Med       Date:  2017-05-24       Impact factor: 5.326

7.  Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.

Authors:  Shengli Zhang; Xin Duan
Journal:  J Theor Biol       Date:  2017-10-31       Impact factor: 2.691

Review 8.  How the Local Environment of Functional Sites Regulates Protein Function.

Authors:  Karine Mazmanian; Karen Sargsyan; Carmay Lim
Journal:  J Am Chem Soc       Date:  2020-05-19       Impact factor: 15.419

9.  A subcellular map of the human proteome.

Authors:  Peter J Thul; Lovisa Åkesson; Mikaela Wiking; Diana Mahdessian; Aikaterini Geladaki; Hammou Ait Blal; Tove Alm; Anna Asplund; Lars Björk; Lisa M Breckels; Anna Bäckström; Frida Danielsson; Linn Fagerberg; Jenny Fall; Laurent Gatto; Christian Gnann; Sophia Hober; Martin Hjelmare; Fredric Johansson; Sunjae Lee; Cecilia Lindskog; Jan Mulder; Claire M Mulvey; Peter Nilsson; Per Oksvold; Johan Rockberg; Rutger Schutten; Jochen M Schwenk; Åsa Sivertsson; Evelina Sjöstedt; Marie Skogs; Charlotte Stadler; Devin P Sullivan; Hanna Tegel; Casper Winsnes; Cheng Zhang; Martin Zwahlen; Adil Mardinoglu; Fredrik Pontén; Kalle von Feilitzen; Kathryn S Lilley; Mathias Uhlén; Emma Lundberg
Journal:  Science       Date:  2017-05-11       Impact factor: 47.728

10.  A robust method for protein depletion based on gene editing.

Authors:  Yanhui Yang; Wenyi Wu; Tong Liu; Lijun Dong; Hetian Lei
Journal:  Methods       Date:  2021-03-08       Impact factor: 3.608

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.