Literature DB >> 31472418

Efficient utilization on PSSM combining with recurrent neural network for membrane protein types prediction.

Shunfang Wang1, Mingyuan Li2, Lei Guo2, Zicheng Cao3, Yu Fei4.   

Abstract

Position-Specific Scoring Matrix (PSSM) is an excellent feature extraction method that was proposed early in protein classifying prediction, but within the restriction of feature shape in PSSM, researchers make a lot attempts to process it so that PSSM can be input to the traditional machine learning algorithms. These processes drop information provided by PSSM in a way thus the feature representation is limited. Moreover, the high-dimensional feature representation of PSSM makes it incompatible with other feature extraction methods. We use the PSSM as the input of Recurrent Neural Network without any post-processing, the amino acids in protein sequences are regarded as time step in RNN. This way takes full advantage of the information that PSSM provides. In this study, the PSSM is input to the model directly and the internal information of PSSM is fully utilized, we propose an end-to-end solution and achieve state-of-the-art performance. Ultimately, the exploration of how to combine PSSM with traditional feature extraction methods is carried out and achieve slightly improved performance. Our network architecture is implemented in Python and is available at https://github.com/YellowcardD/RNN-for-membrane-protein-types-prediction.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Deep learning; Long short-term memory; Membrane protein types prediction; Position-Specific scoring matrix

Mesh:

Substances:

Year:  2019        PMID: 31472418     DOI: 10.1016/j.compbiolchem.2019.107094

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  4 in total

1.  Gene prediction of aging-related diseases based on DNN and Mashup.

Authors:  Junhua Ye; Shunfang Wang; Xin Yang; Xianjun Tang
Journal:  BMC Bioinformatics       Date:  2021-12-17       Impact factor: 3.169

2.  BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN.

Authors:  Chuang Feng; Zhen Wang; Guokun Li; Xiaohan Yang; Nannan Wu; Lei Wang
Journal:  Biomed Res Int       Date:  2022-08-24       Impact factor: 3.246

3.  Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.

Authors:  Shunfang Wang; Lin Deng; Xinnan Xia; Zicheng Cao; Yu Fei
Journal:  BMC Bioinformatics       Date:  2021-06-23       Impact factor: 3.169

4.  ACEP: improving antimicrobial peptides recognition through automatic feature fusion and amino acid embedding.

Authors:  Haoyi Fu; Zicheng Cao; Mingyuan Li; Shunfang Wang
Journal:  BMC Genomics       Date:  2020-08-28       Impact factor: 3.969

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.