Literature DB >> 31787076

DeepEP: a deep learning framework for identifying essential proteins.

Min Zeng1, Min Li2, Fang-Xiang Wu3, Yaohang Li4, Yi Pan5.   

Abstract

BACKGROUND: Essential proteins are crucial for cellular life and thus, identification of essential proteins is an important topic and a challenging problem for researchers. Recently lots of computational approaches have been proposed to handle this problem. However, traditional centrality methods cannot fully represent the topological features of biological networks. In addition, identifying essential proteins is an imbalanced learning problem; but few current shallow machine learning-based methods are designed to handle the imbalanced characteristics.
RESULTS: We develop DeepEP based on a deep learning framework that uses the node2vec technique, multi-scale convolutional neural networks and a sampling technique to identify essential proteins. In DeepEP, the node2vec technique is applied to automatically learn topological and semantic features for each protein in protein-protein interaction (PPI) network. Gene expression profiles are treated as images and multi-scale convolutional neural networks are applied to extract their patterns. In addition, DeepEP uses a sampling method to alleviate the imbalanced characteristics. The sampling method samples the same number of the majority and minority samples in a training epoch, which is not biased to any class in training process. The experimental results show that DeepEP outperforms traditional centrality methods. Moreover, DeepEP is better than shallow machine learning-based methods. Detailed analyses show that the dense vectors which are generated by node2vec technique contribute a lot to the improved performance. It is clear that the node2vec technique effectively captures the topological and semantic properties of PPI network. The sampling method also improves the performance of identifying essential proteins.
CONCLUSION: We demonstrate that DeepEP improves the prediction performance by integrating multiple deep learning techniques and a sampling method. DeepEP is more effective than existing methods.

Entities:  

Keywords:  Deep learning; Identifying essential proteins; Imbalanced learning; Multi-scale convolutional neural networks; Protein-protein interaction network; node2vec

Year:  2019        PMID: 31787076     DOI: 10.1186/s12859-019-3076-y

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  4 in total

1.  Deep Learning-Powered Prediction of Human-Virus Protein-Protein Interactions.

Authors:  Xiaodi Yang; Shiping Yang; Panyu Ren; Stefan Wuchty; Ziding Zhang
Journal:  Front Microbiol       Date:  2022-04-15       Impact factor: 6.064

2.  A deep learning framework for identifying essential proteins based on multiple biological information.

Authors:  Yi Yue; Chen Ye; Pei-Yun Peng; Hui-Xin Zhai; Iftikhar Ahmad; Chuan Xia; Yun-Zhi Wu; You-Hua Zhang
Journal:  BMC Bioinformatics       Date:  2022-08-04       Impact factor: 3.307

3.  Inference of pan-cancer related genes by orthologs matching based on enhanced LSTM model.

Authors:  Chao Wang; Houwang Zhang; Haishu Ma; Yawen Wang; Ke Cai; Tingrui Guo; Yuanhang Yang; Zhen Li; Yuan Zhu
Journal:  Front Microbiol       Date:  2022-10-04       Impact factor: 6.064

4.  DeepHE: Accurately predicting human essential genes based on deep learning.

Authors:  Xue Zhang; Wangxin Xiao; Weijia Xiao
Journal:  PLoS Comput Biol       Date:  2020-09-16       Impact factor: 4.475

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.