Literature DB >> 30736002

A Deep Learning Framework for Identifying Essential Proteins by Integrating Multiple Types of Biological Information.

Min Zeng, Min Li, Zhihui Fei, Fang-Xiang Wu, Yaohang Li, Yi Pan, Jianxin Wang.   

Abstract

Computational methods including centrality and machine learning-based methods have been proposed to identify essential proteins for understanding the minimum requirements of the survival and evolution of a cell. In centrality methods, researchers are required to design a score function which is based on prior knowledge, yet is usually not sufficient to capture the complexity of biological information. In machine learning-based methods, some selected biological features cannot represent the complete properties of biological information as they lack a computational framework to automatically select features. To tackle these problems, we propose a deep learning framework to automatically learn biological features without prior knowledge. We use node2vec technique to automatically learn a richer representation of protein-protein interaction (PPI) network topologies than a score function. Bidirectional long short term memory cells are applied to capture non-local relationships in gene expression data. For subcellular localization information, we exploit a high dimensional indicator vector to characterize their feature. To evaluate the performance of our method, we tested it on PPI network of S. cerevisiae. Our experimental results demonstrate that the performance of our method is better than traditional centrality methods and is superior to existing machine learning-based methods. To explore which of the three types of biological information is the most vital element, we conduct an ablation study by removing each component in turn. Our results show that the PPI network embedding contributes most to the improvement. In addition, gene expression profiles and subcellular localization information are also helpful to improve the performance in identification of essential proteins.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 30736002     DOI: 10.1109/TCBB.2019.2897679

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  10 in total

1.  Multi-view feature selection for identifying gene markers: a diversified biological data driven approach.

Authors:  Sudipta Acharya; Laizhong Cui; Yi Pan
Journal:  BMC Bioinformatics       Date:  2020-12-30       Impact factor: 3.169

2.  ProB-Site: Protein Binding Site Prediction Using Local Features.

Authors:  Sharzil Haris Khan; Hilal Tayara; Kil To Chong
Journal:  Cells       Date:  2022-07-05       Impact factor: 7.666

3.  Predicting Microbe-Disease Association by Learning Graph Representations and Rule-Based Inference on the Heterogeneous Network.

Authors:  Xiujuan Lei; Yueyue Wang
Journal:  Front Microbiol       Date:  2020-04-15       Impact factor: 5.640

4.  A novel essential protein identification method based on PPI networks and gene expression data.

Authors:  Jiancheng Zhong; Chao Tang; Wei Peng; Minzhu Xie; Yusui Sun; Qiang Tang; Qiu Xiao; Jiahong Yang
Journal:  BMC Bioinformatics       Date:  2021-05-13       Impact factor: 3.169

5.  gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network.

Authors:  Li Wang; Cheng Zhong
Journal:  BMC Bioinformatics       Date:  2022-01-04       Impact factor: 3.169

6.  A deep learning framework for identifying essential proteins based on multiple biological information.

Authors:  Yi Yue; Chen Ye; Pei-Yun Peng; Hui-Xin Zhai; Iftikhar Ahmad; Chuan Xia; Yun-Zhi Wu; You-Hua Zhang
Journal:  BMC Bioinformatics       Date:  2022-08-04       Impact factor: 3.307

Review 7.  Bacterial genome reductions: Tools, applications, and challenges.

Authors:  Nicole LeBlanc; Trevor C Charles
Journal:  Front Genome Ed       Date:  2022-08-31

8.  A consensus multi-view multi-objective gene selection approach for improved sample classification.

Authors:  Sudipta Acharya; Laizhong Cui; Yi Pan
Journal:  BMC Bioinformatics       Date:  2020-09-17       Impact factor: 3.169

9.  DeepHE: Accurately predicting human essential genes based on deep learning.

Authors:  Xue Zhang; Wangxin Xiao; Weijia Xiao
Journal:  PLoS Comput Biol       Date:  2020-09-16       Impact factor: 4.475

10.  Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis.

Authors:  Dongfang Jia; Cheng Chen; Chen Chen; Fangfang Chen; Ningrui Zhang; Ziwei Yan; Xiaoyi Lv
Journal:  Front Genet       Date:  2021-05-17       Impact factor: 4.599

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.