Literature DB >> 32911276

AE-LGBM: Sequence-based novel approach to detect interacting protein pairs via ensemble of autoencoder and LightGBM.

Abhibhav Sharma1, Buddha Singh2.   

Abstract

Protein-protein interactions (PPIs) play a crucial role in biological processes of living organisms. Correct prediction of PPI can prove to be extremely valuable in probing protein functions which can aid in the development of new and powerful therapies for disease prevention. Many experimental studies have been previously performed to investigate PPIs. However, in-vitro techniques to investigate PPIs are resource-extensive and time-consuming. Although various in-silico approaches to predict PPI have been developed in recent years, they could be fallible in terms of accuracy and false-positive rate. To overcome these shortcomings, we propose a novel approach, AE-LGBM to predict the PPIs more accurately. It employs LightGBM classifier and utilizes the Autoencoder, which is an artificial neural network, to efficiently produce lower-dimensional, discriminative, and noise-free features. We incorporate conjoint triad (CT) and Composition-Transition-Distribution (CTD) features into the AE-LGBM framework. On performing ten-fold cross-validation, the prediction accuracies obtained by AE-LGBM for Human and Yeast datasets are 98.7% and 95.4% respectively. AE-LGBM was further evaluated on independent datasets and has achieved excellent accuracies of 100%, 100%, 99.9%, 99.3%, 99.2% on E. coli, M. musculus, C. elegans, H. pylori and H. sapiens respectively. AE-LGBM has also obtained the best accuracy when tested over three important PPI networks namely single-core network (CD9), the multiple-core network (The Ras/Raf/MEK/ERK pathway) and the cross-connection network (Wnt Network). The outstanding generalization ability of AE-LGBM makes it a versatile, efficient, and robust PPIs prediction model.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Accuracy; Autoencoder; LightGBM; Machine learning; Neural network; PPI; Prediction model; Protein; Protein-protein interaction

Mesh:

Year:  2020        PMID: 32911276     DOI: 10.1016/j.compbiomed.2020.103964

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  4 in total

1.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

2.  SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction.

Authors:  Xue Li; Peifu Han; Gan Wang; Wenqi Chen; Shuang Wang; Tao Song
Journal:  BMC Genomics       Date:  2022-06-27       Impact factor: 4.547

3.  ResSUMO: A Deep Learning Architecture Based on Residual Structure for Prediction of Lysine SUMOylation Sites.

Authors:  Yafei Zhu; Yuhai Liu; Yu Chen; Lei Li
Journal:  Cells       Date:  2022-08-25       Impact factor: 7.666

4.  AutoPPI: An Ensemble of Deep Autoencoders for Protein-Protein Interaction Prediction.

Authors:  Gabriela Czibula; Alexandra-Ioana Albu; Maria Iuliana Bocicor; Camelia Chira
Journal:  Entropy (Basel)       Date:  2021-05-21       Impact factor: 2.524

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.