Literature DB >> 28088356

Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier.

Lei Wang1, Zhu-Hong You2, Shi-Xiong Xia3, Feng Liu4, Xing Chen5, Xin Yan6, Yong Zhou7.   

Abstract

Protein-Protein Interactions (PPIs) are essential to most biological processes and play a critical role in most cellular functions. With the development of high-throughput biological techniques and in silico methods, a large number of PPI data have been generated for various organisms, but many problems remain unsolved. These factors promoted the development of the in silico methods based on machine learning to predict PPIs. In this study, we propose a novel method by combining ensemble Rotation Forest (RF) classifier and Discrete Cosine Transform (DCT) algorithm to predict the interactions among proteins. Specifically, the protein amino acids sequence is transformed into Position-Specific Scoring Matrix (PSSM) containing biological evolution information, and then the feature vector is extracted to present protein evolutionary information using DCT algorithm; finally, the ensemble rotation forest model is used to predict whether a given protein pair is interacting or not. When performed on Yeast and H. pylori data sets, the proposed method achieved excellent results with an average accuracy of 98.54% and 88.27%. In addition, we achieved good prediction accuracy of 98.08%, 92.75%, 98.87% and 98.72% on independent data sets (C.elegans, E.coli, H.sapiens and M.musculus). In order to further evaluate the performance of our method, we compare it with the state-of-the-art Support Vector Machine (SVM) classifier and get good results. As a web server, the source code and Yeast data sets used in this article are freely available at http://202.119.201.126:8888/DCTRF/.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Cancer; Multiple sequences alignments; Position-specific scoring matrix; Rotation forest

Mesh:

Substances:

Year:  2017        PMID: 28088356     DOI: 10.1016/j.jtbi.2017.01.003

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  13 in total

1.  Prediction of Protein-Protein Interaction Sites Using Convolutional Neural Network and Improved Data Sets.

Authors:  Zengyan Xie; Xiaoya Deng; Kunxian Shu
Journal:  Int J Mol Sci       Date:  2020-01-11       Impact factor: 5.923

2.  PPInS: a repository of protein-protein interaction sitesbase.

Authors:  Vicky Kumar; Suchismita Mahato; Anjana Munshi; Mahesh Kulharia
Journal:  Sci Rep       Date:  2018-08-20       Impact factor: 4.379

3.  Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions.

Authors:  Lei Wang; Zhu-Hong You; Xin Yan; Shi-Xiong Xia; Feng Liu; Li-Ping Li; Wei Zhang; Yong Zhou
Journal:  Sci Rep       Date:  2018-08-27       Impact factor: 4.379

4.  Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest.

Authors:  Lei Wang; Hai-Feng Wang; San-Rong Liu; Xin Yan; Ke-Jian Song
Journal:  Sci Rep       Date:  2019-07-08       Impact factor: 4.379

5.  Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions.

Authors:  Lei Wang; Zhu-Hong You; Li-Ping Li; Xin Yan; Wei Zhang
Journal:  Sci Rep       Date:  2020-04-20       Impact factor: 4.379

6.  Benchmark Evaluation of Protein-Protein Interaction Prediction Algorithms.

Authors:  Brandan Dunham; Madhavi K Ganapathiraju
Journal:  Molecules       Date:  2021-12-22       Impact factor: 4.927

7.  PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles.

Authors:  Alireza Mohammadi; Javad Zahiri; Saber Mohammadi; Mohsen Khodarahmi; Seyed Shahriar Arab
Journal:  Biol Methods Protoc       Date:  2022-03-30

8.  An Iterative Method for Predicting Essential Proteins Based on Multifeature Fusion and Linear Neighborhood Similarity.

Authors:  Xianyou Zhu; Yaocan Zhu; Yihong Tan; Zhiping Chen; Lei Wang
Journal:  Front Aging Neurosci       Date:  2022-01-24       Impact factor: 5.750

9.  Amalgamation of 3D structure and sequence information for protein-protein interaction prediction.

Authors:  Kanchan Jha; Sriparna Saha
Journal:  Sci Rep       Date:  2020-11-05       Impact factor: 4.379

10.  Study on the differentially expressed genes and signaling pathways in dermatomyositis using integrated bioinformatics method.

Authors:  Wei Liu; Wen-Jia Zhao; Yuan-Hao Wu
Journal:  Medicine (Baltimore)       Date:  2020-08-21       Impact factor: 1.817

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.