Literature DB >> 24469380

Ensemble learning prediction of protein-protein interactions using proteins functional annotations.

Indrajit Saha1, Julian Zubek, Tomas Klingström, Simon Forsberg, Johan Wikander, Marcin Kierczak, Ujjwal Maulik, Dariusz Plewczynski.   

Abstract

Protein-protein interactions are important for the majority of biological processes. A significant number of computational methods have been developed to predict protein-protein interactions using protein sequence, structural and genomic data. Vast experimental data is publicly available on the Internet, but it is scattered across numerous databases. This fact motivated us to create and evaluate new high-throughput datasets of interacting proteins. We extracted interaction data from DIP, MINT, BioGRID and IntAct databases. Then we constructed descriptive features for machine learning purposes based on data from Gene Ontology and DOMINE. Thereafter, four well-established machine learning methods: Support Vector Machine, Random Forest, Decision Tree and Naïve Bayes, were used on these datasets to build an Ensemble Learning method based on majority voting. In cross-validation experiment, sensitivity exceeded 80% and classification/prediction accuracy reached 90% for the Ensemble Learning method. We extended the experiment to a bigger and more realistic dataset maintaining sensitivity over 70%. These results confirmed that our datasets are suitable for performing PPI prediction and Ensemble Learning method is well suited for this task. Both the processed PPI datasets and the software are available at .

Entities:  

Mesh:

Year:  2014        PMID: 24469380     DOI: 10.1039/c3mb70486f

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  10 in total

Review 1.  Kernel methods for large-scale genomic data analysis.

Authors:  Xuefeng Wang; Eric P Xing; Daniel J Schaid
Journal:  Brief Bioinform       Date:  2014-07-22       Impact factor: 11.622

2.  Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae.

Authors:  Julian Zubek; Marcin Tatjewski; Adam Boniecki; Maciej Mnich; Subhadip Basu; Dariusz Plewczynski
Journal:  PeerJ       Date:  2015-07-02       Impact factor: 2.984

3.  A novel feature extraction scheme with ensemble coding for protein-protein interaction prediction.

Authors:  Xiuquan Du; Jiaxing Cheng; Tingting Zheng; Zheng Duan; Fulan Qian
Journal:  Int J Mol Sci       Date:  2014-07-18       Impact factor: 5.923

4.  A new ensemble coevolution system for detecting HIV-1 protein coevolution.

Authors:  Guangdi Li; Kristof Theys; Jens Verheyen; Andrea-Clemencia Pineda-Peña; Ricardo Khouri; Supinya Piampongsant; Mónica Eusébio; Jan Ramon; Anne-Mieke Vandamme
Journal:  Biol Direct       Date:  2015-01-07       Impact factor: 4.540

5.  An ensemble approach for large-scale identification of protein- protein interactions using the alignments of multiple sequences.

Authors:  Lei Wang; Zhu-Hong You; Xing Chen; Jian-Qiang Li; Xin Yan; Wei Zhang; Yu-An Huang
Journal:  Oncotarget       Date:  2017-01-17

6.  A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences.

Authors:  Xue Wang; Yuejin Wu; Rujing Wang; Yuanyuan Wei; Yuanmiao Gui
Journal:  PLoS One       Date:  2019-06-07       Impact factor: 3.240

7.  Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest.

Authors:  Lei Wang; Hai-Feng Wang; San-Rong Liu; Xin Yan; Ke-Jian Song
Journal:  Sci Rep       Date:  2019-07-08       Impact factor: 4.379

Review 8.  Protein-protein interaction prediction with deep learning: A comprehensive review.

Authors:  Farzan Soleymani; Eric Paquet; Herna Viktor; Wojtek Michalowski; Davide Spinello
Journal:  Comput Struct Biotechnol J       Date:  2022-09-19       Impact factor: 6.155

9.  Quo vadis computational analysis of PPI data or why the future isn't here yet.

Authors:  Konstantinos A Theofilatos; Spiros Likothanassis; Seferina Mavroudi
Journal:  Front Genet       Date:  2015-09-15       Impact factor: 4.599

10.  Detecting Protein Communities in Native Cell Extracts by Machine Learning: A Structural Biologist's Perspective.

Authors:  Fotis L Kyrilis; Jaydeep Belapure; Panagiotis L Kastritis
Journal:  Front Mol Biosci       Date:  2021-04-15
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.