Literature DB >> 30553999

A new sequence based encoding for prediction of host-pathogen protein interactions.

İrfan Kösesoy1, Murat Gök2, Cemil Öz3.   

Abstract

Pathogen-host interactions are very important to figure out the infection process at the molecular level, where pathogen proteins physically bind to human proteins to manipulate critical biological processes in the host cell. Data scarcity and data unavailability are two major problems for computational approaches in the prediction of pathogen-host interactions. Developing a computational method to predict pathogen-host interactions with high accuracy, based on protein sequences alone, is of great importance because it can eliminate these problems. In this study, we propose a novel and robust sequence based feature extraction method, named Location Based Encoding, to predict pathogen-host interactions with machine learning based algorithms. In this context, we use Bacillus Anthracis and Yersinia Pestis data sets as the pathogen organisms and human proteins as the host model to compare our method with sequence based protein encoding methods, which are widely used in the literature, namely amino acid composition, amino acid pair, and conjoint triad. We use these encoding methods with decision trees (Random Forest, j48), statistical (Bayesian Networks, Naive Bayes), and instance based (kNN) classifiers to predict pathogen-host interactions. We conduct different experiments to evaluate the effectiveness of our method. We obtain the best results among all the experiments with RF classifier in terms of F1, accuracy, MCC, and AUC.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Host–pathogen interactions; Infectious diseases; Machine learning; Protein networks; Protein–protein interactions

Mesh:

Substances:

Year:  2018        PMID: 30553999     DOI: 10.1016/j.compbiolchem.2018.12.001

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  5 in total

Review 1.  Computational Tools and Strategies to Develop Peptide-Based Inhibitors of Protein-Protein Interactions.

Authors:  Maxence Delaunay; Tâp Ha-Duong
Journal:  Methods Mol Biol       Date:  2022

2.  Prediction of host-pathogen protein interactions by extended network model.

Authors:  İrfan Kösesoy; Murat Gök; Tamer Kahveci
Journal:  Turk J Biol       Date:  2021-04-20

3.  SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction.

Authors:  Xue Li; Peifu Han; Gan Wang; Wenqi Chen; Shuang Wang; Tao Song
Journal:  BMC Genomics       Date:  2022-06-27       Impact factor: 4.547

4.  In silico unravelling pathogen-host signaling cross-talks via pathogen mimicry and human protein-protein interaction networks.

Authors:  Suyu Mei; Kun Zhang
Journal:  Comput Struct Biotechnol J       Date:  2019-12-27       Impact factor: 7.271

Review 5.  Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions.

Authors:  Padhmanand Sudhakar; Kathleen Machiels; Bram Verstockt; Tamas Korcsmaros; Séverine Vermeire
Journal:  Front Microbiol       Date:  2021-05-11       Impact factor: 5.640

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.