Literature DB >> 26733452

Path2PPI: an R package to predict protein-protein interaction networks for a set of proteins.

Oliver Philipp1, Heinz D Osiewacz2, Ina Koch3.   

Abstract

UNLABELLED: : We introduce Path2PPI, a new R package to identify protein-protein interaction (PPI) networks for fully sequenced organisms for which nearly none PPI are known. Path2PPI predicts PPI networks based on sets of proteins from well-established model organisms, providing an intuitive visualization and usability. It can be used to combine and transfer information of a certain pathway or biological process from several reference organisms to one target organism.
AVAILABILITY AND IMPLEMENTATION: Path2PPI is an open-source tool implemented in R. It can be obtained from the Bioconductor project: http://bioconductor.org/packages/Path2PPI/ CONTACT: : ina.koch@bioinformatik.uni-frankfurt.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26733452      PMCID: PMC4848400          DOI: 10.1093/bioinformatics/btv765

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Plenty of databases exist which contain protein–protein interaction (PPI) data for various organisms (e.g. Chatr-aryamontri ; Franceschini ). For some well-established model organisms, species-specific data repositories are available (e.g. Güldener ; Prasad ) providing also PPI data. In contrast, for the majority of organisms such a comprehensive amount of PPI data is not available. Therefore, different approaches have been developed to predict PPIs. Some of these approaches aim to deduce new interactions from known PPIs by means of homology-based mapping based on sequence similarity. Other methods apply supervised learning to filter and score the predicted interactions, using additional biological data, e.g. functional annotation, co-expression and / or text-mining data (Yu ; for a recent review see Rao ). Approaches that predict PPIs based on sequence data or network topology information often provide only precomputed data (Franceschini ; Pesch and Zimmer, 2013) or predict interactions only for a set of predefined organisms (Deng ; Wiles ). Furthermore, most of the methods do not supply information about the underlying reference interactions. In the majority, only the scores are provided to validate a predicted interaction. Often it is necessary to easily access the entire underlying information about the predicted interactions since the interpretation and experimental validation is one of the most important steps after the prediction. As we were interested in aging processes and interaction networks of age-related pathways in the fungal model organism Podospora anserina (Osiewacz ; Philipp ), we found only a few data repositories for interaction data. For example, the KEGG database (Kanehisa ) provides small subnetworks of some selected, mainly metabolic, pathways. Recently, also the STRING database (e.g. Franceschini ) involves some predicted interactions for P.anserina. Nevertheless, there was no satisfactory solution and no easy and fast way to directly gain knowledge about proteins and their interactions of certain biological processes, which are well established in some model organisms, but nearly unknown in the target organism. Homology-based tools which theoretically enable to predict or transfer interactions between species are mostly implemented for a set of predefined organisms or require pairs of proteins in the target organism to decide whether they may interact (Chen ; Murakami and Mizuguchi, 2014). Here, we report the implementation of Path2PPI which helps finding proteins and interactions of certain pathways or biological processes in each fully sequenced organism without the need for predefinition of putative proteins and interactions.

2 Features

Using Path2PPI, the user can choose up to seven of the most established model organisms (human, mouse, rat, yeast, E.coli, C.elegans and D.melanogaster). Based on sets of proteins from these reference species Path2PPI uses the interaction repository iRefIndex (Razick ) to find the corresponding relevant interactions. We implemented a more flexible and comfortable search engine than provided by the iRefR package (Mora and Donaldson, 2011). Additionally, Path2PPI requires results of NCBI BLAST+ (Camacho ) searches of all reference species against the target species. Based on these data, Path2PPI computes new interactions in the target species and scores them. The score is based on the degree of homology and the number of reference species which show the corresponding interaction. A major advantage of Path2PPI is the easy access to the underlying reference interactions, i.e. all information provided by iRefIndex, e.g. source database, interaction type and reference publication. Based on the igraph package (Csardi and Nepusz, 2006) the computed PPI can directly be visualized in R (see Fig. 1).
Fig. 1.

The predicted PPI network of autophagy induction in P.anserina based on the corresponding PPIs in human and yeast. (a) The predicted PPI network (normal view). The edge thickness corresponds to the scores and the number of reference species showing the interaction. (b) Detailed view of the PPI network. Each edge is specifically colored (see legend), indicating in which reference species this interaction occurs. Multiple edges represent multiple findings of an interaction. That means, that e.g. the interaction of the proteins ‘B2AT71’ and ‘B2AE79’ was found six times in yeast (blue edges) and two times in human (green edges). (c) Hybrid network representation of the predicted PPI: The relevant parts of the PPI networks of the reference species are included together with the predicted PPI network of the target species. Interactions are depicted as solid lines in the respective color (see legend). Homologous relations of the reference proteins to those of the target species are drawn as dotted edges in the respective color of the target node

The predicted PPI network of autophagy induction in P.anserina based on the corresponding PPIs in human and yeast. (a) The predicted PPI network (normal view). The edge thickness corresponds to the scores and the number of reference species showing the interaction. (b) Detailed view of the PPI network. Each edge is specifically colored (see legend), indicating in which reference species this interaction occurs. Multiple edges represent multiple findings of an interaction. That means, that e.g. the interaction of the proteins ‘B2AT71’ and ‘B2AE79’ was found six times in yeast (blue edges) and two times in human (green edges). (c) Hybrid network representation of the predicted PPI: The relevant parts of the PPI networks of the reference species are included together with the predicted PPI network of the target species. Interactions are depicted as solid lines in the respective color (see legend). Homologous relations of the reference proteins to those of the target species are drawn as dotted edges in the respective color of the target node

3 Implementation

Path2PPI can be obtained from the Bioconductor project (Huber ). It contains a comprehensive tutorial and for the case study, data files necessary to predict interactions of the induction step of autophagy in P.anserina by means of the corresponding PPIs in human and yeast. There are three types of visualization methods available, the normal, detailed and hybrid (Fig 1a–c). Additionally, detailed information about each interaction can be obtained. Results are provided as data frame or as igraph objects, enabling for subsequent analyses in R or in advanced analysis tools like Cytoscape (Cline ). Through the S4 class architecture Path2PPI can be easily extended by further prediction and validation algorithms. The example depicted in Figure 1, the prediction algorithm and all features of Path2PPI are described in detail in the tutorial, see the supplement and the corresponding Bioconductor web site.

4 Conclusion

We introduced a new R package to predict PPI networks based on sets of proteins which may belong to a specific biological pathway, providing an intuitive visualization and usability. We implemented Path2PPI to reveal putative proteins and interactions for a pathway or a biological process in organisms for which nearly none PPI information is available. The results can serve as starting points for further network modeling studies and experimental validations.
  19 in total

1.  Simple sequence-based kernels do not predict protein-protein interactions.

Authors:  Jiantao Yu; Maozu Guo; Chris J Needham; Yangchao Huang; Lu Cai; David R Westhead
Journal:  Bioinformatics       Date:  2010-08-27       Impact factor: 6.937

2.  Integration of biological networks and gene expression data using Cytoscape.

Authors:  Melissa S Cline; Michael Smoot; Ethan Cerami; Allan Kuchinsky; Nerius Landys; Chris Workman; Rowan Christmas; Iliana Avila-Campilo; Michael Creech; Benjamin Gross; Kristina Hanspers; Ruth Isserlin; Ryan Kelley; Sarah Killcoyne; Samad Lotia; Steven Maere; John Morris; Keiichiro Ono; Vuk Pavlovic; Alexander R Pico; Aditya Vailaya; Peng-Liang Wang; Annette Adler; Bruce R Conklin; Leroy Hood; Martin Kuiper; Chris Sander; Ilya Schmulevich; Benno Schwikowski; Guy J Warner; Trey Ideker; Gary D Bader
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

3.  iRefR: an R package to manipulate the iRefIndex consolidated protein interaction database.

Authors:  Antonio Mora; Ian M Donaldson
Journal:  BMC Bioinformatics       Date:  2011-11-24       Impact factor: 3.169

4.  Assessing organismal aging in the filamentous fungus Podospora anserina.

Authors:  Heinz D Osiewacz; Andrea Hamann; Sandra Zintel
Journal:  Methods Mol Biol       Date:  2013

5.  Building and analyzing protein interactome networks by cross-species comparisons.

Authors:  Amy M Wiles; Mark Doderer; Jianhua Ruan; Ting-Ting Gu; Dashnamoorthy Ravi; Barron Blackman; Alexander J R Bishop
Journal:  BMC Syst Biol       Date:  2010-03-30

6.  The BioGRID interaction database: 2015 update.

Authors:  Andrew Chatr-Aryamontri; Bobby-Joe Breitkreutz; Rose Oughtred; Lorrie Boucher; Sven Heinicke; Daici Chen; Chris Stark; Ashton Breitkreutz; Nadine Kolas; Lara O'Donnell; Teresa Reguly; Julie Nixon; Lindsay Ramage; Andrew Winter; Adnane Sellam; Christie Chang; Jodi Hirschman; Chandra Theesfeld; Jennifer Rust; Michael S Livstone; Kara Dolinski; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

7.  ppiPre: predicting protein-protein interactions by combining heterogeneous features.

Authors:  Yue Deng; Lin Gao; Bingbo Wang
Journal:  BMC Syst Biol       Date:  2013-10-14

8.  Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators.

Authors:  Yoichi Murakami; Kenji Mizuguchi
Journal:  BMC Bioinformatics       Date:  2014-06-23       Impact factor: 3.169

9.  iRefIndex: a consolidated protein interaction database with provenance.

Authors:  Sabry Razick; George Magklaras; Ian M Donaldson
Journal:  BMC Bioinformatics       Date:  2008-09-30       Impact factor: 3.169

Review 10.  Protein-protein interaction detection: methods and analysis.

Authors:  V Srinivasa Rao; K Srinivas; G N Sujini; G N Sunand Kumar
Journal:  Int J Proteomics       Date:  2014-02-17
View more
  2 in total

1.  The autophagy interaction network of the aging model Podospora anserina.

Authors:  Oliver Philipp; Andrea Hamann; Heinz D Osiewacz; Ina Koch
Journal:  BMC Bioinformatics       Date:  2017-03-27       Impact factor: 3.169

2.  Multifaceted protein-protein interaction prediction based on Siamese residual RCNN.

Authors:  Muhao Chen; Chelsea J-T Ju; Guangyu Zhou; Xuelu Chen; Tianran Zhang; Kai-Wei Chang; Carlo Zaniolo; Wei Wang
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.