| Literature DB >> 29746660 |
Daehong Kwon1, Daehwan Lee1, Juyeon Kim1, Jongin Lee1, Mikang Sim1, Jaebum Kim1.
Abstract
Proteins perform biological functions through cascading interactions with each other by forming protein complexes. As a result, interactions among proteins, called protein-protein interactions (PPIs) are not completely free from selection constraint during evolution. Therefore, the identification and analysis of PPI changes during evolution can give us new insight into the evolution of functions. Although many algorithms, databases and websites have been developed to help the study of PPIs, most of them are limited to visualize the structure and features of PPIs in a chosen single species with limited functions in the visualization perspective. This leads to difficulties in the identification of different patterns of PPIs in different species and their functional consequences. To resolve these issues, we developed a web application, called INTER-Species Protein Interaction Analysis (INTERSPIA). Given a set of proteins of user's interest, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins and searches for different patterns of PPIs in multiple species through a server-side pipeline, and second visualizes the dynamics of PPIs in multiple species using an easy-to-use web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29746660 PMCID: PMC6031021 DOI: 10.1093/nar/gky378
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.INTERSPIA workflow for inter-species protein interaction analysis. As user input, INTERSPIA takes a target species (orange-colored species in the right side of the top panel) and a set of proteins of the target species of user's interest (the left side of the top panel) with additional species to be compared (black-colored species in the right side of the top panel). In the server-side step (middle panel), additional proteins interacting with the input target proteins are discovered by the random walk with restart algorithm using the PPIs of the target species in the STRING database. The extended proteins are next used to identify their interactions in the other chosen species using the STRING and orthoDB database. Functions related with the extended proteins are identified by the GO enrichment test. In the client-side step (bottom panel), the dynamics of the protein-protein interactions among the selected species are visualized in an easy-to-use web interface with various features.
Figure 2.Examples of the client-side web interface of INTERSPIA showing protein interactions observed in different sets of species. (A) Protein interactions observed in cow or pig. (B) Protein interactions commonly present in cow and pig but absent in horse with the use of the ‘Only species-specific interactions’ option. Phylogenetic trees on the left side represent the evolutionary relationship of used species and chosen species with thick branches to display the observed protein interactions. Edge color indicates a species having the interaction. If an edge is observed in multiple species, multiple colors are used to draw the edge.
Figure 3.Examples of the client-side web interface of INTERSPIA showing various features. (A) Diverse protein identifiers, a gene symbol, function and orthologous protein information of a chosen protein shown by a mouse right click on a node in the network. (B) Information of two interactions and species containing (green color) or not containing (grey color) the interactions shown by a mouse left click on an edge. The numbers next to the species names represents the STRING confidence score of the PPI in the case of a target species, or the average of all STRING confidence scores of all orthologous protein pairs in the case of other species. (C) Results of the GO enrichment test for the proteins shown in a current network. GO terms can be filtered by q-value and sorted by q-value, GO ID, and GO name. User can highlight proteins with a specific function by selecting a specific GO term. In the figure, proteins associated with the SMAD protein complex assembly (GO:0007183) function are highlighted with a black and magnified label.