| Literature DB >> 31485137 |
Soudabeh Sabetian1,2, Mohd Shahir Shamsir1.
Abstract
Proteins can interact in various ways, ranging from direct physical relationships to indirect interactions in a formation of protein-protein interaction network. Diagnosis of the protein connections is critical to identify various cellular pathways. Today constructing and analyzing the protein interaction network is being developed as a powerful approach to create network pharmacology toward detecting unknown genes and proteins associated with diseases. Discovery drug targets regarding therapeutic decisions are exciting outcomes of studying disease networks. Protein connections may be identified by experimental and recent new computational approaches. Due to difficulties in analyzing in-vivo proteins interactions, many researchers have encouraged improving computational methods to design protein interaction network. In this review, the experimental and computational approaches and also advantages and disadvantages of these methods regarding the identification of new interactions in a molecular mechanism have been reviewed. Systematic analysis of complex biological systems including network pharmacology and disease network has also been discussed in this review.Entities:
Keywords: Protein interaction network; computational method; disease network
Year: 2019 PMID: 31485137 PMCID: PMC6704336 DOI: 10.6026/97320630015513
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Graphical abstract for the study
Protein interaction databases
| Databases | Descriptions | References and URL |
| The Database of Interacting Proteins (DIP) | DIP is a database that records experimentally determined protein-protein interactions. It provides the scientific community with an integrated set of tools for browsing and extracting information about protein interaction networks. Tools have been developed that allow users to analyse, visualize and integrate their own experimental data with the information about protein-protein interactions available in the DIP database. | [ |
| IntAct | IntAct is an open source database suite for storing and analysing protein-protein interaction data. The available data emanates from published literature and is manually interpreted by expert biologists to a high confidence of detail, comprising experimental methods, conditions and interacting domains. The experimental methods include yeast-2-hybrid, mass spectrometry, fluorescence microscopy, co immune precipitation, pull down and others. PPI network data can be import from IntAct directly using IntAct Web Service Client, a plugin of Cytoscape. | [ |
| The Biomolecular Interaction Network Database (BIND)/(BOND) | BIND (/BOND) is a database designed to store full descriptions of interactions, molecular complexes and pathways. It can be used to study networks of interactions, to map pathways across taxonomic branches and to generate information for kinetic simulations. | [ |
| A Molecular Interaction database (MINT) | MINT is a database designed to store functional interactions data such as enzymatic modifications of one of the partners. MINT includes of extracted data from the published literature by expert curators and software that assemble abstracts comprising information from interaction and demonstrated them in a user-friendly format. The interaction data can be easily mined and observed graphically through 'MINT Viewer'. | [ |
| Reactome | Reactome is a peer-reviewed resource of human biological pathways. The complete set of possible reactions organizes its reactome by enter the genetic profile of an organism. Reaction is the basic module of the Reactome database then the reactions are grouped into causal strings to procedure pathways. The involving applications have been developed to enter custom data and interpretation by expert biologists, and to allow visualisation to construct an interactive pathway network. | [ |
| Human Protein Reference Database (HPRD) | HPRD is an open source based on technologies for protein features in various aspects of human proteins comprising post-translational modifications enzyme-substrate links, disease associations and PPI. The details were derived manual accurate reading of the scientific literature by expert biologists and also protein sequence analyses by bioinformatics approaches. | [ |
| MIPS | The MIPS or mammalian protein-protein interaction database (MPPI) is a new high-quality resource which stores experimental protein interaction data in mammals. The data is based on published experimental studies that has been analysed by human expert curators. It provides a flexible and powerful web interface with full dataset for download toward various scientific targets. | [ |
| SMART (a Simple Modular Architecture Research Tool) | SMARTanalysis the domain structures of proteins and provides the documentation and annotation of genetically the domains. The domains which are grouped in more than 500 domain families are widely annotated in order to phyletic distributions, tertiary structures, functional class and functionally important residues. Each domain found in a non-redundant protein database as well as search features are collected in an associated database system. | [ |
| Kyoto Encyclopaedia of Genes and Genomes (KEGG) | KEGG is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes, information about genes and proteins generated by genome projects and information about biochemical compounds and reactions. | [ |
| InterPro | InterPro is a resource that provides functional analysis of protein sequences by classifying them into families and predicting the existence of domains and main sites. To organize proteins in this way, InterPro procedures predictive models, represented by several different databases. The databases that make up the InterPro Consortium contain PROSITE, HAMAP, Pfam, PRINTS, ProDom, SMART, TIGRAFAMs, PRISF, SUPERFAMILY, CATH, Gene30 and PANTHER. | |
| Gene Ontology (GO) | The GO Consortium is the set of model organism and protein databases and biological research communities actively involved in the development and application of the Gene Ontology. | [ |
| BioGRID | BioGRID is an online interaction repository with data compiled through comprehensive curation efforts. This database contains protein and genetic interaction from major model organism species. All interaction data are freely provided via search index and available by downloading in a wide variety of standardized format. | [ |
| Pathway Commons | Pathway Commons collects publicly available pathway information from various organisms. It allows convenient access to a comprehensive store of biological pathways from multiple resources presented in a common language for gene and metabolic pathway analysis. | [ |
| BioCyc | The BioCyc is a collection of genomes and metabolic pathways which are represented by multiple pathways. The included data is generated by software that predict the metabolic pathways of completely sequenced organisms. BioCyc also integrates protein feature and Gene Ontology information from other bioinformatics databases, such as from UniProt. | [ |
| Pfam | The Pfam database provides a large collection of protein families, each output results by multiple sequence alignments and hidden Markov models (HMMs). | [ |
| GEO (Gene Expression Omnibus) | GEO is an international public source that stores freely microarray, next-generation sequencing, and other forms of high-throughput functional genomics data submitted by the research community. GEO fallows three main aims: archive high-throughput functional genomic data; collect and well-annotated data from the research community; provide to researchers to query, review and download gene expression profile of interest. | [ |
Common tools for visualization protein interaction network
| Visualization tools | URL |
| Cytoscape [ | http://www.cytoscape.org/ |
| NAViGaTOR [ | http://ophid.utoronto.ca/navigator/ |
| Biolayout [ | http://www.biolayout.org/ |
| Medusa [ | https://sites.google.com/site/medusa3visualization/ |
| Arena3D [ | http://www.arena3d.org/ |
Comparisons of Visualization Tools in Several Features Including Database Integration, Data Input Format, Export File Format, and Layout Algorithm
| Features | Arena3D | Biolayout | Cytoscape | NAViGaTOR | Pajek | Gephi | Medusa |
| Plug-ins | x | x | √ | x | x | √ | x |
| Auto merge numbers of networks (no modification on original input file) | x | x | √ | √ | x | x | x |
| Mark nodes according to GO annotations | x | x | √ (plug-ins) | x | x | x | x |
| GO annotations | x | x | √ (plug-ins) | √ | x | x | x |
| Database Integration | |||||||
| IntAct | x | x | √ (plug-ins) | x | x | x | x |
| NCBI | x | x | √ (plug-ins) | x | x | x | x |
| APID2NET | x | x | √ (plug-ins) | x | x | x | x |
| cPath | x | x | √ (plug-ins) | √ | x | x | x |
| GO | x | x | √ (plug-ins) | √ | x | x | x |
| Data Input Format | |||||||
| Text delimited data | x | √ | √ | √ | x | x | √ |
| GML | x | x | √ | √ | x | √ | x |
| PSI-MI XML | x | x | √ | √ | x | x | x |
| SIF | x | √ | √ | x | x | x | x |
| BIOPAX | x | x | √ | √ | x | x | x |
| XGMML | x | x | √ | x | x | x | x |
| Text | √ | √ | √ | √ | x | x | √ |
| Export File Format | |||||||
| Image file | √ | √ | √ | √ | √ | √ | √ |
| xml | x | x | √ | x | x | x | |
| Pajek | √ | x | x | x | √ | x | √ |
| gml | x | x | √ | √ | x | x | x |
| PSI-MI | x | x | √ | √ | x | x | x |
| Arena3D | √ | x | x | x | x | x | √ |
| text | √ | x | x | √ | x | x | x |
| sif | x | √ | √ | x | x | x | √ |
| svg | x | x | √ | √ | √ | √ | x |
| tiff | x | x | x | √ | x | x | x |
| x | x | √ | √ | x | √ | x | |
| GraphViz | x | x | x | x | x | x | √ |
| Medusa format | √ | x | x | x | x | x | √ |
| VRML | √ | x | x | x | √ | x | x |
| Layout Algorithm | |||||||
| Multi-threaded | x | x | x | √ | x | x | x |
| grid-variant layout algorithm | |||||||
| Spring Embedded algorithms | √ | √ | √ | x | √ | x | √ |
| Fruchterman-Rheingold layout algorithm | √ | √ | x | x | √ | x | √ |
| Lanczos algorithm | x | x | x | x | √ | x | x |
| Force Atlas algorithm | x | x | x | x | x | √ | x |
| Distance Geometry layout | √ | x | x | x | x | x | √ |
| Simulated Annealing Algorithm | √ | √ | √ | x | x | x | x |
| Circular layout | √ | x | √ | √ | √ | √ | √ |
| Hierarchiral layout | √ | x | √ | x | x | x | √ |
| Yifan's Hu Multilevel layout | x | x | x | x | x | √ | x |