| Literature DB >> 20334628 |
Abstract
Protein-protein interactions form the basis for a vast majority of cellular events, including signal transduction and transcriptional regulation. It is now understood that the study of interactions between cellular macromolecules is fundamental to the understanding of biological systems. Interactions between proteins have been studied through a number of high-throughput experiments and have also been predicted through an array of computational methods that leverage the vast amount of sequence data generated in the last decade. In this review, I discuss some of the important computational methods for the prediction of functional linkages between proteins. I then give a brief overview of some of the databases and tools that are useful for a study of protein-protein interactions. I also present an introduction to network theory, followed by a discussion of the parameters commonly used in analysing networks, important network topologies, as well as methods to identify important network components, based on perturbations.Entities:
Year: 2010 PMID: 20334628 PMCID: PMC2834675 DOI: 10.1186/1759-4499-2-2
Source DB: PubMed Journal: Autom Exp ISSN: 1759-4499
Figure 1Prediction of functional linkages between proteins, based on different methods. (A) Method of domain fusion. The figure shows proteins predicted to interact by the Rosetta stone method (domain fusion). Each protein is shown schematically with boxes representing domains. Proteins P2 and P3 in Genomes 2 and 3 are predicted to interact because their homologues are fused in the first genome. (B) Gene neighbourhood. The figure shows four hypothetical genomes, containing one or more of the genes A, B and C. Since the genes A and B are co-localised in multiple genomes (1–4), they are likely to be functionally linked with one another. (C) Phylogenetic profiles. The figure shows five hypothetical genomes, each containing one or more of the proteins A, B, C and D. The presence or absence of each protein is indicated by 1 or 0, respectively, in the phylogenetic profiles given on the right. Identical profiles are highlighted — proteins A and B are functionally linked (dotted line), whereas proteins C and D, which have different phylogenetic profiles (shown in grey) are not likely to be functionally linked. (D) Correlated mutations. The alignments of two protein families are shown; conserved residues in either alignment are shown in the same colour (blue and green). Correlated mutations in either alignment (coloured red) are indicated by arrow marks. Common sub-trees of the phylogenetic trees are highlighted in yellow. The presence of correlated mutations in each family suggests that the corresponding sites may be involved in mediating interactions between the proteins from each family.
Databases and resources useful for researching PPIs.
| Database | URL | Resources | Refs. |
|---|---|---|---|
| BIND | Peer-reviewed bio-molecular interaction database containing published interactions and complexes | [ | |
| BioGRID | Protein and genetic interactions from major model organism species | [ | |
| COGs | Orthology data and phylogenetic profiles | [ | |
| DIP | Experimentally determined interactions between proteins | [ | |
| HPRD | Human protein functions, PPIs, post-translational modifications, enzyme–substrate relationships and disease associations | [ | |
| IntAct | Interaction data abstracted from literature or from direct data depositions by expert curators | [ | |
| iPFAM | Physical interactions between those Pfam domains that have a representative structure in the Protein DataBank (PDB) | [ | |
| MINT | Experimentally verified PPI mined from the scientific literature by expert curators | [ | |
| Predictome | Experimentally derived and computationally predicted functional linkages | [ | |
| ProLinks | Protein functional linkages | [ | |
| SCOPPI | Domain–domain interactions and their interfaces derived from PDB structure files and SCOP domain definitions | [ | |
| STRING | Protein functional linkages from experimental data and computational predicttions | [ |
Examples of tools useful for the visualisation of networks and PPIs.
| Tool | URL | Features | Refs. |
|---|---|---|---|
| BioLayout Express 3D | Facilitates microarray data analysis | [ | |
| Cytoscape | Versatile; implements many visualisation algorithms; many plug-ins available | [ | |
| Large Graph Layout (LGL) | Especially useful for dynamic visualisation of large graphs (105 nodes, 106 edges); force-directed layout algorithm | [ | |
| Osprey | Provides network filters, connectivity filters, many layouts and facilitates dataset superimposing | [ | |
| Pajek | Especially useful for the analysis of very large networks | [ | |
| Visant | Especially facilitates analysis of gene ontologies | [ | |
| Yed | General purpose graph editor | - |