| Literature DB >> 24688699 |
Abstract
Small world network concepts provide many new opportunities to investigate the complex three dimensional structures of protein molecules. This mini-review explores the published literature on using small-world network approaches to study protein structure, with emphasis on the different combinations of descriptors that have been tested, on studies involving ligand binding in protein-ligand complexes, and on protein-protein complexes. The benefits and success of small world network approaches, which change the focus from specific interactions to the local environment, even to non-local phenomenon, are described. The purpose is to show the different ways that small world network concepts have been used for building new computational models for studying protein structure and function, and for extending and improving existing modelling approaches.Entities:
Keywords: Small world network; protein structure; protein-ligand binding; protein-protein binding
Year: 2013 PMID: 24688699 PMCID: PMC3962176 DOI: 10.5936/csbj.201302006
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Figure 1Different representations of 3D protein structure provide different types of understanding. (a) The electron density map for ligand and surrounding binding site residues shows a good quality, rigid model. (b) Typical computational chemistry view, showing that protein-ligand binding involves many short hydrogen bonds. (c) A network view of all favourable polar interactions in the binding site region shows how the protein-ligand hydrogen bonds are parts of highly connected local environments, which also involve numerous bound water molecules. (d) Secondary structure view, suited for structural bioinformatics analysis. Protein is Neuraminidase in complex with Zanamivir. Images generated using PyMol (www.pymol.org), PDB Ids 2cml and 1nnc.
Figure 2Simple chemical graph showing the three most widely used network descriptors for nodes, with the top five values for each. Node 28 has the highest degree, that is, the most connections (and is therefore a hub). Node 10 has the highest betweenness, which is a function of the fraction of shortest paths through a node (removing this node creates the two largest disconnected fragments). Node 1 has the highest closeness, that is, the inverse of the average of the shortest paths to all other nodes. This particular graph has a low clustering coefficient - no pair of connected nodes is connected to the same third node (no triangles). The characteristic path length is low (near 5) though this is not particularly meaningful as it is only a very small network.
Overview of publications using network methods to model the three dimensional conformations of protein domains. The summaries focus on the different cheminformatics and bioinformatics methods employed, and the properties investigated.
| Publication | Topic of Inquiry | Methods Used |
|---|---|---|
| Dokholyan et al. 2002 [ | Studied folding by creating networks based on conformations generated from Monte Carlo simulations | Showed that post-transition state protein conformations are more small-world like, that is, globally tighter, than pre-transition state conformations. Characteristic path length was found to be the best performing global descriptor examined |
| Vendruscolo et al. 2002 [ | Studied folding by creating a weighted graph from transition state ensembles of protein structures generated by Monte Carlo sampling | Edge weights obtained by dividing the number of structures with a specific edge by the total number of structures in the ensemble. Betweenness values were highest for key residues involved in folding |
| Greene et al. 2003 [ | Studied protein conformation | Used a network with weighting factors derived from the presence of multiple links between residues |
| Brinda et al. 2005 [ | Studied protein structure and stability | Used edge weighting based on the number of close contacts between a pair of residues divided by normalisation factors (which take into account residue size and the propensity to make a large number of contacts) for each residue, and allowing for varying interaction strengths |
| Paszkiewicz et al. 2006 [ | Used network methods to identify a predictor of suitable regions of circular permutation | Closeness was shown to useful descriptor; exploration of relative side chain area and sequence conservation measures also undertaken |
| del Sol et al. 2006 [ | Study of communication aspects of the aminoacid residue network | Used a characteristic path length metric, corresponding to the change in path length resulting from removal of a node |
| Muppirala et al. 2006 [ | Aim was to distinguish correctly folded from incorrectly folded proteins | Used more stringent distance constraints for hydrogen bonding (and also angle cutoffs), and similarly for hydrophobic and ionic contacts, and disulphide bonds, and conserved hydrophobic aminoacids were mapped onto the network to examine their roles |
| Aftabuddin et al. 2007 [ | Protein structure analysis | Separated amino acid sidechains into different types, hydrophobic (F, M, W, I, V, L, P, A), hydrophilic (N, C, Q, G, S, T, Y), and charged (R, D, E, H, K), and the weighted (according to the number of close-contacts between a pair of residues) and unweighted networks separately analysed |
| Gaci et al. 2008 [ | Examined subgraphs involving residues participating in secondary structure elements, alpha helices and beta sheets | Explored distribution of characteristic path length and clustering coefficient as a function protein size |
| Li et al. 2008 [ | Examined protein folding and communication between residues | Found that key residues in the folding process -found to be global centrals rather than local centrals -could be identified using solvent accessibility and network terms together |
| Milenkovic et al. 2009 [ | Sought to determine the best null model for discriminating network motifs, including secondary structure motifs, in aminoacid graphs | Examined different choices for nodes -just using side chain atoms, just using backbone atoms, using all residue atoms -and different wiring models explored, including 3D geometric, random, scale free, and stickiness-index based networks |
| Vijayabaskar et al. 2010 [ | Investigated protein structure integrity and communication pathways using energy weighted networks | Protein energy networks (PENs) were created with edges weighted according to calculated interaction energies, obtained from sampling structures from a molecular dynamics simulation, and fluctuations from the mean |
| Petersen et al. 2012 [ | Protein structure analysis, seeking patterns in the packing of amino acid pairs | Created an eight dimensional descriptor space encompassing residue proximity, solvent accessibility, sequence distance, secondary structure, and sequence length, and uncovered a scale free organization in aminoacid pair interactions |