| Literature DB >> 26287166 |
Jennifer D Atkins1, Samuel Y Boateng2, Thomas Sorensen3, Liam J McGuffin4.
Abstract
The role and function of a given protein is dependent on its structure. In recent years, however, numerous studies have highlighted the importance of unstructured, or disordered regions in governing a protein's function. Disordered proteins have been found to play important roles in pivotal cellular functions, such as DNA binding and signalling cascades. Studying proteins with extended disordered regions is often problematic as they can be challenging to express, purify and crystallise. This means that interpretable experimental data on protein disorder is hard to generate. As a result, predictive computational tools have been developed with the aim of predicting the level and location of disorder within a protein. Currently, over 60 prediction servers exist, utilizing different methods for classifying disorder and different training sets. Here we review several good performing, publicly available prediction methods, comparing their application and discussing how disorder prediction servers can be used to aid the experimental solution of protein structure. The use of disorder prediction methods allows us to adopt a more targeted approach to experimental studies by accurately identifying the boundaries of ordered protein domains so that they may be investigated separately, thereby increasing the likelihood of their successful experimental solution.Entities:
Keywords: disorder prediction methods; intrinsic disorder; structural bioinformatics; types of disorder
Mesh:
Substances:
Year: 2015 PMID: 26287166 PMCID: PMC4581285 DOI: 10.3390/ijms160819040
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Number of publications relating to intrinsic disorder/unfolded proteins on PubMed since 1990. The early 2000’s saw a dramatic increase in research on these proteins. This figure has been updated from [1] using the same search terms within PubMed; intrinsically disordered, intrinsically unstructured, natively unfolded, intrinsically unfolded and intrinsically flexible.
A selection of current protein disorder prediction servers.
| Disorder Prediction Server | URL | Description | Publication Date | CASP Rank | Standalone Method Available? |
|---|---|---|---|---|---|
| MobiDB [ | 10 servers; Espritz (all 3 flavours) [ | 2014 | - | No | |
| Metadisorder [ | 13 servers; output weighted by accuracy score (Sw). Uses DisEMBL (3 versions) [ | 2012 | CASP10: 22 CASP9: 14 CASP8: 21 | No | |
| Spine-D [ | Ab-initio predictor with an initial three-state state prediction. Generates a consensus prediction based upon 5 independent predictors. | 2012 | CASP9: 4 | Yes | |
| MFDp [ | 3 servers; DISOclust [ | 2010 | CASP10: 3/4 | No | |
| PreDisorder [ | Ab-initio predictor based upon a recursive neural network using a PSI-BLAST profile combined with secondary structure predictions and solvent accessibility. | 2009 | CASP8: 8 | Yes | |
| DISOclust [ | Utilizes outputs from the ModFOLD method to calculate per residue variation in 3D models from IntFOLD. | 2008 | CASP10: 19 CASP9: 9 CASP8: 3 | Yes | |
| metaPrDOS [ | 8 servers; prediction scores of each converted into an input vector which feeds into an SVM. Uses PrDos [ | 2008 | CASP10: 5 CASP8: 13 | No | |
| PrDOS [ | Combines two predictors; one based upon amino acid composition and one on template proteins. | 2007 | CASP10: 1 CASP9: 1 | No | |
| POODLE [ | Integrated system using 3 predictors; POODLE-L, POODLE-S and POODLE-W. | 2007 | CASP10: 6 | Yes | |
| DisPro [ | All disordered X-ray crystal structures from the PDB were filtered to obtain a dataset with only >30 residues. The final data set contained 215, 612 residues; only 6.2% disordered. | 2005 | - | Yes | |
| IUPred [ | Based upon a quadratic equation of amino acid composition determining energies; chemical type, sequential environment and interaction partners. | 2005 | - | Yes | |
| DISOPRED 2+3 [ | Web based ab-initio prediction server. Trained on 750 non-redundant disordered high resolution X-ray Crystal structures. | 2004 | CASP10: 2 CASP9: 2 CASP8: 19 | Yes | |
| PONDR [ | Default predictor VL-XT; uses VL1 trained on 8 disordered regions from X-ray crystallographic data and 7 characterized by NMR with >30residues. 10 attributes were used as inputs into a feedforward neural network [ | 1999 | - | No |
Figure 2IntFOLD server model of Cardiac MLP. The central and terminal regions are both thought to contain disorder, as found within the other members of the CRP family. The ordered domains are predicted to contain zinc binding sites; likely locations of zinc atoms are indicated by grey spheres. The image is rendered using PyMOL [56].
Comparison of disordered region prediction for Muscle LIM Protein (MLP) from a variety of servers which utilize different methodologies, including the top five ranked servers from the past three CASP (Critical assessment of disorder prediction servers) experiments. N.B. POODLE not tested as unavailable at the time of writing.
| Server/Prediction Method | CASP Rank (AUC (ROC) Score) | N-Terminus Disordered Residues | Central Disordered Residues | C-Terminus Disordered Residues | ||
|---|---|---|---|---|---|---|
| 10 | 9 | 8 | ||||
| PrDOS (5% False Positive; default) | 1 | 1 | 1–6 | - | 187–194 | |
| PrDOS (15% False Positive) | 1 | 5 | 184–194 | |||
| DISOPRED3 | 2 | 2 | - | 1–4 | - | 185–194 |
| MFDp | 3 & 4 | 7 & 8 | 19 | 1–8 | 87–112 | 184–194 |
| MFDp2 | - | 93–108 | 186–194 | |||
| metaPrDOS | 5 | - | 13 | 1–8 | 82–86, 89–116 | 187–194 |
| PreDisorder | 7 | 3 | 8 | 1–12 | 41–54, 72–122 | 151–163, 187–194 |
| Spine-D | 9 | 4 | - | 1–15 | 68–119 | 181–194 |
| DISOclust (From IntFOLD) | 19 | 9 | 3 | 1–5 | 78–123 | 182–194 |
| GSMetaDisorder | 22 | 14 | 21 | 1–5 | 88–114 | 182–194 |
| GSMetaDisorderMD | 15 | 10 | - | 1–5 | 91–114 | 185–194 |
| DISOPRED2 | - | - | 2 | 1–2 | 95–114 | 186–194 |
| GSMetaDisorderMD2 | - | 1–5 | 85–115 | 182–194 | ||
| MobiDB (consensus) | - | 1–6 | 91–107, 110, 113–118 | 189–194 | ||
| PredictProtein: MD | - | 1–15 | 91–119 | 152, 154–158, 178–194 | ||
| PredictProtein: UCON | - | 49 | 93–117, 119–121, 126, 133, 136 | 155–161, 163–164, 187–194 | ||
| PredictProtein: PROFbval | - | 1–16, 18–20, 22, 24, 26–29, 32, 41–48, 50–56 | 60–131, 136–139 | 149, 151–165, 170, 173–182, 184–194 | ||