| Literature DB >> 31045208 |
Liam J McGuffin1, Recep Adiyaman1, Ali H A Maghrabi1, Ahmad N Shuid1,2, Danielle A Brackenridge1, John O Nealon1, Limcy S Philomina1.
Abstract
The IntFOLD server provides a unified resource for the automated prediction of: protein tertiary structures with built-in estimates of model accuracy (EMA), protein structural domain boundaries, natively unstructured or disordered regions in proteins, and protein-ligand interactions. The component methods have been independently evaluated via the successive blind CASP experiments and the continual CAMEO benchmarking project. The IntFOLD server has established its ranking as one of the best performing publicly available servers, based on independent official evaluation metrics. Here, we describe significant updates to the server back end, where we have focused on performance improvements in tertiary structure predictions, in terms of global 3D model quality and accuracy self-estimates (ASE), which we achieve using our newly improved ModFOLD7_rank algorithm. We also report on various upgrades to the front end including: a streamlined submission process, enhanced visualization of models, new confidence scores for ranking, and links for accessing all annotated model data. Furthermore, we now include an option for users to submit selected models for further refinement via convenient push buttons. The IntFOLD server is freely available at: http://www.reading.ac.uk/bioinf/IntFOLD/.Entities:
Year: 2019 PMID: 31045208 PMCID: PMC6602432 DOI: 10.1093/nar/gkz322
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The IntFOLD5 server results pages for CASP13 target T0971. (A) Graphical output from the main results page showing (from top to bottom): 1. The table with the top 5 selected 3D models and scores (table truncated here to fit); 2. The prediction of natively unstructured/disordered regions; 3. The predicted structural domain boundaries; 4. The ligand binding site prediction; 5. The full model quality rankings for all generated models (table truncated here to fit). The arrows point to additional pages that are linked to when users click on images/buttons on the main page. (B) Clicking the button titled ‘View model in 3D and download’ leads to dynamically generated pages showing interactive views of the model, and structural superpositions of the model with relevant template/s, which can be manipulated in 3D using the JSmol/HTML5 framework (http://www.jmol.org/) and/or downloaded for local viewing. (C) Clicking the button titled ‘Refine model using ReFOLD’ submits the 3D model to the ReFOLD service (21) for refinement guided by accurate quality estimates. (D) Clicking on the image of the ligand binding site prediction links to a dynamically generated page that provides numerous options for interactively viewing the likely protein–ligand interactions in 3D with JSmol.
Figure 2.The IntFOLD5 server predictions for CASP13 target T0971 – comparison of models with the native crystal structure (PDB ID: 6d34). All images were rendered using PyMOL (http://www.pymol.org/). (A) The IntFOLD5 3D model coloured by accuracy self-estimate of local quality using the temperature coloured scheme from blue (indicating residues in the model predicted to be close to the native structure) to red (indicating residues in the model that are far from the native or unstructured). (B) The IntFOLD5 3D model with the main cluster of predicted ligands (red spheres) indicating the predicted location of binding site. (C) The crystal structure of T0971/6d34 with ligand (blue spheres). Note: the disordered domain predicted in the model is absent in the X-ray data. (D) Superposition of the IntFOLD5 model and the native structure.
Independent benchmarking of tertiary structure predictions with CAMEO 3D data. Performance results for 3 months of data (26 October 2018 to 19 January 2019) are shown for all (250) targets and all (17) public methods. Data are sorted by average lDDT score for all targets. The scores for the IntFOLD-TS methods are indicated in bold. Data are taken from the CAMEO 3D front page http://www.cameo3d.org/ on 19 January 2019.
| Average lDDT | Average lDDT-BS | |||
|---|---|---|---|---|
| Server name | All targets | Modelled targets | All targets | Modelled targets |
|
|
|
|
|
|
| RaptorX | 67.38 | 67.38 | 68.45 | 68.45 |
| Robetta | 65.51 | 69.1 | 63.24 | 66.11 |
| HHpredB | 64.06 | 64.06 | 68.59 | 68.59 |
| SWISS-MODEL | 62.22 | 62.97 | 64.85 | 65.56 |
|
|
|
|
|
|
| SPARKS-X | 54.63 | 60.7 | 58.07 | 66.78 |
| M4T-SMOTIF-TF | 54.45 | 60.77 | 62.92 | 65.78 |
|
|
|
|
|
|
| PRIMO | 51.74 | 57.48 | 58.32 | 64.65 |
| PRIMO_BST_CL | 51.71 | 57.45 | 58.32 | 64.65 |
| NaiveBLAST | 50.34 | 55.69 | 60.08 | 62.11 |
| PRIMO_BST_3D | 49.83 | 55.86 | 57.99 | 63.51 |
| PRIMO_HHS_3D | 48.27 | 55.87 | 56.49 | 62.62 |
| PRIMO_HHS_CL | 46.73 | 56.43 | 55.55 | 61.58 |
| Princeton_TEMPLATE | 24.46 | 54.61 | 25.63 | 58.95 |
| Phyre2 | 24.06 | 52.77 | 29.27 | 67.31 |
Independent benchmarking of IntFOLD versions with CAMEO 3D data showing the sequential improvement in server performance since the last webserver paper describing IntFOLD3. Performance results for 1 year of data (26 January 2018 to 19 January 2019) are shown for a common subset of 581 targets. The reference method is IntFOLD5-TS and the table is sorted by average lDDT. Data are downloaded from http://www.cameo3d.org/
| Avg. lDDT | Avg. CAD-score | Avg. lDDT-BS | ||||
|---|---|---|---|---|---|---|
| Server Name | Dif. | Ref. | Dif. | Ref. | Dif. | Ref. |
| IntFOLD5-TS | 0 | 67.72 | 0 | 0.67 | 0 | 71.86 |
| IntFOLD4-TS | 0.53 | 67.18 | 0 | 0.66 | 0.23 | 71.62 |
| IntFOLD3-TS | 2.11 | 65.61 | 0.02 | 0.65 | 1.9 | 69.96 |