| Literature DB >> 29788355 |
Andrew Waterhouse1,2, Martino Bertoni1,2, Stefan Bienert1,2, Gabriel Studer1,2, Gerardo Tauriello1,2, Rafal Gumienny1,2, Florian T Heer1,2, Tjaart A P de Beer1,2, Christine Rempfer1,2, Lorenza Bordoli1,2, Rosalba Lepore1,2, Torsten Schwede1,2.
Abstract
Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29788355 PMCID: PMC6030848 DOI: 10.1093/nar/gky427
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Performance comparison between ProMod-II and ProMod3 modelling engines. Performance is measured on a benchmark dataset of 250 targets collected during the CAMEO time range 20 October 2017–13 January 2018. For each target, the same template and target-template alignment were used as input for both modelling engines. Each data point represents the difference in model accuracy in terms of all-atom IDDT score. ProMod3 shows a statically significant improvement of 2.65 IDDT points on average (P-value = 1.1E–43) based on paired t-test.
Performance comparison in the context of the CAMEO continuous evaluation platform
| Server | Response time (hh:mm:ss) ( | lDDT total ( | lDDT easy ( | lDDT medium ( | lDDT hard ( | lDDT BS ( | QS-Score ( | Model confidence ( |
|---|---|---|---|---|---|---|---|---|
| SWISS-MODEL | 00:15:48 | 66.22 | 86.01 | 69.71 | 40.67 | 70.88 | 63.95 | 0.85 |
| HHpredB | 01:16:15* | 65.95 | 82.10* | 69.68 | 43.18 | 71.47 | - | 0.79* |
| NaiveBLAST | 01:20:27* | 58.93* | 82.86* | 64.20* | 25.76* | 63.88* | - | 0.68* |
| PRIMO | 02:12:08* | 60.26* | 84.51* | 65.07* | 27.82* | 67.30* | - | 0.67* |
| SPARKS-X | 02:35:21* | 63.14* | 80.06* | 65.57* | 42.53 | 67.76* | - | 0.54* |
| RaptorX | 06:28:57* | 69.15* | 83.35* | 72.10* | 49.88* | 68.85 | - | 0.65* |
| IntFOLD4-TS | 32:47:59* | 68.41* | 83.76* | 70.88 | 49.11* | 71.65 | - | 0.84 |
| Robetta | 37:00:07* | 71.60* | 85.17 | 74.00* | 54.08* | 67.48* | 60.20 | 0.81* |
Performance is measured based on a benchmark dataset of 250 targets collected during the CAMEO time range 20 October 2017–13 January 2018. Results from SWISS-MODEL and seven other modelling servers were collected from CAMEO and the performance evaluated on a common subset of targets where all compared servers returned a model. Each column indicates average performance values in terms of Response Time, model accuracy (IDDT, QS-score) and self-assessment of model quality (Model Confidence). lDDT evaluation has further been split according to CAMEOs definition of target difficulty; per column subset sizes are shown in brackets. Asterisks indicate a statistically significant difference (P-value < 0.05) compared to SWISS-MODEL based on paired t-test.
Figure 2.Modelling example of the Ferrodoxin/Ferredoxin-NADP(+) Reductase hetero dimeric complex. (A) Decision tree of templates clustered according to their quaternary structure features: oligomeric state, stoichiometry, topology and interface similarity. Three different clusters are formed based on interface similarity between templates. (B) PPI fingerprint analysis of available template structures. The ratio between interface and surface residue entropy (interface conservation, y-axis) is reported as a function of evolutionary distance (sequence identity, x-axis). Templates corresponding to SMTL ID: 1ewy.1 (in blue) and SMTL ID: 1gaq.1 (in green) show the typical conservation pattern observed for biologically relevant interfaces, with stronger conservation signal in the sequence identity range between 40 and 60%. Considering also remote homologs (below 40% sequence identity), only the interface in template SMTL ID: 1gaq.1 is deemed as conserved (interface/surface conservation ratio below zero). Template corresponding to SMTL ID: 3w5u.1 (in red) displays an interface/surface conservation ratio close to zero, as observed in crystal contacts/artefacts. (C) Structure superposition of available templates. Each template is coloured according to same colouring scheme of Figure 2A and B. Templates corresponding to SMTL ID: 1ewy.1 (in blue) and 1gaq.1 (in green) show similar arrangement of FNR and Fd in the complex. Template SMTL ID: 3w5u.1 (red) shows a different localization of the Fd moiety. Cross-linked cysteines are shown in sticks. (D) Structure superposition between model and native structure of the root FNR:Fd complex. The model is coloured according to its local quality using a colour gradient from blue (high quality) to red (low quality) as measured by all-atom IDDT score. The native structure of the complex is shown in light gray.