| Literature DB >> 29071738 |
Joerg Schaarschmidt1, Bohdan Monastyrskyy2, Andriy Kryshtafovych2, Alexandre M J J Bonvin1.
Abstract
Following up on the encouraging results of residue-residue contact prediction in the CASP11 experiment, we present the analysis of predictions submitted for CASP12. The submissions include predictions of 34 groups for 38 domains classified as free modeling targets which are not accessible to homology-based modeling due to a lack of structural templates. CASP11 saw a rise of coevolution-based methods outperforming other approaches. The improvement of these methods coupled to machine learning and sequence database growth are most likely the main driver for a significant improvement in average precision from 27% in CASP11 to 47% in CASP12. In more than half of the targets, especially those with many homologous sequences accessible, precisions above 90% were achieved with the best predictors reaching a precision of 100% in some cases. We furthermore tested the impact of using these contacts as restraints in ab initio modeling of 14 single-domain free modeling targets using Rosetta. Adding contacts to the Rosetta calculations resulted in improvements of up to 26% in GDT_TS within the top five structures.Entities:
Keywords: CASP; co-variation; contact prediction; correlated mutations; de novo structure prediction; evolutionary coupling
Mesh:
Substances:
Year: 2017 PMID: 29071738 PMCID: PMC5820169 DOI: 10.1002/prot.25407
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Figure 1Color‐coded similarity matrix with a dendrogram on the left illustrating the similarity among different methods as judged by the number of common predicted contacts for all targets. The Jaccard distances used in the matrix are calculated on the union of the predicted top L/2 medium‐ and long‐range contacts for each pair of groups
Brief description of the methods participating in CASP12 contact prediction according to the CASP12 Abstracts (http://predictioncenter.org/casp12/doc/CASP12_Abstracts.pdf) if available
| Methods | Group ID | Remarks |
|---|---|---|
| AkbAR | G389 | A meta‐classifier combining several distinct approaches used for inferring contacts from multiple sequence alignments along with a broad range of sequence‐derived features |
|
BAKER_GREMLIN | G030, G157 | A pseudo‐likelihood‐based method using co‐evolution information from multiple sequence alignments |
| Deepfold‐Contact, iFold_1, naive | G219, G079, G109 | Deep learning‐ based contact prediction algorithms using amino acid composition from the MSA, secondary structure predicted by PSIPRED, |
| Distill | G407 | A system based on 2D‐ Recursive Neural Networks with sequence; MSA; PSI‐BLAST, |
| FALCON_COLORS | G348 | An approach predicting contacts by removing background correlations via low‐rank and sparse decomposition of a residue correlation matrix. The matrix is calculated by using local statistical models (e.g., MI and OMES) or global statistical models (e.g., mfDCA |
|
FLOUDAS | G040, G357 | The prediction of tertiary contacts is based on the integration of a random forest model that utilizes coevolution scores and sequence‐based features, and tertiary contacts extracted from the structural templates identified by conSSert |
| IGBteam | G310 | Deep learning approach combining evolutionary information, co‐evolution information, CCMpred |
| MetaPSICOV | G013 | A neural network‐based method inferring coevolutionary signals from multiple sequence alignments of three distinct approaches (PSICOV, |
| MULTICOM‐CLUSTER | G287 | a meta‐predictor that combines contacts predicted by the two other methods—CONSTRUCT and NOVEL |
| MULTICOM‐CONSTRUCT | G236 | A predictor employing MetaPSICOV |
| MULTICOM‐NOVEL (DNcon | G345 | A sequence‐based deep learning contact predictor. |
| Myprotein‐me (plmConv) | G251 | A method using a deep, convolutional neural network relying on the evolutionary coupling inference of a Potts model, using pseudolikelihood maximization |
| Pcons‐net, PconsC2, PconsC31 | G432, G024, G097 | A deep learning approach combining PSICOV |
| PLCT | G281 | A neural network method that has been trained using unbalanced training, oversampling the negative class. The training set has been built based on predicted solvent accessibilities and predicted secondary structures. |
| raghavagps (RRCpred2) | G320 | A simple method that utilizes predicted tertiary structures of a protein for identification of residue‐residue contacts. After ranking predicted structures using quality assessment software QASproCL, residue‐residue contacts are determined in the top 10 protein structures based on distance between residues. From these, residue‐residue contacts are predicted for the target protein based on consensus/average. |
| RaptorX‐Contact | G451 | A deep learning method predicting contacts by integrating both evolutionary coupling and sequence conservation information through an ultra‐deep residual neural network |
| RBO‐Epsilon | G020 | A deep learning method combining evolutionary, sequence‐based, and physicochemical information stemming from EPC‐map |
| RRCpred | G108 | A model using the Logistic classifier of WEKA |
| Shen‐Group | G431 | Updated version of R2C |
| Wang1–4 | G132, G206, G458, G195 | Methods trained on a set of features including PSIPRED |
| Yang‐Server | G044 | A Pipeline by combining existing contact prediction methods including SVMSEQ, |
|
Zhang_Contact | G373 | A neural network combining the contact prediction from three machine‐learning methods (BETACON, |
| ZHOU‐SPARKS‐X | G452 | A probabilistic‐based matching approach |
Figure 2Average precision of long range contacts on L/5 lists for free modeling targets in CASP10 (red), CASP11 (green), and CASP12 (blue) sorted by rank. Grey dashed lines indicate the levels of the best performing group in CASP10 and CASP11, respectively. While only one group showed a significantly better average precision than all the others in CASP 11 compared to CASP10, 26 groups showed an improved average precision in CASP12 compared to the best performing group of CASP11
Figure 3The Precision distribution of medium and long range contacts within the L/2 (blue) and L/5 (green) set for increasing levels of alignment depth (specified in parentheses) in the FM category. Precisions above 90% were reached for almost half of the targets in the L/5 list
Figure 4Plot of average precision by target for all groups versus (A) alignment depth (logarithmic scale) for the FM targets and (B) sequence length. While a correlation between precision and alignment depth can be observed (R 2∼0.56), there is no significant correlation between the sequence length and precision (R 2 ∼0.03)
Figure 5Contact maps for two distinct predictions of L/2 medium and long range contacts for target T0866. Both predictions have an identical precision of 94.23% but a quite a different distribution of contacts with the true predictions (blue) on the left‐hand side spreading equally over the true contacts (green and blue) while on the right‐hand side the predicted contacts cluster in three different regions. This is reflected in a difference in Entropy score of 19.1 for BAKER_GREMLIN and 14.1 for Pcons‐net
Figure 6Effect of alignment depth (x axis) and target length (gradient) on overall GDT_TS (A) and GDT_TS improvement (Δbest GDT_TS—B). An overall poor GDT_TS (A) is observed for the longest targets (white and light gray) regardless of alignment depth. The smallest changes in best GDT_TS are observed for the targets with an alignment depth below 0.05 (B)
Figure 7Improvement in Modeling for target T0915; while the best model from the run without restraints (green) reaches a GDT_TS of only 35 (A) to the reference structure (white), the best model from the run with restraints (blue) reaches a GDT_TS of 52 (B)
Figure 8The GDT_TS of contact‐guided Rosetta models built by us for different contact prediction groups as a function of the precision of underlying contact prediction on three representative targets—T0866, T0904 and T0941. The tertiary structure predictions were built separately for six lengths of contact lists (0.2–3 L) used to guide the modeling. Points in the graph represent the highest GDT_TS score within the top five structures built for each contact prediction group. The best GDT_TS of the Top five models without contacts is indicated by the dashed vertical line. In general, the best GDT_TS correlates with the precision. Hardly any improvement in respect to the run without constraint is observed for T0941 (right). While Precisions above 50% are associated with an increased best GDT_TS for target T0904 (middle), even precisions of 100% are not resulting in an improved best GDT_TS in all cases in T0866 (left)
Figure 9(A) Effect of the number of employed contacts on the improvement of GDT_TS within the top five models by score. Performance of the run without restraints is indicated by the dashed line. For the best five ranked groups (ranking according to Table 2) the list size with the biggest improvement varies between L/2 to 2*L for target T0866. (B) In contrast the best option based on the boxplot for the top 10 groups (ranking according to Table 2) on the seven targets with a GDT_TS above 30 is L/5 with a slight margin over L/2
Figure 10Distribution of delta GDT_TS values by group compared to the modeling without restraints for the best GDT_TS within the top five structures by score over all targets reaching ΔGDT_TS values above 25 (L/2 medium‐range and long‐range contacts). Interestingly groups performing well in the L/2 ranking in Table 2 like G079 and G219 (underlined) have a lower GDT_TS in the majority of targets compared to the reference, groups that are not in the Top 5 (G097 and G320) of the ranking show on average the biggest improvement in GDT_TS
z‐scores ranking based on the sum of z‐scores for various measures and list sizes covering reduced lists (L/2 and L/5) and the full prediction (FL)a
| L/2 | L/5 | Full List | Average rank ± SD | ||||
|---|---|---|---|---|---|---|---|
| F1+ | Prec | F1+ | MCC+ | AUC_PR | |||
| 0.5*ES | — | — | 0.5*ES | 0.5*ES | — | ||
| RaptorX‐Contact | 1 | 1 | 1 | 4 | 2 | 1 | 1.7 ± 1.2 |
| MetaPSICOV | 2 | 2 | 2 | 15 | 12 | 3 | 6.0 ± 5.9 |
| iFold_1 | 3 | 4 | 8 | 3 | 1 | 2 | 3.5 ± 2.4 |
| MULTICOM‐CONSTRUCT | 4 | 3 | 3 | 13 | 10 | 5 | 6.3 ± 4.2 |
| RBO‐Epsilon | 5 | 5 | 4 | 18 | 15 | 6 | 8.8 ± 6.0 |
| Deepfold‐Contact | 6 | 8 | 11 | 5 | 4 | 4 | 6.3 ± 2.7 |
| FALCON_COLORS | 7 | 6 | 7 | 19 | 16 | 8 | 10.5 ± 5.5 |
| Yang‐Server | 8 | 7 | 5 | 17 | 18 | 10 | 10.8 ± 5.4 |
| AkbAR | 9 | 14 | 15 | 22 | 21 | 15 | 16.0 ± 4.8 |
| raghavagps | 10 | 11 | 12 | 10 | 9 | 7 | 9.8 ± 1.7 |
| Pcons‐net | 11 | 9 | 6 | 14 | 13 | 9 | 10.3 ± 2.9 |
| naive | 12 | 13 | 16 | 6 | 6 | 13 | 11.0 ± 4.1 |
| Shen‐Group | 13 | 15 | 13 | 1 | 3 | 14 | 9.8 ± 6.1 |
| IGBteam | 14 | 10 | 9 | 9 | 7 | 11 | 10.0 ± 2.4 |
| PconsC31 | 15 | 12 | 10 | 16 | 17 | 12 | 13.7 ± 2.7 |
| MULTICOM‐CLUSTER | 16 | 16 | 14 | 8 | 8 | 17 | 13.2 ± 4.1 |
| MULTICOM‐NOVEL | 17 | 18 | 17 | 2 | 5 | 16 | 12.5 ± 7.1 |
| Zhang_Contact | 18 | 17 | 19 | 20 | 20 | 18 | 18.7 ± 1.2 |
| PLCT | 19 | 19 | 18 | 26 | 26 | 20 | 21.3 ± 3.7 |
| PconsC2 | 20 | 20 | 20 | 28 | 27 | 21 | 22.7 ± 3.8 |
| Distill | 21 | 21 | 21 | 21 | 22 | 19 | 20.8 ± 1.0 |
| ZHOU‐SPARKS‐X | 22 | 23 | 28 | — | — | 29 | 25.5 ± 3.5 |
| FLOUDAS_SERVER | 23 | 22 | 23 | 27 | 28 | 22 | 24.2 ± 2.6 |
| Wang4 | 24 | 26 | 25 | — | — | 31 | 26.5 ± 3.1 |
| BG2 | 25 | 29 | 24 | 24 | 23 | 24 | 24.8 ± 2.1 |
| BAKER_GREMLIN | 26 | 30 | 22 | 25 | 24 | 23 | 25.0 ± 2.8 |
| Wang2 | 27 | 24 | 27 | — | — | 27 | 26.2 ± 1.5 |
| myprotein‐me | 28 | 25 | 26 | 30 | 30 | 26 | 27.5 ± 2.2 |
| RRCpred | 29 | 28 | 29 | 12 | 14 | 25 | 22.8 ± 7.8 |
| Wang3 | 30 | 27 | 30 | 11 | 19 | 28 | 24.2 ± 7.6 |
| Wang1 | 31 | 31 | 32 | 7 | 11 | 32 | 24.0 ± 11.7 |
| FLOUDAS | 32 | 32 | 31 | 23 | 25 | 30 | 28.8 ± 3.9 |
The table includes rankings of the groups according to the scores illustrated in Figure 13 (see the caption to Figure 13 for details).
Figure 11Average Precision by group for the L/2 and L/5 list (medium and long range contacts, FM targets). While the average precision of the L/5 predictions is in most cases slightly higher than the L/2 precision, the ranking on either metric will be similar
Figure 12Paired t test comparing average precision (L2/FM/ML) per target for each of the 32 groups. P values <0.05 are shaded indicating a significant difference between the groups. White indicates no significant difference between the groups while darker shades represent higher significance. Based on the matrix, there is no significant difference between the predictions of the best performing groups
Figure 13Top 10 predictors by cumulative z‐scores on (A) metrics assessing ranking of probabilities and (B) metrics assessing binary contact classification based on the 0.5 cutoff according to the assessor‐selected scores in each category. The four predictors appearing in the top 10 of both rankings are underlined. The scores in panel A include three reduced list scores—the F1 + 0.5*ES combination of the F1 and entropy scores, and the precision on L/2 and L/5 data, and one full list score—the area under the curve in the precision‐recall analysis (AUC_PR). The scores in panel B include the F1 + 0.5*ES and MCC + 0.5*ES combinations of the F1, MCC and the entropy scores. For the FL assessment of MCC and F1, only the residue pairs predicted with the probability >0.5 were considered as contacts. The results in panel (B) are therefore affected by the way some groups scaled their contacts, not submitting predictions with probabilities above 0.5 for several targets
Figure 14Boxplot showing statistics on the contact probabilities submitted FM and FM/TBM targets for CASP12. One group submitted only confident contacts (G108), while others did not appear to take the requested format into consideration by submitting almost all the contacts with probabilities below 0.5 (e.g., G206 and G458)
Figure 15boxplots per group depicting the fraction of True Positive contacts for the 10 intervals between 0 and 1 (step = 0.1). The perfect correlation between the intervals and the true positive fraction of the prediction (TP/[TP + FP]) is indicated by the dashed line. In the majority of groups TP/[TP + FP] corresponds roughly to the probability interval
Figure 16Distribution of sequence depth over all targets for CASP11 (white light gray) and CASP12 (black) for targets with low (<0.5), medium (0.5‐2), and high alignment depth. Alignment depth values were calculated from the outputs of HHblits and PSI‐BLAST as described previously10 using the latest databases available after closure of the prediction window for each target