Literature DB >> 21546397

APOLLO: a quality assessment service for single and multiple protein models.

Zheng Wang¹, Jesse Eickholt, Jianlin Cheng.

Abstract

SUMMARY: We built a web server named APOLLO, which can evaluate the absolute global and local qualities of a single protein model using machine learning methods or the global and local qualities of a pool of models using a pair-wise comparison approach. Based on our evaluations on 107 CASP9 (Critical Assessment of Techniques for Protein Structure Prediction) targets, the predicted quality scores generated from our machine learning and pair-wise methods have an average per-target correlation of 0.671 and 0.917, respectively, with the true model quality scores. Based on our test on 92 CASP9 targets, our predicted absolute local qualities have an average difference of 2.60 Å with the actual distances to native structure. AVAILABILITY: http://sysbio.rnet.missouri.edu/apollo/. Single and pair-wise global quality assessment software is also available at the site.

Entities: Disease

Mesh：

Substances：
Proteins

Year: 2011 PMID： 21546397 PMCID： PMC3106203 DOI： 10.1093/bioinformatics/btr268

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 INTRODUCTION

Protein model quality assessment plays an important role in protein structure prediction and application. Assessing the quality of protein models is essential for ranking models, refining models and using models (Cheng, 2008). Model Quality Assessment Programs (MQAPs) predict model qualities from two perspectives: the global quality of the entire model and the residue-specific local qualities. The techniques often used by MQAPs include multiple-model (clustering) methods (Ginalski ; McGuffin, 2007, 2008; Paluszewski and Karplus, 2008; Wallner and Elofsson, 2007; Zhang and Skolnick, 2004a), single model methods (Archie and Karplus, 2009; Benkert ; Cline ; Qiu ; Wallner and Elofsson, 2003; Wang ) and hybrid methods (Cheng ; McGuffin, 2009). According to the CASP experiments, multiple-model clustering methods are currently more accurate than single model methods. However, they cannot work well if only a small number of models are available. A hybrid quality assessment method (Cheng ) was recently developed to combine the two approaches and integrate their respective strengths. Here, we build a web server to provide the community with access to all three model quality assessment approaches (i.e. single, clustering and hybrid).

2 METHODS

2.1 Input and output

Users only need to upload or paste a single model file in Protein Data Bank (PDB) format or a zipped file containing multiple models. If a single model is submitted, APOLLO predicts the absolute global and local qualities. If multiple models are submitted, APOLLO outputs the absolute global qualities, average pair-wise GDT-TS scores, refined average pair-wise Q-scores, refined absolute scores and pair-wise local qualities. All the global qualities range between (0, 1), where 1 indicates a perfect model and 0 indicates the worst case.

2.2 Algorithms

The absolute global quality score is generated based on our single model QA predictor—ModelEvaluator (Wang ). Given a single model, ModelEvaluator (as MULTICOM-NOVEL server in CASP9) extracts secondary structure, solvent accessibility, beta-sheet topology and a contact map from the model, and then compares these items with those predicted from the primary sequence using the SCRATCH program (Cheng ). These comparisons generate match scores which are then fed into an SVM model trained on CASP6 and CASP7 data to predict the absolute global quality of the model in terms of GDT-TS scores. To predict absolute local quality score of a residue, the secondary structure and solvent accessibility predicted from the sequence are compared with the ones parsed from the model in a 15-residue window around the residue. For each residue in the window, we also gather its contact residues that are ≥ 6 residues away in sequence and have an Euclidean distance ≤ 8 Å in the model. Their probabilities of being in contact according to the predicted contact probability map are averaged. The averaged contact probabilities, the match scores of secondary structure and solvent accessibility comparison and the residue encoding are fed into an SVM to predict local quality. The SVM are trained on the models of 30 CASP8 single domain targets. The average pair-wise GDT-TS score is generated using our latest implementation (as MULTICOM-CLUSTER server in CASP9) of the widely used pair-wise comparison approach (Larsson ). Taking a pool of models as an input, it first filters out illegal characters and chain-break characters in their corresponding PDB files. It then uses TM-Score (Zhang and Skolnick, 2004b) to perform a full pair-wise comparison between these models. The average GDT-TS score between a model and all other models is used as the predicted GDT-TS score of the model. One caveat is that the GDT-TS score of a partial model is scaled down by the ratio of its length divided by the full target length. The refined global and local quality scores are generated using a hybrid approach (as MULTICOM-REFINE server in CASP9) (Cheng ) that integrates single model ranking methods with structural comparison-based methods. It first selects several top models (i.e. top five or top ten) as reference models. Each model in the ranking list is superposed with the reference models by the TM-Score. The average GDT-TS score of these superposition is considered as the predicted quality score. The superposition with the reference models is also used to calculate Euclidean distances between the same residues in the superposed models. The average distance is used as the predicted pair-wise local quality of the residue (Fig. 1). Higher distances correspond to poorer local quality.

Fig. 1.

A local quality example for CASP9 target T0563. On the left is a plot of predicted local quality scores (colorful line) and actual distance (black line) against residue positions. On the right is the superposition between native structure (grey) and the model. The regions of the model with different local quality are visualized in different colors corresponding to the color of line segments in the plot on the left. Disordered regions are not plotted in the actual distance line. The refined average pair-wise Q-scores are generated using a consensus approach (as MULTICOM-CONSTRUCT server in CASP9). APOLLO first uses the average pair-wise similarity scores, calculated in terms of Q-score (Ben-David ; McGuffin and Roche, 2010), to generate an initial ranking of all the models. The Q-score between a pair of residues (i, j) in the two models is computed as: Q = exp[ − (r − r )2], where r and r are the distance between Cα atoms at residue positions i and j in models a and b, respectively. The overall Q-score between models a and b is equal to the average of all Q scores of all residue pairs in the entire model. The average Q-score between a model and all other models is used as the predicted quality score of the model. The initial quality scores are refined by the same refinement process used by our hybrid method in MULTICOM-REFINE.

3 RESULTS

We assessed most of the methods used by APOLLO on 107 valid CASP9 targets. We downloaded all the CASP9 models from CASP9 (http://predictioncenter.org/download_area/CASP9/) and the experimental structures from the PDB (Berman ). These PDB files were preprocessed in order to select correct chains and residues that match the CASP9 target sequences. TM-Score was used to align each model with the corresponding native structure and generate its real quality score (GDT-TS). The CASP9 QA predictions made by our methods were evaluated against the actual quality scores by four criteria: average per-target correlation (Cozzetto ), the average sum of the GDT-TS scores of the top one ranked models, the overall correlation on all targets and the average loss—the difference in GDT-TS score between the top ranked model and the best model (Cozzetto ) (Table 1). The results show that the average correlation can be as high as 0.92 (respectively, 0.67) and the average loss can be as low as 0.057 (respectively, 0.095) for multiple model (respectively, single model). Our multiple- and single-model global QA methods were ranked among the most accurate QA methods of their respective kind according to the CASP9 official assessment (http://www.predictioncenter.org/casp9/doc/presentations/CASP9_QA.pdf). The average per-target correlation of our pair-wise local quality predictions is ~ 0.53, which is also among the top local quality predictors in CASP9. We also conducted a blind test of the absolute local quality predictor (trained on the CASP8 dataset) on the CASP9 models of 92 CASP9 single domain proteins. On the residues whose actual distances to the native are ≤ 10 and 20 Å, the average absolute difference between our predicted distances and the actual distances is 2.60 and 3.18 Å, respectively.

Table 1.

Results of global quality assessment methods used by APOLLO server on 107 CASP9 targets

Methods	Average correlation	Average top 1	Overall correlation	Average % loss
Absolute score	0.671	0.552	0.767	0.095
Average pair-wise GDT-TS	0.917	0.591	0.943	0.057
Refined absolute score	0.870	0.567	0.928	0.081
Refined pair-wise Q-score	0.835	0.572	0.904	0.076

Results of global quality assessment methods used by APOLLO server on 107 CASP9 targets Funding: A National Institutes of Health (NIH) (grant 1R01GM093123 to J.C.). Conflict of Interest: none declared.

22 in total

1. The Protein Data Bank.

Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. Can correct protein models be identified?

Authors: Björn Wallner; Arne Elofsson
Journal: Protein Sci Date: 2003-05 Impact factor: 6.725

3. 3D-Jury: a simple approach to improve protein structure predictions.

Authors: Krzysztof Ginalski; Arne Elofsson; Daniel Fischer; Leszek Rychlewski
Journal: Bioinformatics Date: 2003-05-22 Impact factor: 6.937

4. SPICKER: a clustering approach to identify near-native protein folds.

Authors: Yang Zhang; Jeffrey Skolnick
Journal: J Comput Chem Date: 2004-04-30 Impact factor: 3.376

5. Scoring function for automated assessment of protein structure template quality.

Authors: Yang Zhang; Jeffrey Skolnick
Journal: Proteins Date: 2004-12-01

6. Prediction of global and local model quality in CASP7 using Pcons and ProQ.

Authors: Björn Wallner; Arne Elofsson
Journal: Proteins Date: 2007

7. The ModFOLD server for the quality assessment of protein structural models.

Authors: Liam J McGuffin
Journal: Bioinformatics Date: 2008-01-09 Impact factor: 6.937

8. Rapid model quality assessment for protein structure predictions using the comparison of multiple models without structural alignments.

Authors: Liam J McGuffin; Daniel B Roche
Journal: Bioinformatics Date: 2009-11-06 Impact factor: 6.937

9. Benchmarking consensus model quality assessment for protein fold recognition.

Authors: Liam J McGuffin
Journal: BMC Bioinformatics Date: 2007-09-18 Impact factor: 3.169

10. SCRATCH: a protein structure and structural feature prediction server.

Authors: J Cheng; A Z Randall; M J Sweredoski; P Baldi
Journal: Nucleic Acids Res Date: 2005-07-01 Impact factor: 16.971

39 in total

1. An iterative self-refining and self-evaluating approach for protein model quality estimation.

Authors: Zheng Wang; Jianlin Cheng
Journal: Protein Sci Date: 2011-11-23 Impact factor: 6.725

2. Predicting Protein Model Quality from Sequence Alignments by Support Vector Machines.

Authors: Xin Deng; Jilong Li; Jianlin Cheng
Journal: J Proteomics Bioinform Date: 2013-11-04

3. An Improved Integration of Template-Based and Template-Free Protein Structure Modeling Methods and its Assessment in CASP11.

Authors: Jilong Li; Badri Adhikari; Jianlin Cheng
Journal: Protein Pept Lett Date: 2015 Impact factor: 1.890

4. Predicting protein residue-residue contacts using deep networks and boosting.

Authors: Jesse Eickholt; Jianlin Cheng
Journal: Bioinformatics Date: 2012-10-09 Impact factor: 6.937

5. Recursive protein modeling: a divide and conquer strategy for Protein Structure Prediction and its case study in CASP9.

Authors: Jianlin Cheng; Jesse Eickholt; Zheng Wang; Xin Deng
Journal: J Bioinform Comput Biol Date: 2012-06 Impact factor: 1.122

6. What is the best reference state for designing statistical atomic potentials in protein structure prediction?

Authors: Haiyou Deng; Ya Jia; Yanyu Wei; Yang Zhang
Journal: Proteins Date: 2012-06-18

7. WeFold: a coopetition for protein structure prediction.

Authors: George A Khoury; Adam Liwo; Firas Khatib; Hongyi Zhou; Gaurav Chopra; Jaume Bacardit; Leandro O Bortot; Rodrigo A Faccioli; Xin Deng; Yi He; Pawel Krupa; Jilong Li; Magdalena A Mozolewska; Adam K Sieradzan; James Smadbeck; Tomasz Wirecki; Seth Cooper; Jeff Flatten; Kefan Xu; David Baker; Jianlin Cheng; Alexandre C B Delbem; Christodoulos A Floudas; Chen Keasar; Michael Levitt; Zoran Popović; Harold A Scheraga; Jeffrey Skolnick; Silvia N Crivelli
Journal: Proteins Date: 2014-07-08

8. Estimation of model accuracy in CASP13.

Authors: Jianlin Cheng; Myong-Ho Choe; Arne Elofsson; Kun-Sop Han; Jie Hou; Ali H A Maghrabi; Liam J McGuffin; David Menéndez-Hurtado; Kliment Olechnovič; Torsten Schwede; Gabriel Studer; Karolis Uziela; Česlovas Venclovas; Björn Wallner
Journal: Proteins Date: 2019-07-16

9. Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11.

Authors: Renzhi Cao; Debswapna Bhattacharya; Badri Adhikari; Jilong Li; Jianlin Cheng
Journal: Proteins Date: 2015-09-29

10. Protein Residue Contacts and Prediction Methods.

Authors: Badri Adhikari; Jianlin Cheng
Journal: Methods Mol Biol Date: 2016