| Literature DB >> 20525783 |
Raquel Norel1, Donald Petrey, Barry Honig.
Abstract
The construction of a homology model for a protein can involve a number of decisions requiring the integration of different sources of information and the application of different modeling tools depending on the particular problem. Functional information can be especially important in guiding the modeling process, but such information is not generally integrated into modeling pipelines. Pudge is a flexible, interactive protein structure prediction server, which is designed with these issues in mind. By dividing the modeling into five stages (template selection, alignment, model building, model refinement and model evaluation) and providing various tools to visualize, analyze and compare the results at each stage, we enable a flexible modeling strategy that can be tailored to the needs of a given problem. Pudge is freely available at http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:PUDGE.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20525783 PMCID: PMC2896183 DOI: 10.1093/nar/gkq475
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The Pudge protein structure prediction pipeline. Starting with a sequence, a model is constructed in five stages: TS, AL, MB, MR and ME. Menu options allow a user to select the methods applied at each stage but models can also be generated in an entirely automated fashion. Results from each stage can be examined and edited on the web page (A, E). A variety of tools can be applied to analyze the edited results. Those illustrated here are taken from a case study modeling the protein PqsE described in the text. (B) The ‘annotation map’. Functional properties of all the templates can be examined, queried and filtered using the annotation map. A wide range of properties can be displayed (see text). For example, the blue rectangle highlights residues in metal-binding active sites (magenta squares), which are conserved in PqsE (black bars). (C). Simultaneous comparison of the alignments of PqsE to two templates (PDB codes 2zwr and 2zo4). The figure indicates that the models built on these templates would be equivalent in the gray region and would have an alignment shift in the yellow region. Models built using 2zwr in the red region would be built ab initio, whereas that region is structured and well-aligned for the template 2zo4 (green, see also Figure 1G). (D) Any individual model can be submitted to our MarkUs protein function annotation server for additional bioinformatic and biophysical analysis. The figure shows a molecular viewer within MarkUs displaying predicted functional cavities. (F) A ProSa2003 evaluation of models of PqsE built on templates 2zo4 and 2zwr. The C-terminal of the model built on 2zwr has lower quality (red rectangle) than the model built on 2zo4 (green rectangle). (G) Worm representations of the C-termini of the native structure of PqsE, and the models built on 2zwr and 2zo4. The models were largely equivalent (e.g. gray regions) but the C-terminal of 2zo4 (green) was a close structural match to the PqsE native structure (blue), whereas the same region had to be built ab initio in the model built on 2zwr (red) even though PqsE was more similar to this template overall.
Modeling and analysis methods used in the PUDGE pipeline
| TS | AL | MB | MR | ME |
|---|---|---|---|---|
| BLAST ( | BLAST ( | NEST ( | SCAP ( | SVM (Zhu and Honig, manuscript in preparation) DFIRE ( |
All methods used have been independently validated and appropriate references are cited. Additional details on how the methods are specifically used within Pudge are provided as ‘tool tips’ on the Pudge web site itself.