| Literature DB >> 22570420 |
Ambrish Roy1, Jianyi Yang, Yang Zhang.
Abstract
We have developed a new COFACTOR webserver for automated structure-based protein function annotation. Starting from a structural model, given by either experimental determination or computational modeling, COFACTOR first identifies template proteins of similar folds and functional sites by threading the target structure through three representative template libraries that have known protein-ligand binding interactions, Enzyme Commission number or Gene Ontology terms. The biological function insights in these three aspects are then deduced from the functional templates, the confidence of which is evaluated by a scoring function that combines both global and local structural similarities. The algorithm has been extensively benchmarked by large-scale benchmarking tests and demonstrated significant advantages compared to traditional sequence-based methods. In the recent community-wide CASP9 experiment, COFACTOR was ranked as the best method for protein-ligand binding site predictions. The COFACTOR sever and the template libraries are freely available at http://zhanglab.ccmb.med.umich.edu/COFACTOR.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22570420 PMCID: PMC3394312 DOI: 10.1093/nar/gks372
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Illustration of structure-based function annotation by the COFACTOR server, starting from the query structure (shown in green).
Figure 2.Flowchart of functional site identification by the COFACTOR server. (i) Conserved residues in query sequence are identified based on Jensen–Shannon diverge score, which are then used to glean local 3D-fragments from the query structure. (ii) Each local 3D-motif of query is aligned with the fragments collected from functional site of template protein and the local similarity between query and template protein is evaluated using Lsim Equation (2). Finally, the best match among all the probable sets with the best local match (i.e. highest Lsim) is selected. The residues of query protein (yellow) are shown in cyan, while those in template protein (gray) are shown in magenta.
Figure 3.An excerpt of the result page showing ligand-binding site analysis for a Glyoxalase family protein from Bacillus anthracis (PDB ID: 2qqz). The server identifies high global and local similarity to Lactoylglutathione lyase of Agrobacterium tumefaciens, suggesting that the query also has a similar metal-ion binding site, which is required for catalysis in Glyoxalase I enzymes. The protein–ligand interactions are visualized using the Jmol applet.