| Literature DB >> 28459991 |
João P A Moraes1, Gisele L Pappa2, Douglas E V Pires3, Sandro C Izidoro1.
Abstract
Enzyme active sites are important and conserved functional regions of proteins whose identification can be an invaluable step toward protein function prediction. Most of the existing methods for this task are based on active site similarity and present limitations including performing only exact matches on template residues, template size restraints, despite not being capable of finding inter-domain active sites. To fill this gap, we proposed GASS-WEB, a user-friendly web server that uses GASS (Genetic Active Site Search), a method based on an evolutionary algorithm to search for similar active sites in proteins. GASS-WEB can be used under two different scenarios: (i) given a protein of interest, to match a set of specific active site templates; or (ii) given an active site template, looking for it in a database of protein structures. The method has shown to be very effective on a range of experiments and was able to correctly identify >90% of the catalogued active sites from the Catalytic Site Atlas. It also managed to achieve a Matthew correlation coefficient of 0.63 using the Critical Assessment of protein Structure Prediction (CASP 10) dataset. In our analysis, GASS was ranking fourth among 18 methods. GASS-WEB is freely available at http://gass.unifei.edu.br/.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28459991 PMCID: PMC5570142 DOI: 10.1093/nar/gkx337
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.GASS method: data is extracted from PDB, CSA and NCBI and pre-processed. GASS performs a heuristic search to find matching active sites in the proteins of interest or, given an active site template it searches for it in the database of protein structures using a genetic algorithm. Matching active sites are then returned to the user.
Figure 2.(A) Protein search for catalytic or binding sites requires the protein PDB file (Step 1) and the template size (Step 2). (B) NCBI-VAST Database search requires the PDB file (Step 1) and a template (Step 2), and has an optional field for email allowing the user to be emailed once the search finishes (Step 3). (C) Protein search for catalytic sites results page.
Figure 3.CMS score of catalytic sites found correctly according to the CSA—Datasets DS1:DS6 (blue). CMS score of catalytic sites found correctly according to the CSA—NCBI-VAST database experiment (red).