Konda Mani Saravanan1, Renganathan Senthil2. 1. Centre of Excellence in Bioinformatics, School of Biotechnology, Madurai Kamaraj University, Madurai, India. 2. Department of Bioinformatics, School of Biosciences, The Marudupandiyar Institutions (Affiliated to Bharathidasan University), Thanjavur, Tamil Nadu, India.
Abstract
AIM: The PreFRP web server extracts sequence and basic information of a protein structure and groups amino acid residues in a protein into three important types such as high, moderate, and weak fluctuating residues. MATERIALS AND METHODS: The server takes a protein data bank file or an amino acid sequence as input and prints the probability of amino acid residues to fluctuate. The server also provides a link to Jmol, a molecular visualization program to visualize the high, moderate, and weak fluctuating residues in three different colors. RESULTS: Prediction and visualization of fluctuating amino acid residues in proteins may help to understand the complex three-dimensional structure of proteins and may further help in docking and mutation experiments. AVAILABILITY: The web server is freely accessible through the web page of the author's institution http://www.mpi.edu.in/prefrp/link.html.
AIM: The PreFRP web server extracts sequence and basic information of a protein structure and groups amino acid residues in a protein into three important types such as high, moderate, and weak fluctuating residues. MATERIALS AND METHODS: The server takes a protein data bank file or an amino acid sequence as input and prints the probability of amino acid residues to fluctuate. The server also provides a link to Jmol, a molecular visualization program to visualize the high, moderate, and weak fluctuating residues in three different colors. RESULTS: Prediction and visualization of fluctuating amino acid residues in proteins may help to understand the complex three-dimensional structure of proteins and may further help in docking and mutation experiments. AVAILABILITY: The web server is freely accessible through the web page of the author's institution http://www.mpi.edu.in/prefrp/link.html.
Identifying potential binding sites of the proteins is a challenging task for the computational molecular biologists. There are different methods to identify the binding sites of the given protein structure such as homology-based approach,[1] energy-based approach,[2] and surface area-based approach.[3] All the methods incorporate the information about the amino acid flexibility or rigidity to predict the binding sites of a protein. Conformational changes upon protein-protein binding play a vital role in the functioning of proteins in the cell.[4] There is a variety of conformational changes such as residue fluctuations, domain motions, physico-chemical changes, secondary structural changes, and disordered to ordered transitions.[5678910] Several statistical models are available to study residue fluctuations in protein structures to understand molecular recognition processes, which are discussed in many papers.[111213]The fluctuating nature of intrinsically disordered proteins is believed to contribute to binding with other proteins to perform numerous cellular functions. Identification of fluctuation residues can be done by studying various parameters such as size, B-factors, crystal packing, and physico-chemical properties of amino acid residues.[12] In the present work, we have developed a probability-based method which relies on carbon content and grouping of high, moderate, and weak fluctuating residues in a given protein databank file. The carbon content of amino acid residues in proteins may help to classify residues into hydrophobic and hydrophilic.[1415161718] Considering the above facts, we have developed a web server to identify the fluctuation residues in proteins which may help in finding binding sites, disordered regions, and understand the role of amino acid residues in forming compact functional three-dimensional structures of proteins.
METHODS
The amino acid sequence and Protein Data Bank[19] file is used as input. The input coordinate file is then processed to extract basic information such as protein name, number of residues, source organism, and experiment used to solve the structure. The number of carbons present in each residue is calculated. Based on the carbon content of the residue and classification of 20 amino acid residues into high (G, A, S, P, and D), moderate (T, E, N, K, C, Q, R, and V), and weak fluctuating residues (H, L, M, I, Y, F, and W), amino acid residues are ranked in a protein by using the following formula.Where,P is the probability of an amino acid to fluctuateNc is the Number of carbon atoms in a residue iN is the total number of atoms in a residue iRc is the Rank constant based on Ruvinsky et al., (2010) classification of fluctuating residuesFor high fluctuating, Rc is –2; moderate, Rc is –1; and weak, Rc is 2
WEB SERVER
The web server allows a user to upload a sequence or protein data bank file to identify fluctuating residues in proteins. The server computes sequence composition, probability of the residue fluctuations, prediction of secondary structure propensity, percentage of carbon atom and fluctuated residues in protein sequence, and prediction of residue-residue mediation potential. The schematic design of the web server is represented as a flowchart in Figure 1. The screen shot of the web server is shown in Figure 2. Furthermore, the fluctuation residues can be visualized in three colors by clicking the “View fluctuation residues” link present in the left panel of the main page.
Figure 1
Schematic design of webserver
Figure 2
Screenshot of the webserver
Schematic design of webserverScreenshot of the webserver
IMPLEMENTATION AND ACCESS
The web server has been developed using CGI PERL programs. Processes such as the extraction of amino acid sequences, calculation of propensity, prediction of secondary structure, and mediation potential were done using PERL in Linux. The main page “form,” results, and information page were designed using HTML. The web server is freely accessible from the institution page of the author, at http://www.mpi.edu.in/prefrp/link.html.
ADVANTAGES
The information provided by the web server may help the user to carry out further structural and conformational analysis on proteins. Furthermore, the probability scores provided on the results page are very helpful for users who wish to perform docking, mutation, or dynamics studies. A link to Jmol is also presented using which a user can view the fluctuation residues in three-dimension.[20]
LIMITATIONS
Due to storage limitations in the server, the protein coordinates files cannot be fetched from the server and should be accessed from RCSB. Furthermore, the server does not have precomputed results.
Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971
Authors: A K Dunker; J D Lawson; C J Brown; R M Williams; P Romero; J S Oh; C J Oldfield; A M Campen; C M Ratliff; K W Hipps; J Ausio; M S Nissen; R Reeves; C Kang; C R Kissinger; R W Bailey; M D Griswold; W Chiu; E C Garner; Z Obradovic Journal: J Mol Graph Model Date: 2001 Impact factor: 2.518