| Literature DB >> 30108414 |
Rifat Nawaz Ul Islam1, Chittran Roy2, Parth Sarthi Sen Gupta3, Shyamashree Banerjee2, Debanjan Mitra2, Sahini Banerjee4, Amal Kumar Bandyopadhyay2.
Abstract
Residues in allelic positions, in the local segment of aligned sequences of proteins show wide variations. Here, we describe PROPAB that computes the propensity tables for helix, strand and coil types from multiple 3D structure files following ab initio statistical procedure. It also classifies them in range specific and chain specific manners. It further computes percentage composition and physicochemical properties along with residues propensities. It also prepares FASTA files for different segments (helix, strand and coil) in the exact order that they follow in the sequence. Representative analyses on orthologous (homologous across species) proteins demonstrate wide segmental variations of physicochemical properties. Such variations provide insights to relate the adaptation of these proteins in a given functional constraint under diverse environmental conditions. Thus, the program finds applications in the structural and evolutionary analysis of proteins. AVAILABILITY: PROPAB is freely available at http://sourceforge.net/projects/propab/for worldwide user.Entities:
Keywords: Chou and Fasman; Program; Protein; propensity; properties; secondary structure
Year: 2018 PMID: 30108414 PMCID: PMC6077827 DOI: 10.6026/97320630014190
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Flowchart for the functioning of the program PROPAB. Upon start of the program, it first checks for NMR files in the working directory. If these are present, they are screened out and then a list of X-ray structure files is made. Each PDB file is processed separately. Once completed, three kinds of outputs: O1 (six itemed one output per PDB), O2 (all chains, all PDB files specific outputs for Helix, Strand and Coil i.e. four outputs) and O3 (one output, FASTA files for Total, Helix, Strand and Coil containing sequences from all chains and all PDBs) are produced. PDB: protein Data Bank; SSSs: Secondary Structure Sequences; H: Helix; S: Strand; C: Coil; CS-S: Chain Specific Sequence; CS-P: Chain Specific Propensity; Freq: Frequency.
Figure 2PROPAB extracted results show insightful observations. Running of PROPAB in CYGWIN 32 bit UNIX like environment, (A) that updates details of inputs and outputs in the screen. Program makes sequence from structure files (B) with GAP information, identify helix (red), strand (blue) and thus the Coil (black) regions (B). FASTA files are prepared for entire, helix, strand and coil sequences for all PDBs and all chains. Normalized composition (in %) and physicochemical properties are then computed, which are overcastted during the formation of range specific and chain specific propensity table for strand (C), helix (D) and coil (not shown) regions. Comparison of physicochemical properties of 2AZ3 (halophilic) and 2HUR (mesophilic) for different segments (E) shows that GRAVY for strand segment is positive (GVa) whereas it is negative for entire sequence (GVb). Similarly, pI for strand is much higher (pIa) than the entire protein (pIb; 2AZ3). Although known propensity is lower than unity, certain residues (e.g. R1, R2 for 2AZ3 and R3 for HUR2) show propensity at higher range with their normalized compositions for strand and helix segments. PROPAB presents segments (helix, strand, coil) propensity in range specific (DR) and chain specific manner (D), wherein %-compositions (CP) and physicochemical properties (PC) are also included.