| Literature DB >> 28479743 |
Parth Sarthi Sen Gupta1, Shyamashree Banerjee1, Rifat Nawaz Ul Islam1, Vishma Pratap Sur2, Amal K Bandyopadhyay1.
Abstract
Orthologous proteins, form due to divergence of parental sequence, perform similar function under different environmental and biological conditions. Amino acid changes at locus specific positions form hetero-pairs whose role in BLOCK evolution is yet to be understood. We involve eight protein BLOCKs of known divergence rate to gain insight into the role of hetero-pairs in evolution. Our procedure APBEST uses BLOCK-FASTA file to extract BLOCK specific evolutionary parameters such as dominantly used hetero-pair (D), usage of hetero-pairs (E), non-conservative to conservative substitution ratio (R), maximally-diverse residue (MDR), residue (RD) and class (CD) specific diversity. All these parameters show BLOCK specific variation. Conservative nature of D points towards restoration of function of BLOCK. While E sets the upper-limit of usage of hereto-pairs, strong correlation of R with divergence-rate indicates that the later is directly dependent on non-conservative substitutions. The observation that MDR, measure of positional diversity, occupy very limited positions in BLOCK indicates accommodation of diversity is positionally restricted. Overall, the study extract observed hetero-pair related quantitative and multi-parametric details of BLOCK, which finds application in evolutionary biology.Entities:
Keywords: conservative; divergence rate; evolution; hetero-pairs; non-conservative; substitution
Year: 2017 PMID: 28479743 PMCID: PMC5405086 DOI: 10.6026/97320630013001
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Flowchart describing methodology and operation of APBEST for extraction of analytical parameters from orthologous protein family.
Figure 2APBEST implemented equations and their clarity.
BLOCK specific quantitative parameters for SHPs as obtained by APBEST analysis
| Name of Protein BLOCK | Divergence Rate* | Computed Ratio parameters | Residue diversity RD | € Dominant pair (D) | Class specific diversity CD | |||||||
| R (%) | E (%) | N (%) | MDR or RDMAX | RDMIN | Acidic | Basic | Non-polar | Hydrophobic | Hydrophilic | |||
| Ubiquitin | 0.1 | 42 | 42 | 18 | K (18.8) | C (0.5) | IV (5.1) | 30.5 | 34.8 | 49.8 | 50.7 | 78.6 |
| G3PDH | 2.2 | 53 | 46 | 1 | V (21.2) | W (1.4) | IV (9.2) | 23.3 | 24 | 36.2 | 69.8 | 64.9 |
| LDH | 3.4 | 55 | 56 | 0 | V (20.9) | W (1.8) | IV (7.6) | 18.8 | 21.9 | 44 | 69.4 | 66.1 |
| Acid-protease | 9 | 62 | 52 | 2 | S (23.5) | C (1.6) | IV (4.7) | 16.5 | 13.5 | 57.3 | 68.6 | 69.6 |
| Hemoglobin | 12 | 63 | 52 | 0 | L (26.7) | P (2.2) | FL (6.7) | 5.61 | 16.2 | 47.2 | 81.5 | 56.9 |
| Ribonucleases | 21 | 67 | 57 | 0 | S (23.4) | W (3.3) | TS (4.1) | 15.9 | 38 | 56.4 | 58.2 | 81.8 |
| Somatotropin | 25 | 76 | 44 | 13 | S (20.0) | W (0.1) | ED (5.0) | 21.6 | 22.9 | 55.2 | 62.8 | 80.4 |
| Kappa-casein | 33 | 92 | 32 | 22 | V(26.4) | C (0.6) | VA(9.6) | 12.7 | 12.9 | 48.7 | 80.6 | 48.8 |
| *percent/100 MYr; Divergence rates (second column of the table) for protein BLOCKs (first column) are taken from (Marks, 1988; Dayhoff and Schwartz, 1978). LDH: Lactate dehydrogenase; G3PDH: Glyceraldehyde 3-phosphate dehydrogenase; €Dominant pair indicates the hetero-pair type whose observed frequency is maximum for Block. | ||||||||||||
Figure 3190 SHP types (upper-half of diagonal) and observed frequencies (lower-half of diagonal) are shown. Substitution-homo-pair frequencies (i.e. 20) are at the diagonal position. Both these types and their frequencies divided into three categories: a] HB-HB category: total 36 (upper dark shade), b] HL-HL category: total 55 (lower white shade) and HB-HL category: 99 in number (middle gray shade region). Residue Q is shown by gray-strip for explanation of the calculation of diversity of a given hetero-pair.
Figure 4Plot of derived ratios (R and E) vs. divergence rate and observed hetero-pair frequency vs. probability range. Two typical frequency distributions are shown (Graph A1 and Graph A2) where the observed-data are fitted with Weibull distribution function. Used (E) fraction of hetero-pair is plotted against divergence rate (Graph B). Graph C, shows the correlation between R with divergence rate [6,7].
Positional analysis of BLOCKs for invariant line (only one type of residue), conserved position (Shannon entropy ≤1.0) and type of amino acid classes (such as HB, Ac, Bs, PC, ST, HB+HL and PU+PC). Normalized values (in %) are presented for comparison among BLOCKs
| Blocks | Dv Rate | INV | CONV | HB | Ac | Bs | PC | ST | HB+HL | PU+PC |
| Ubiquitin | 0.1 | 7 | 43.7 | 11.3 | - | 2.8 | - | - | 62 | 16.9 |
| G3PDH | 2.2 | 15.2 | 39 | 19.5 | 0.6 | 0.3 | 2.8 | 1.5 | 56 | 4 |
| LDH | 3.4 | 3.6 | 21.5 | 8.8 | 0.4 | - | 0.4 | 0.4 | 84.3 | 2.2 |
| Acid-protease | 9 | 1.9 | 24.8 | 5.7 | - | - | 1 | - | 88.6 | 2.9 |
| Hemoglobin | 12 | - | 29 | 6.5 | - | - | - | - | 93.5 | - |
| Ribonuclease | 21 | - | 10.5 | 5.3 | - | - | 2.6 | - | 89.5 | 2.6 |
| Somatotropin | 25 | 14.4 | 39.4 | 7.5 | 2.5 | 3.1 | 0.6 | 0.6 | 60 | 11.3 |
| Kappa-casein | 33 | 1 | 64.6 | 15.2 | 2 | - | - | - | 76.8 | 5.1 |
| INV Invariant position; CONV Conserve position; HB position contains only hydrophobic amino acids; Similarly Ac acidic, Bs basic, PC Polar charge, ST serine plus threonine; HB+HL position contains hydrophobic and hydrophilic amino acids; similarly PU+PC polar uncharged and polar charged; - absent. | ||||||||||