| Literature DB >> 26672478 |
Ngoc Tuan Tran1, Ivan Jakovlić2, Wei-Min Wang3.
Abstract
BACKGROUND: Heat shock proteins play an important role in protection from stress stimuli and metabolic insults in almost all organisms.Entities:
Keywords: Functional annotation; Homology modelling; Hsc70; Hsp70; Physicochemical characteristics; Structural similarity
Year: 2015 PMID: 26672478 PMCID: PMC4678458 DOI: 10.1186/s40781-015-0077-x
Source DB: PubMed Journal: J Anim Sci Technol ISSN: 2055-0391
Amino acid composition of Ma-Hsp70 and Ma-Hsc70
| Amino acid |
|
| Amino acid |
|
| ||||
|---|---|---|---|---|---|---|---|---|---|
| N | % | N | % | N | % | N | % | ||
| Alanine | 54 | 8.4 | 47 | 7.2 | Lysine | 53 | 8.2 | 53 | 8.2 |
| Arginine | 28 | 4.4 | 29 | 45 | Methionine | 14 | 2.2 | 15 | 2.3 |
| Asparagine | 36 | 5.6 | 34 | 5.2 | Phenylalanine | 22 | 3.4 | 23 | 3.5 |
| Aspartic acid | 47 | 7.3 | 49 | 7.6 | Proline | 20 | 3.1 | 25 | 3.9 |
| Cysteine | 6 | 0.9 | 4 | 0.6 | Serine | 36 | 5.6 | 37 | 5.7 |
| Glutamine | 28 | 4.4 | 25 | 3.9 | Threonine | 41 | 6.4 | 48 | 7.4 |
| Glutamic acid | 47 | 7.3 | 47 | 7.2 | Tryptophan | 2 | 0.3 | 2 | 0.3 |
| Glycine | 52 | 8.1 | 55 | 8.5 | Tyrosine | 15 | 2.3 | 16 | 2.5 |
| Histidine | 7 | 1.1 | 7 | 1.1 | Valine | 44 | 6.8 | 45 | 6.9 |
| Isoleucine | 45 | 7.0 | 46 | 7.1 | Pyrrolysine | 0 | 0.0 | 0 | 0.0 |
| Leucine | 46 | 7.2 | 42 | 6.5 | Selenocysteine | 0 | 0.0 | 0 | 0.0 |
N represents the total number and % the numeric percentage of each amino acid
Cysteine occurrence pattern and probability of cysteine residue pairing in Ma-Hsp70 and Ma-Hsc70 proteins
| Protein | Position | Status | Score |
|---|---|---|---|
|
| Cys19 | no SS-bond | −58.8 |
| Cys269 | no SS-bond | −34.4 | |
| Cys308 | no SS-bond | −24.9 | |
| Cys576 | no SS-bond | −29.6 | |
| Cys605 | no SS-bond | −27.5 | |
| Cys622 | probably no SS-bond | −8.00 | |
|
| Cys17 | no SS-bond | −55.6 |
| Cys267 | no SS-bond | −37.8 | |
| Cys574 | no SS-bond | −32.2 | |
| Cys603 | no SS-bond | −23.6 |
Fig. 1Alignment of the deduced Ma-Hsp70 a and Ma-Hsc70 b with bovine Hsc70 (PDB ID: 4 fl9.1.A) amino acid sequence. Amino acid positions are numbered on the right, conserved substitutions are indicated by (:), semi-conserved by (.) and deletions by (−)
Fig. 2Ma-Hsp70 a and Ma-Hsc70 b protein tertiary structure model and validation results: a 3-D homology model rendered by the SWISS-MODEL program. b Ramachandran plot analysis, indicating residues in the favoured regions (red), allowed regions (yellow), generously allowed regions (light yellow) and disallowed regions (white). c Overall quality of the model evaluated by the ERRAT program. On the error axis, two lines (95 and 99 %) indicate the confidence with which it is possible to reject regions that exceed that error value. Regions of the structure highlighted in grey and black can be rejected at 95 % and 99 % confidence level, respectively. d Z-score (highlighted as a black dot) is displayed in a plot that contains the Z-scores of all experimentally determined protein chains currently available in the Protein Data Bank. Groups of structures from different sources (X-ray and NMR) are distinguished by different colours (light- and dark-blue, respectively). e Plot of single residue energies, where window sizes of 40 and 10 residues are distinguished by dark- and light-green lines, respectively. Positive values indicate problematic or erroneous parts of the structure
Assessment of the predicted three-dimensional structures of Ma-Hsp70 and Ma-Hsc70 proteins
| Validation Index |
|
|
|---|---|---|
| Ramachandran plot | ||
| Residues in most favoured regions | 92.7 | 92.4 |
| Residues in additional allowed regions | 6.7 | 7.1 |
| Residues in generously allowed regions | 0.4 | 0.2 |
| Residues in disallowed regions | 0.2 | 0.2 |
| Overall G-factor | 0.16 | 0.16 |
| ProQ | ||
| Lgscore | 5.417 | 5.467 |
| MaxSub | 0.448 | 0.427 |
| ProSA Z-Score | −11.01 | −11.19 |
| ERRAT | 95.547 | 92.63 |
Top five identified structural analogs in the Protein Data Bank (PDB) library
| PDB ID | Protein | Species | TM-score | RMSDa | IDENa | Cov. | |
|---|---|---|---|---|---|---|---|
|
| 1yuwA | Hsc70 |
| 0.986 | 0.95 | 0.896 | 0.996 |
| 4j8fA | Hsc70 |
| 0.688 | 1.78 | 0.676 | 0.711 | |
| 3cqxB | Hsc70 |
| 0.678 | 0.90 | 0.902 | 0.685 | |
| 3iucC | Hsp70 |
| 0.677 | 0.95 | 0.687 | 0.685 | |
| 3qmlA | Hsp70 |
| 0.669 | 1.17 | 0.656 | 0.682 | |
|
| 1yuwA | Hsc70 |
| 0.986 | 1.24 | 0.958 | 1.000 |
| 4j8fA | Hsc70 |
| 0.688 | 1.81 | 0.654 | 0.711 | |
| 3cqxB | Hsc70 |
| 0.678 | 0.90 | 0.960 | 0.685 | |
| 3iucC | Hsp70 |
| 0.677 | 0.95 | 0.698 | 0.685 | |
| 3qmlA | Hsp70 |
| 0.669 | 1.17 | 0.659 | 0.682 |
Analogs were inferred by COFACTOR analysis, based on the TM-score of the structural alignment between the query structure and known structures in the PDB. RMSDa is the average root mean square deviation between residues that are structurally aligned by TM-align; IDENa is the percentage sequence identity in the structurally aligned region; Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein
Fig. 3Bos taurus Hsc70 (PDB ID: 1yuw-A) structural analog (backbone trace) superimposed upon the Ma-Hsp70 a and Ma-Hsc70 b proteins (shown in cartoon), rendered by COFACTOR server
Residue-specific ligand binding probability
| PDB ID | C-score | Clust. size | Ligand | Ligand-binding site residues | |
|---|---|---|---|---|---|
|
| 1kax-A | 0.98 | 176 | ATP | 14, 15, 16, 17, 203, 204, 205, 206, 232, 270, 273, 274, 277, 340, 341, 342, 344, 345, 368 |
| 1hpm-A | 0.22 | 33 | PO4 | 14, 15, 73, 149, 177, 231 | |
|
| 1yag-A | 0.96 | 146 | ATP | 12, 13, 14, 15, 17, 201, 202, 203, 204, 230, 268, 271, 272, 338, 339, 340, 342, 343, 366 |
| 3l4i-B | 0.23 | 32 | PO4 | 12, 13, 71, 147, 175, 229 |
Predicted by COACH analysis, based on the C-score of the structural alignment between the query structure and known structures in the PDB. C-score is the confidence score of the prediction, where a higher score (min-0, max-1) indicates a more reliable prediction; Clust. size is the total number of templates in a cluster; ligand is the name of a possible binding ligand
Enzyme Commission (EC) predictions for Ma-Hsp70 and Ma-Hsc70 proteins
| CscEC | PDB ID | TM-sc | RMSDa | IDENa | Cov | EC No. | EC Name | |
|---|---|---|---|---|---|---|---|---|
|
| 0.241 | 3cjc-A | 0.421 | 4.08 | 0.137 | 0.493 | 3.1.21.1 | Deoxyribonuclease I |
| 0.230 | 1qha-A | 0.405 | 5.2 | 0.08 | 0.509 | 2.7.1.1 | Hexokinase | |
| 0.222 | 3f9m-A | 0.386 | 4.08 | 0.099 | 0.457 | 2.7.1.2 | Glucokinase | |
| 0.221 | 3hm8-A | 0.381 | 4.25 | 0.106 | 0.456 | 2.7.1.1 | Hexokinase | |
| 0.221 | 2e2o-A | 0.362 | 3.8 | 0.103 | 0.417 | 2.7.1.1 | Hexokinase | |
|
| 0.221 | 1qha-A | 0.390 | 5.27 | 0.089 | 0.491 | 2.7.1.1 | Hexokinase |
| 0.216 | 3hm8-A | 0.378 | 4.12 | 0.111 | 0.447 | 2.7.1.1 | Hexokinase | |
| 0.215 | 3f9m-A | 0.382 | 4.08 | 0.110 | 0.452 | 2.7.1.2 | Glucokinase | |
| 0.185 | 1v4t-A | 0.281 | 4.95 | 0.064 | 0.347 | 2.7.1.2 2.7.1.1 | Glucokinase Hexokinase | |
| 0.168 | 3cjc-A | 0.411 | 4.13 | 0.131 | 0.484 | 3.1.21.1 | Deoxyribonuclease I |
CscoreEC is the confidence score for the enzyme commission number (EC No.) prediction (0–1), TM-sc is the TM-score, Cov represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein. See Table 4 for other term explanations