| Literature DB >> 15804350 |
Doreen Pahlke1, Christian Freund, Dietmar Leitner, Dirk Labudde.
Abstract
BACKGROUND: A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15804350 PMCID: PMC1087856 DOI: 10.1186/1472-6807-5-8
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Figure 1Occurrence of secondary structure. Relative occurrence of the secondary structure types compared for cis and trans prolines.
Chou-Fasman parameters (i). Chou-Fasman parameters for the cis/trans classes at position i regarding the secondary structure of proline.
| Structure | ||
| Bend | 4.249 | 0.683 |
| Coil | 0.389 | 1.060 |
| Helix | 0.036 | 1.094 |
| Strand | 0.051 | 1.093 |
| Turn | 1.452 | 0.956 |
Chou-Fasman parameters (i-1) (i+1). Chou-Fasman parameters for the cis/trans classes regarding the secondary structure of proline's predecessors and successors.
| structure | ||||
| bend | 2.054 | 0.897 | 1.263 | 0.974 |
| coil | 0.462 | 1.052 | 1.437 | 0.957 |
| helix | 0.086 | 1.089 | 0.321 | 1.066 |
| strand | 0.794 | 1.020 | 1.207 | 0.980 |
| turn | 2.261 | 0.877 | 0.786 | 1.021 |
Occurrence matrix. Occurrence matrix for the amino acid combinations of trans and cis proline in the fixed position (i). The total number Nis the occurrence of an amino acid in all investigated sequence fragments. The number of the proline at the position (i) in the trans conformation is 14388 and in the cis conformation it is 1390. The absolute and relative occurrence of one amino acid is shown at all positions (rel and abs). The relative occurrence is the relation of the number of the amino acid at the position to the total number of all amino acid at this position.
| AA | Total | |||||||||||||||||
| abs | rel | |||||||||||||||||
| A: | 4343 | 0.075 | 1024 | 0.071 | 1030 | 0.072 | 1169 | 0.081 | 1120 | 0.078 | 90 | 0.065 | 111 | 0.080 | 134 | 0.096 | 79 | 0.057 |
| C: | 810 | 0.014 | 204 | 0.014 | 256 | 0.018 | 166 | 0.012 | 184 | 0.013 | 38 | 0.027 | 25 | 0.018 | 24 | 0.017 | 33 | 0.024 |
| D: | 3565 | 0.062 | 818 | 0.057 | 972 | 0.068 | 956 | 0.066 | 819 | 0.057 | 57 | 0.041 | 59 | 0.042 | 52 | 0.037 | 83 | 0.060 |
| E: | 3604 | 0.063 | 799 | 0.056 | 685 | 0.048 | 1208 | 0.084 | 912 | 0.063 | 70 | 0.050 | 84 | 0.060 | 47 | 0.034 | 61 | 0.044 |
| F: | 2426 | 0.042 | 656 | 0.046 | 562 | 0.039 | 596 | 0.041 | 612 | 0.043 | 59 | 0.042 | 82 | 0.059 | 96 | 0.069 | 58 | 0.042 |
| G: | 4275 | 0.074 | 1202 | 0.084 | 824 | 0.057 | 1113 | 0.077 | 1136 | 0.079 | 112 | 0.081 | 154 | 0.111 | 112 | 0.081 | 106 | 0.076 |
| H: | 1418 | 0.025 | 323 | 0.022 | 418 | 0.029 | 313 | 0.022 | 364 | 0.025 | 32 | 0.023 | 29 | 0.021 | 40 | 0.029 | 32 | 0.023 |
| I: | 3178 | 0.055 | 780 | 0.054 | 915 | 0.064 | 711 | 0.049 | 772 | 0.054 | 68 | 0.049 | 38 | 0.027 | 54 | 0.039 | 62 | 0.045 |
| K: | 2804 | 0.049 | 654 | 0.045 | 745 | 0.052 | 703 | 0.049 | 702 | 0.049 | 76 | 0.055 | 66 | 0.047 | 57 | 0.041 | 70 | 0.050 |
| L: | 5279 | 0.092 | 1336 | 0.093 | 1423 | 0.099 | 1179 | 0.082 | 1341 | 0.093 | 96 | 0.069 | 90 | 0.065 | 134 | 0.096 | 96 | 0.069 |
| M: | 1131 | 0.020 | 292 | 0.020 | 272 | 0.019 | 263 | 0.018 | 304 | 0.021 | 35 | 0.025 | 21 | 0.015 | 22 | 0.016 | 28 | 0.020 |
| N: | 2683 | 0.047 | 684 | 0.048 | 816 | 0.057 | 608 | 0.042 | 575 | 0.040 | 65 | 0.047 | 79 | 0.057 | 56 | 0.040 | 70 | 0.050 |
| P: | 3121 | 0.054 | 896 | 0.062 | 687 | 0.048 | 691 | 0.048 | 847 | 0.059 | 98 | 0.071 | 82 | 0.059 | 78 | 0.056 | 149 | 0.107 |
| Q: | 2105 | 0.037 | 480 | 0.033 | 556 | 0.039 | 574 | 0.040 | 495 | 0.034 | 72 | 0.052 | 56 | 0.040 | 53 | 0.038 | 45 | 0.032 |
| R: | 2591 | 0.045 | 644 | 0.045 | 675 | 0.047 | 655 | 0.046 | 617 | 0.043 | 61 | 0.044 | 49 | 0.035 | 54 | 0.039 | 64 | 0.046 |
| S: | 3596 | 0.062 | 886 | 0.062 | 887 | 0.062 | 877 | 0.061 | 946 | 0.066 | 108 | 0.078 | 84 | 0.060 | 91 | 0.065 | 79 | 0.057 |
| T: | 3479 | 0.060 | 876 | 0.061 | 948 | 0.066 | 818 | 0.057 | 837 | 0.058 | 84 | 0.060 | 61 | 0.044 | 77 | 0.055 | 102 | 0.073 |
| V: | 4165 | 0.072 | 1021 | 0.071 | 1036 | 0.072 | 1073 | 0.075 | 1035 | 0.072 | 101 | 0.073 | 65 | 0.047 | 97 | 0.070 | 101 | 0.073 |
| W: | 815 | 0.014 | 221 | 0.015 | 169 | 0.012 | 206 | 0.014 | 219 | 0.015 | 18 | 0.013 | 44 | 0.032 | 23 | 0.017 | 19 | 0.014 |
| Y: | 2164 | 0.038 | 592 | 0.041 | 512 | 0.036 | 509 | 0.035 | 551 | 0.038 | 50 | 0.036 | 111 | 0.080 | 89 | 0.064 | 53 | 0.038 |
Normalized relative occurrences. Normalized relative occurrences of all amino acids at the different positions. For the normalization we used the coefficient of each observed occurrence and the natural occurrence for each amino acid at each position. Highlighted in bold are the most probable amino acid (H > 1) occurring at the four positions.
| AA | ||||||||
| A: | 0.95 | 0.87 | 0.96 | 0.76 | ||||
| C: | 1.00 | 0.86 | 0.93 | |||||
| D: | 0.92 | 0.66 | 0.68 | 0.60 | 0.92 | 0.97 | ||
| E: | 0.89 | 0.79 | 0.76 | 0.95 | 0.54 | 1.00 | 0.70 | |
| F: | 1.00 | 0.93 | 0.98 | 1.00 | ||||
| G: | 0.77 | |||||||
| H: | 0.88 | 0.92 | 0.84 | 0.88 | 1.00 | 0.92 | ||
| I: | 0.98 | 0.89 | 0.49 | 0.89 | 0.71 | 0.98 | 0.82 | |
| K: | 0.92 | 0.96 | 1.00 | 0.84 | 1.00 | |||
| L: | 0.75 | 0.71 | 0.89 | 0.75 | ||||
| M: | 1.00 | 0.95 | 0.75 | 0.90 | 0.80 | 1.00 | ||
| N: | 1.00 | 0.89 | 0.85 | 0.85 | ||||
| P: | 0.89 | 0.89 | ||||||
| Q: | 0.89 | 0.92 | 0.86 | |||||
| R: | 1.00 | 0.98 | 0.78 | 0.87 | 0.96 | |||
| S: | 1.00 | 1.00 | 0.97 | 0.98 | 0.92 | |||
| T: | 1.00 | 0.73 | 0.95 | 0.92 | 0.97 | |||
| V: | 0.99 | 1.00 | 0.65 | 0.97 | 1.00 | |||
| W: | 0.93 | 0.86 | 1.00 | 1.00 | ||||
| Y: | 0.95 | 0.95 | 0.92 | 1.00 | 1.00 |
Secondary structure matrix. The occurrence of the secondary structure types of proline in the cis and trans conformation is shown. For the five defined secondary structure elements (bend, coil, helix, strand and turn) we analyzed the surrounding of the fixed proline in the mid-position. Rows indicate the secondary structure type with fixed secondary structure of the proline. Columns show the relative occurrences of the secondary structure at the 4 positions around proline. Bold face highlights the most probable secondary structure. For the cis peptide bond conformation of the central proline in helix and strand not enough entries could be collected from the PDB to reach significance.
| secondary | structure of trans Pro | structure of cis Pro | ||||||||||||||||
| position | position | |||||||||||||||||
| bend: | 257 | 0.21 | 317 | 0.26 | 1211 | 684 | 229 | 0.19 | 176 | 0.24 | 467 | 742 | 143 | 0.19 | 105 | 0.14 | ||
| coil: | 462 | 813 | 1211 | 334 | 0.28 | 545 | 329 | 180 | 0.24 | 742 | 433 | 321 | ||||||
| helix: | 77 | 0.06 | 2 | 0.00 | 1211 | 22 | 0.02 | 107 | 0.09 | 31 | 0.04 | 0 | 0.00 | 742 | 28 | 0.04 | 70 | 0.09 |
| strand: | 254 | 0.21 | 74 | 0.06 | 1211 | 135 | 0.11 | 228 | 0.19 | 164 | 0.22 | 95 | 0.13 | 742 | 104 | 0.14 | 174 | 0.23 |
| turn: | 161 | 0.13 | 5 | 0.00 | 1211 | 36 | 0.03 | 102 | 0.08 | 42 | 0.06 | 0 | 0.00 | 742 | 34 | 0.05 | 72 | 0.10 |
| bend: | 1401 | 0.22 | 1614 | 0.25 | 6505 | 952 | 0.15 | 901 | 0.14 | 41 | 0.18 | 80 | 0.35 | 231 | 29 | 0.13 | 37 | 0.16 |
| coil: | 2966 | 4430 | 6505 | 3749 | 2656 | 112 | 137 | 231 | 127 | 81 | 0.35 | |||||||
| helix: | 377 | 0.06 | 8 | 0.00 | 6505 | 532 | 0.08 | 862 | 0.13 | 9 | 0.04 | 0 | 0.00 | 231 | 9 | 0.04 | 14 | 0.06 |
| strand: | 934 | 0.14 | 360 | 0.06 | 6505 | 788 | 0.12 | 1312 | 0.20 | 41 | 0.18 | 13 | 0.06 | 231 | 62 | 0.27 | 91 | |
| turn: | 827 | 0.13 | 93 | 0.01 | 6505 | 484 | 0.07 | 774 | 0.12 | 28 | 0.12 | 1 | 0.00 | 231 | 4 | 0.02 | 8 | 0.03 |
| bend: | 361 | 0.14 | 193 | 0.07 | 2603 | 1 | 0.00 | 12 | 0.00 | 0 | 0.00 | 0 | 0.00 | 8 | 0 | 0.00 | 1 | 0.13 |
| coil: | 752 | 913 | 2603 | 1 | 0.00 | 38 | 0.01 | 0 | 0.00 | 0 | 0.00 | 8 | 0 | 0.00 | 2 | 0.25 | ||
| helix: | 722 | 935 | 2603 | 2597 | 2489 | 8 | 1.00 | 8 | 1.00 | 8 | 8 | 1.00 | 0 | 0.00 | ||||
| strand: | 165 | 0.06 | 43 | 0.02 | 2603 | 0 | 0.00 | 6 | 0.00 | 0 | 0.00 | 0 | 0.00 | 8 | 0 | 0.00 | 0 | 0.00 |
| turn: | 603 | 0.23 | 519 | 0.20 | 2603 | 4 | 0.00 | 58 | 0.02 | 0 | 0.00 | 0 | 0.00 | 8 | 0 | 0.00 | 5 | 0.63 |
| bend: | 124 | 0.09 | 98 | 0.07 | 1352 | 123 | 0.09 | 202 | 0.15 | 0 | 0.00 | 2 | 0.22 | 9 | 1 | 0.11 | 1 | 0.11 |
| coil: | 268 | 0.20 | 157 | 0.12 | 1352 | 391 | 0.29 | 322 | 0.24 | 4 | 0.44 | 2 | 0.22 | 9 | 2 | 0.22 | 2 | 0.22 |
| helix: | 10 | 0.01 | 0 | 0.00 | 1352 | 80 | 0.06 | 115 | 0.09 | 0 | 0.00 | 0 | 0.00 | 9 | 0 | 0.00 | 0 | 0.00 |
| strand: | 788 | 1096 | 1352 | 703 | 603 | 3 | 0.33 | 4 | 0.44 | 9 | 6 | 0.67 | 4 | 0.44 | ||||
| turn: | 162 | 0.12 | 1 | 0.00 | 1352 | 55 | 0.04 | 110 | 0.08 | 2 | 0.22 | 1 | 0.11 | 9 | 0 | 0.00 | 2 | 0.22 |
| bend: | 334 | 0.12 | 247 | 0.09 | 2717 | 12 | 0.00 | 450 | 0.17 | 57 | 0.14 | 0 | 0.00 | 400 | 54 | 0.14 | 39 | 0.10 |
| coil: | 777 | 1266 | 2717 | 26 | 0.01 | 835 | 142 | 9 | 0.02 | 400 | 88 | 0.22 | 138 | |||||
| helix: | 534 | 0.20 | 127 | 0.05 | 2717 | 239 | 0.09 | 414 | 0.15 | 34 | 0.09 | 0 | 0.00 | 400 | 57 | 0.14 | 74 | 0.19 |
| strand: | 447 | 0.16 | 155 | 0.06 | 2717 | 20 | 0.01 | 192 | 0.07 | 38 | 0.10 | 13 | 0.03 | 400 | 20 | 0.05 | 69 | 0.17 |
| turn: | 625 | 0.23 | 922 | 0.34 | 2717 | 2420 | 826 | 129 | 0.32 | 378 | 400 | 181 | 80 | 0.20 | ||||
10 most frequent cis secondary structure combinations. Comparison of the 10 most frequent cis secondary structure combinations versus the corresponding trans secondary structure combinations.
| Structure | cis Pro | Trans Pro | ||
| absolute | relative(%) | absolute | relative(%) | |
| cbbcc | 93 | 6.6906 | 24 | 0.1668 |
| bbbcc | 58 | 4.1727 | 28 | 0.1946 |
| cbbss | 45 | 3.2374 | 7 | 0.0487 |
| cttcc | 31 | 2.2302 | 3 | 0.0209 |
| ssbcc | 30 | 2.1583 | 12 | 0.0834 |
| ttttt | 28 | 2.0144 | 166 | 1.1537 |
| ccbcc | 25 | 1.7986 | 48 | 0.3336 |
| cbbbc | 25 | 1.7986 | 22 | 0.1529 |
| bbbss | 25 | 1.7986 | 21 | 0.146 |
| sbbcc | 24 | 1.7266 | 5 | 0.0348 |
10 most frequent trans secondary structure. Comparison of the 10 most frequent trans secondary structure combinations versus the corresponding cis secondary structure combinations.
| structure | trans Pro | Cis Pro | ||
| absolute | relative(%) | absolute | relative(%) | |
| ccccc | 1311 | 9.1118 | 23 | 1.6547 |
| cchhh | 524 | 3.6419 | 0 | 0 |
| hhhhh | 398 | 2.7662 | 0 | 0 |
| tthhh | 291 | 2.0225 | 0 | 0 |
| ccttc | 251 | 1.7445 | 1 | 0.0719 |
| cccss | 241 | 1.675 | 9 | 0.6475 |
| bbccc | 221 | 1.536 | 5 | 0.3597 |
| hthhh | 218 | 1.5152 | 0 | 0 |
| ccchh | 213 | 1.4804 | 3 | 0.2158 |
| ccttt | 210 | 1.4595 | 2 | 0.1439 |