| Literature DB >> 12495440 |
Sharmila Anishetty1, Gautam Pennathur, Ramesh Anishetty.
Abstract
BACKGROUND: An efficient building block for protein structure prediction can be tripeptides. 8000 different tripeptides from a dataset of 1220 high resolution (<or= 2.0 degrees A) structures from the Protein Data Bank (PDB) have been looked at, to determine which are structurally rigid and non-rigid. This data has been statistically analyzed, discussed and summarized. The entire data can be utilized for the building of protein structures.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12495440 PMCID: PMC140318 DOI: 10.1186/1472-6807-2-9
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Figure 1Tripeptide "R1 R2 R3" with Cα and Cβ positions
Column 1 shows the amino acid. Columns 2 to 4 show percentage of occurrence of the 20 amino acids in Intermediate (I), Rigid (R) And Non-rigid (N) categories respectively.
| Glycine G | 18 | 5 | 18 |
| Alanine A | 16 | 13 | 18 |
| Valine V | 16 | 12 | 11 |
| Leucine L | 16 | 15 | 8 |
| IsoLeucine I | 15 | 16 | 10 |
| Methionine M | 10 | 23 | 10 |
| Proline P | 13 | 23 | 10 |
| Phe Alanine F | 15 | 14 | 11 |
| Tryptophan W | 10 | 20 | 12 |
| Serine S | 16 | 6 | 34 |
| Threonine T | 16 | 7 | 25 |
| Asparagine N | 16 | 11 | 14 |
| Glutamine Q | 13 | 20 | 10 |
| Tyrosine Y | 14 | 15 | 15 |
| Cysteine C | 9 | 16 | 16 |
| Lysine K | 15 | 14 | 13 |
| Arginine R | 14 | 16 | 14 |
| Histidine H | 13 | 15 | 11 |
| Asp acid D | 16 | 9 | 19 |
| Glu acid E | 15 | 17 | 10 |
Column 1 has the mean (α1,α3) distance ranges (first bin 5.2 to 5.4°A and so on) Columns 2 to 4 have counts of intermediate, rigid and non-rigid tripeptides respectively, falling in each bin.
| 5.4 | 2 | 5 | 0 |
| 5.6 | 22 | 121 | 0 |
| 5.8 | 772 | 335 | 6 |
| 6 | 1744 | 276 | 40 |
| 6.2 | 1755 | 200 | 128 |
| 6.4 | 1052 | 209 | 89 |
| 6.6 | 330 | 113 | 23 |
| 6.8 | 52 | 27 | 9 |
| 7 | 2 | 8 | 5 |
| 7.2 | 0 | 0 | 1 |
| 7.4 | 0 | 0 | 0 |
| 7.6 | 0 | 0 | 1 |
Intermediate, rigid and non-rigid tripeptide occurrence in relative percentages ± Standard deviation (in %), in secondary structures and in the entire dataset.
| Helix | ≥ 5 | 82 ± 16 | 14 ± 14 | 3 ± 7 |
| Helix | ≥ 12 | 82 ± 11 | 15 ± 11 | 3 ± 5 |
| β Strands | ≥ 3 | 84 ± 24 | 13 ± 22 | 3 ± 11 |
| β Strands | ≥ 7 | 85 ± 15 | 11 ± 13 | 3 ± 7 |
| Entire dataset | > 0 | 12 | 4 | 84 |
Figure 2Legend: Series1 (All tripeptides) Series2 (Intermediate tripeptides) Series3 (Rigid tripeptides) Series 4 (Non-rigid tripeptides).
Sample of the Mean (M), Standard Deviation (SD) in Angstroms and Frequency (F).
| R1R2R3 | (α1,α2) | (α1,β2) | (β1,α2) | (β1,β2) | ||||||||
| M | SD | F | M | SD | F | M | SD | F | M | SD | F | |
| AAA | 3.80 | 0.02 | 292 | 4.82 | 0.10 | 291 | 4.50 | 0.10 | 291 | 5.37 | 0.18 | 291 |
| AAC | 3.80 | 0.03 | 37 | 4.81 | 0.12 | 37 | 4.51 | 0.12 | 37 | 5.39 | 0.23 | 37 |
| AAD | 3.80 | 0.02 | 123 | 4.79 | 0.11 | 122 | 4.55 | 0.15 | 123 | 5.43 | 0.25 | 122 |
| AAE | 3.80 | 0.02 | 138 | 4.81 | 0.11 | 138 | 4.50 | 0.10 | 138 | 5.38 | 0.21 | 138 |
Only the R1R2 set for four tripeptides is shown here. Similar data for R1R2, R1R3 and R2R3 for 7964 tripeptides is available at
Column 1 shows the standard deviation ranges in °A. For example the first range is between 0 and 0.1°A and the second range is between 0.1 and 0.2°A and so on. Columns 2 to 12 show for the 12 distances, number of tripeptides, which fall in the range.
| 0.1 | 594 | 192 | 130 | 109 | 151 | 123 | 604 | 188 | ||||
| 0.2 | 153 | 632 | 82 | 50 | 157 | 69 | 190 | 601 | ||||
| 0.3 | 165 | 128 | 141 | 67 | 255 | 116 | 159 | 118 | ||||
| 0.4 | 89 | 31 | 112 | 459 | 160 | 615 | 203 | 69 | 23 | 96 | ||
| 0.5 | 38 | 26 | 57 | 352 | 397 | 434 | 31 | 22 | 52 | 290 | ||
| 0.6 | 12 | 11 | 19 | 48 | 795 | 604 | 15 | 11 | 15 | 29 | ||
| 0.7 | 16 | 8 | 12 | 10 | 12 | 13 | 15 | 17 | ||||
| 0.8 | 7 | 4 | 9 | 3 | 424 | 909 | 6 | 3 | 9 | 9 | ||
| 0.9 | 4 | 1 | 5 | 9 | 62 | 711 | 12 | 9 | 4 | 5 | ||
| 1 | 3 | 4 | 0 | 3 | 20 | 624 | 350 | 773 | 2 | 4 | 6 | 6 |
| 1.1 | 2 | 5 | 4 | 2 | 12 | 223 | 162 | 542 | 4 | 2 | 5 | 1 |
| 1.2 | 2 | 2 | 2 | 3 | 7 | 88 | 48 | 427 | 3 | 2 | 2 | 4 |
| 1.3 | 2 | 3 | 4 | 2 | 5 | 27 | 28 | 306 | 1 | 2 | 1 | 2 |
| 1.4 | 4 | 5 | 3 | 1 | 7 | 7 | 6 | 188 | 1 | 2 | 1 | 2 |
| 1.5 | 4 | 1 | 0 | 5 | 4 | 6 | 9 | 83 | 0 | 0 | 2 | 2 |
| 1.6 | 2 | 3 | 0 | 0 | 3 | 3 | 6 | 36 | 2 | 2 | 1 | 0 |
| 1.7 | 2 | 1 | 3 | 2 | 4 | 2 | 2 | 19 | 3 | 4 | 1 | 4 |
| 1.8 | 3 | 1 | 2 | 1 | 1 | 2 | 2 | 11 | 1 | 1 | 4 | 2 |
| 1.9 | 0 | 0 | 2 | 0 | 1 | 2 | 0 | 7 | 1 | 1 | 1 | 0 |
| 2 | 1 | 0 | 1 | 1 | 2 | 0 | 0 | 3 | 3 | 0 | 2 | 0 |