| Literature DB >> 21342550 |
Ashish V Tendulkar1, Pramod P Wangikar.
Abstract
BACKGROUND: The structure conservation in various α-helix subclasses reveals the sequence and context dependent factors causing distortions in the α-helix. The sequence-structure relationship in these subclasses can be used to predict structural variations in α-helix purely based on its sequence. We train support vector machine(SVM) with dot product kernel function to discriminate between regular α-helix and non-regular α-helices purely based on the sequences, which are represented with various overall and position specific propensities of amino acids.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21342550 PMCID: PMC3044275 DOI: 10.1186/1471-2105-12-S1-S20
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Mean and standard deviation of gross structure descriptors for helix subclasses
| Subclass | ||
|---|---|---|
| Regular | 10.64+/-0.34 | 10.40+/-1.50 |
| Extended helix | 11.51+/-0.59 | 10.66+/-4.04 |
| Helix with c-cap | 10.35+/-0.65 | 15.21+/-2.32 |
| Kinked helix | 11.39+/-0.82 | 18.52+/-4.25 |
| Curved Helix | 8.64+/-1.58 | 12.12+/-6.23 |
(i) d18 denotes distance between of the octapeptide helix conformations. (ii) A158 denotes area of triangle formed by .
Figure 1Plots for finer geometric invariants for different helix subclasses. The octapeptide helical structures are divided into five overlapping tetrahedrons consisting of . For each of the tetrahedra, we plot: (a) d,+3 denotes distance between the first and the last C in tetrahedron (b) vol,+1,+2,+3 represent volume of the tetrahedron.
Distribution of helices by their lengths
| Subclass | 8 < | 15 < | |
|---|---|---|---|
| Regular | 14.41% | 59.60% | 25.98% |
| Extended helix | 72.72% | 27.24% | 0.02% |
| Helix with c-cap | 79.03% | 20.95% | 0.01% |
| Kinked helix | 99.31% | 0.69% | 0% |
| Curved Helix | 93.02% | 6.98% | 0% |
The table shows the distribution of helices in different subclasses by their lengths: (i) l=8 contains helices having length equal to 8 (ii) 8
Figure 2Overall amino acid propensities for different helix subclasses. Overall amino acid propensities for different helix subclasses.
Preferred Amino Acids at N terminus of helix
| Subclass | ||||
|---|---|---|---|---|
| Regular | S/T/D/N | P | E | E/Q |
| Extended Helix | D/N | P/A/W/E/L | E/A/Q | E/L/F/Q |
| Helix with c-cap | D/E/A | Q/A/E | E/W/A/L | L/F/I/M |
| Kinked helix | P/E | E/P/D | N/H/Y/F | W/A/L/F |
| Curved Helix | Y/F/W | E/K/R | H/N/Q/K/E | N/D/H/C/G |
We have listed the most preferred amino acids at N-terminus positions in the helix. At each position, the amino acids are arranged in descending order of their propensities. The positions in the N-terminus are denoted by N1 to N4 from left to right.
Preferred Amino Acids at C terminus of helix
| Subclass | ||||
|---|---|---|---|---|
| Regular | A/I/L | K/A/R/E/Q | L/A/Q/K/R | G |
| Extended Helix | L/M/A | L/Q/K | L/Q/K | L/F/Y/Q/M |
| Helix with c-cap | L/A/R/M | K/ R/ E/ Q/ L | H/ Y/ F | G |
| Kinked helix | P | E/Q/W | L/I/F/Y/V | L/M/I/A |
| Curved Helix | P | P/E | H/P/W | F/Y/L/W/A |
We have listed the most preferred amino acids at C-terminus positions in the helix. At each position, the amino acids are arranged in descending order of their propensities. The position in the C-terminus are denoted by C1 to C4 from right to left.
Preferred Amino Acids in the middle of helix
| Subclass | ||||
|---|---|---|---|---|
| Regular | I/L/A/M/F | A/K/R/Q/E | A/R/K/Q/E | L/M/A |
| Extended Helix | L/M/A | L/A | Q/K/R/L/M | L/F |
| Helix with c-cap | L/A/M/R | K/L/A/Q/E | H/F/Y/N | G/F/L/Y/H |
The propensity for the middle four positions are calculated for the sequences containing minimum 12 amino acids. The first middle position represent the fifth position in the sequence from the N-terminus end. The amino acids at each position are arranged in descending order of their propensities. The kinked and curved helix do not have sequences more than length 8 and do not figure out in this list.