| Literature DB >> 25080018 |
Jingran Wen, Daniel R Scoles, Julio C Facelli.
Abstract
BACKGROUND: The expansion of polyglutamine (poly-Q) repeats in several unrelated proteins is associated with at least ten neurodegenerative diseases. The length of the poly-Q regions plays an important role in the progression of the diseases. The number of glutamines (Q) is inversely related to the onset age of these polyglutamine diseases, and the expansion of poly-Q repeats has been associated with protein misfolding. However, very little is known about the structural changes induced by the expansion of the repeats. Computational methods can provide an alternative to determine the structure of these poly-Q proteins, but it is important to evaluate their performance before large scale prediction work is done.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25080018 PMCID: PMC4110737 DOI: 10.1186/1471-2105-15-S7-S11
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Length of poly-Qs in polyglutamine diseases.
| Gene | Wild-type allele | Mutant allele | |
|---|---|---|---|
| SCA1 | ATXN1 | 6-39 | 41-83 |
| SCA2 | ATXN2 | 13-31 | > = 32 |
| SCA3 | ATXN3 | 12-43 | 60-89 |
| SCA6 | CACNA1A | <18 | 20-33 |
| SCA7 | ATXN7 | <19 | 36-460 |
| SCA8 | ATXN8 | 15-50a | 80-250a |
| SCA17 | TBP | 25-42 | 49-66 |
| HD | HTT | 10-26 | >40 |
| DRPLA | ATN1 | 6-35 | >48 |
| SBMA | AR | < =36 | >38 |
The non-coding gene ATXN8OS gene may contribute to SCA8 pathogenesis. The number of CAG repeats in normal and mutant ATXN8 are assumed to be the same as the number of CTG repeats in normal and mutant ATXN8OS, as in [4].
Figure 1The sequence construction of HTT17Q-EX1. (a) sequence structure of the PDB records; (b) sequence used for structure prediction.
C-scores for the best I-TASSER models.
| Model | # 1 | # 2 | # 3 | # 4 | # 5 |
|---|---|---|---|---|---|
| Run 1 | -2.91 | -3.69 | -3.33 | -3.62 | -4.72 |
| Run 2 | -2.84 | -3.71 | -3.31 | -3.5 | -4.42 |
| Run 3 | -2.81 | -3.76 | -3.48 | -3.89 | -3.74 |
| Run 4 | -2.62 | -3.69 | -3.32 | -3.76 | -3.49 |
| Run 5 | -3.02 | -3.21 | -3.91 | -4.11 | -3.42 |
| Run 6 | -2.67 | -3.76 | -3.48 | -3.62 | -4.37 |
| Run 7 | -2.77 | -3.42 | -3.96 | -3.3 | -3.51 |
| Run 8 | -3.09 | -3.45 | -4.09 | -4.22 | -4.27 |
| Run 9 | -2.73 | -3.61 | -3.38 | -3.76 | -4.42 |
| Run 10 | -2.62 | -3.49 | -3.75 | -4.33 | -4.01 |
Energy for the best Rosetta models.
| Model | # 1 | # 2 | # 3 | # 4 | # 5 |
|---|---|---|---|---|---|
| Run 1 | 16.061 | 16.349 | 18.609 | 19.656 | 19.956 |
| Run 2 | 17.881 | 18.309 | 18.373 | 18.386 | 19.215 |
| Run 3 | 16.943 | 17.598 | 17.639 | 18.306 | 19.436 |
| Run 4 | 18.414 | 18.662 | 18.691 | 18.812 | 19.076 |
| Run 5 | 16.74 | 18.004 | 18.192 | 18.3 | 19.015 |
| Run 6 | 18.353 | 18.388 | 18.572 | 18.766 | 18.96 |
| Run 7 | 17.435 | 18.897 | 19.571 | 19.603 | 19.617 |
| Run 8 | 18.128 | 19.111 | 19.521 | 19.643 | 19.707 |
| Run 9 | 17.317 | 17.586 | 17.655 | 17.916 | 18.69 |
| Run 10 | 19.329 | 19.899 | 19.928 | 20.104 | 20.13 |
Figure 2Secondary structure WebLogo. (a) PDB structures; (b) I-TASSER models; (c) Rosetta models. In (a) M represents the number of structures with missing values due to lack of resolution in the experimental data. The codes for secondary structure are as follows: H: α-helix; B: β-bridge; E: Strand; G: Helix-3; I: Helix-5; T: Turn; S: Bend; M: Missing data.
Figure 3Distribution of TM-scores of any two models from I-TASSER and Rosetta respectively.
TM-scores between the best models from I-TASSER.
| run | # 2 | # 3 | # 4 | # 5 | # 6 | # 7 | # 8 | # 9 | # 10 |
|---|---|---|---|---|---|---|---|---|---|
| # 1 | 0.48 | 0.48 | 0.63 | 0.48 | 0.54 | 0.57 | 0.49 | 0.51 | 0.46 |
| # 2 | 0.56 | 0.54 | 0.55 | 0.48 | 0.54 | 0.45 | 0.58 | 0.46 | |
| # 3 | 0.54 | 0.63 | 0.53 | 0.51 | 0.54 | 0.56 | 0.51 | ||
| # 4 | 0.50 | 0.52 | 0.66 | 0.49 | 0.56 | 0.49 | |||
| # 5 | 0.60 | 0.51 | 0.58 | 0.59 | 0.54 | ||||
| # 6 | 0.51 | 0.68 | 0.54 | 0.49 | |||||
| # 7 | 0.50 | 0.57 | 0.45 | ||||||
| # 8 | 0.50 | 0.52 | |||||||
| # 9 | 0.50 |
TM-scores between best models from Rosetta.
| run | # 2 | # 3 | # 4 | # 5 | # 6 | # 7 | # 8 | # 9 | # 10 |
|---|---|---|---|---|---|---|---|---|---|
| # 1 | 0.30 | 0.43 | 0.37 | 0.42 | 0.42 | 0.53 | 0.40 | 0.38 | 0.43 |
| # 2 | 0.30 | 0.34 | 0.31 | 0.39 | 0.28 | 0.34 | 0.32 | 0.38 | |
| # 3 | 0.52 | 0.53 | 0.53 | 0.65 | 0.47 | 0.57 | 0.50 | ||
| # 4 | 0.48 | 0.53 | 0.47 | 0.68 | 0.49 | 0.43 | |||
| # 5 | 0.63 | 0.52 | 0.48 | 0.58 | 0.61 | ||||
| # 6 | 0.52 | 0.47 | 0.64 | 0.59 | |||||
| # 7 | 0.41 | 0.56 | 0.55 | ||||||
| # 8 | 0.46 | 0.42 | |||||||
| # 9 | 0.60 |
Number of pairs with TM-score greater than 0.5 between any two runs of I-TASSER.
| run | # 2 | # 3 | # 4 | # 5 | # 6 | # 7 | # 8 | # 9 | # 10 |
|---|---|---|---|---|---|---|---|---|---|
| # 1 | 2 | 3 | 3 | 1 | 3 | 3 | 0 | 5 | 1 |
| # 2 | 2 | 2 | 2 | 2 | 3 | 2 | 2 | 1 | |
| # 3 | 3 | 3 | 2 | 3 | 2 | 1 | 4 | ||
| # 4 | 3 | 2 | 6 | 3 | 5 | 1 | |||
| # 5 | 3 | 5 | 3 | 2 | 4 | ||||
| # 6 | 4 | 1 | 5 | 2 | |||||
| # 7 | 4 | 5 | 2 | ||||||
| # 8 | 3 | 2 | |||||||
| # 9 | 2 |
Number of pairs with TM-score greater than 0.5 between any two runs of Rosetta.
| run | # 1 | # 2 | # 3 | # 4 | # 5 | # 6 | # 7 | # 8 | # 9 | # 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| # 1 | 4 | 6 | 10 | 11 | 8 | 5 | 11 | 12 | 8 | 9 |
| # 2 | 2 | 11 | 8 | 7 | 6 | 13 | 11 | 11 | 10 | |
| # 3 | 6 | 15 | 9 | 13 | 20 | 16 | 18 | 16 | ||
| # 4 | 3 | 9 | 7 | 14 | 13 | 16 | 14 | |||
| # 5 | 4 | 8 | 15 | 5 | 16 | 15 | ||||
| # 6 | 1 | 8 | 5 | 13 | 9 | |||||
| # 7 | 5 | 15 | 16 | 14 | ||||||
| # 8 | 8 | 7 | 13 | |||||||
| # 9 | 9 | 17 | ||||||||
| #10 | 6 |
Numbers of Qs in the PDB structures.
| PDB structure | Number of Qs |
|---|---|
| 3io4_b | 10 |
| 3io4_c | 11 |
| 3io6_b | 14 |
| 3io6_c | 10 |
| 3ior_c | 13 |
| 3iot_b | 12 |
| 3iou_c | 14 |
| 3iov_b | 11 |
| 3iov_c | 15 |
| 3iow_b | 17 |
Distribution of structure superposition parameters between predicted models and PDB structures
| I-TASSER | Rosetta | p-value | |
|---|---|---|---|
| TM-score | 0.50 ± 0.06 | 0.45 ± 0.06 | <0.0001 |
| RMSD (Å) | 1.53 ± 0.34 | 1.74 ± 0.34 | <0.0001 |
| Aligned number | 24.05 ± 2.14 | 25.56 ± 2.41 | <0.0001 |
| Sequence Identity a | (0.30,0.71) | (0.38,0.52) | <0.0001 |
| Exact Match (<5.0 Å) a | (0,16) | (0,0) | <0.0001 |
| Exact Qs Match(<5.0 Å) a | (0,1) | (0,0) | <0.0001 |
| Total Qs Match(<5.0 Å) a | (5,8) | (8,11) | <0.0001 |
| Exact Match (other) a | (0,0) | (0,0) | <0.0001 |
| Exact Qs Match(other) a | (0,0) | (0,0) | <0.0001 |
| Total Qs Match(other) a | (0,1) | (0,1) | <0.0001 |
| Exact Match (all) a | (6,25) | (0,0) | <0.0001 |
| Exact Qs Match (all) a | (0,1) | (0,0) | <0.0001 |
| Total Qs Match (all) a | (6,8) | (9,12) | <0.0001 |
| ESOP a | (0,9.09) | (0,0) | <0.0001 |
| ESO a | (0, 53.13) | (0,0) | <0.0001 |
a The values between brackets represent the value of the property for the best structure superposition at the first and third quartile, respectively, of their distributions.
Figure 4Structure superposition of predicted models and PDB structures. Structure superposition of predicted models and PDB structures with TM-score>0.5 and ESOP = 100. (a) I-TASSER third model of the tenth run with 3io6_b; (b) Rosetta forth model in first run with 3iou_c; (c) Rosetta third model in fifth run with 3iow_b. tan: PDB structure, sky-blue: predicted models. The N-terminal end of each structure is shown on the left.