| Literature DB >> 17324296 |
Oscar Conchillo-Solé1, Natalia S de Groot, Francesc X Avilés, Josep Vendrell, Xavier Daura, Salvador Ventura.
Abstract
BACKGROUND: Protein aggregation correlates with the development of several debilitating human disorders of growing incidence, such as Alzheimer's and Parkinson's diseases. On the biotechnological side, protein production is often hampered by the accumulation of recombinant proteins into aggregates. Thus, the development of methods to anticipate the aggregation properties of polypeptides is receiving increasing attention. AGGRESCAN is a web-based software for the prediction of aggregation-prone segments in protein sequences, the analysis of the effect of mutations on protein aggregation propensities and the comparison of the aggregation properties of different proteins or protein sets.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17324296 PMCID: PMC1828741 DOI: 10.1186/1471-2105-8-65
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List and ranking of the predicted aggregation-prone regions in the different disease-linked polypeptides analyzed in this study and comparison with the available experimental data.
| 1–34 | 4–9 | 2/2 | [52] | |
| 15–28 | 1/2 | |||
| 1–34 | 4–9 | 2/2 | [53] | |
| 15–24 | 1/2 | |||
| 68–78 | 66–77 | 1/6 | [54–56] | |
| 31–109 | 36–42 | 2/6 | [56] | |
| 49–55 | 4/6 | |||
| 87–94 | 5/6 | |||
| 12–27 | 14–22 | 2/3 | [57] | |
| 17–21 | 17–22 | 2/2 | [58] | |
| 31–36/38–42 | 30–42 | 1/2 | ||
| 1–83 | 13–21 | 2/2 | [59] | |
| N-terminal fragments | 1–19 | 1/3 | [11] | |
| N-terminal fragments | 1–19 | 1/6 | [11] | |
| 57–74 | 60–67 | 2/3 | [60] | |
| 69–76 | 1/3 | |||
| 21–41 | 22–30 | 2/2 | [61] | |
| 59–79 | 59–70 | 1/2 | [62] | |
| 1–25 | 1–7 | 2/2 | [63] | |
| 9/18 | 1/2 | |||
| 501–506 | 499–521 | 1/6 | [64] | |
| 482–504 | 501–506 | 1/5 | ||
| 70–98 | 74–98 | 1/4 | [65] | |
| 1–38 | 12–19 | 1/3 | [66] | |
| 21–27 | 3/3 | |||
| 8–20 | 13–18 | 1/2 | [67] | |
| 20–29 | 24–28 | 2/2 | [68] | |
| 40–64 | 54–62 | 2/4 | [69] | |
| 49–101 | 76–84 | 3/4 | [70] | |
| 47–54 | 49–55 | 1/3 | [71] | |
| 101–118 | 101–115 | 1/4 | [72] | |
| 106–147 | 117–136 | 3/6 | [73] | |
| 138–142 | 6/6 | |||
| 1–34 | 10–32 | 2/9 | [74] | |
| 24–58 | 31–59 | 1/5 | [25] | |
| 2–12 | 1–9 | 1/2 | [75] | |
| 301–320 | 304–311 | 1/2 | [27] | |
| 10–20 | 12–19 | 2/7 | [76] | |
| 105–115 | 105–112 | 3/7 | [77] | |
| 114–123 | 4/7 |
aSequence stretches experimentally identified as critical for protein aggregation.
bCoincident aggregation-prone segments as predicted by AGGRESCAN.
cThe rank position refers to the entire protein and reflects the importance of this specific "hot spot" (HS) relative to all the aggregation-prone regions identified by AGGRESCAN in the protein. (i.e., 1/4 indicates that this HS has the highest aggregation propensity of the four detected in a particular sequence by the software)
Figure 1Hot spot area graphics. Hot spot area plots for a) lung surfactant protein C, b) serum amyloid A protein and c) Tau protein.
Comparison of AGGRESCAN predictions with the structural composition of different amyloid fibrils.
| β 1: 12–24 | 17–22 | [78] | |
| β 2: 30–40 | 30–40 | ||
| β 1: 12–17 | 13–18 | [79] | |
| β 2: 22–27 | 24–28 | ||
| β 3: 31–37 | - | [80] | |
| β 1: 226–234 | - | ||
| β 2: 237–245 | 238–248 | ||
| β 3: 262–270 | 263–267 | ||
| β 4: 273–282 | 272–276 | ||
| β 1: 112–124 | 115–129 | [81] | |
| β 1: 21–30 | 22–30 | [82] | |
| β 2: 33–40 | - | ||
| β 1: 105–115 | 105–112 | [83] |
Correlation coefficients (R) between the individual amino acid aggregation propensities used by AGGRESCAN and those used by other predictive methods.
| AGGRESCAN | AMYLOID1a | AMILOYD2b | AMYLOID3c | |
| AGGRESCAN | * | 0.849 | 0.794 | 0.867 |
| AMYLOID1a | 0.849 | * | 0.764 | 0.837 |
| AMILOYD2b | 0.794 | 0.764 | * | 0.807 |
| AMYLOID3c | 0.867 | 0.837 | 0.807 | * |
a AMYLOID1 corresponds to the method described in Ref. [22]
bAMYLOID2 corresponds to the method described in Ref. [19]
cAMYLOID3 corresponds to the method described in Ref. [31]
Figure 2Comparative prediction performance of AGGRESCAN and structure-based methods. Comparative predictions of AGGRESCAN (solid circles), packing density profile [31] (no symbols), 3D Profile [32] using the NNQQNY template (solid squares) and 3D Profile using an ensemble of templates (empty squares). Predictions were tested in a Database of Fibril Formers and Non-Formers hexa-peptides. Predictions are shown as receiver-operator characteristic curves.
Comparison of the predicted and experimentally tested effects of mutations on the aggregation propensity of amyloidogenic proteins.
| -16 | - | [84] | |
| 15 | + | [84] | |
| 29 | + | [84] | |
| 5 | + | [84] | |
| -68 | - | [85] | |
| -63 | - | [35] | |
| 16 | + | [86] | |
| -118 | - | [12] | |
| -15 | - | [87] | |
| -15 | - | [87] | |
| -62 | - | [87] | |
| -49 | - | [87] | |
| -12 | - | [87] | |
| -10 | - | [87] | |
| 32 | + | [87] | |
| 59 | + | [88] | |
| 237 | + | [88] | |
| -63 | - | [88] | |
| -34 | - | [36] | |
| 89 | + | [36] | |
| 111 | + | [36] | |
| 167 | + | [36] | |
| -312 | - | [87] | |
| -123 | - | [87] | |
| 2 | + | [89] | |
| 2 | + | [90] | |
| 2 | + | [90] | |
| 0 | = | [39] | |
| 1 | + | [40] | |
| 2 | + | [91] | |
| -1 | = | [92] | |
| 2 | + | [92] | |
| -1 | + | [92] | |
| -5 | - | [93] | |
| -3 | - | [93] | |
| 9 | + | [94] | |
| 17 | + | [94] | |
| 11 | + | [94] | |
| 34 | + | [94] | |
| 21 | + | [68] | |
| -59 | - | [68] | |
| 16 | + | [68] | |
| -61 | - | [68] | |
| -23 | - | [68] | |
| -106 | + | [95] | |
| -90 | +? | [96] | |
| 5 | +/= | [97] | |
| 0 | -/= | [97] | |
| 7 | + | [97] | |
| 1 | + | [98] | |
| 37 | + | [41] | |
| -6 | - | [41] | |
| 17 | + | [46] |
aRelative change in Na4vSS upon mutation, expressed as percentage.
ΔNavSS = ((NavSS - NavSS)/|NavSS|)*100
NavSSrefers to the Na4vSS value of the mutant sequence.
NavSSrefers to the Na4vSS value of the wild type sequence.
bChanges in aggregation determined experimentally.
Symbols: + increase; - decrease; = no significant change.
Figure 3Changes in the hot spot area plot caused by point mutations in amyloidogenic proteins. a) Aβ42 wild type (red) and Aβ42 F19T mutant (green). b) SH3 wild type (red), SH3 D48G (green) and SH3 N47G (blue). c) TAU wild type (red), TAU P301L (green) and TAU S320F (blue).
Comparison of the different AGGRESCAN parameters for globular, natively unstructured, amyloidogenic, soluble and insoluble proteins.
| 9.54 | 5.63 | 5.86 | 11.97 | 10.34 | |
| 29.94 | 18.21 | 24.51 | 41.27 | 34.43 | |
| 25.58 | 14.97 | 21.26 | 36.00 | 29.61 | |
| -5.17 | -60.95 | -26.42 | -5.00 | -5.55 | |
In bold and italics are shown those parameters that are normalized by the number of residues, allowing direct comparison of datasets independently of protein size.
1Natively globular proteins: 160 proteins randomly selected from SCOP (the ASTRAL40 set)
2Natively intrinsically unstructured proteins: 51 proteins
3Amyloidogenic proteins: 57 proteins
4Proteins forming inclusion bodies when overexpressed in bacteria: 121 proteins
5Proteins which are soluble when overexpressed in bacteria: 38 proteins
Figure 4"Hot spots" distribution in different protein groups. Distribution of the number of "hot spots" relative to sequence length in the following protein datasets: natively globular proteins, intrinsically unstructured proteins, amyloidogenic proteins, soluble proteins when overexpressed in bacteria and proteins forming inclusion bodies when overexpressed in bacteria.
Figure 5Modulation of hot spot nucleation specificity by global aggregation propensity. The black solid line represents a standard amyloidogenic protein aggregation profile, with only one "hot spot" and low global aggregation propensity. The pink discontinuous line corresponds to a typical aggregation profile from an inclusion-body-forming protein, with many "hot spots" and high global aggregation propensity. The horizontal lines represent the aggregation-propensity average thresholds for each sequence. The coloured regions indicate the area of each "hot spot" over the aggregation propensity threshold. It is proposed that a higher area over the threshold promotes a more specific aggregation reaction, resulting in highly ordered deposits.