| Literature DB >> 26924748 |
Sandeep Kumar1, A Mary Thangakani2, R Nagarajan3, Satish K Singh1, D Velmurugan2, M Michael Gromiha3.
Abstract
Why do patients suffering from neurodegenerative diseases generate autoantibodies that selectively bind soluble aggregates of amyloidogenic proteins? Presently, molecular basis of interactions between the soluble aggregates and human immune system is unknown. By analyzing sequences of experimentally validated T-cell autoimmune epitopes, aggregating peptides, amyloidogenic proteins and randomly generated peptides, here we report overlapping regions that likely drive aggregation as well as generate autoantibodies against the aggregates. Sequence features, that make short peptides susceptible to aggregation, increase their incidence in human T-cell autoimmune epitopes by 4-6 times. Many epitopes are predicted to be significantly aggregation prone (aggregation propensities ≥10%) and the ones containing experimentally validated aggregating regions are enriched in hydrophobicity by 10-20%. Aggregate morphologies also influence Human Leukocyte Antigen (HLA)--types recognized by the aggregating regions containing epitopes. Most (88%) epitopes that contain amyloid fibril forming regions bind HLA-DR, while majority (63%) of those containing amorphous β-aggregating regions bind HLA-DQ. More than two-thirds (70%) of human amyloidogenic proteins contain overlapping regions that are simultaneously aggregation prone and auto-immunogenic. Such regions help clear soluble aggregates by generating selective autoantibodies against them. This can be harnessed for early diagnosis of proteinopathies and for drug/vaccine design against them.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26924748 PMCID: PMC4770294 DOI: 10.1038/srep22258
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Incidence of experimentally validated aggregating peptides within the experimentally validated human T-cell autoimmune epitopes.
| 359 β-strandforming hexapeptides from globular proteins | 158 amorphous β-aggregatesforming hexapeptides | 194 amyloid fibrilforming hexapeptides | 341 amyloid fibrilforming peptides of length 7–141 | 536 amyloid fibrilforming peptides of all lengths | |
|---|---|---|---|---|---|
| 155 HLA-DP bindingAutoimmune epitopes | 0 | 0 | 0 | 0 | 0 |
| 543 HLA-DQ bindingAutoimmune epitopes | 0 | 31 (19.6%) | 12 (6.2%) | 13 (3.8%) | 25 (4.7%) |
| 3243 HLA-DR bindingAutoimmune epitopes | 21 (5.8%) | 18 (11.4%) | 34 (17.5%) | 150 (44%) | 184 (34.3%) |
| All 3941 T-cellautoimmune epitopes | 21 (5.8%) | 49 (31%) | 46 (23.7%) | 163 (47.8%) | 209 (39%) |
| 430 MHCIInonbindpeptides | 0 | 39 (24.7%) | 7 (3.6%) | 10 (2.9%) | 17 (3.2%) |
Percentages were calculated with respect to the number of aggregating peptides in each column.
Figure 1Box and Whisker plots showing average aggregation propensities of the peptide sequences in experimentally validated human T-cell autoimmune epitopes and MHCIInonbind datasets, predicted by (a) TANGO (PAgg-TANGO) and (b) WALTZ (PAgg-WALTZ). All epitope classes (HLA-DP, HLA-DQ and HLA-DR) show long tails suggesting that a number of epitopes in these datasets are aggregation prone.
Enrichment in hydrophobicity for human T-cell autoimmune epitopes that contain aggregating peptides.
| Dataset | Average Hydrophobicity |
|---|---|
| All amorphous β-aggregates forming hexapeptides | 35.6 ± 13.2 |
| All amyloid fibril forming hexapeptides | 38.0 ± 12.9 |
| All amyloid fibril forming peptides of all lengths | 32.3 ± 11.2 |
| All human T-cell autoimmune epitopes | 28.4 ± 7.6 |
| All human T-cell autoimmune epitopes that contain amorphous β-aggregating hexapeptides | 31.3 ± 4.3 |
| All human T-cell autoimmune epitopes that contain amyloid fibril forming hexapeptides | 31.9 ± 4.0 |
| All human T-cell autoimmune epitopes that contain amyloid fibril forming peptides of all lengths | 30.5 ± 4.4 |
| All HLA-DP binding epitopes | 30.3 ± 8.9 |
| All HLA-DQ binding epitopes | 26.9 ± 7.3 |
| HLA-DQ binding epitopes that contain amorphous β-aggregating hexapeptides | 31.3 ± 3.8 |
| HLA-DQ binding epitopes that contain amyloid fibril forming hexapeptides | 32.4 ± 3.3 |
| HLA-DQ binding epitopes that contain amyloid fibril forming peptides of all lengths | 31.8 ± 3.1 |
| All HLA-DR binding epitopes | 28.5 ± 7.5 |
| HLA-DR binding epitopes that contain amorphous β-aggregating hexapeptides | 31.3 ± 5.3 |
| HLA-DR binding epitopes that contain amyloid fibril forming hexapeptides | 31.6 ± 4.5 |
| HLA-DR binding epitopes that contain amyloid fibril forming peptides of all lengths | 30.1 ± 4.7 |
Average and standard deviation values of intrinsic hydrophobicity of peptide sequences in a given dataset of aggregating peptides and epitopes were computed as described in Methods.
Figure 2Overlap between aggregation prone and T-cell autoimmune epitope regions in human amyloidogenic proteins.
Human amyloidogenic proteins whose amino acid sequences contain experimentally validated aggregating peptides (bold font) as well as experimentally validated T-cell autoimmune epitope peptides (underlined) are shown. All available experimental data on these proteins is consolidated into the regions highlighted in the sequences. The PDB26 was queried for available structural information on these proteins and information obtained was used to map overlapping aggregation prone and autoimmune epitope regions in the respective protein structures. These regions are shown in magenta. The magenta regions in structural images correspond to the sequence regions that are simultaneously shown in bold magenta font and underlined. (a) Human Major Prion Protein. Structural information is from the PDB entry 4 KML chain A. The N-terminal residues 1–116 are not present in the structure. (b) Human Amyloid-β peptide 1–42. Structural information is from the PDB entry 2BEG, which shows this peptide in fibrillary form. N-terminal residues 1–16 are not present in the structure. (c) Human Insulin. Structural information was taken from the PDB entry 3EY7. The chains A and B are shown in yellow and blue ribbons. The disulfide bonds are also shown. (d) Human Pmel. No structural information is available for this protein.
Strongly predicted T-cell autoimmune epitope regions in human amyloidogenic proteins that contain experimentally validated aggregating peptides.
| Protein name, UniProt ID and Disease | T-cell autoimmune epitope region(s) |
|---|---|
| Apolipoprotein A-I,P02647 [25–267], Amyloidosis 8 | 219-LPVLESFKVSFLS |
| Apolipoprotein C-II,P02655 [23–101],Hyperlipoproteinemia 1B | 48-KLRDLYSK |
| α-Synuclein P37840[1–140] Parkinsondisease 1 | 7-GLS |
| β2-Microglobulin,P61769 [21–119],Hypercatabolic hypoproteinemia | 55-EHS |
| β2-Microglobulin,P61769 [21–119],Hypercatabolic hypoproteinemia | 79-ACRV |
| Calcitonin, P01258 [85–116],Medullary carcinomaof thyroid | 13-TQ |
| Cystatin-C,P01034 [27–146],Amyloidosis 6 | 41-MYHSRALQVVRARKQIV |
| Gelsolin P06396 [28–782],Amyloidosis 5 | 183- |
| IAPP (Amylin) P10997 [34–70],Type 2 diabetes | 17-V |
| Insulin B-chain P01308 [25–54],Diabetes mellitus,Injection localized amyloidosis | 5-HLCGSH |
| Kerato-epithelin,Q15582 [24–683],Corneal dystrophy | 462-RGRYGTLFTMDRVLTPPMGTVMDVLKGDNR |
| Lactoferrin, P02788 [20–710],Familial subepithelial cornealamyloidosis | 538- |
| Lung Surfactant Protein C,P11686 [24–58],Pulmonary surfactant metabolism dysfunction 2 | 6-CPV |
| Lysozyme C, P61626 [19–148],Amyloidosis 8 | 4-E |
| Major Prion Protein,P04156 [23–230],Creutzfeldt-Jakob disease (CJD) | 81-SKPKTN |
| Pmel, P40967 [25–661] | 74-PD |
| Natriuretic peptides B,P16860 [27–134],Isolated Atrial Amyloidosis | 55-EVATEGIRGHR |
| Transthyretin, P02766 [21–147], Hyperthyroxinemia, dystransthyretinemic, Carpal tunnel syndrome 1 | 9-K |
Strongly predicted (percentile ≤1.0) T-cell autoimmune epitopes in each amyloidogenic protein are overlapping and were therefore clubbed into T-cell autoimmune epitope regions. These regions were then searched for the incidence of experimentally validated aggregating peptides and are shown in bold fonts. The protein sequence codes and associated disease names were primarily taken from UniProt29. In the cases where this information is not available in UniProt, the disease names were taken from literature searches and citations are provided in this table.
Figure 3Additional examples of overlap between aggregation prone and T-cell autoimmune epitope regions in human amyloidogenic proteins are presented.
This figure is prepared in the same way as the Fig. 2, except that the T-cell autoimmune epitope shown here are the strongly predicted ones. (a) Human α-Synuclein. Structural information is from the PDB entry 1XQ8. This structure is for the micelle bound form of α-Synuclein. In the unbound form, it is an intrinsically disordered protein. (b) Human β2-microglobulin. Structural information is from the PDB entry 2D4F. (c) Human Cystatin C. The structural information is from the PDB entry 1TIJ chains A and B. This PDB entry contains domain swapped form of Cystatin C dimer. Both the chains are shown here in yellow and blue ribbons.
Predicted T-cell autoimmune epitopes and aggregation prone regions in 100,000 randomly generated 15-residues long peptides.
| Sequences | Actual number andfrequency of TANGOpredicted APRs | Expected number andfrequency of TANGOpredicted APRs | Preference | Actual number andfrequency of WALTZpredicted APRs | Expected number andfrequency of WALTZpredicted APRs | Preference |
|---|---|---|---|---|---|---|
| 100,000 Randomly generated Peptides | 12,179 (12.2%) | NA | NA | 9,582 (9.6%) | NA | NA |
| 16,384 peptides predicted T-cell autoimmune epitopes | 3,620 (22.1%) | 1,995 (12.2%) | 1.8 | 2,321 (14.2%) | 1,570 (9.6%) | 1.5 |
| Remaining 83,616 peptides | 8,559 (10.2%) | 10,184 (12.2%) | 0.8 | 7,261 (8.7%) | 8,012 (9.6%) | 0.9 |
The 100,000 randomly generated peptides have the same amino acid composition as 3,243 experimentally validated HLA-DR binding T-cell autoimmune epitopes. The incidences of TANGO and WALTZ predicted aggregation prone regions in all 100,000 randomly generated peptides were used to compute the expected incidences of TANGO and WALTZ predicted aggregation prone regions within 16,384 predicted T-cell autoimmune epitopes. In each case, the expected number of aggregation prone regions was rounded off to its closest integer value. Preference was computed by dividing the actual numbers of TANGO or WALTZ predicted aggregation prone regions in the 16,384 autoimmune epitopes by their respective expected numbers. Similar calculations were also performed for the remaining 83,616 peptides that do not meet the percentile cut off used in this study (see methods for more details). NA stands for not applicable.
Figure 4Sequence patterning synergies between the T-cell autoimmune epitopes and aggregation prone regions.
Sequence logos are shown for (a) 100,000 randomly generated 15-residues long peptides, (b) 83,616 random peptides that were not predicted to be T-cell autoimmune epitopes (non-epitopes), (c) 16,384 strongly predicted T-cell autoimmune epitopes, (d) 12,179 random peptides that contain TANGO predicted APRs, (e) 8,559 non-epitopes that contain TANGO APRs, (f) 3,620 T-cell autoimmune epitopes that contain TANGO APRs, (g) 9,582 random peptides that contain WALTZ predicted APRs, (h) 7,261 non-epitopes that contain WALTZ APRs, and (i) 2,331 T-cell autoimmune epitopes that contain WALTZ APRs. Note the enrichment of hydrophobic β-branched and aromatic residues, particularly Val, Ile, and Phe, at initial positions of T-cell autoimmune epitope peptides (see panels c,f,i).