| Literature DB >> 24988487 |
Abstract
BACKGROUND: Many aspects of autoimmune disease are not well understood, including the specificities of autoimmune targets, and patterns of co-morbidity and cross-heritability across diseases. Prior work has provided evidence that somatic mutation caused by gene conversion and deletion at segmentally duplicated loci is relevant to several diseases. Simple tandem repeat (STR) sequence is highly mutable, both somatically and in the germ-line, and somatic STR mutations are observed under inflammation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24988487 PMCID: PMC4079513 DOI: 10.1371/journal.pone.0101093
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Twenty-one of the most prevalent human autoimmune diseases, in approximately decreasing order of prevalence [49], [275].
| Abbrev. | Name |
| B-cell Autoantigens | Refs. |
| GD | Graves' Disease | Yes |
|
|
| RA | Rheumatoid Arthritis | Yes |
|
|
| HT | Hashimoto's Thyroiditis | Yes |
|
|
| CEL | Celiac disease | Unclear | TG2, HP, actin, CALR, TG3, ganglioside, collagen |
|
| PSO | Psoriasis | No | PALLD, AGAP3, DSP, collagen-21, ATXN3 |
|
| VIT | Vitiligo | Yes | TYR, TH, TYRP1, MCHR1, lamin A |
|
| SJ | Sjogren syndrome | Unclear |
|
|
| UC | Ulcerative Colitis | Yes |
|
|
| AS | Ankylosing Spondylitis | No | multispecific |
|
| T1D | Type-1 diabetes | Yes |
|
|
| AA | Alopecia Areata | Yes | TH, TCHH, KRT16 |
|
| JIA | Juvenile Idiopathic Arthritis | Yes | DEK, HSP70, citrullinated peptides |
|
| PA | Pernicious Anemia | Unknown | ATP4A/ATP4B, pepsinogen A |
|
| MS | Multiple Sclerosis | No |
|
|
| CD | Crohn's disease | Opposite | GP2, CUDZ1 |
|
| SLE | Systemic Lupus Erythematosus | Yes |
|
|
| UV | Uveitis | No | CRALBP, CRYAA, CRYAB, CRYBB1 |
|
| AD | Addison's disease | Yes |
|
|
| MG | Myasthenia Gravis | Yes | AChR, MUSK, LRP4, AGRN, ColQ, TTN, KCNA1, RYR |
|
| DM | Dermatomyositis | Yes | Mi-2-complex, IFIH1, TRIM33, MORC3, Ro52(TRIM21) |
|
| SSc | Systemic Sclerosis | Yes | RNA Polymerase III, CENPB, CENPA, RNA Polymerase I, RNA Polymerase II, TOP1, PM/Scl-complex, Ro52(TRIM21), SNRNP70, NOR-90, Ku, Th/To, U3RNP/FBL, IFI16, ANXA5, NPM1, HMGB1, HMGB2, Mitochondrial-M2 |
|
Autoantibodies to antigens in bold are known to be primary antibodies that occur early in disease progression, often prior to the appearance of symptoms. The tryptophan allele of the Arg620Trp polymorphism at rs2476601 in the PTPN22 gene is associated with many autoimmune diseases, as indicated in the “PTPN22 Assn.” column. Atherosclerosis (CAD) is not universally considered an autoimmune disease, and is therefore not listed. Nevertheless, CAD does have autoimmune features [376] and an association with PTPN22 [377]–[379]. The initial pathology in some MS lesions is associated with MAG loss [329], [380], [381].
The tryptophan PTPN22 allele is protective from CD [333].
In MG, two studies conflict about whether PTPN22 is specifically associated with the subset of cases having anti-TTN antibodies.
Autoantigens for selected low-prevalence autoimmune diseases.
| Abbrev. | Name | Autoantigens | Refs. |
| PV | Pemphigus Vulgaris | DSG3, DSG1, HLA-DRA, DSC3, DSC1, ATP2C1, PKP3, CHRM3, COL21A1, ANXA8L1, CD88, CHRNE |
|
| RHF | Rheumatic Fever | VIM, MYBPC3, tropomyosin, collagen |
|
| LEMS | Lambert-Eaton Myasthenic Syndrome | CACNA1A, CACNB2 |
|
| AH1 | Autoimmune Hepatitis (type 1) | HMGB1, HMGB2 |
|
| AH2 | Autoimmune Hepatitis (type 2) | CYP2D6, CES1, PDIA3 |
|
| HA | Autoimmune hemolytic anemia | RHD, GYPA |
|
| AP | Autoimmune pancreatitis | AMY2A, CA2, LTF, HSP10, plasminogen-binding protein, trypsinogens, SPINK1 |
|
| PBC | Primary Biliary Cirrhosis | Mitochondrial-M2, SP100, PML, NUP210, Ro52(TRIM21), CENPB, SUMO2, SUMO1, CHRM3 |
|
| NMO | Neuromyelitis Optica | AQP4 |
|
| GPS | Goodpasture's Syndrome | COL4A3 |
|
Figure 1Genes with long internal repeats.
The x-axis denotes the total length of the tandem repeat (log-scale), and the y-axis represents the number of repeat units within the tandem repeat (log-scale). The degree of repeat identity reported by TMRF is indicated by the color of the data point. Genes in bold have exonic sequence overlapping the repeat. Genes containing multiple disjoint long repeats appear more than once.
Long tandem duplications with at least 96% identity that occur within a gene locus.
| Gene | Length | Copies | Gap | Coding |
|
| 76181 | 2 | 55 | Y |
|
| 65137 | 2 | 0 | Y |
|
| 54708 | 3 | 0 | Y |
|
| 47663 | 2 | 2 | Y |
|
| 47081 | 3 | 0 | Y |
|
| 44119 | 2 | 31 | Y |
|
| 37662 | 2 | 0 | Y |
|
| 35986 | 2 | 0 | N |
|
| 31945 | 2 | 0 | Y |
|
| 31782 | 3 | 0 | N |
|
| 30864 | 2 | 0 | Y |
|
| 27063 | 2 | 0 | N |
|
| 26136 | 2 | 14 | Y |
|
| 22675 | 2 | 22 | N |
|
| 21690 | 2 | 0 | Y |
|
| 12620 | 2 | 17 | Y |
|
| 12568 | 2 | 4 | Y |
|
| 11321 | 2 | 0 | N |
|
| 10103 | 2 | 58 | Y |
|
| 8924 | 2 | 0 | Y |
|
| 8521 | 2 | 0 | Y |
|
| 8383 | 2 | 1 | N |
|
| 8282 | 2 | 0 | Y |
|
| 7627 | 2 | 1 | Y |
|
| 7020 | 5 | 0 | Y |
|
| 5850 | 2 | 23 | N |
|
| 5637 | 2 | 0 | Y |
|
| 4289 | 2 | 66 | Y |
|
| 4144 | 2 | 0 | N |
|
| 3907 | 2 | 12 | N |
|
| 3431 | 2 | 4 | Y |
Duplications were identified as described in the Methods. The length indicates the total length of the high-identity tandem duplicons. The gap is the separation between the two highest-identity (long) duplicons, which was required to be less than 100 bp. The duplication is “coding” if a duplicon overlaps at least one exon.
FCGBP has a third duplicon, but with less than 96% identity.
These genes have duplicons that are themselves STRs of lower fidelity; only the copy number for the high-identity long tandem duplication is reported in this table.
The segmental duplication containing NKG2-E overlaps the three genes KLRC1, KLRC2 and KLRC3.
Long (5 kb) regions of self-alignment within protein-coding genes.
| Gene | Length | Gene | Length |
|
| 45133 |
| 10090 |
|
| 40083 |
| 8739 |
|
| 39623 |
| 8649 |
|
| 38211 |
| 8495 |
|
| 36567 |
| 8343 |
|
| 35017 |
| 7969 |
|
| 32343 |
| 7934 |
|
| 30167 |
| 7650 |
|
| 26579 |
| 6583 |
|
| 24595 |
| 6447 |
|
| 23181 |
| 6346 |
|
| 22943 |
| 6303 |
|
| 20252 |
| 6290 |
|
| 18603 |
| 6161 |
|
| 14249 |
| 5794 |
|
| 13432 |
| 5789 |
|
| 13432 |
| 5498 |
|
| 13424 |
| 5345 |
|
| 13368 |
| 5304 |
|
| 12971 |
| 5229 |
|
| 12648 |
| 5227 |
|
| 12480 |
| 5091 |
|
| 12115 |
| 5021 |
Sequences with a self-similarity score of 60 or above having both query and target mapped within a protein-coding gene locus were obtained from the self-alignment track [267] of the UCSC database as described in the Methods, and ranked by match-length. In this table, the match length corresponds to the length of identity between the two duplicons. Note that self-aligned duplicons may overlap.
Correspondence of PPP4C localization with many known SLE autoantigens.
| Autoantigen(s) | Putative PPP4C function/localization |
|
| These are spliceosome proteins. PPP4C is involved in spliceosome assembly |
| SR proteins | SR proteins associate with the spliceosome |
| chromatin/histones | PPP4C binds to HDAC3, a histone deacetylase |
| ds-DNA | Stabilisation of stalled replication forks |
| Ku70, Ku80 | Ku70 and Ku80 associate with |
| PARP1 | PARP1 binds with Ku |
|
| dephosphorylated during apoptosis by a caspase-induced phosphatase |
| La(SSB) | La is dephosphorylated during apoptosis by a caspase-induced PP2A-like phosphatase |
|
| See La; Ro60 and La are components of a common protein/RNA complex |
|
| APOH (coding for beta 2 glycoprotein I) associates with ANXA2/TLR4/CALR/NCL complexes |
| VIM/cardiolipin-complex, VIM | Vimentin is also observed in analysis of APOH/ANXA2/TLR4/CALR/NCL complexes |
| NPM1 | NPM1 binds cardiolipin |
| CALR | CALR may be dephosphorylated by an okadaic-acid-sensitive protein phosphatase |
| Ro52(TRIM21) | See CALR; Ro52 and CALR are binding partners |
| NCL | See APOH/cardiolipin-complex. |
| ANXA2 | See APOH/cardiolipin-complex. |
| F2/Phosphatidyl-serine, F2 | Phosphatidylserine bound by ANXA2 |
| ANXA1 | Binds ANXA2 |
| ANXA5 | Binds phosphatidylserine as a monomer or dimer |
| HMGB1 | Binds phosphatidylserine |
| LTF | Binds to TLR4 and activates the TRAF6/MyD88 pathway |
Primary autoantigens are in bold.
Murine long (8 kb) STRs overlapping protein-coding RefSeq gene loci, including 5 kb upstream of the gene start site.
| Gene | Largest Repeat length (bp) | Smallest unit size |
|
| 19480 | 10 |
|
| 19106 | 1236 |
|
| 18426 | 734 |
|
| 16228 | 234 |
|
| 12289 | 37 |
|
| 8023 | 513 |
The smallest repeat unit for each region is given together with the total STR length. The Ttc34 repeat ends 3.2 kb upstream of the gene start site.
Two adjacent repeats reported by TMRF have similar repeat structure, and have been combined.
Figure 2Structure of the long ABCG8 repeat in the human reference genome.
A 10.8 kb repeat and a 4.1 kb repeat have closely related repeat unit sequence, and are separated by a 1.4 kb LINE insertion. The SNP rs4952688 occurs in the middle of the 4.1 kb repeat.
Known links between genes with long STRs and human autoimmune diseases.
| Gene(s) | Disease | Antigen type | CNV | Expr. changes |
|
| RA |
| ||
|
| HT |
| ||
|
| GD |
| ||
|
| T1D |
| ||
|
| SLE | Autoantigen | Yes | Yes |
|
| MS | Autoantigen | ||
|
| PA | Autoantigen | Yes | |
|
| MG | Autoantigen | ||
|
| SLE, SSc, RA, SJ | Autoantigen | Yes | |
|
| CEL | Autoantigen | Yes | Yes |
|
| SSc | Peri-antigen | ||
|
| SLE | Peri-antigen | ||
|
| CAD | Peri-antigen | Yes | Yes |
|
| CAD | Peri-antigen? | Yes | |
|
| CD | Peri-antigen | Yes | Yes |
|
| UC | Yes | ||
|
| CD, UC | Yes | ||
|
| SJ | Yes | ||
|
| RA, SLE, CD, CAD, SSc | Yes | ||
|
| T1D | Yes | Yes | |
|
| several | Yes | ||
|
| PSO | Yes | Yes | |
|
| MS | Yes | ||
|
| AS, UV | Yes |
Genes with long STRs come from Figure 1, Table 3 and Table 4. A bold autoantigen label corresponds to a known primary autoantigen. The CNV column indicates whether a germ-line STR length variant is associated with the disease. Gene expression changes during disease are also shown.
While many genes qualify as encoding peri-antigens in SLE, TTC34 encodes a peri-antigen for many autoantigens (Table 5).
Co-morbidity and/or familial associations between six autoimmune diseases and atherosclerosis.
| GD | RA | T1D | SLE | SSc | CAD | |
| HT |
|
|
|
|
|
|
| GD |
|
|
|
|
| |
| RA |
|
|
|
| ||
| T1D |
|
| ||||
| SLE |
|
| ||||
| SSc |
|
Comorbidity may reflect common susceptibility factors or secondary disease effects, such as inflammation in RA contributing to CAD risk [427]. Comorbidities with some of these diseases exist for alopecia areata [428], [429], vitiligo [430], [430,431], juvenile idiopathic arthritis [432], myasthenia gravis [433], [434], and Addison's disease [435], five additional PTPN22 -associated diseases, as well as celiac disease [436], [437] and pernicious anemia [438], [439].
A link between GD and CAD is potentially confounded by the anti-atherogenic properties of thyroid hormones [440].
Genes with intronic tandem repeats occupying more than 90% of an intron.
| Gene | Chrom. | Intron start | Intron end | Int. length | Rpt. Length | Occupancy | Copies |
|
| chr11 | 400706 | 403076 | 2371 | 2310 | 0.974 | 155.9 |
|
| chr19 | 3937287 | 3938599 | 1313 | 1274 | 0.970 | 27.8 |
|
| chr19 | 49337616 | 49339060 | 1445 | 1401 | 0.970 | 38.1 |
|
| chr19 | 55604602 | 55605711 | 1110 | 1066 | 0.960 | 29.4 |
|
| chrX | 1755454 | 1761694 | 6241 | 5827 | 0.934 | 35.4 |
|
| chr16 | 89973731 | 89975372 | 1642 | 1526 | 0.929 | 43.8 |
|
| chr9 | 140344708 | 140346815 | 2108 | 1948 | 0.924 | 33.4 |
|
| chr1 | 1223418 | 1225649 | 2232 | 2055 | 0.921 | 110.9 |
|
| chr14 | 105407316 | 105420216 | 12901 | 11844 | 0.918 | 24 |
|
| chr14 | 105695251 | 105707600 | 12350 | 11322 | 0.917 | 111 |
|
| chr8 | 144403557 | 144406167 | 2611 | 2367 | 0.907 | 164.1 |
Figure 3Genes with high copy number internal repeats.
The x-axis denotes the total length of the tandem repeat (log-scale), and the y-axis represents the number of repeat units within the tandem repeat (log-scale). The degree of homology between repeat units is indicated by the color of the data point. All repeats in this diagram reside in introns. Genes containing multiple disjoint repeats appear more than once.
Figure 4Structure of the TYR -related tandem duplications in the human reference genome.
The long, high-identity duplicons make the region susceptible to gene conversion [274].
Multiple mechanisms generating coherent somatic mutation, and possible examples where autoimmunity results.
| Mechanism | Possible Examples |
| Mutations at long tandem repeats | T1D, HT, RA, SLE, … |
| Gene conversion at segmental duplications | AD |
| Clonal expansion | Paraneoplastic autoimmune diseases, GD |
| Oxidative stress |
|
| RAG-dependent somatic mutation |
|
| Pathogen Binding/Modification |
|
| Retrotransposition | BOMS |
| Apoptotic protein cleavage | Many cleaved proteins are autoantigens |
| Dysregulation of protein modification | Anti-TOP1 SSc |
| Environmental mutagens |
|
Immunological features of autism and schizophrenia.
| Feature | Autism | Schizophrenia |
| HLA Association | Yes | Yes |
| Co-morbidity with autoimmune disease | Yes | Yes |
| Viral triggers for disease | Yes | Yes |
| Association with maternal infection during pregnancy. | Yes | Yes |
| Autoantibodies | Brain-specific antibodies in mothers and probands | Yes |
| Other | Gene expression changes reminiscent of autoimmunity | Various immunological abnormalities |