| Literature DB >> 34874980 |
Miko Valori1, Lilja Jansson1,2, Pentti J Tienari1,2.
Abstract
Somatic mutations have a central role in cancer but their role in other diseases such as common autoimmune disorders is not clear. Previously we and others have demonstrated that especially CD8+ T cells in blood can harbor persistent somatic mutations in some patients with multiple sclerosis (MS) and rheumatoid arthritis. Here we concentrated on CD8+ cells in more detail and tested (i) how commonly somatic mutations are detectable, (ii) does the overall mutation load differ between MS patients and controls, and (iii) do the somatic mutations accumulate non-randomly in certain genes? We separated peripheral blood CD8+ cells from newly diagnosed relapsing MS patients (n = 21) as well as matched controls (n = 21) and performed next-generation sequencing of the CD8+ cells' DNA, limiting our search to a custom panel of 2524 immunity and cancer related genes, which enabled us to obtain a median sequencing depth of over 2000x. We discovered nonsynonymous somatic mutations in all MS patients' and controls' CD8+ cell DNA samples, with no significant difference in number between the groups (p = 0.60), at a median allelic fraction of 0.5% (range 0.2-8.6%). The mutations showed statistically significant clustering especially to the STAT3 gene, and also enrichment to the SMARCA2, DNMT3A, SOCS1 and PPP3CA genes. Known activating STAT3 mutations were found both in MS patients and controls and overall 1/5 of the mutations were previously described cancer mutations. The detected clustering suggests a selection advantage of the mutated CD8+ clones and calls for further research on possible phenotypic effects.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34874980 PMCID: PMC8651110 DOI: 10.1371/journal.pone.0261002
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic features of the participants.
| Relapsing MS patients | Controls | |
|---|---|---|
|
| 21 | 21 |
|
| 35.0 yrs (23–55) | 35.2 yrs (23–57) |
|
| 76% | 76% |
| 1.64 (0–4.5) | n.a. |
*Expanded disability scale score (EDSS) at first neurologist visit.
Comparison of the discovered nonsynonymous somatic mutations in genes expressed in CD8+ cells in MS patients and controls.
| MS patients (n = 21) | Controls (n = 21) | |
|---|---|---|
| Median sequencing depth | 2317 | 2381 |
| Total number of mutations | 104 | 121 |
| Number of mutations per subject (median) | 4 | 4 |
| Number of mutations per subject (range) | 1–11 | 1–15 |
| Mutation allelic fraction (median) | 0.5% | 0.5% |
| Mutation allelic fraction (mean) | 0.7% | 1.1% |
| Mutation allelic fraction (range) | 0.2–3.6% | 0.2–8.6% |
| Number of mutated genes | 95 | 107 |
| Novel mutation | 63 (61%) | 82 (68%) |
| Mutation found in COSMIC database | 23 (22%) | 23 (19%) |
| Mutation found in ExAC/gnomAD (AF<10–5) | 18 (17%) | 16 (13%) |
|
| ||
| Stop codon | 9 (8.7%) | 8 (6.6%) |
| Frameshift insertion/deletion | 4 (3.8%) | 13 (10.7%) |
| Non-frameshift insertion/deletion | 1 (1.0%) | 7 (5.8%) |
| Splice site mutation | 0 | 2 (1.7%) |
| Single nucleotide variation | 90 (86.5%) | 91 (75.2%) |
| CADD ≥20 | 60/89* (67%) | 67/90* (74%) |
| PolyPhen2 D+P | 49/89 (55%) | 46/90 (51%) |
| PolyPhen Benign | 40/89 (45%) | 44/90 (49%) |
COSMIC = Catalogue of Somatic Mutations in Cancer. AF = allele frequency, CADD = Combined Annotation Dependent Depletion, CADD scores ≥20 can be considered predictably deleterious. PolyPhen2 D = deleterious, P = probably deleterious. *One nonsynonymous SNV could not be classified by CADD and PolyPhen21.
Fig 1Participants’ CD8+ cell somatic mutation allelic fraction distribution.
Genes enriched with somatic mutations.
| Gene | Enrichment p-value | Participants with mutation | Cases with mutation | Controls with mutation |
|---|---|---|---|---|
| STAT3 | 9.00E-08 | 6 | 2 | 4 |
| SMARCA2 | 0.001060894 | 4 | 2 | 2 |
| SOCS1 | 0.001550647 | 2 | 1 | 1 |
| DNMT3A | 0.002299316 | 3 | 3 | 0 |
| PPP3CA | 0.008899261 | 2 | 2 | 0 |
| CASP10 | 0.011354062 | 2 | 2 | 0 |
| IQCB1 | 0.011560434 | 2 | 1 | 1 |
| ARAF | 0.012015277 | 2 | 1 | 1 |
| IGSF8 | 0.012114642 | 2 | 0 | 2 |
| RFX5 | 0.012226858 | 2 | 1 | 1 |
| PIAS3 | 0.01268028 | 2 | 1 | 1 |
| TAB3 | 0.016054429 | 2 | 1 | 1 |
| TRPV2 | 0.018313686 | 2 | 1 | 1 |
| TNFAIP3 | 0.019490645 | 2 | 1 | 1 |
| VLDLR | 0.023452248 | 2 | 2 | 0 |
| PDE3B | 0.036479269 | 2 | 0 | 2 |
| NCKAP1L | 0.037371546 | 2 | 2 | 0 |
The enrichment p-value is obtained using a Poisson test, where it is calculated using the length of each gene and the number of mutations (out of total 225 possible) found in that gene, and the total size of the gene panel.
Top 5 mutated genes.
| Subject | Age | Gender | Mutation | CADD score | Found in COSMIC | Allelic fraction |
|---|---|---|---|---|---|---|
| MS-4 | 48 yrs | F | STAT3*D661Y | 34 | Yes | 0.5% |
| MS-8 | 52 yrs | M | STAT3*S614R | 25.5 | Yes | 0.4% |
| Control-55 | 39 yrs | F | STAT3*Y640F | 24.2 | Yes | 1.9% |
| Control-31 | 30 yrs | F | STAT3*H410R | 26.6 | Yes | 1.1% |
| Control-3 | 57 yrs | F | STAT3*A596V | 23.6 | - | 0.4% |
| Control-45 | 28 yrs | M | STAT3*R278L | 35 | - | 0.5% |
| MS-20 | 26 yrs | F | SMARCA*G366D | 25.7 | - | 0.4% |
| MS-44 | 34 yrs | F | SMARCA*P153S | 23.7 | - | 0.6% |
| Control-7 | 54 yrs | M | SMARCA*D208Y | 24.4 | - | 0.8% |
| Control-25 | 28 yrs | F | SMARCA* Q228Rfs | n.a. | - | 1.3% |
| MS-24 | 32 yrs | F | SOCS1*F148L | 23.2 | Yes | 3.4% |
| Control-21 | 47 yrs | F | SOCS1*P97_G99del | n.a. | - | 4.6% |
| MS-2 | 34 yrs | F | DNMT3A* M700Hfs | n.a. | - | 0.5% |
| MS-14 | 23 yrs | M | DNMT3A*R214S | 32 | Yes | 0.8% |
| MS-50 | 55 yrs | M | DNMT3A*G494W | 35 | - | 1.2% |
| MS-46 | 30 yrs | M | PPP3CA*R426Q | 22.9 | Yes | 0.3% |
| MS-52 | 41 yrs | F | PPP3CA* T458R | 24.1 | - | 0.7% |
F = female, M = male. CADD scores ≥20 can be considered predictably deleterious. COSMIC = Catalogue of Somatic Mutations in Cancer. fs = frameshift. Note: Only one SMARCA2 variant listed in control-25 and one SOCS1 variant listed in MS-24. Two insertion-deletion variants were found at the SMRCA2 site (rs757850599 and rs753013339), but these are not necessarily independent variants due to their occurrence in a CAG-repeat. Their frequencies in ExAC are 0.000032 (rs757850599) and 0.000031 (rs753013339) and allelic fractions were 1.2% and 1.3% in our data. The SOCS1*F148L variant occurred together with a frameshift SOCS1*E149Rfs*57 variant.