| Literature DB >> 30845214 |
Maëva Veyssiere1, Javier Perea1, Laetitia Michou2, Anne Boland3, Christophe Caloustian3, Robert Olaso3, Jean-François Deleuze3, François Cornelis4, Elisabeth Petit-Teixeira1, Valérie Chaudru1.
Abstract
The triggering and development of Rheumatoid Arthritis (RA) is conditioned by environmental and genetic factors. Despite the identification of more than one hundred genetic variants associated with the disease, not all the cases can be explained. Here, we performed Whole Exome Sequencing in 9 multiplex families (N = 30) to identify rare variants susceptible to play a role in the disease pathogenesis. We pre-selected 77 genes which carried rare variants with a complete segregation with RA in the studied families. Follow-up linkage and association analyses with pVAAST highlighted significant RA association of 43 genes (p-value < 0.05 after 106 permutations) and pinpointed their most likely causal variant. We re-sequenced the 10 most significant likely causal variants (p-value ≤ 3.78*10-3 after 106 permutations) in the extended pedigrees and 9 additional multiplex families (N = 110). Only one SNV in SUPT20H: c.73A>T (p.Lys25*), presented a complete segregation with RA in an extended pedigree with early-onset cases. In summary, we identified in this study a new variant associated with RA in SUPT20H gene. This gene belongs to several biological pathways like macro-autophagy and monocyte/macrophage differentiation, which contribute to RA pathogenesis. In addition, these results showed that analyzing rare variants using a family-based approach is a strategy that allows to identify RA risk loci, even with a small dataset.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30845214 PMCID: PMC6405192 DOI: 10.1371/journal.pone.0213387
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Description of the study design.
This schema of the study presents the 3 main steps of the analysis and, the resulting number of variants selected through each sub-analysis. (A) The selection of candidate variants was performed on the discovery set and resulted in the identification of 77 candidate SNVs (B) The association analysis was applied on the discovery set and 98 controls extracted from the 1000 genomes project. The 43 genes significantly associated with RA were re-tested after removing the candidate variant identified in step A and, the candidate variant was tested alone (C) The SNVs with the strongest RA association were re-sequenced in the extended families of the discovery, plus 9 new multiplex families.
Description of the discovery set.
| Pedigree id | Affected by RA | Unaffected by RA | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Only RA reported | ||||||||||||||||||||||
| 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | ||||||||||||
| 3 | 3 | 2 | 1 | 3 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | |||||||||||
| 4 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||||||||||
| 5 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | ||||||||||||
| 6 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |||||||||||||
| 7 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | |||||||||||
| RA and other AID reported | ||||||||||||||||||||||
| 9 | 2 | 1 | 1 | 2 | 2 | 2 | 1 | |||||||||||||||
| 10 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | ||||||||
| 11 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | |||||||||
a: Rheumatoid Arthritis
b: Anti-Citrullinated Peptides Antibodies
c: Rheumatoid Factor
Fig 2Principal component analysis (PCA) results of samples processed for pVAAST analysis.
The PCA is based on the 30 samples from the discovery set and 98 CEU controls extracted from the 1000 genomes databases. This plot represents the 2 first principal components which respectively account for 2.12% and 1.87% of the genetic variability. The color code represents the population source: each family sequenced by WES has its own color described in the legend, the CEU controls are in grey. The shape of each dot represents the RA status of the represented individual.
Genes showing association signal in pVAAST analysis.
| Name | |||||||
|---|---|---|---|---|---|---|---|
| 3: 33,191,537–33,260,707 | 0.0003 (22.22) | 0.00063 (22.22) | 3 | 7 | |||
| 15: 52,043,758–52,108,565 | 0.0094 (7.07) | 0.002 (7.07) | 1 | 1 | |||
| 17: 8,108,056–8,113,918 | 0.00053 (15.93) | 0.0023 (15.93) | 2 | 4 | |||
| 4: 170,314,426–170,533,780 | 0.0021 (8.56) | 0.0031 (8.56) | 1 | 7 | |||
| 17: 9,674,751–9,694,614 | 0.0094 (7.13) | 0.0041 (7.13) | 2 | 7 | |||
| 12: 133,345,495–133,405,444 | 0.0057 (13.15) | 0.0057 (13.15) | 2 | 10 | |||
| 9: 22,002,902–22,009,362 | 0.004 (8.37) | 0.0057 (8.37) | 1 | 1 | |||
| 16: 30,768,744–30,774,031 | 0.0032 (9.82) | 0.006 (9.82) | 1 | 1 | |||
| 11: 62,414,320–62,420,774 | 0.031 (13.35) | 0.0068 (13.35) | 1 | 1 | |||
| 13: 37,583,449–37,633,850 | 0.004 (15.15) | 0.0068 (15.15) | 1 | 3 | |||
| 5: 111,478,138–111,755,013 | 0.024 (18.19) | 0.0068 (18.19) | 2 | 8 | |||
| 13: 70,274,726–70,682,591 | 0.002 (12.31) | 0.0071 (12.31) | 2 | 5 | |||
| 13: 24,553,944–24,881,212 | 0.021 (13) | 0.0087 (13) | 2 | 12 | |||
| 8: 27,168,999–27,316,903 | 0.014 (12.72) | 0.0097 (12.72) | 2 | 12 | |||
| 10: 103,911,933–103,923,627 | 0.011 (12.89) | 0.01 (12.89) | 1 | 5 | |||
| 2: 73,441,350–73,454,365 | 0.016 (8.36) | 0.011 (8.36) | 2 | 3 | |||
| 2: 241,544,848–241,570,676 | 0.011 (13.72) | 0.012 (13.72) | 1 | 12 | |||
| 13: 100,741,269–101,182,686 | 0.036 (7.7) | 0.013 (7.7) | 1 | 3 | |||
| 9: 114,312,002–114,362,135 | 0.011 (7.89) | 0.014 (7.89) | 1 | 2 | |||
| 12: 123,773,656–123,849,390 | 0.0097 (11.01) | 0.015 (11.01) | 2 | 6 | |||
| 16: 16,043,434–16,236,931 | 0.014 (15.17) | 0.015 (15.17) | 2 | 8 | |||
| 2: 37,326,353–37,384,208 | 0.014 (9.01) | 0.015 (9.01) | 1 | 2 | |||
| 2: 202,241,930–202,316,302 | 0.014 (6.45) | 0.016 (6.45) | 1 | 6 | |||
| 10: 50,887,697–50,918,307 | 0.009 (13.76) | 0.017 (13.76) | 1 | 3 | |||
| 2: 96,778,707–96,781,984 | 0.019 (7.03) | 0.019 (7.03) | 1 | 3 | |||
| X: 135067958–135129423 | 0.015 (6.57) | 0.02 (6.57) | 1 | 1 | |||
| 1: 16,558,195–16,563,657 | 0.0074 (9.85) | 0.023 (9.85) | 1 | 2 | |||
| 3: 43,731,605–43,775,863 | 0.0094 (12.41) | 0.024 (12.41) | 1 | 3 | |||
| 11: 71,950,121–71,956,708 | 0.017 (12.36) | 0.026 (12.36) | 1 | 1 | |||
| 1: 44,457,031–44,462,200 | 0.012 (6.41) | 0.026 (6.41) | 1 | 3 | |||
| 19: 7,991,603–8,008,805 | 0.015 (11.37) | 0.027 (11.37) | 2 | 8 | |||
| 5: 172,741,716–172,756,506 | 0.013 (8.74) | 0.027 (8.74) | 1 | 2 | |||
| 15: 56,713,742–56,757,335 | 0.019 (12.17) | 0.028 (12.17) | 2 | 5 | |||
| 1: 111,214,310–111,217,655 | 0.036 (8.1) | 0.028 (8.1) | 1 | 1 | |||
| 17: 80,477,589–80,602,538 | 0.021 (11.71) | 0.03 (11.71) | 2 | 6 | |||
| 14: 31,363,005–31,495,607 | 0.034 (5.39) | 0.033 (5.39) | 1 | 4 | |||
| 18: 12,254,318–12,277,594 | 0.031 (9.33) | 0.034 (9.33) | 1 | 3 | |||
| 10: 75,757,872–75,879,918 | 0.031 (9.33) | 0.035 (9.33) | 1 | 3 | |||
| 15: 22,833,395–22,873,892 | 0.016 (7.21) | 0.036 (7.21) | 2 | 3 | |||
| 10: 70,237,756–70,287,231 | 0.024 (12.35) | 0.037 (12.35) | 1 | 1 | |||
| 5: 153,570,290–153,800,544 | 0.041 (3.93) | 0.046 (3.93) | 3 | 3 | |||
| 9: 91,933,421–91,974,557 | 0.02 (7.32) | 0.047 (7.32) | 1 | 3 | |||
| 19: 17,326,155–17,330,638 | 0.026 (12.36) | 0.049 (12.36) | 1 | 2 | |||
The presented genes are ordered according to the association p-value of pVAAST test with sex co-variable control.
a: position on the human reference genome hg19
b: p-values computed from 106 permutations (score)
Association analysis of candidate SNVs in 43 candidate genes.
| HGVS annotation | |||||
|---|---|---|---|---|---|
| 1 | 19: 7997576 | c.923C>A | 0.0017 (11.37) | 0.26 (2.58) | |
| 2 | X: 135122270 | c.1763C>T | 0.002 (6.57) | 1 (0) | |
| 2 | 15: 52060595 | c.263C>T | 0.0023 (7.07) | 1 (0) | |
| 2 | 11: 62415889 | c.1663C>T | 0.0024 (13.35) | 1 (0) | |
| 1 | 8: 27255130 | c.29G>A | 0.0028 (6.45) | 0.17 (5.15) | |
| 1 | 5: 111504714 | c.1828C>T | 0.003 (13.76) | 0.41 (4.03) | |
| 2 | 2: 241569447 | c.171C>G | 0.0031 (13.72) | 1 (0) | |
| 2 | 4: 170506525 | c.782G>A | 0.0035 (8.6) | 1 (0) | |
| 2 | 13: 100921021 | c.898G>A | 0.0037 (7.7) | 1 (0) | |
| 2 | 13:37622040 | c.73A>T | 0.0038 (15.15) | 1 (0) | |
| 1 | 15: 56736015 | c.724C>T | 0.0038 (15.15) | 0.036 (2.64) | |
| 1 | 17: 9676206 | c.608T>C | 0.0041 (6.98) | 0.41 (0.84) | |
| 2 | 10: 103917202 | c.331C>T | 0.0043 (12.89) | 1 (0) | |
| 2 | 1: 16558713 | c.607C>T | 0.0044 (10.72) | 0.93 (0.01) | |
| 2 | 3: 43753284 | c.590G>C | 0.0046 (12.41) | 1 (0) | |
| 1 | 17: 8109861 | c.637G>C | 0.0051 (10.39) | 0.14 (3.54) | |
| 2 | 16:30768844 | c.949C>T | 0.0051 (9.82) | 1 (0) | |
| 2 | 9: 22006147 | c.256G>A | 0.0052 (8.37) | 1 (0) | |
| 1 | 13: 70681732 | c.100G>T | 0.0068 (9.53) | 0.11 (2.77) | |
| 2 | 9: 114345759 | c.488A>T | 0.0074 (7.89) | 1 (0) | |
| 1 | 12: 133363022 | c.3026G>A | 0.0077 (8.59) | 0.24 (3.17) | |
| 2 | 2: 202252596 | c.1526G>A | 0.014 (6.45) | 1 (0) | |
| 1 | 16: 16208739 | c.3196C>T | 0.016 (9.33) | 0.28 (3.86) | |
| 2 | 10: 50901917 | c.195C>A | 0.017 (13.76) | 1 (0) | |
| 2 | 5: 172755066 | c.131G>T | 0.018 (8.74) | 1 (0) | |
| 2 | 18: 12262897 | c.214C>T | 0.018 (9.33) | 1 (0) | |
| 1 | 12: 123806235 | c.2170G>T | 0.019 (10.21) | 0.9 (0.17) | |
| 1 | 15: 22864369 | c.2327G>A | 0.019 (7.21) | 1 (0) | |
| 2 | 2: 37347149 | c.1201G>A | 0.02 (9.01) | 1 (0) | |
| 1 | 13: 24868977 | c.3367C>T | 0.02 (6.79) | 0.076 (6.46) | |
| 2 | 2: 96780655 | c.1234G>A | 0.021 (7.03) | 1 (0) | |
| 2 | 1: 111216792 | c.640C>T | 0.022 (8.1) | 1 (0) | |
| 2 | 19: 17330089 | c.490C>T | 0.023 (12.36) | 1 (0) | |
| 2 | 10: 75855428 | c.1558C>T | 0.023 (9.33) | 1 (0) | |
| 2 | 1: 44461756 | c.848G>A | 0.025 (6.41) | 1 (0) | |
| 1 | 5: 153765901 | c.967G>A | 0.026 (3.93) | 1 (0) | |
| 2 | 10: 70246944 | c.799C>T | 0.031 (12.35) | 1 (0) | |
| 2 | 11: 71950885 | c.763C>T | 0.032 (12.36) | 1 (0) | |
| 2 | 9: 91954819 | c.1253G>T | 0.033 (7.32) | 1 (0) | |
| 2 | 14: 31371744 | c.2135C>T | 0.034 (5.39) | 1 (0) | |
| 1 | 17: 80543821 | c.1321C>T | 0.037 (7.93) | 0.17 (4.17) | |
| 1 | 2: 73450610 | c.833G>A | 0.067 (1.69) | 0.025 (4.86) | |
| 1 | 3: 33216450 | c.526C>T | 0.21 (5.55) | 0.0037 (14.67) | |
The genes are sorted according to the association p-value of the candidate variant. For each gene, the candidate variant is the one with the highest score.
a: The two groups correspond to:
1 –genes containing multiple rare variants contributing to pVAAST score
2 –genes containing one rare variant leading the association
b: position on the human reference genome hg19
c: annotation on the canonical transcript
d: p-values computed from 106 permutations (score)
Top 10 RA associated SNVs and their re-sequencing results.
| 13 | 37622040 | T/A | . | 3 | 4/4 | 0/5 | |
| X | 135122270 | C/T | . | 6 | 4/4 | 1/3 | |
| 19 | 7997576 | G/T | rs139625465 | 10 | 2/2 | 1/3 | |
| 4 | 170506525 | C/T | rs200161705 | 3 | 4/4 | 2/5 | |
| 15 | 52060595 | C/T | . | 4 | 4/5 | 3/6 | |
| 13 | 100921021 | G/A | . | 3 | 4/4 | 2/5 | |
| 5 | 111504714 | G/A | rs200812889 | 4 | 2/5 | 2/6 | |
| 2 | 241569447 | C/G | rs141249079 | ||||
| 11 | 62415889 | G/A | . |
a: rs number from dbsnp138
Fig 3Gene SUPT20H and its product: p38IP.
Abbreviations: NLS: Nuclear Localization signal, PEST: Proline (P) Glutamic acid (E) Serine (S) Threonine. (A) Gene model of SUPT20H. The red arrow indicates the exon including the nonsense variant identified by WES. The exons color code refers to the protein domain it encodes for. (B) On top, p38IP protein model of 733 amino-acid and on the bottom, the positions of p38IP interactors binding sites.
Fig 4Representation of the pedigree 3 and the variants identified in his members.
All the individuals represented here were part of the validation set but only the ones marked with a red arrow were part of the discovery set. Each of them is represented by a circle (if female) or a square (if male). Their filling color corresponds to their status: black if affected by RA or white if unaffected by the disease. The presence of a variant is represented by its HGVS id. The absence of this id indicates a homozygote genotype for the reference allele.