| Literature DB >> 31672989 |
Elena López-Isac1, Marialbert Acosta-Herrera2, Martin Kerick2, Shervin Assassi3, Ansuman T Satpathy4,5, Jeffrey Granja4,5, Maxwell R Mumbach4,5, Lorenzo Beretta6, Carmen P Simeón7, Patricia Carreira8, Norberto Ortego-Centeno9, Ivan Castellvi10, Lara Bossini-Castillo11, F David Carmona12, Gisela Orozco13, Nicolas Hunzelmann14, Jörg H W Distler15, Andre Franke16, Claudio Lunardi17, Gianluca Moroncini18, Armando Gabrielli18, Jeska de Vries-Bouwstra19, Cisca Wijmenga20, Bobby P C Koeleman21, Annika Nordin22, Leonid Padyukov22, Anna-Maria Hoffmann-Vold23, Benedicte Lie24, Susanna Proudman25, Wendy Stevens26, Mandana Nikpour27, Timothy Vyse28, Ariane L Herrick29,30, Jane Worthington13, Christopher P Denton31, Yannick Allanore32, Matthew A Brown33, Timothy R D J Radstake34, Carmen Fonseca31, Howard Y Chang4,5, Maureen D Mayes3, Javier Martin35.
Abstract
Systemic sclerosis (SSc) is an autoimmune disease that shows one of the highest mortality rates among rheumatic diseases. We perform a large genome-wide association study (GWAS), and meta-analysis with previous GWASs, in 26,679 individuals and identify 27 independent genome-wide associated signals, including 13 new risk loci. The novel associations nearly double the number of genome-wide hits reported for SSc thus far. We define 95% credible sets of less than 5 likely causal variants in 12 loci. Additionally, we identify specific SSc subtype-associated signals. Functional analysis of high-priority variants shows the potential function of SSc signals, with the identification of 43 robust target genes through HiChIP. Our results point towards molecular pathways potentially involved in vasculopathy and fibrosis, two main hallmarks in SSc, and highlight the spectrum of critical cell types for the disease. This work supports a better understanding of the genetic basis of SSc and provides directions for future functional experiments.Entities:
Mesh:
Year: 2019 PMID: 31672989 PMCID: PMC6823490 DOI: 10.1038/s41467-019-12760-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Twenty-seven signals independently associated with systemic sclerosis in the meta-GWAS
| Chr | Locus | Bp | SNP | Index SNP | Ref. | MAF |
| OR |
|
|
| Func refgene | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 |
| 67814440 | rs3790566 | Yes | T | 0.24 | 13 | 3.84E-10 | 1.16 | 0.80 | 0 | - | Intronic |
| 1 |
| 167420425 | rs2056626 | Yes | G | 0.39 | 6 | 1.31E-11 | 0.81 | 0.57 | 0 | - | Intronic |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3.84E-07 |
|
| 2 |
| 191902758 | rs3821236 | Yes | A | 0.20 | 12 | 1.94E-23 | 1.31 | 0.03 | 48.21 | - | Intronic |
| 2 |
| 191959489 | rs4853458 | No | A | 0.23 | 9 | 4.86E-18 | 1.35 | 0.42 | 1.79 | 5.58E-08 | Intronic |
| 3 |
| 58131515 | rs7355798 | No | T | 0.24 | 13 | 1.24E-08 | 1.14 | 0.14 | 30.5 | 7.42E-07 | Intronic |
| 3 |
| 58375286 | rs4076852 | Yes | G | 0.26 | 13 | 1.04E-10 | 1.16 | 0.71 | 0 | - | Intronic |
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
| 3 |
| 159733527 | rs589446 | Yes | T | 0.35 | 11 | 1.95E-10 | 0.86 | 0.85 | 0 | - | ncRNA intronic |
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
| 5 |
| 150455732 | rs3792783 | Yes | G | 0.16 | 14 | 2.42E-12 | 1.20 | 0.03 | 47.41 | - | Intronic |
| 6 |
| 106734040 | rs633724 | Yes | T | 0.35 | 14 | 2.84E-09 | 1.13 | 0.31 | 13.41 | - | Intronic |
| 7 |
| 128651522 | rs36073657 | Yes | T | 0.10 | 12 | 3.10E-21 | 1.40 | 0.21 | 23.35 | - | Intronic |
| 7 |
| 128658739 | rs12155080 | No | G | 0.37 | 13 | 2.87E-13 | 0.85 | 0.69 | 0 | 2.22E-07 | Intronic |
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
| 15 |
| 75077367 | rs1378942 | Yes | C | 0.39 | 13 | 1.84E-14 | 1.18 | 0.90 | 0 | - | Intronic |
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
|
|
|
|
|
|
|
|
|
|
|
|
| - |
|
| 19 |
| 18193191 | rs2305743 | Yes | A | 0.20 | 12 | 4.64E-10 | 0.83 | 0.28 | 16.88 | - | Intronic |
The new genome-wide significant loci for systemic sclerosis are highlighted in bold. NAB1-rs16832798 p value conditioned on conditioned on STAT4-rs3821236 and STAT4-rs4853458. For those intronic or regulatory SNPs that are located in a high gene density region, the gene they lie in was underlined
Bp base pair, Chr chromosome, MAF minor allele frequency in the 1000 Genome Project European Population, N number of cohorts, OR odds ratio, Pcond p value conditioned on index SNP, Ref. reference allele, SNP single-nucleotide polymorphism
Fig. 1Association signals for systemic sclerosis in a large meta-GWAS. a Manhattan plot representing the meta-GWAS results. The −log10 of the p values are plotted against their physical chromosomal position. The red and blue lines represent the genome-wide level of significance (p < 5 × 10−8) and p value threshold at p < 1 × 10−5, respectively. The plot has been truncated at p < 1 × 10−30. The lowest p value was observed within the MHC region for rs6457617 (association test p = 3.25 × 10−43). b Locuszoom to depict independent association signals in IRF5-TNPO3. From left to right, locuszoom of the association signals in IRF5-TNPO3 for the global meta-analysis; association signals conditioned on the lead SNP (rs36073657), and conditioned on rs36073657 and the secondary signal at the locus (rs12155080)
Posterior probabilities of systemic sclerosis fine-mapped loci
| Chr | Credible set locus | SNPs Cred. Set | Index SNP | PP Index | SNP PP Max | PP Max | Func.refgene SNP PP Max | Func.refgene in the 95% credible set |
|---|---|---|---|---|---|---|---|---|
| 1 |
| 6 | rs3790566 | 0.195 | rs3790567 | 0.321 | Intronic | Intronic |
| 1 |
| 1 | rs2056626 | 0.999 | rs2056626 | 0.999 | Intronic | - |
| 1 |
| 6 | rs2022449 | 0.659 | rs2022449 | 0.659 | ncRNA Intronic | ncRNA_intronic |
| 1 |
| 43 | rs1857066 | 0.046 | rs11576547 | 0.265 | ncRNA intronic | ncRNA_intronic |
| 2 |
| 11 | rs16832798 | 0.191 | rs716254 | 0.242 | Intronic | Intronic; intergenic; downstream |
| 2 |
| 1 | rs3821236 | 1.000 | rs3821236 | 1.000 | Intronic | - |
| 2 |
| 2 | rs4853458 | 0.905 | rs4853458 | 0.905 | Intronic | Intronic |
| 3 |
| 6 | rs7355798 | 0.365 | rs7355798 | 0.365 | Intronic | Intronic |
| 3 |
| 27 | rs4076852 | 0.123 | rs7653734 | 0.292 | Intronic | Intronic; intergenic |
| 3 |
| 1 | rs9884090 | 0.956 | rs9884090 | 0.956 | Intronic | - |
| 3 |
| 23 | rs589446 | 0.385 | rs589446 | 0.385 | ncRNA intronic | ncRNA_intronic |
| 4 |
| 2 | rs11724804 | 0.793 | rs11724804 | 0.793 | Intronic | Intronic |
| 4 |
| 6 | rs230534 | 0.200 | rs230517 | 0.329 | Intronic | Intronic |
| 5 |
| 1 | rs3792783 | 0.999 | rs3792783 | 0.999 | Intronic | - |
| 6 |
| 3 | rs633724 | 0.588 | rs633724 | 0.588 | Intronic | Intronic |
| 8 |
| 1 | rs2736340 | 1.000 | rs2736340 | 1.000 | Intergenic | - |
| 8 |
| 80 | rs685985 | 0.003 | rs6987084 | 0.139 | Intronic | Intronic; UTR3; intergenic |
| 11 |
| 4 | rs6598008 | 0.760 | rs6598008 | 0.760 | Intronic | Intronic; exonic synonymous SNV; UTR3; exonic nonsynonymous |
| 11 |
| 20 | rs2651804 | 0.184 | rs2651804 | 0.184 | Intergenic | Intergenic |
| 11 |
| 7 | rs11217020 | 0.021 | rs10892286 | 0.775 | Intronic | Intronic; intergenic |
| 15 |
| 1 | rs1378942 | 0.993 | rs1378942 | 0.993 | Intronic | - |
| 16 |
| 6 | rs11117420 | 0.202 | rs11117422 | 0.54 | Intergenic | Intergenic |
| 17 |
| 17 | rs883770 | 0.032 | rs9303277 | 0.157 | Intronic | Intergenic; intronic; exonic synonymous SNV; exonic nonsynonymous |
| 17 |
| 2 | rs1005714 | 0.940 | rs1005714 | 0.940 | Intronic | Intronic |
| 19 |
| 2 | rs2305743 | 0.944 | rs2305743 | 0.944 | Intronic | Intronic |
Chr chromosome, PP posterior probability, SNP single-nucleotide polymorphism
1Name of the credible set that comprised the index SNP from STAT4 genomic region (2q32.2-q32.3)
2Name of the credible set that comprised the secondary association signal in STAT4 genomic region (2q32.2-q32.3)
H3K27ac HiChIP target genes and nominated target genes for the 27 systemic sclerosis association signals
| Ch | Locus | SNP PP Max | HiChIP target genes with SNP PP Max | Nominated genes by H3K27ac HiChIP + eQTL validation | Nominated genes by CHi-C + eQTL validation | Prioritized genes by DEPICT/other criteria |
|---|---|---|---|---|---|---|
| 1 |
| rs3790567 |
|
|
|
|
| 1 |
| rs2056626 |
|
|
|
|
| 1 |
| rs2022449 |
| |||
| 1 |
| rs11576547 |
| |||
| 2 |
| rs716254 |
|
|
| |
| 2 |
| rs3821236 |
|
|
| |
| 2 |
| rs4853458 |
|
| ||
| 3 |
| rs7355798 |
|
| ||
| 3 |
| rs7653734 |
|
| ||
| 3 |
| rs9884090 |
|
| ||
| 3 |
| rs589446 |
|
|
| |
| 4 |
| rs11724804 |
| |||
| 4 |
| rs230517 |
|
|
|
|
| 5 |
| rs3792783 |
|
|
| |
| 6 |
| rs633724 |
| |||
| 7 |
| rs36073657 |
|
| ||
| 7 |
| rs12155080 |
|
|
| |
| 8 |
| rs2736340 |
|
| ||
| 8 |
| rs6987084 |
| |||
| 11 |
| rs6598008 |
|
|
| |
| 11 |
| rs2651804 |
|
| ||
| 11 |
| rs10892286 |
|
| ||
| 15 |
| rs1378942 |
|
|
| |
| 16 |
| rs11117422 |
|
|
| |
| 17 |
| rs9303277 |
|
|
| |
| 17 |
| rs1005714 |
|
|
| |
| 19 |
| rs2305743 |
|
|
|
In column ‘HiChIP Target Genes withSNP PP Max’, genes with HiChIP and eQTL signals are highlighted in bold
Ch, chromosome; PP, posterior probability from the statistical fine-mapping; SNP, single-nucleotide polymorphism
*Credible set not feasible
**NO eQTL signals found but strong HiChIP signal
aGene nominated by proximity
bGenes nominated according to the results from the first Immunochip in systemic sclerosis (9)
Fig. 2H3K27ac HiChIP signals at systemic sclerosis loci in human CD4+ T cells. The SNPs with maximum Posterior Probabilities in each locus were set as anchor points to assess promoter-enhancer chromatin interactions. Representation of overlap with ATAC-seq peaks is included. Chr, chromosome; Kb, kilo base; Mb, mega base; Res, resolution
Fig. 3Tissue-specific enrichment for systemic sclerosis associations in epigenetic marks. The heatmap displays the significant enrichment (p value < 1.25 × 10−4) in 59 out of the 127 cell and tissue types in Roadmap Epigenomics Consortium and the Encyclopedia of DNA Elements (ENCODE) projects. The enrichment p values are plotted with different colors according to the strength of the significance. Since the enrichments were computed at various GWAS p value cutoffs (5 × 10−6, 5 × 10−7, 5 × 10−8), the most significant p value was selected if a cell type/epigenetic mark combination showed more than one significant enrichment across the different cutoffs. Supplementary Data 13 provides the correspondence between cell codes and cell types