| Literature DB >> 31467281 |
Cristina M Lanata1, Ishan Paranjpe2,3, Joanne Nititham1, Kimberly E Taylor1, Milena Gianfrancesco1, Manish Paranjpe2, Shan Andrews2, Sharon A Chung1, Brooke Rhead4, Lisa F Barcellos4, Laura Trupin1, Patricia Katz1, Maria Dall'Era1, Jinoos Yazdany1, Marina Sirota2, Lindsey A Criswell5,6.
Abstract
Systemic lupus erythematous (SLE) is a heterogeneous autoimmune disease in which outcomes vary among different racial groups. Here, we aim to identify SLE subgroups within a multiethnic cohort using an unsupervised clustering approach based on the American College of Rheumatology (ACR) classification criteria. We identify three patient clusters that vary according to disease severity. Methylation association analysis identifies a set of 256 differentially methylated CpGs across clusters, including 101 CpGs in genes in the Type I Interferon pathway, and we validate these associations in an external cohort. A cis-methylation quantitative trait loci analysis identifies 744 significant CpG-SNP pairs. The methylation signature is enriched for ethnic-associated CpGs suggesting that genetic and non-genetic factors may drive outcomes and ethnic-associated methylation differences. Our computational approach highlights molecular differences associated with clusters rather than single outcome measures. This work demonstrates the utility of applying integrative methods to address clinical heterogeneity in multifactorial multi-ethnic disease settings.Entities:
Mesh:
Year: 2019 PMID: 31467281 PMCID: PMC6715644 DOI: 10.1038/s41467-019-11845-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Integrative analysis pipeline. An overview of the omics data integration strategy used to characterize clinical clusters identified by K-means clustering. MCA = Multiple Component Analysis, HWE = Hardy-Weinberg Equilibrium, MAF = minor allele frequency, LD = linkage disequilibrium, FDR = false discovery rate, meQTL = cis-methylation quantitative trait loci
Fig. 2Characterization of clinical features between clusters. a Distribution of American College of rheumatology (ACR) classification criteria for SLE within each cluster where red indicates presence and blue absence of each criterion. Association between each criterion and cluster was evaluated by a Fisher exact test. b Criteria significantly associated with each cluster (FDR < 0.01). c Distribution of lupus severity index across clusters with p-value computed using an ANOVA test
Summary of significant clinical and demographic variables across clusters
| Cluster | |||||
|---|---|---|---|---|---|
| M ( | S1 ( | S2 ( | FDR | ||
|
| |||||
| Malar rash | 68.3 | 20.8 | 62.8 | 1.22E-15 | 4.87E-15 |
| Discoid rash | 14.9 | 5.2 | 20.5 | 1.41E-03 | 1.88E-03 |
| Oral ulcers | 70.3 | 26.6 | 48.7 | 4.72E-11 | 1.18E-10 |
| Arthritis | 93.1 | 66.2 | 88.5 | 1.14E-07 | 2.53E-07 |
| Pleuritis | 48.0 | 17.5 | 38.5 | 7.41E-07 | 1.48E-06 |
| Pericarditis | 13.0 | 13.6 | 35.9 | 4.71E-05 | 7.24E-05 |
| Seizure | 8.0 | 4.5 | 9.0 | 0.354 | 0.354 |
| Psychosis | 5.0 | 0.6 | 9.0 | 6.79E-03 | 7.54E-03 |
| anti-dsDNA antibodies | 35.6 | 76.0 | 92.3 | 9.79E-17 | 4.90E-16 |
| anti-Smith antibodies | 9.9 | 27.9 | 57.7 | 2.64E-11 | 7.55E-11 |
| ANA positivity | 89.1 | 98.7 | 98.7 | 3.20E-04 | 4.57E-04 |
| Hemolytic anemia | 1.0 | 8.5 | 15.4 | 1.65E-03 | 2.07E-03 |
| Leukopenia | 4.0 | 11.7 | 65.4 | 1.34E-25 | 2.68E-24 |
| Lymphopenia | 19.8 | 16.9 | 76.9 | 1.67E-21 | 1.67E-20 |
| Thrombocytopenia | 8.9 | 13.0 | 34.6 | 6.82E-06 | 1.24E-05 |
| Renal | 15.8 | 62.3 | 57.7 | 3.32E-13 | 1.11E-12 |
| Photosensitivity | 73.3 | 18.2 | 41.0 | 1.91E-17 | 1.27E-16 |
| APLA | 25.7 | 32.5 | 42.3 | 0.0641 | 0.0675 |
|
| |||||
| White | 48.0 | 22.7 | 16.7 | 4.76E-04 | 1.90E-03 |
| Hispanic | 18.0 | 22.1 | 30.8 | 0.115 | 0.23 |
| African–American | 11.0 | 9.7 | 12.8 | 0.642 | 0.64 |
| Asian | 20.0 | 44.8 | 37.2 | 1.43E-03 | 4.30E-03 |
| Other | 3.0 | 0.6 | 2.6 | 2.57E-05 | 1.29E-04 |
| Lupus Severity Index (SD) | 5.6 (1.37) | 7.39 (1.43) | 7.41 (1.42) | 2.93E-22 | 2.05E-21 |
| SLEDAI Score (SD) | 2.43 (2.94) | 2.82 (2.9) | 3.94 (3.44) | 3.58E-03 | 1.08E-02 |
ACR American college of Rheumatology, APL antiphospholipid antibodies, FDR false discovery rate, SLEDAI SLE disease activity index
False Discovery Rate (FDR) p-values were calculated for Kruskall–Wallis (continuous variables) or Fisher’s exact test (binary variables)
Fig. 3Cluster associated CpGs and meQTL associations. a Heatmap of CpGs significantly associated with clinical cluster (FDR < 0.1) b Manhattan plot shows −log10(p-value) for associations between cluster-associated CpGs and all SNPs within 1 Mb of each CpG. For each CpG with a significant meQTL (FDR < 0.05), the most significant variant is labelled with its corresponding gene
Significantly enriched pathways in cluster-associated CpGs
| Name | Source | FDR B&H | Genes from Input | Genes in Annotation | |
|---|---|---|---|---|---|
| Interferon signaling | REACTOME | 9.54E-32 | 8.19E-29 | 29 | 202 |
| Interferon alpha/beta signaling | REACTOME | 2.06E-30 | 8.84E-28 | 21 | 69 |
| Cytokine signaling in immune system | REACTOME | 2.26E-20 | 6.46E-18 | 34 | 763 |
| Interferon gamma signaling | REACTOME | 1.04E-15 | 2.23E-13 | 14 | 94 |
| Influenza A | KEGG | 3.35E-13 | 5.76E-11 | 15 | 173 |
| Herpes simplex infection | KEGG | 8.98E-13 | 1.29E-10 | 15 | 185 |
| Measles | KEGG | 1.09E-09 | 1.33E-07 | 11 | 134 |
| ISG15 antiviral mechanism | REACTOME | 1.67E-09 | 1.59E-07 | 9 | 77 |
| Antiviral mechanism by IFN-stimulated genes | REACTOME | 1.67E-09 | 1.59E-07 | 9 | 77 |
| Hepatitis C | KEGG | 1.33E-08 | 1.14E-06 | 10 | 131 |
| RIG-I/MDA5 mediated induction of IFN-alpha/beta pathways | REACTOME | 7.27E-08 | 5.68E-06 | 8 | 84 |
| Negative regulators of RIG-I/MDA5 signaling | REACTOME | 1.13E-07 | 8.08E-06 | 6 | 36 |
| TRAF3-dependent IRF activation pathway | REACTOME | 3.00E-06 | 1.87E-04 | 4 | 16 |
| TRAF6 mediated IRF7 activation | REACTOME | 3.05E-06 | 1.87E-04 | 5 | 35 |
| RIG-I-like receptor signaling pathway | KEGG | 6.35E-06 | 3.64E-04 | 6 | 70 |
| Antigen presentation: folding, assembly and peptide loading of class I MHC | REACTOME | 1.99E-05 | 1.01E-03 | 4 | 25 |
| TRAF6 mediated NF-kB activation | REACTOME | 1.99E-05 | 1.01E-03 | 4 | 25 |
| Viral carcinogenesis | KEGG | 5.15E-05 | 2.46E-03 | 8 | 201 |
| Endosomal/vacuolar pathway | REACTOME | 5.81E-05 | 2.62E-03 | 3 | 12 |
| NF-kB activation through FADD/RIP-1 pathway mediated by caspase-8 and -10 | REACTOME | 7.51E-05 | 3.23E-03 | 3 | 13 |
| NOD-like receptor signaling pathway | KEGG | 1.26E-04 | 5.14E-03 | 7 | 170 |
| Nicotinamide salvaging | REACTOME | 2.09E-04 | 8.17E-03 | 3 | 18 |
| ER-Phagosome pathway | REACTOME | 2.65E-04 | 9.91E-03 | 5 | 87 |
FDR B&H false discovery rate by Benjamini and Hochberg method
CpGs were mapped to genes using Illumina annotation file, and pathway analysis was performed using ToppFunn[92]
Summary of cluster-wise comparison and validation
| Comparison | Number of Differentially Methylated CpGs in study cohort (# CpGs on 450k Chip) | Significant in Validation Set (FDR < 0.1) | ΔBeta R-squared CLUES vs. Validation |
|---|---|---|---|
| Cluster S1 vs M | 53 (28) | 21 | 0.94 |
| Cluster S2 vs S1 | 18 (8) | 6 | 0.96 |
| Cluster S2 vs M | 247 (122) | 105 | 0.94 |
Rows indicate individual pairwise comparisons as performed using the nestedF method in Limma
Fig. 4Validation of differentially methylated CpGs between clusters. Differentially methylated CpGs between clusters were validated in an external dataset (19) a cluster S1 vs M. b cluster S2 vs S1 c cluster S2 vs M. Difference in CpG beta values for differentially methylated sites in CLUES data on x axis and corresponding delta beta value for CpG in validation data on y axis. Green color indicates significant association with cluster in validation dataset (FDR < 0.1)
Fig. 5Identification of candidate cluster-associated cg07259759 (USP35) that mediates genetic association of rs7104222 (GAB2) with clusters. Association of DNA methylation of cg07259759 and cluster (a) or genotype of rs7104222 (b). c Association between genotype rs7104222 and clusters. d Beta coefficient represents the dependence of cluster on genotype with or without adjusting for methylation. Error bars represent the 95% confidence interval for beta coefficient estimate. After adjusting for methylation, the observed dependence reduces toward zero
Fig. 6Enrichment of ethnicity-associated CpGs in set of cluster-associated CpGs. a Null distribution generated by randomly permuting ethnicity labels 1000 times and identifying the number of cluster associated CpGs that were also significantly with ethnicity (p < 0.05) in each sample. The red line indicates the number of significant ethnic-associated CpGs (238) found in the set of 256 cluster-associated CpGs. b Working model illustrating the role of ethnic-associated non-genetic factors in controlling both SLE disease subtypes and methylation signature