| Literature DB >> 35142878 |
Inna Starskaia1,2,3, Essi Laajala1,2,3,4, Toni Grönroos1,2, Taina Härkönen5,6, Sini Junttila1,2, Roosa Kattelus1,2, Henna Kallionpää1, Asta Laiho1,2, Veronika Suni1, Vallo Tillmann7,8, Riikka Lund1, Laura L Elo1,2,9, Harri Lähdesmäki4, Mikael Knip5,6,10, Ubaid Ullah Kalim11,12, Riitta Lahesmaa13,14,15.
Abstract
AIMS/HYPOTHESIS: Type 1 diabetes is a chronic autoimmune disease of complex aetiology, including a potential role for epigenetic regulation. Previous epigenomic studies focused mainly on clinically diagnosed individuals. The aim of the study was to assess early DNA methylation changes associated with type 1 diabetes already before the diagnosis or even before the appearance of autoantibodies.Entities:
Keywords: DNA methylation; Epigenetics; T cells; Type 1 diabetes
Mesh:
Substances:
Year: 2022 PMID: 35142878 PMCID: PMC8960578 DOI: 10.1007/s00125-022-05657-x
Source DB: PubMed Journal: Diabetologia ISSN: 0012-186X Impact factor: 10.122
Fig. 1Study design and analysis workflow schematics. (a) Study design. Each case individual is visualised as a red line above the corresponding control individual, visualised as a black line. The sample collection time points are marked as triangles. Diagnosis of type 1 diabetes and seroconversion are marked as blue and red asterisks, respectively. (b) The schematic shows the outline of the RRBS data analysis workflow. Both (a) and (b) were created with BioRender. M-biases, biased average methylation level at 5′ or 3′ of the reads; PC1, principal component 1; PC2, principal component 2; PCA, principal component analysis
Fig. 2DMCs. Methylation profiles of top-ranked DMCs between cases and controls identified in CD4+ T cells (a–d), CD8+ T cells (e, f) and CD4−CD8− fractions (g, h). The plots are coloured according to the pairwise methylation difference, as specified in the colour key. Blue and red indicate hypo- and hypermethylation of the CpG site in cases, compared with controls, respectively. Black represents missing values. The pairs of individuals are not in any particular order. A black dot in the boxes indicates a sample collected after seroconversion. m, months
Top-ranked DMCs at FDR <0.0001 identified between cases and controls in all longitudinal samples
| DMC (hg19) | Fraction | Methylation difference | FDR, <0.0001 | Nearest gene | Genomic region | |
|---|---|---|---|---|---|---|
| chr2:113192477 | CD4+ | 0.130 | 8.45× 10−13 | 1.25× 10−7 | Promoter | |
| chr11:42897003 | CD4+ | 0.456 | 1.42× 10−12 | 1.40× 10−7 | Intergenic | |
| chr7:22861070 | CD4+ | −0.160 | 2.61× 10−12 | 1.93× 10−7 | Intron | |
| chr1:44878291 | CD4+ | −0.104 | 4.46× 10−12 | 2.64× 10−7 | Exon | |
| chr11:118842572 | CD4+ | 0.101 | 1.25× 10−10 | 3.71× 10−6 | Promoter | |
| chr14:103227394 | CD4+ | −0.150 | 5.86× 10−10 | 1.24× 10−5 | Intergenic | |
| chr6:121069653 | CD4+ | −0.124 | 6.88× 10−10 | 1.34× 10−5 | Intergenic | |
| chr9:137674085 | CD4+ | −0.113 | 7.25× 10−10 | 1.34× 10−5 | Intron | |
| chr19:18118304 | CD4+ | −0.174 | 1.11× 10−9 | 1.94× 10−5 | Promoter | |
| chr22:39633525 | CD4+ | 0.189 | 1.77× 10−9 | 2.61× 10−5 | Intron | |
| chr16:32289963 | CD8+ | −0.130 | 5.21× 10−11 | 1.27× 10−5 | Intergenic | |
| chr2:27038339 | CD8+ | −0.186 | 2.78× 10−10 | 4.31× 10−5 | Intergenic | |
| chr9:79557441 | CD8+ | 0.154 | 5.37× 10−10 | 7.04× 10−5 | Intergenic | |
| chr14:45343058 | CD8+ | −0.207 | 8.55× 10−10 | 9.11× 10−5 | Intergenic | |
| chr18:54333583 | CD4−CD8− | 0.131 | 1.26× 10−17 | 7.62× 10−12 | Intron | |
| chr14:57523325 | CD4−CD8− | −0.299 | 5.17× 10−13 | 1.25× 10−7 | Intergenic | |
| chr3:194706028 | CD4−CD8− | −0.132 | 1.24× 10−12 | 2.51× 10−7 | Intergenic | |
| chr5:173991159 | CD4−CD8− | 0.143 | 7.59× 10−12 | 1.32× 10−6 | Intergenic | |
| chr19:38346420 | CD4−CD8− | 0.115 | 1.14× 10−11 | 1.72× 10−6 | Promoter | |
| chr3:194705954 | CD4−CD8− | −0.138 | 3.75× 10−11 | 5.06× 10−6 | Intergenic | |
| chr6:166419166 | CD4−CD8− | 0.104 | 5.30× 10−11 | 6.44× 10−6 | Intergenic | |
| chr11:134709660 | CD4−CD8− | 0.215 | 2.73× 10−10 | 2.55× 10−5 | Intergenic |
Fig. 3DMRs. (a) Venn diagrams of the numbers of DMRs detected in different cell fractions. A DMR is considered to be found in two cell fractions if at least one DMC within the DMR is found in both cell fractions. Positive methylation difference means the DMR was hypermethylated in cases, compared with controls; negative means hypomethylated in cases, compared with controls. The figures were created with BioRender. (b) Distribution of DMRs in genomic regions in CD4+ T cells, CD8+ T cells and CD4−CD8− cell fractions
Fig. 4Annotations of the type 1 diabetes-associated DMRs. The genomic loci harbouring the DMRs at CBFA2T3, IL32 and TOX were visualised with the UCSC Genome Browser. The DMRs are shown in red colour in the DMR track below the GeneHancer track in all of the panels. Six other reference tracks are displayed. (a) A DMR hypermethylated in cases vs controls (chr16:89050538–89,050,592) mapped in the intergenic region upstream of CBFA2T3 in CD4+ T cells. It overlaps with an enhancer region. A long-distance chromatin interaction between this active regulatory element and CBFA2T3 promoter is shown by a dashed line. (b) A DMR hypomethylated in cases vs controls at IL32 gene locus (chr16:3116115–3,116,616) in CD8+ T cells is located at the promoter of the gene. (c) A DMR hypermethylated in cases vs controls in CD4+ T cells (chr8:60032900–60,032,942) is located approximately 1 kb upstream of TOX and overlaps with its promoter. GH Reg Elems (DE), GeneHancer regulatory elements and gene interactions (Double Elite); Txn Factor ChIP, Transcription factor chromatin immunoprecipitation sequencing clusters
DMCs across all longitudinal time points showing the highest Spearman rank correlation >|0.5| with gene expression levels
| DMC (hg19) | Fraction | Methylation difference | FDR | Nearest gene | Spearman correlation | Correlating gene | |
|---|---|---|---|---|---|---|---|
| chr4:961599 | CD4+ | 0.127 | 7.29 × 10−5 | 4.18 × 10−2 | 0.619 | ||
| chr4:962384 | CD4+ | 0.136 | 4.66 × 10−5 | 3.25 × 10−2 | 0.508 | ||
| chr21:47307692 | CD4+ | −0.171 | 1.33 × 10−5 | 1.59 × 10−2 | −0.591 | ||
| chr21:47307758 | CD4+ | −0.210 | 2.25 × 10−4 | 7.82 × 10−2 | −0.537 | ||
| chr21:47308375 | CD4+ | −0.244 | 3.46 × 10−4 | 9.76 × 10−2 | −0.537 | ||
| chr21:47308422 | CD4+ | −0.208 | 3.80 × 10−6 | 7.98 × 10−3 | −0.517 | ||
| chr21:46714810 | CD4+ | −0.109 | 2.66 × 10−6 | 6.57 × 10−3 | −0.561 | ||
| chr7:76129434 | CD4+ | −0.165 | 1.29 × 10−4 | 5.64 × 10−2 | −0.539 | ||
| chr16:433855 | CD8+ | −0.174 | 1.14 × 10−9 | 1.10 × 10−4 | −0.538 | ||
| chr10:135092506 | CD8+ | −0.118 | 5.12 × 10−6 | 3.96 × 10−2 | −0.520 | ||
| chr4:961576 | CD4−CD8− | 0.164 | 2.01 × 10−6 | 2.65 × 10−2 | 0.654 | ||
| chr4:961573 | CD4−CD8− | 0.119 | 2.37 × 10−5 | 9.88 × 10−2 | 0.528 |
Fig. 5Methylation and gene expression correlation analysis. The most highly correlated expression–methylation pairs in CD4+ and CD8+ T cells. Spearman ρ (Spearman’s rank correlation coefficient) is between methylation proportion (x-axis) and reads per kb per million (RPKM)-normalised gene expression level (y-axis). (a, b) Expression–methylation pairs in the CD4+ T cells. The methylation of CpG site chr4:961599 at an intron of DGKQ gene positively correlated (Spearman ρ = 0.619) with gene expression. The methylation of CpG site chr21: 47307692 at an intron of PCBP3 gene showed negative correlation (Spearman ρ = −0.591) with gene expression. (c, d) Expression–methylation pairs in the CD8+ T cells. The methylation of CpG site chr16:433855 near TMEM8A gene inversely correlated (Spearman ρ = −0.538) with gene expression. The methylation of CpG site chr10:135092506 at an exon of ADAM8 gene showed negative correlation (Spearman ρ = −0.520) with gene expression. For a complete list of CpG sites and their correlation with gene expression, please see ESM Tables 3–5, 9–11
Fig. 6DNA methylation analysis prior to seroconversion. (a) Venn diagrams of the numbers of DMRs detected in different cell fractions for all longitudinal samples (indicated as All) and prior to seroconversion (indicated as Pre-sc). A DMR is considered to be found in two analyses if at least one DMC within the DMR is found in both cell fractions. Seroconversion was defined based on the measurements of autoantibodies IAA, GADA, IA-2A and ZnT8A in the participant’s serum (details in the Methods section). The figure was created with BioRender. (b) UCSC Genome Browser visualisation of a DMR hypermethylated in cases vs control participants at IRF5 gene identified in CD4+ T cells prior to seroconversion. The genomic locus (chr7:128568936–128,609,758) includes the DMR (chr7:128580072–128,580,230). (c) The DMR (chr10:74082048–74,082,090) identified in this study and the DMC (chr10:74058002, cg01674036) found in the Paul et al study [12] located in the intergenic region between DDIT4 and DNAJB12 in CD4+ T cells prior to seroconversion; the DMC is shown below the DMR track, and the DMC and DMR are shown as vertical lines in respective tracks. Six other reference tracks are displayed. Long-distance chromatin interactions between the gene promoter and other active regulatory elements (indicated in grey colour in GeneHancer regulatory element track) are shown by curves connecting promoters and other regulatory elements. GH Reg Elems (DE), GeneHancer regulatory elements and gene interactions (Double Elite); Txn Factor ChIP, Transcription factor chromatin immunoprecipitation sequencing clusters
Top-ranked DMCs at FDR <0.0001 identified between cases and controls prior to seroconversion
| DMC (hg19) | Fraction | Methylation difference | FDR, <0.0001 | Nearest gene | Genomic region | |
|---|---|---|---|---|---|---|
| chr19:18118304 | CD4+ | −0.185 | 4.22 × 10−22 | 1.64 × 10−16 | Promoter | |
| chr7:3169658 | CD4+ | 0.269 | 4.76 × 10−13 | 4.62 × 10−8 | Intergenic | |
| chr10:99209697 | CD4+ | 0.541 | 1.95 × 10−12 | 1.52 × 10−7 | Intron | |
| chr14:65542789 | CD4+ | −0.505 | 7.00 × 10−12 | 3.88 × 10−7 | Intron | |
| chr8:530556 | CD4+ | −0.171 | 1.00 × 10−11 | 4.87 × 10−7 | Intergenic | |
| chr21:47285711 | CD8+ | −0.124 | 9.16 × 10−22 | 4.78 × 10−16 | Intron | |
| chr17:71322457 | CD8+ | −0.113 | 1.92 × 10−11 | 2.28 × 10−6 | Intergenic | |
| chr19:12035022 | CD8+ | −0.150 | 3.10 × 10−10 | 2.31 × 10−5 | Promoter | |
| chr7:3169674 | CD8+ | −0.174 | 3.00 × 10−10 | 2.31× 10−5 | Intergenic | |
| chr14:45343058 | CD8+ | 0.189 | 6.22 × 10−10 | 4.33 × 10−5 | Intergenic | |
| chr11:134709660 | CD4−CD8− | 0.228 | 5.41 × 10−17 | 4.72 × 10−11 | Intergenic | |
| chr2:113192534 | CD4−CD8− | 0.132 | 6.45 × 10−13 | 1.87 × 10−7 | Promoter | |
| chr8:144788521 | CD4−CD8− | 0.18 | 1.09 × 10−12 | 2.37 × 10−7 | Intron | |
| chr3:194706028 | CD4−CD8− | −0.135 | 9.52 × 10−12 | 1.38 × 10−6 | Intergenic | |
| chr6:163570666 | CD4−CD8− | −0.219 | 5.53 × 10−10 | 4.25 × 10−5 | Intron |
DMCs in the analysis of only pre-seroconversion samples showing the highest Spearman rank correlation >|0.5| with gene expression levels
| DMC (hg19) | Fraction | Methylation difference | FDR | Nearest gene | Spearman correlation | Correlating gene | |
|---|---|---|---|---|---|---|---|
| chr2:1817466 | CD4+ | 0.269 | 1.88 × 10−4 | 8.42 × 10−2 | −0.695 | ||
| chr2:1817753 | CD4+ | 0.174 | 9.85 × 10−5 | 6.05 × 10−2 | −0.629 | ||
| chr16:452426 | CD4+ | −0.124 | 7.79 × 10−7 | 3.36 × 10−3 | −0.623 | ||
| chr8:144659988 | CD4+ | 0.104 | 1.43 × 10−4 | 7.35 × 10−2 | −0.600 | ||
| chr11:67351952 | CD4+ | −0.104 | 1.92 × 10−5 | 2.53 × 10−2 | −0.567 | ||
| chr5:179193600 | CD4+ | −0.197 | 4.95 × 10−5 | 4.17 × 10−2 | −0.548 | ||
| chr19:37807762 | CD4+ | 0.138 | 1.49 × 10−5 | 2.17 × 10−2 | −0.540 | ||
| chr7:76129273 | CD4+ | −0.174 | 3.28 × 10−5 | 3.34 × 10−2 | −0.536 | ||
| chr2:131695535 | CD4+ | −0.214 | 6.79 × 10−5 | 5.00 × 10−2 | 0.600 | ||
| chr4:961405 | CD4+ | 0.325 | 1.29 × 10−6 | 4.84 × 10−3 | 0.504 | ||
| chr8:144660722 | CD8+ | 0.142 | 2.53 × 10−6 | 1.87 × 10−2 | −0.543 | ||
| chr21:47307825 | CD8+ | −0.142 | 5.84× 10−5 | 9.98× 10−2 | −0.534 | ||
| chr21:47285711 | CD8+ | −0.427 | 9.16 × 10−22 | 4.78 × 10−16 | −0.512 | ||
| chr21:47307753 | CD8+ | −0.129 | 6.48 × 10−9 | 3.38 × 10−4 | −0.501 | ||
| chr17:71322457 | CD8+ | 0.548 | 1.92 × 10−11 | 2.28 × 10−6 | 0.516 | ||
| chr11:65359968 | CD4−CD8− | −0.320 | 6.92 × 10−9 | 3.14 × 10−4 | −0.545 | ||
| chr16:1480869 | CD4−CD8− | 0.135 | 7.38 × 10−6 | 3.04 × 10−2 | 0.611 | ||
| chr1:3811367 | CD4−CD8− | 0.238 | 9.27 × 10−10 | 5.77 × 10−5 | 0.558 | ||
| chr19:50962487 | CD4−CD8− | 0.121 | 5.31 × 10−5 | 9.00 × 10−2 | 0.527 | ||
| chr19:1047251 | CD4−CD8− | 0.171 | 4.93 × 10−5 | 8.68 × 10−2 | 0.501 |
Validation of the RRBS results by bisulphite pyrosequencing
| Selected targets for validation | Validation results | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| DMC (hg19) | Nearest gene | Methylation difference | FDR | Fraction | Concordant direction of difference | Correlation | FDR | ||
| chr19:18118304 | −0.174 | 1.11 × 10−9 | 1.94 × 10−5 | CD4+ | Yes | 0.85 | 0.03 | 0.15 | |
| chr21:47307815 | −0.207 | 7.69 × 10−8 | 6.48 × 10−4 | CD4+ | Yes | 0.86 | 0.03 | 0.15 | |
| chr14:103227394 | −0.15 | 5.86 × 10−10 | 1.24 × 10−5 | CD4+ | Yes | 0.82 | 0.05 | 0.15 | |
| chr4:81128398 | −0.143 | 4.71 × 10−6 | 8.92 × 10−3 | CD4+ | Yes | 0.71 | 0.11 | 0.23 | |
| chr1:228659024 | 0.147 | 3.33 × 10−8 | 3.40 × 10−4 | CD4+ | Yes | 0.74 | 0.09 | 0.23 | |
| chr16:3116115 | −0.141 | 1.68 × 10−7 | 3.82 × 10−3 | CD8+ | Yes | 0.14 | 0.79 | 0.79 | |
| chr22:39633525 | 0.189 | 1.77 × 10−9 | 2.61 × 10−5 | CD4+ | Yes | 0.62 | 0.19 | 0.32 | |
| chr8:60032942 | 0.113 | 3.20 × 10−5 | 2.63 × 10−2 | CD4+ | Yes | 0.25 | 0.63 | 0.70 | |
| chr18:77398359 | 0.144 | 2.46 × 10−7 | 1.26 × 10−3 | CD4+ | No | 0.46 | 0.36 | 0.49 | |
| chr2:113192477 | 0.13 | 8.45 × 10−13 | 1.25 × 10−7 | CD4+ | No | 0.43 | 0.39 | 0.49 | |
The first six columns contain information about selected targets. In column 7 we indicate whether the methylation difference between the cases and controls was concordant (to the same direction) in the pyrosequencing results compared with RRBS results. Column 8 contains the Pearson correlation coefficient between pyrosequencing and RRBS results (methylation proportions of the six samples that were pyrosequenced). Columns 9 and 10 contain raw and adjusted p values (FDR), respectively