| Literature DB >> 27876072 |
David Gomez-Cabrero1,2,3,4, Malin Almgren1,5,6, Louise K Sjöholm1,5, Aase H Hensvold1,7, Mikael V Ringh1,5, Rakel Tryggvadottir6, Juha Kere8, Annika Scheynius9, Nathalie Acevedo10, Lovisa Reinius8, Margaret A Taub6,11, Carolina Montano12, Martin J Aryee13,14,15,16, Jason I Feinberg6,17, Andrew P Feinberg6,11,18, Jesper Tegnér1,2, Lars Klareskog1,7, Anca I Catrina1,7, Tomas J Ekström19,20.
Abstract
BACKGROUND: Twin studies are powerful models to elucidate epigenetic modifications resulting from gene-environment interactions. Yet, commonly a limited number of clinical twin samples are available, leading to an underpowered situation afflicted with false positives and hampered by low sensitivity. We investigated genome-wide DNA methylation data from two small sets of monozygotic twins representing different phases during the progression of rheumatoid arthritis (RA) to find novel genes for further research.Entities:
Keywords: ACPA; Bioinformatics; DNA methylation; Epigenetics; Rheumatoid arthritis
Mesh:
Substances:
Year: 2016 PMID: 27876072 PMCID: PMC5120506 DOI: 10.1186/s13073-016-0374-0
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Summary information of the individuals selected for the experimental design
| Discordance type | Number of twin pairs | Females | Ever smokers | SE occurrence | Median age (years) at blood sampling (IQR) | Median CRP at blood sampling (IQR) |
|---|---|---|---|---|---|---|
| ACPA-positive healthy | 5 | 80% | 30% | 60% | 63 (62–74) | 1.7 (1.2–4.2) |
| ACPA-positive RA | 7 | 43% | 36% | 71% | 70 (68–72) | 3.7 (2.6–7.9) |
SE shared epitope, IQR interquartile range, CRP C-reactive protein
Fig. 1Differential methylation after correction for cell proportion. Log-transformed p value (y-axis) versus log-transformed fold change (logFC; x-axis) from the associated linear model. A negative logFC denotes hypomethylation and a positive logFC hypermethylation in TS1 and TS2 comparisons; the logFC was computed using M values in a linear model and using estimated cell proportions as covariates (see “Methods”). The horizontal and vertical lines are arbitrary thresholds selected to highlight possible tendencies. a Results from ACPA-positive healthy versus ACPA-negative healthy twin siblings (TS1). b Results from ACPA-positive RA versus ACPA-negative healthy twin siblings (TS2)
Fig. 2Schematic representation of the analysis. TS1 and TS2 denote the tests comparing ACPA-positive healthy twin versus ACPA-negative healthy twin and ACPA-positive RA twin versus ACPA-negative healthy twin, respectively. Steps 1 and 2 denote the first step of the analysis, DMR identification in TS1 and TS2 without cell proportion adjustment. In step 3 we computed DMRs between each pair of cell types (neutrophils, CD4+ T cells, CD8+ T cells, and CD56+ NK cells) and observed that DMRs identified without cell proportion adjustment were associated with cell type, so likely to be associated with changes in cell proportion. For this reason in step 4 we estimated cell proportion in each sample by adapting the method of Houseman et al. [31] (see the “Cell proportion estimation” section in the “Methods”). In step 5 and 6 we used cell proportion estimations as covariates in the identification of DMRs in TS1 and TS2. A DMR is considered statistically significant if it is significant both globally (FWER <0.10) and locally (permuted p-value <0.10); details are provided in the “Methods”
Estimation of cell proportion
For each sample the estimation of neutrophils, CD4+ T cells, CD8+ T cells, and CD56+ NK cells is provided, as described by Houseman et al. [31]. For each twin pair the difference in cell proportion (computed in ratios from 0 to 1, equivalent to percentages) for each cell type was computed in DIF; the purple cells indicate the p values from the paired t-test comparing differences. The sum of all ratios for a sample may differ from 1 due to rounding in the table; the analysis was performed without rounding
DMRs identified after cell type correction
| DMR name | Chromosome | Start | End | nprobes | FWER average | FWER maximum | FWER area | Permuted | Type |
|---|---|---|---|---|---|---|---|---|---|
| DMR1 | Chr5 | 140582954 | 140584018 | 31 |
|
|
| 0.06 | TS1 |
| DMR2 | Chr11 | 74630937 | 74631216 | 7 |
|
|
| 0. 03 | TS1 |
| DMR3 | ChrX | 51087402 | 51089195 | 32 |
|
|
| 0. 03 | TS1 |
| DMR4 | Chr14 | 54101669 | 54102157 | 6 |
|
| 0.03 | TS1 | |
| DMR5 | Chr2 | 85215637 | 85215811 | 6 |
| 0.06 | TS1 | ||
| DMR6 | Chr14 | 96038274 | 96038760 | 11 |
| 0.06 | TS1 | ||
| DMR7 | Chr9 | 4731364 | 4731640 | 6 |
| 0.03 | TS1 | ||
| DMR8 | Chr1 | 198645284 | 198645642 | 7 |
| 0.03 | TS1 | ||
| DMR9 | Chr1 | 116184060 | 116185215 | 33 |
| 0.06 | TS1 | ||
| DMR10 | Chr15 | 75897924 | 75898101 | 6 |
| 0.01 | TS1 | ||
| DMR11 | Chr2 | 171280327 | 171280711 | 10 |
| 0.03 | TS1 | ||
| DMR12 | Chr16 | 66835947 | 66836364 | 12 |
| 0.09 | TS1 | ||
| DMR13 | Chr13 | 113192663 | 113193262 | 8 |
| 0.06 | TS1 | ||
| DMR14 | Chr22 | 19603181 | 19604444 | 33 |
| 0.06 | TS1 | ||
| DMR15 | Chr17 | 41620864 | 41621503 | 16 |
| 0.06 | TS1 | ||
| DMR16 | Chr8 | 61353541 | 61354034 | 15 |
| 0.06 | TS1 | ||
| DMR17 | ChrX | 113720992 | 113721797 | 22 |
| 0.06 | TS1 | ||
| DMR18 | Chr10 | 99200020 | 99200806 | 23 |
|
|
| <0.01 | TS2 |
DMRs (differentially methylated probes) identified after cell type correction in TS1 and TS2; the double criteria for selection is first a FWER <0.20 in at least one of the three statistics (italics) and secondly a permuted p value <0.1. “Start” and “End” denote the start and end location of the DMR in the chromosome. “nprobes” denotes the number of probes in the DMR. “FWER average”, “FWER maximum”, and “FWER area” denote the FWER associated with each one of the statistics used (see “Methods”). “Permuted p value” denotes the p value computed for each DMR locally, comparing each DMR statistic to random permutations of the samples. “Type” denotes if the DMR is associated with TS1 (ACPA/healthy discordant) or TS2 (ACPA-positive RA/healthy discordant). Bold highlights DMRs whose FWER was <0.005. Chromosomal locations are based on the hg18 build
Characterization of DMRs identified after cell type correction
| DMR name | Gene (distance to TSS) | Projection | Location: gene | Location: CGI | EIRA |
|---|---|---|---|---|---|
| DMR1 | PCDHB14 (+224) |
| Promoter | Shore | No |
| DMR2 | SLCO2B1 (+91266), ARRB1 (+109444) | 0.28 | Intergenic | CGI/Shore | Yesa |
| DMR3 | NUDT10 (−3524) | 0.48 | Gene body | - | No |
| DMR4 | SAMD4A (−2474) | 0.67 | Gene body | CGI | Yes + |
| DMR5 | TCF7L1 (+1479), TGOLN2 (+193161) | 0.30 | Promoter | CGI | Yes a + |
| DMR6 | PAPOLA (+44) | 0.56 | Promoter | CGI | Yes a + |
| DMR7 | AK3 (−275) | 0.84 | Promoter | CGI | Yes + |
| DMR8 | ZNF281 (+326) | 0.89 | Promoter | CGI | Yes + |
| DMR9 | NHLH2 (+632) | 0.55 | Promoter | Shore | Yes |
| DMR10 | LINGO1 (−186249), TBC1D2B (+259036) | 0.45 | Intergenic | CGI/Shore | No |
| DMR11 | SP5 (+412) | 0.83 | Promoter | CGI | Yes + |
| DMR12 | PLA2G15 (−592) | 0.31 | Promoter | CGI/Shore | Yes + |
| DMR13 | TMCO3 (−346) | 0.73 | Promoter | CGI | Yes + |
| DMR14 | LZTR1 (−62745), CRKL (+2099) | 0.13 | Extended promoter | Shore | * |
| DMR15 | LRRC37A (−107090), KIAA1267 (−15809) | 0.94 | Gene body | - | * |
| DMR16 | CA8 (+2720) | 0.91 | Extended promoter | - | * |
| DMR17 | HTR2C (−3412) | 0.20 | Gene body | Shore | * |
| DMR18 | EXOSC1 (−4655), ZDHHC16 (+4493) | 0.69 | Gene body | Shore | No |
“Gene” provides information on the closest genes and the distance to the transcription start site (TSS). “Location: gene” and “Location: CGI” denote the locations of the center of the DMR in relation to a gene or a CpG island. “EIRA” is “yes” if the DMR overlaps with a DMP from the EIRA cohort [2] or if it is alocated within 1000 bp of one; a plus sign in the EIRA column denotes the same direction of change. “Projection” denotes the permuted p value of the DMRs of TS1 and TS2 when using methylation data from TS2 and TS1, respectively (see the “Projection analysis” section in the “Methods”); p values in bold are those <0.05. An asterisk denotes that no probes within 500 bp of the DMR are available in the Illumina 450 K array and no comparison is thus possible
Fig. 3DMR1, from TS1 after cell proportion correction, at the promoter region of PCDHB14 (chr5 140582954–140584018). DMR denotes the DMR location (grey box); CpGdens denotes CpG density as computed by CHARM [27] (CpG); CGI denotes the location of CpG islands (black box). TS1 Δbeta and TS2 Δbeta shows the smoothed linear slope (differences in methylation or delta) associated with ACPA-positive healthy and ACPA-positive RA twin, respectively, in the linear model which is used CHARM [27] to identify DMR candidates. Every point denotes a probe location. The location of the gene PCDHB14 is shown in a dotted box; the black square on the left denotes the location of the transcription start site