| Literature DB >> 35765087 |
Emilie Willoch Olstad1,2, Hedvig Marie Egeland Nordeng3,4,5, Geir Kjetil Sandve4,6, Robert Lyle4,7,8, Kristina Gervin3,4,9.
Abstract
BACKGROUND: There is an increasing interest in the role of epigenetics in epidemiology, but the emerging research field faces several critical biological and technical challenges. In particular, recent studies have shown poor correlation of measured DNA methylation (DNAm) levels within and across Illumina Infinium platforms in various tissues. In this study, we have investigated concordance between 450 k and EPIC Infinium platforms in cord blood. We could not replicate our previous findings on the association of prenatal paracetamol exposure with cord blood DNAm, which prompted an investigation of cross-platform DNAm differences.Entities:
Keywords: EWAS; Epigenetic epidemiology; Epigenetics; Illumina Infinium platforms; MBRN; Microarrays; MoBa; Reliability; Replication; Validity
Mesh:
Substances:
Year: 2022 PMID: 35765087 PMCID: PMC9238140 DOI: 10.1186/s13148-022-01299-3
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 7.259
Fig. 1Overview of the ICC distribution computed from raw data and from data pre-processed using the default settings of five common EWAS analysis pipelines. Additionally, we included one common analysis pipeline (“RnBeads (customised)”, including the normalisation methods ENmix.oob and BMIQ). All pipelines examined also exhibited ICCs lower than –2, but these were removed from the illustration for visualisation purposes. The default settings of each analysis pipeline are detailed in Additional file 1: Table S1
Fig. 2(A–C) Scatter plots of the first three principal components (PC1–3) from PCA of DNAm data from samples with repeated measurements (n = 17 samples) using the 450 k and EPIC platforms, and (D) a scree plot showing the amount of variance explained by the first nine PCs
Fig. 3Mean absolute difference in measured DNA methylation (Δβ) per CpG, on the 450 k and EPIC platforms. Red dotted lines indicate a mean Δβ > 0.1, > 0.25 and > 0.5. Illumina CpG IDs are named if the mean Δβ > 0.5
Fig. 4Pearson’s correlation coefficients of DNAm in replicates of the 450 k and EPIC platforms, for (A) per-sample correlations in a correlogram, and (B) per-CpG correlations as distributions stratified by variance quartiles, based on the variance of the respective CpGs on the EPIC platform
Fig. 5(A) Histogram of the ICCs computed from the 17 samples assessed on both the 450 k and EPIC platforms. (B) Density distribution of mean difference in DNAm level, stratified by ICC category. (C) Density distribution of Pearson’s correlation coefficient, stratified by ICC category. The ICC categories are defined as follows: poor: ICC < 0.5; moderate: 0.5 ≤ ICC < 0.75; good: 0.75 ≤ ICC < 0.9; and excellent: ICC ≥ 0.9. The dark grey, dotted line indicates the median ICC, and the light grey, dotted line indicates the mean ICC. Outlying CpGs with ICCs less than the mean ICC minus three standard deviations were removed for visualisation purposes, but were included for summary statistic calculations
Overview of retained probes upon filtering of data from the EPIC and 450 k microarray platforms
| EPIC probes | 450 k probes | |
|---|---|---|
| Raw data | 866,091 | 485,512 |
| > 5% low-quality values | 857,144 | 479,914 |
| SNP-enriched probe removal | 827,968 | 463,111 |
| Cross-reactive probe removal | 813,047 | 441,548 |
| Sex chromosome removal | 795,515 | 431,536 |