| Literature DB >> 30255778 |
Haakon E Nustad1,2,3, Marcio Almeida4, Angelo J Canty5, Marissa LeBlanc6, Christian M Page6,7, Phillip E Melton8.
Abstract
BACKGROUND: Longitudinal data and repeated measurements in epigenome-wide association studies (EWAS) provide a rich resource for understanding epigenetics. We summarize 7 analytical approaches to the GAW20 data sets that addressed challenges and potential applications of phenotypic and epigenetic data. All contributions used the GAW20 real data set and employed either linear mixed effect (LME) models or marginal models through generalized estimating equations (GEE). These contributions were subdivided into 3 categories: (a) quality control (QC) methods for DNA methylation data; (b) heritability estimates pretreatment and posttreatment with fenofibrate; and (c) impact of drug response pretreatment and posttreatment with fenofibrate on DNA methylation and blood lipids.Entities:
Keywords: Bayesian; DNA methylation; Epigenetics; Heritability; Linear mixed effect models; Repeated measurements; Variance components
Mesh:
Substances:
Year: 2018 PMID: 30255778 PMCID: PMC6156830 DOI: 10.1186/s12863-018-0648-1
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
GAW20 quality control and statistical models, data sets, and software used by this group
| Contribution | Phenotype | Normalization | h2 | Covariates | CpG probes | Model(s) | Software |
|---|---|---|---|---|---|---|---|
| Almeida et al. [ | HDL | Inverse | Pre and post fenofibrate HDL and CpG sites | 20 PCs | Epigenome-wide | VC-LME | SOLAR |
| Canty and Paterson [ | TG, QC | Probe type strata | – | 4 PCs | Epigenome-wide | Standard linear model (t-test) | |
| Fernandez-Rhodes et al. [ | Metabolic syndrome | Type II probes | Metabolic syndrome, 4 CpG sites | age, sex, SNPs, center, smoking, PCs | 4 CpG sites | VC-LME | SOLAR |
| LeBlanc et al. [ | QC only | BMIQ | Used breeding values from a heritability model | Age, sex, SNPs | Epigenome-wide | Bayesian LME | R-INLA |
| Lim et al. [ | TG | BMIQ | Age, sex, study, center, smoking, 10 PCs | 14,850 CpG sites showing | LME | WGCNA, missmethyl | |
| Nustad et al. [ | TG, HDL | BMIQ | P-to-e and post fenofibrate, TG HDL and CpG sites | Age, sex | Epigenome-wide | Bayesian LME | R-package INLA |
| Yu et al. [ | TG | – | Age, sex, study center, smoking status, HDL | 349,755 CpG sites | GEE | R 3.2 |
All contributions in this group used the GAW20 real data set
BMIQ beta-mixture quantile normalization method for correcting probe design bias, GEE generalized estimating equation, HDL high-density lipoprotein, h denotes heritability and indicates if the paper has estimated this quantity, INLA integrated nested Laplace approximation, PC DNA methylation-derived principal component and indicates this study employed PCs as covariates in their analysis, QC quality control, SOLAR sequential oligogenic linkage analysis routines, SNP single-nucleotide polymorphism, TG triglyceride, VC-LME variance component linear mixed effect, WGCNA weighted gene coexpression network analysis
Fig. 1The figure shows a comparison of pretreatment DNA methylation heritability estimates from Nustad et al. [24] and Almeida et al. [22]. Each dot represents a 0.01 × 0.01 square with the color indicating the number of estimates that fall within the square. The red line is the 1-to-1 line, while the dark blue contour lines present the estimated 2-dimensional density. The displayed heritability estimates are those that passed the model selection step in Nustad et al. [24]
Fig. 2The figure shows a comparison of posttreatment DNA methylation heritability estimates from Nustad et al. [24] and Almeida et al. [22]. Each dot represents a 0.01 × 0.01 square with the color indicating the number of estimates that fall within the square. The red line is the 1-to-1 line, while the dark blue contour lines present the estimated 2-dimensional density. The displayed heritability estimates are those that passed the model selection step in Nustad et al. [24]