| Literature DB >> 26925097 |
Lindsay L Waite1, Benjamin Weaver2, Kenneth Day2, Xinrui Li3, Kevin Roberts2, Andrew W Gibson3, Jeffrey C Edberg3, Robert P Kimberly3, Devin M Absher2, Hemant K Tiwari4.
Abstract
DNA methylation levels vary markedly by cell-type makeup of a sample. Understanding these differences and estimating the cell-type makeup of a sample is an important aspect of studying DNA methylation. DNA from leukocytes in whole blood is simple to obtain and pervasive in research. However, leukocytes contain many distinct cell types and subtypes. We propose a two-stage model that estimates the proportions of six main cell types in whole blood (CD4+ T cells, CD8+ T cells, monocytes, B cells, granulocytes, and natural killer cells) as well as subtypes of T and B cells. Unlike previous methods that only estimate overall proportions of CD4+ T cell, CD8+ T cells, and B cells, our model is able to estimate proportions of naïve, memory, and regulatory CD4+ T cells as well as naïve and memory CD8+ T cells and naïve and memory B cells. Using real and simulated data, we are able to demonstrate that our model is able to reliably estimate proportions of these cell types and subtypes. In studies with DNA methylation data from Illumina's HumanMethylation450k arrays, our estimates will be useful both for testing for associations of cell type and subtype composition with phenotypes of interest as well as for adjustment purposes to prevent confounding in epigenetic association studies. Additionally, our method can be easily adapted for use with whole genome bisulfite sequencing (WGBS) data or any other genome-wide methylation data platform.Entities:
Keywords: B cell subtypes; DNA methylation; T cell subtypes; cell-type composition; deconvolution; epigenetics; whole blood
Year: 2016 PMID: 26925097 PMCID: PMC4757643 DOI: 10.3389/fgene.2016.00023
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Comparison of model estimates for main cell types from our two-stage model and the Jaffe and Irizarry model. Plots of measured vs. model-predicted cell type proportions for each cell type for the Absher whole blood data and associated correlation and MSE for each cell type for each of three data sets (Absher whole blood, Reinius whole blood, and Reinius PBMC) using our two-stage model (A,C) and the Jaffe and Irizarry model (B,D).
Figure 2Model performance for CD4+ T cell subtypes. True vs. model-estimated proportions for CD4+ T cell subtypes for simulated whole blood data (A) simulated sorted CD4+ T cell data (B). Correlations and mean square errors (MSE) of true vs. model-estimated CD4+ T cell subtype proportions (C).
Figure 4Model performance for B cell subtypes. True vs. model-estimated proportions for CD19+ B cell subtypes for simulated whole blood data (A) and simulated sorted B cell data (B). Correlations and mean square errors (MSE) of true vs. model-estimated CD19+ B cell subtype proportions (C).