| Literature DB >> 35003108 |
Shivanthan Shanthikumar1,2,3, Melanie R Neeland2,4, Richard Saffery2,4, Sarath C Ranganathan1,2,3, Alicia Oshlack5,6,7, Jovana Maksimovic2,3,5.
Abstract
In epigenome-wide association studies analysing DNA methylation from samples containing multiple cell types, it is essential to adjust the analysis for cell type composition. One well established strategy for achieving this is reference-based cell type deconvolution, which relies on knowledge of the DNA methylation profiles of purified constituent cell types. These are then used to estimate the cell type proportions of each sample, which can then be incorporated to adjust the association analysis. Bronchoalveolar lavage is commonly used to sample the lung in clinical practice and contains a mixture of different cell types that can vary in proportion across samples, affecting the overall methylation profile. A current barrier to the use of bronchoalveolar lavage in DNA methylation-based research is the lack of reference DNA methylation profiles for each of the constituent cell types, thus making reference-based cell composition estimation difficult. Herein, we use bronchoalveolar lavage samples collected from children with cystic fibrosis to define DNA methylation profiles for the four most common and clinically relevant cell types: alveolar macrophages, granulocytes, lymphocytes and alveolar epithelial cells. We then demonstrate the use of these methylation profiles in conjunction with an established reference-based methylation deconvolution method to estimate the cell type composition of two different tissue types; a publicly available dataset derived from artificial blood-based cell mixtures and further bronchoalveolar lavage samples. The reference DNA methylation profiles developed in this work can be used for future reference-based cell type composition estimation of bronchoalveolar lavage. This will facilitate the use of this tissue in studies examining the role of DNA methylation in lung health and disease.Entities:
Keywords: DNA methylation; bronchoalveolar lavage; cystic fibrosis; epigenetics; paediatrics; pulmonary disease
Mesh:
Year: 2021 PMID: 35003108 PMCID: PMC8727592 DOI: 10.3389/fimmu.2021.788705
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Study outline. (A) BAL samples were collected, cryopreserved and sorted using fluorescence-activated single cell sorting, resulting in 4 purified cell populations (Alveolar Epithelial Cells, Alveolar Macrophages, Granulocytes, and Lymphocytes). Their DNA was extracted and pooled and DNAm profiled on EPIC arrays. Unique DNAm profiles for constituent cell types were identified. These DNAm profiles were then used to estimate cell type compositions in subsequent validation experiments. (B) Data from a publicly available blood-derived artificial cell mixture with known cell composition, and matching DNAm data, was used to validate our BAL-derived reference panel. The BAL-derived DNAm profiles we developed were used in conjunction with a reference-based deconvolution method to estimate cell composition from the DNAm data of the artificial cell mixtures. This was then compared to the known cell composition. (C) BAL samples were collected and divided into a raw fraction and a cryopreserved fraction. DNA was extracted from the raw fraction, DNAm profiled, and our BAL-derived DNAm profiles used in conjunction with a reference-based deconvolution method to estimate cell composition. The cryopreserved fraction was profiled using flow cytometry, and cell composition estimated. The methylation-based estimates were compared to the flow-cytometry cell composition measurements.
Figure 2MDS plots showing the first 3 principal components of methylation data from pooled BAL macrophage (n=5), granulocyte (n=4), lymphocyte (n=3) and AEC (n=2) samples. Each pool contained purified cells from multiple individuals (see Methods). Clear separation of the different cell populations is seen in the first 3 principal components, which account for 60.1% of the total variation, and there is no evidence of a significant batch affect related to samples run at different service providers (Erasmus MC or GenomeScan).
Figure 3Heatmaps showing unique DNA methylation profiles for constituent cell types. The probes shown were algorithmically selected based on their ability to discriminate between the different cell types. The heatmaps demonstrate the different probe set selected when the probeSelect parameter is set to (A) “any” or (B) “both”. “Both” selects the top 50 hypermethylated and 50 hypomethylated probes (F-stat p-value < 1E-8) with the greatest methylation difference between each cell type compared to all the others. “Any” selects the top 100 probes (F-stat p-value < 1E-8) with the greatest methylation difference between each cell type compared to all the others, regardless of direction of effect.
Figure 4Comparison of known proportions of artificial cell mixtures (with T Cells, B Cells, and Natural Killer cells combined into a Lymphocyte population), to methylation-based estimated cell proportions derived using BAL purified cell population data. Cell type discriminating probes were selected using “any” and “both” approaches (see methods for details). (A) Comparison of known proportions of artificial cell mixtures (with T Cells, B Cells, and Natural Killer cells combined into a Lymphocyte population), to methylation-based estimated cell proportions derived using BAL purified cell population data. Cell type discriminating probes were selected using “any” and “both” approaches (see methods for details). (B, C) Bland Altman Plots Comparing Known Artificial Cell Mixture Proportions (“Truth”) and methylation-based cell proportion estimates (“Estimate”). The data point numbers represent which cell mixture the data pertains to. Cell type discriminating probes were selected using either (B) “any” and (C) “both” approaches (see Methods for details). The mean squared error (MSE) between the known proportion and estimated proportion was calculated for each cell type and probe selection Method.
Cell composition of BAL determined by both DNA methylation-based estimate and flow cytometry.
| Alveolar Macrophage | Lymphocyte | Granulocyte | Alveolar Epithelial Cell | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scaled Flow Cytometry | DNAm Estimate (Any) | DNAm Estimate (Both) | Scaled Flow Cytometry | DNAm Estimate (Any) | DNAm Estimate (Both) | Scaled Flow Cytometry | DNAm Estimate (Any) | DNAm Estimate (Both) | Scaled Flow Cytometry | DNAm Estimate (Any) | DNAm Estimate (Both) | |
| CF1 | 5.2 | 0.7 | 0.5 | 5.5 | 1.6 | 1.8 | 86.1 | 88.8 | 89.8 | 3.2 | 9.2 | 8.7 |
| CF2 | 55.7 | 29.9 | 33.7 | 8.9 | 19.7 | 16.3 | 32.5 | 43.1 | 48.0 | 2.9 | 11.2 | 10.5 |
| CF3 | 64.6 | 37.3 | 39.3 | 8 | 22.3 | 20.7 | 23.9 | 22.1 | 25.7 | 3.4 | 21.0 | 20.0 |
| CF4 | 72.5 | 66.6 | 68.6 | 7.2 | 11.6 | 10.0 | 19.7 | 13.7 | 15.4 | 0.6 | 9.9 | 9.7 |
| CF5 | 63.2 | 50.7 | 52.8 | 7.2 | 11.0 | 9.5 | 29.4 | 34.5 | 35.7 | 0.2 | 5.6 | 5.4 |
| Control | 51.1 | 13.7 | 13.1 | 20.5 | 17.8 | 16.7 | 21.5 | 58.7 | 61.7 | 6.9 | 12.9 | 12.8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| CF1 | 2.4 | 0.7 | 0.5 | 2.5 | 1.6 | 1.8 | 39.7 | 88.8 | 89.8 | 1.5 | 9.2 | 8.7 |
| CF2 | 46.1 | 29.9 | 33.7 | 7.4 | 19.7 | 16.3 | 26.9 | 43.1 | 48.0 | 2.4 | 11.2 | 10.5 |
| CF3 | 42.5 | 37.3 | 39.3 | 5.3 | 22.3 | 20.7 | 15.8 | 22.1 | 25.7 | 2.3 | 21.0 | 20.0 |
| CF4 | 50.2 | 66.6 | 68.6 | 5 | 11.6 | 10.0 | 13.6 | 13.7 | 15.4 | 0.4 | 9.9 | 9.7 |
| CF5 | 31.3 | 50.7 | 52.8 | 3.6 | 11.0 | 9.5 | 14.5 | 34.5 | 35.7 | 0.1 | 5.6 | 5.4 |
| Control | 34.8 | 13.7 | 13.1 | 13.9 | 17.8 | 16.7 | 14.6 | 58.7 | 61.7 | 4.7 | 12.9 | 12.8 |
For DNA methylation-based estimates cell type discriminating probes were selected using “Any” and “Both” approaches (see Methods for details). Regarding flow cytometry the “scaled” proportions were calculated not including the CD45-, EpCAM – cells which are likely Red Blood Cells that do not contribute to DNA methylation data.
Figure 5(A) Comparison of cell proportions measured via flow cytometry to methylation-based estimated cell proportions derived using BAL purified cell population data. Cell type discriminating probes were selected using “any” and “both” approaches (see methods for details). “Original” refers to the proportion of cells from flow cytometry in the total live cell population. “Scaled” refers to the proportion of cells when limited to just the four cell types of interest. (B, C) Bland Altman Plots Comparing cell proportions measured by flow cytometry (“Flow”) and methylation-based cell proportion estimates (“Meth. Est.”). The shapes indicate use of “original” (circle) or “scaled” (cross) flow cytometry proportions. The colour indicates the cell type (see legend). Probes were selected using either (B) “any” or (C) “both” approaches (see Methods for details). The mean squared error (MSE) between the known proportion and estimated proportion was calculated for each cell type and probe selection Method.