| Literature DB >> 26799745 |
Yuh Shiwa1,2, Tsuyoshi Hachiya1,2, Ryohei Furukawa2, Hideki Ohmomo2, Kanako Ono2, Hisaaki Kudo3, Jun Hata4,5, Atsushi Hozawa6, Motoki Iwasaki7, Koichi Matsuda8, Naoko Minegishi3, Mamoru Satoh1,2,9,10, Kozo Tanno11, Taiki Yamaji7, Kenji Wakai12, Jiro Hitomi13,14, Yutaka Kiyohara15, Michiaki Kubo16, Hideo Tanaka17, Shoichiro Tsugane7, Masayuki Yamamoto18,19, Kenji Sobue20,21, Atsushi Shimizu2.
Abstract
Differences in DNA collection protocols may be a potential confounder in epigenome-wide association studies (EWAS) using a large number of blood specimens from multiple biobanks and/or cohorts. Here we show that pre-analytical procedures involved in DNA collection can induce systematic bias in the DNA methylation profiles of blood cells that can be adjusted by cell-type composition variables. In Experiment 1, whole blood from 16 volunteers was collected to examine the effect of a 24 h storage period at 4°C on DNA methylation profiles as measured using the Infinium HumanMethylation450 BeadChip array. Our statistical analysis showed that the P-value distribution of more than 450,000 CpG sites was similar to the theoretical distribution (in quantile-quantile plot, λ = 1.03) when comparing two control replicates, which was remarkably deviated from the theoretical distribution (λ = 1.50) when comparing control and storage conditions. We then considered cell-type composition as a possible cause of the observed bias in DNA methylation profiles and found that the bias associated with the cold storage condition was largely decreased (λ adjusted = 1.14) by taking into account a cell-type composition variable. As such, we compared four respective sample collection protocols used in large-scale Japanese biobanks or cohorts as well as two control replicates. Systematic biases in DNA methylation profiles were observed between control and three of four protocols without adjustment of cell-type composition (λ = 1.12-1.45) and no remarkable biases were seen after adjusting for cell-type composition in all four protocols (λ adjusted = 1.00-1.17). These results revealed important implications for comparing DNA methylation profiles between blood specimens from different sources and may lead to discovery of disease-associated DNA methylation markers and the development of DNA methylation profile-based predictive risk models.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26799745 PMCID: PMC4723336 DOI: 10.1371/journal.pone.0147519
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow of the study design in two experiments.
DNA collection protocols used in Experiment 2.
| Protocol | Blood collection | Pre-process | DNA extraction | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Anticoagulant | Vol. collected (mL) | Storage/transport | Centrifugation | Separation of buffy coat | Storage of buffy coat | Blood cell fraction used | DNA extraction kit (Supplier) | Vol. used | Elution volume | |
| Control condition (Ctrl1, Ctrl2) | EDTA-2Na | 7 | - | - | - | - | Whole blood | Maxwell16 Blood DNA Purification Kit (Promega) | 400 μL | 300 μL |
| Tohoku Medical Megabank (TMM) | EDTA-2Na | 7 | 4°C (16 h) | 2300 × | BC 490 μL (Automatic) | −80°C | Buffy coat | Autopure LS (Qiagen) | 490 μL | 350 μL |
| BBJ | EDTA-2Na | 7 | 1–15°C (1 day), 4°C (4 days) | - | - | - | Whole blood | Gentra Puregene Blood Kit (Qiagen) | 7 mL | Adjust to 100 ng/μL |
| Hisayama | EDTA-2Na | 7 | −80°C (7 days) | - | - | - | Whole blood | QIAamp DNA Blood Maxi Kit (Qiagen) | 7 mL | 1.5 mL |
| JPHC | Heparin sodium | 10 | 4°C (12 h) | 10 min | BC 1–1.5 mL (Manual) | −80°C | Buffy coat | FlexiGene DNA Kit (Qiagen) | 300 μL | 200–500 μL |
*DNA extraction outsourced to an external company.
Fig 2Correction of systematic biases in DNA methylation profile caused by cold storage using cell-type composition.
A. Quantile-quantile (QQ) plot for the comparison of paired β-values from the 16 individuals between duplicates (Ctrl1 vs. Ctrl2). The genomic inflation factor lambda (median P-value of obs/exp) is shown. B. QQ plot for the comparison of 16 individuals between Ctrl1 and 4°C-24 h conditions. C. QQ plot for the comparison of 16 individuals between Ctrl1 and 4°C-24 h conditions after adjustment for the change in the estimated proportion of granulocytes. D. Differences of cell proportion between conditions (Ctrl2: Ctrl1 vs. Ctrl2; 4°C-24 h: Ctrl1 vs. 4°C-24 h) within the same individual are estimated by the cell-type composition from DNA methylation profiles. CD8T, CD8+ T cells; CD4T, CD4+ T cells; NK, natural killer cells; Bcell, B cells; Mono, monocytes; Gran, granulocytes. *, P < 0.05; **, P < 0.01; ***, P < 0.001 (Wilcoxon sighed rank test compared with Ctrl1) E. Gating strategy used to analyze populations of lymphocytes, monocytes, and granulocytes. F. Differences of cell proportion between conditions (Ctrl2: Ctrl1 vs. Ctrl2; 4°C-24 h: Ctrl1 vs. 4°C-24 h) within the same individual are measured by FACS using samples derived another 6 individuals. *, P < 0.05 (Wilcoxon signed-rank test compared with Ctrl1).
Fig 3Systematic biases in DNA methylation profile caused by difference of DNA collection protocols.
A. Differences of cell proportion between conditions (Ctrl1 vs. Ctrl2, TMM, BBJ, Hisayama, and JPHC) within the same individual are estimated by the cell-type composition from DNA methylation profiles. CD8T, CD8+ T cells; CD4T, CD4+ T cells; NK, natural killer cells; Bcell, B cells; Mono, monocytes; Gran, granulocytes. *, P < 0.05; **, P < 0.01; ***, P < 0.001 (Wilcoxon sighed rank test compared with Ctrl1). B-F. QQ plots for the comparison of 16 individuals between conditions (Ctrl1 vs. Ctrl2, TMM, BBJ, Hisayama, and JPHC) before (brown points) and after adjustment for the change in the estimated proportion of granulocytes (blue points). The genomic inflation factor lambda (median P-value of obs/exp) is shown.