| Literature DB >> 23631413 |
Ruth Pidsley1, Chloe C Y Wong, Manuela Volta, Katie Lunnon, Jonathan Mill, Leonard C Schalkwyk.
Abstract
BACKGROUND: As the most stable and experimentally accessible epigenetic mark, DNA methylation is of great interest to the research community. The landscape of DNA methylation across tissues, through development and in disease pathogenesis is not yet well characterized. Thus there is a need for rapid and cost effective methods for assessing genome-wide levels of DNA methylation. The Illumina Infinium HumanMethylation450 (450K) BeadChip is a very useful addition to the available methods for DNA methylation analysis but its complex design, incorporating two different assay methods, requires careful consideration. Accordingly, several normalization schemes have been published. We have taken advantage of known DNA methylation patterns associated with genomic imprinting and X-chromosome inactivation (XCI), in addition to the performance of SNP genotyping assays present on the array, to derive three independent metrics which we use to test alternative schemes of correction and normalization. These metrics also have potential utility as quality scores for datasets.Entities:
Mesh:
Year: 2013 PMID: 23631413 PMCID: PMC3769145 DOI: 10.1186/1471-2164-14-293
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Example of density distribution of values profiled using Type I and Type II probes.
Figure 2Density plots of the values generated from the raw and dasen preprocessing methods for Type I and II probes. Plots represent the loci investigated in each of the three performance tests: a DMRSE, b GCOSE, and c Seabird. Black line=raw, red line=dasen.
Table summarising nomenclature of preprocessing methods
| naten | n | t | n |
| nanet | n | n | t |
| nanes | n | n | s |
| danes | d | n | s |
| danet | d | n | t |
| danen | d | n | n |
| daten1 | d | t | n |
| daten2 | d | t | n |
| nasen | n | s | n |
| dasen | d | s | n |
We have used a naming convention that encodes key aspects of each method. Vowels were added between the letters for ease of pronunciation. The 1st letter (column 1) indicates whether background adjustment was performed (i.e. the offset between Type I and II probe intensities added to Type I intensities), with ‘d’ indicating background adjustment and ‘n’ indicating no background adjustment. Note daten2 is identical to daten1 but with the addition of a linear model of Sentrix position to obtain smooth background offsets. The 3rd letter (column 2) specifies whether between-array normalization was performed (i.e. between-sample quantile normalization of M and U separately), with ‘s’ indicating between-array normalization applied to Type I and Type II probes separately, ‘t’ indicating between-array normalization applied to Type I and Type II probes together and ‘n’ indicating no between-array normalization. The 5th letter (column 3) specifies whether the dye-bias correction was performed (i.e. quantile normalization of M against U), with ‘s’ indicating dye bias correction applied to Type I and Type II probes separately, ‘t’ indicating dye bias correction applied to Type I and Type II probes together and ‘n’ indicating no dye bias correction.
Figure 3Results of the performance tests for Cohort 1C. All values range between 0 and 1. Lower values are indicative of a more sensitive preprocessing method. :a) DMRSE (x10-3), b) GCOSE (x10-4) and c) Seabird (x10-2). Type I probes are denoted by circles and Type II probes by triangles.
Overall rank scores of each preprocessing method
| raw | 6.5 | 11 | 8.75 |
| betaqn | 14 | 13 | 13.5 |
| naten | 12 | 9 | 10.5 |
| nanet | 11 | 3 | 7 |
| nanes | 9.5 | 7.5 | 8.5 |
| danes | 2.5 | 7.5 | 5 |
| danet | 1 | 6 | 3.5 |
| danen | 5 | 12 | 8.5 |
| daten1 | 4 | 4 | 4 |
| daten2 | 8 | 5 | 6.5 |
| nasen | 9.5 | 1.5 | 5.5 |
| dasen | 2.5 | 1.5 | 2 |
| fuks | 6.5 | 15 | 10.75 |
| tost | 13 | 14 | 13.5 |
| swan | 15 | 10 | 12.5 |
Ranks calculated from the three performance metrics applied to 11 datasets. Ranks are shown for Type I and Type II probes separately as well as averaged across both probe types. Dasen is shown to be the best preprocessing method across both probe types.