| Literature DB >> 24884391 |
Martin A Rijlaarsdam, Yvonne G van der Zwan, Lambert C J Dorssers, Leendert H J Looijenga1.
Abstract
BACKGROUND: Array based methylation profiling is a cost-effective solution to study the association between genome methylation and human disease & development. Available tools to analyze the Illumina Infinium HumanMethylation450 BeadChip focus on comparing methylation levels per locus. Other tools combine multiple probes into a range, identifying differential methylated regions (DMRs). These tools all require groups of samples to compare. However, comparison of unique, individual samples is essential in situations where larger sample sizes are not possible.Entities:
Mesh:
Year: 2014 PMID: 24884391 PMCID: PMC4046028 DOI: 10.1186/1471-2105-15-141
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flowchart and overview of DMRforPairs results of the Illumina data. (A) The subsequent steps of recoding probe classes, identifying regions, quantifying and testing methylation differences and exporting the results are described in detail in the main text. Briefly, 473,151 probes remained after quality control. Subsequently, probes not associated to any of the 11 classes or not included in any of the regions are discarded. 145,537 probes (35%) were included in 29,404 potential regions of interest. Finally, these were assessed for methylation differences. (B) Number of regions identified in the various pairwise analyses. * = relevant indicates regions with |ΔM| > 1.4, ** = significant indicates relevant + padjusted ≤ 0.05. “repl.” indicates technical replicates. (C,D) The density plots illustrate the distribution of all/the relevant/the significant regions with regard to the number of probes in each region. Only the comparisons of the two cancer cell lines (C) and the pair of lymphoblastoid cell lines (D) are depicted as the technical replicates yielded no significant DMRs.
Figure 2Tuning of the of dand nparameters. (A) Number of regions identified and (B) fraction of all probes included in these regions using different settings of dmin and nmin. dmin denotes the maximal distance in bp allowed between two adjacent probes to be accepted in the same region. nmin denotes the minimal number of probes in a region (per sample). All runs of the algorithm were done using the 415,712 probes annotated to at least one Illumina class grouped according to gene/transcription start site/CpG island (recode parameter = 1). These benchmark statistics can be generated using the tune_parameters function in the algorithm (optional parallelization).
Figure 3DMRforPairs output. (A) One row of the HTML table describing one DMR. Thumbnail, genomic annotation and descriptive statistics regarding (the difference between) the samples are presented as well as links to figures/tables illustrating the methylation patterns in the samples in detail. Direct links to the genomic region in two genome browsers are also provided (Ensembl & UCSC). Region IDs are generated on the fly by the regionfinder function and are specific to a dataset and to a set of DMRforPairs parameters. They are therefore not interchangeable between datasets/experiments and serve mainly as identifiers during exploration of the dataset. (B) Methylation level per probe (M and β values) plotted against its genomic position. These plots are generated for all relevant and significant regions. (C) Annotated visualization of DMRs (β values) ±10 kb. Black box indicates the DMR. Transcripts overlapping/near the region are retrieved from Ensembl. These plots are optionally generated for all relevant and significant regions. (D) Additional statistics (STATS link in table) as provided for each region.