| Literature DB >> 31105932 |
Clarence K Mah1, Jill P Mesirov1,2, Lukas Chavez1.
Abstract
Illumina Infinium DNA methylation arrays are a cost-effective technology to measure DNA methylation at CpG sites genome-wide and across cohorts of normal and cancer samples. While copy number alterations are commonly inferred from array-CGH, SNP arrays, or whole-genome DNA sequencing, Illumina Infinium DNA methylation arrays have been shown to detect copy number alterations at comparable sensitivity. Here we present an accessible, interactive GenePattern notebook for the analysis of copy number variation using Illumina Infinium DNA methylation arrays. The notebook provides a graphical user interface to a workflow using the R/Bioconductor packages minfi and conumee. The environment allows analysis to be performed without the installation of the R software environment, the packages and dependencies, and without the need to write or manipulate code.Entities:
Keywords: DNA methylation; GenePattern Notebook; Illumina Infinium methylation arrays; Jupyter Notebook; R/Bioconductor; conumee; copy number variation; interactive; minfi; open-source; pre-processing; visualization
Mesh:
Year: 2018 PMID: 31105932 PMCID: PMC6498745 DOI: 10.12688/f1000research.16338.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Analysis workflow.
The flowchart shows the main inputs and outputs necessary for the copy number variation analysis.
Figure 2. Copy number variation analysis GenePattern Notebook interface.
The “MethylationCNVAnalysis” module is presented as an input form using the GenePattern Notebook graphical user interface. The user links or uploads input files and selects analysis parameters before pressing “Run” to execute the workflow.
Figure 3. Plots of query and control samples.
( A) Median intensity plot of query & control samples. Log median intensity of the methylated channel is along the x-axis and log median intensity of unmethylated channel is along the y-axis. Bad-quality samples fall under the threshold and are colored red. There is no bad quality sample in this plot. ( B) DNA methylation (Beta-value) density plot of query & control samples. A density plot showing the distribution of beta values across each sample. Beta values should be bimodal and peak around 0 and 1.0.
Figure 4. Plots of copy numbers.
( A) Copy number plot of the entire genome in the example glioblastoma sample. A plot of all chromosomes across the genome. Intensity values of each bin are plotted as colored dots, green indicating above normal copy number, red indicating below normal copy number, and grey indicating close to normal copy number. Blue lines indicate the median intensity of each bin. Specified genes to be highlighted are annotated. ( B) Copy number plot of common cancer genes in the example glioblastoma sample. An overview of the genomic loci of common cancer genes are shown in more detail. Copy number values are visualized as described in Figure 4A.