Pei Fen Kuan1, Sijian Wang, Xin Zhou, Haitao Chu. 1. Department of Biostatistics, Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA. pfkuan@bios.unc.edu
Abstract
MOTIVATION: The Illumina BeadArray is a popular platform for profiling DNA methylation, an important epigenetic event associated with gene silencing and chromosomal instability. However, current approaches rely on an arbitrary detection P-value cutoff for excluding probes and samples from subsequent analysis as a quality control step, which results in missing observations and information loss. It is desirable to have an approach that incorporates the whole data, but accounts for the different quality of individual observations. RESULTS: We first investigate and propose a statistical framework for removing the source of biases in Illumina Methylation BeadArray based on several positive control samples. We then introduce a weighted model-based clustering called LumiWCluster for Illumina BeadArray that weights each observation according to the detection P-values systematically and avoids discarding subsets of the data. LumiWCluster allows for discovery of distinct methylation patterns and automatic selection of informative CpG loci. We demonstrate the advantages of LumiWCluster on two publicly available Illumina GoldenGate Methylation datasets (ovarian cancer and hepatocellular carcinoma). AVAILABILITY: R package LumiWCluster can be downloaded from http://www.unc.edu/∼pfkuan/LumiWCluster.
MOTIVATION: The Illumina BeadArray is a popular platform for profiling DNA methylation, an important epigenetic event associated with gene silencing and chromosomal instability. However, current approaches rely on an arbitrary detection P-value cutoff for excluding probes and samples from subsequent analysis as a quality control step, which results in missing observations and information loss. It is desirable to have an approach that incorporates the whole data, but accounts for the different quality of individual observations. RESULTS: We first investigate and propose a statistical framework for removing the source of biases in Illumina Methylation BeadArray based on several positive control samples. We then introduce a weighted model-based clustering called LumiWCluster for Illumina BeadArray that weights each observation according to the detection P-values systematically and avoids discarding subsets of the data. LumiWCluster allows for discovery of distinct methylation patterns and automatic selection of informative CpG loci. We demonstrate the advantages of LumiWCluster on two publicly available Illumina GoldenGate Methylation datasets (ovarian cancer and hepatocellular carcinoma). AVAILABILITY: R package LumiWCluster can be downloaded from http://www.unc.edu/∼pfkuan/LumiWCluster.
Authors: Rafael A Irizarry; Christine Ladd-Acosta; Benilton Carvalho; Hao Wu; Sheri A Brandenburg; Jeffrey A Jeddeloh; Bo Wen; Andrew P Feinberg Journal: Genome Res Date: 2008-03-03 Impact factor: 9.043
Authors: Sahar Houshdaran; Sarah Hawley; Chana Palmer; Mihaela Campan; Mari N Olsen; Aviva P Ventura; Beatrice S Knudsen; Charles W Drescher; Nicole D Urban; Patrick O Brown; Peter W Laird Journal: PLoS One Date: 2010-02-22 Impact factor: 3.240
Authors: Thomas A Down; Vardhman K Rakyan; Daniel J Turner; Paul Flicek; Heng Li; Eugene Kulesha; Stefan Gräf; Nathan Johnson; Javier Herrero; Eleni M Tomazou; Natalie P Thorne; Liselotte Bäckdahl; Marlis Herberth; Kevin L Howe; David K Jackson; Marcos M Miretti; John C Marioni; Ewan Birney; Tim J P Hubbard; Richard Durbin; Simon Tavaré; Stephan Beck Journal: Nat Biotechnol Date: 2008-07 Impact factor: 54.908
Authors: Dan Wang; Li Yan; Qiang Hu; Lara E Sucheston; Michael J Higgins; Christine B Ambrosone; Candace S Johnson; Dominic J Smiraglia; Song Liu Journal: Bioinformatics Date: 2012-01-16 Impact factor: 6.937
Authors: Min A Jhun; Jennifer A Smith; Erin B Ware; Sharon L R Kardia; Thomas H Mosley; Stephen T Turner; Patricia A Peyser; Sung Kyun Park Journal: Am J Epidemiol Date: 2017-11-15 Impact factor: 4.897
Authors: Jennifer Z J Maccani; Devin C Koestler; Eugene Andrés Houseman; Carmen J Marsit; Karl T Kelsey Journal: Epigenomics Date: 2013-12 Impact factor: 4.778
Authors: Carmen M Koch; Christoph V Suschek; Qiong Lin; Simone Bork; Maria Goergens; Sylvia Joussen; Norbert Pallua; Anthony D Ho; Martin Zenke; Wolfgang Wagner Journal: PLoS One Date: 2011-02-08 Impact factor: 3.240
Authors: Devin C Koestler; Jing Li; John A Baron; Gregory J Tsongalis; Lynn F Butterly; Martha Goodrich; Corina Lesseur; Margaret R Karagas; Carmen J Marsit; Jason H Moore; Angeline S Andrew; Amitabh Srivastava Journal: Mod Pathol Date: 2013-07-19 Impact factor: 7.842