Ya Wang1, Min Qian1, Deliang Tang2, Julie Herbstman2, Frederica Perera2, Shuang Wang1. 1. Department of Biostatistics, New York, NY 10032, USA. 2. Columbia Center for Children's Environmental Health, Department of Environmental Health Science, Mailman School of Public Health, Columbia University, New York, NY 10032, USA.
Abstract
MOTIVATION: Deoxyribonucleic acid (DNA) methylation plays a crucial role in human health. Studies have demonstrated associations between DNA methylation and environmental factors with evidence also supporting the idea that DNA methylation may modify the risk of environmental factors on health outcomes. However, due to high dimensionality and low study power, current studies usually focus on finding differential methylation on health outcomes at CpG level or gene level combining multiple CpGs and/or finding environmental effects on health outcomes but ignoring their interactions on health outcomes. Here we introduce the idea of a pseudo-data matrix constructed with cross-product terms between CpGs and environmental factors that are able to capture their interactions. We then develop a powerful and flexible weighted distance-based method with the pseudo-data matrix where association strength was used as weights on CpGs, environmental factors and their interactions to up-weight signals and down-weight noises in distance calculations. RESULTS: We compared the power of this novel approach and several comparison methods in simulated datasets and the Mothers and Newborns birth cohort of the Columbia Center for Children's Environmental Health to determine whether prenatal polycyclic aromatic hydrocarbons interacts with DNA methylation in association with Attention Deficit Hyperactivity Disorder and Mental Development Index at age 3. AVAILABILITY AND IMPLEMENTATION: An R code for the proposed method Dw-M-E-int together with a tutorial and a sample dataset is available for downloading from http://www.columbia.edu/∼sw2206/softwares.htm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Deoxyribonucleic acid (DNA) methylation plays a crucial role in human health. Studies have demonstrated associations between DNA methylation and environmental factors with evidence also supporting the idea that DNA methylation may modify the risk of environmental factors on health outcomes. However, due to high dimensionality and low study power, current studies usually focus on finding differential methylation on health outcomes at CpG level or gene level combining multiple CpGs and/or finding environmental effects on health outcomes but ignoring their interactions on health outcomes. Here we introduce the idea of a pseudo-data matrix constructed with cross-product terms between CpGs and environmental factors that are able to capture their interactions. We then develop a powerful and flexible weighted distance-based method with the pseudo-data matrix where association strength was used as weights on CpGs, environmental factors and their interactions to up-weight signals and down-weight noises in distance calculations. RESULTS: We compared the power of this novel approach and several comparison methods in simulated datasets and the Mothers and Newborns birth cohort of the Columbia Center for Children's Environmental Health to determine whether prenatal polycyclic aromatic hydrocarbons interacts with DNA methylation in association with Attention Deficit Hyperactivity Disorder and Mental Development Index at age 3. AVAILABILITY AND IMPLEMENTATION: An R code for the proposed method Dw-M-E-int together with a tutorial and a sample dataset is available for downloading from http://www.columbia.edu/∼sw2206/softwares.htm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Alexandra J White; Jia Chen; Lauren E McCullough; Xinran Xu; Yoon Hee Cho; Susan L Teitelbaum; Alfred I Neugut; Mary Beth Terry; Hanina Hibshoosh; Regina M Santella; Marilie D Gammon Journal: Cancer Causes Control Date: 2015-09-25 Impact factor: 2.506
Authors: Jonathan Mill; Thomas Tang; Zachary Kaminsky; Tarang Khare; Simin Yazdanpanah; Luigi Bouchard; Peixin Jia; Abbas Assadzadeh; James Flanagan; Axel Schumacher; Sun-Chong Wang; Arturas Petronis Journal: Am J Hum Genet Date: 2008-03 Impact factor: 11.025
Authors: Julie B Herbstman; Deliang Tang; Deguang Zhu; Lirong Qu; Andreas Sjödin; Zheng Li; David Camann; Frederica P Perera Journal: Environ Health Perspect Date: 2012-01-17 Impact factor: 9.031
Authors: Bram G Janssen; Lode Godderis; Nicky Pieters; Katrien Poels; Michał Kiciński; Ann Cuypers; Frans Fierens; Joris Penders; Michelle Plusquin; Wilfried Gyselaers; Tim S Nawrot Journal: Part Fibre Toxicol Date: 2013-06-07 Impact factor: 9.400
Authors: R Kumsta; S J Marzi; J Viana; E L Dempster; B Crawford; M Rutter; J Mill; E J S Sonuga-Barke Journal: Transl Psychiatry Date: 2016-06-07 Impact factor: 6.222
Authors: Monica D Nye; Katherine E King; Thomas H Darrah; Rachel Maguire; Dereje D Jima; Zhiqing Huang; Michelle A Mendez; Rebecca C Fry; Randy L Jirtle; Susan K Murphy; Cathrine Hoyo Journal: Environ Epigenet Date: 2016-02-15
Authors: Frederica P Perera; Virginia Rauh; Wei-Yann Tsai; Patrick Kinney; David Camann; Dana Barr; Tom Bernert; Robin Garfinkel; Yi-Hsuan Tu; Diurka Diaz; Jessica Dietrich; Robin M Whyatt Journal: Environ Health Perspect Date: 2003-02 Impact factor: 9.031
Authors: Nelly D Saenen; Karen Vrijens; Bram G Janssen; Harry A Roels; Kristof Y Neven; Wim Vanden Berghe; Wilfried Gyselaers; Charlotte Vanpoucke; Wouter Lefebvre; Patrick De Boever; Tim S Nawrot Journal: Environ Health Perspect Date: 2016-09-13 Impact factor: 9.031