Jing Xu1,2, Linna Zhao1,2, Di Liu1,2, Simeng Hu1,2, Xiuling Song1,2, Jin Li1, Hongchao Lv1, Lian Duan1, Mingming Zhang1, Qinghua Jiang3, Guiyou Liu3, Shuilin Jin4, Mingzhi Liao5, Meng Zhang2,6, Rennan Feng2,6, Fanwu Kong7, Liangde Xu1,2, Yongshuai Jiang1,2. 1. College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China. 2. Training Center for Students Innovation and Entrepreneurship Education, Harbin Medical University, Harbin, China. 3. Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China. 4. Department of Mathematics, Harbin Institute of Technology, Harbin, China. 5. College of Life Science, Northwest A&F University, Yangling, Shaanxi, China. 6. Department of Nutrition and Food Hygiene, Public Health College, Harbin Medical University, Harbin, China. 7. Department of Nephrology, The Second Affiliated Hospital, Harbin Medical University, Harbin, China.
Abstract
Motivation: With the development of biotechnology, DNA methylation data showed exponential growth. Epigenome-wide association study (EWAS) provide a systematic approach to uncovering epigenetic variants underlying common diseases/phenotypes. But the EWAS software has lagged behind compared with genome-wide association study (GWAS). To meet the requirements of users, we developed a convenient and useful software, EWAS2.0. Results: EWAS2.0 can analyze EWAS data and identify the association between epigenetic variations and disease/phenotype. On the basis of EWAS1.0, we have added more distinctive features. EWAS2.0 software was developed based on our 'population epigenetic framework' and can perform: (i) epigenome-wide single marker association study; (ii) epigenome-wide methylation haplotype (meplotype) association study and (iii) epigenome-wide association meta-analysis. Users can use EWAS2.0 to execute chi-square test, t-test, linear regression analysis, logistic regression analysis, identify the association between epi-alleles, identify the methylation disequilibrium (MD) blocks, calculate the MD coefficient, the frequency of meplotype and Pearson's correlation coefficients and carry out meta-analysis and so on. Finally, we expect EWAS2.0 to become a popular software and be widely used in epigenome-wide associated studies in the future. Availability and implementation: The EWAS software is freely available at http://www.ewas.org.cn or http://www.bioapp.org/ewas.
Motivation: With the development of biotechnology, DNA methylation data showed exponential growth. Epigenome-wide association study (EWAS) provide a systematic approach to uncovering epigenetic variants underlying common diseases/phenotypes. But the EWAS software has lagged behind compared with genome-wide association study (GWAS). To meet the requirements of users, we developed a convenient and useful software, EWAS2.0. Results: EWAS2.0 can analyze EWAS data and identify the association between epigenetic variations and disease/phenotype. On the basis of EWAS1.0, we have added more distinctive features. EWAS2.0 software was developed based on our 'population epigenetic framework' and can perform: (i) epigenome-wide single marker association study; (ii) epigenome-wide methylation haplotype (meplotype) association study and (iii) epigenome-wide association meta-analysis. Users can use EWAS2.0 to execute chi-square test, t-test, linear regression analysis, logistic regression analysis, identify the association between epi-alleles, identify the methylation disequilibrium (MD) blocks, calculate the MD coefficient, the frequency of meplotype and Pearson's correlation coefficients and carry out meta-analysis and so on. Finally, we expect EWAS2.0 to become a popular software and be widely used in epigenome-wide associated studies in the future. Availability and implementation: The EWAS software is freely available at http://www.ewas.org.cn or http://www.bioapp.org/ewas.
Epigenome-wide association study (EWAS) is an effective tool to identify the association between epigenetic variation and common disease/phenotype (Rakyan ; Wahl ). Compared with genome-wide association study (GWAS), the analysis tools of EWAS have lagged behind. To fill this gap, we developed novel and unique features, and improved upon the previous version EWAS1.0 (Xu ).EWAS1.0 was originally designed only for identifying the association between combinations of methylation levels (beta-value) and diseases. EWAS2.0 (http://www.ewas.org.cn) is a fully functional software.
2 Features
EWAS2.0 software can perform: (i) epigenome-wide single marker association study; (ii) epigenome-wide methylation haplotype (meplotype) association study and (iii) epigenome-wide association meta-analysis. The methylation data should be cleaned and normalized.For each DNA methylation loci, EWAS2.0 can carry out t-test or logistic regression analysis to identify the significant associations with case/control or binomial phenotype, perform linear regression analysis to identify the significant results associated with continuous phenotype, and calculate the Pearson's correlation coefficients between beta-value and continuous phenotype.According to our ‘population epigenetic framework’ (Zhao ), EWAS2.0 can analyze the methylation genotypes (menotypes: MM, MU and UU, where M is methylation epi-allele and U is unmethylation epi-allele) data, calculate the epi-allele frequency and identify risk epi-allele (calculate Chi-square, P-value, odd ratio and 95% confidential interval). EWAS2.0 can also analyze the association between two epi-alleles (M and U) in the same locus, and label the type of the relationships: synergic (two members of homologous chromosomes tend to be methylated simultaneously) or exclusive (one member of homologous chromosomes is methylated, the other member of homologous chromosomes tends to be unmethylated) (Zhao ).For multiple DNA methylation loci that are physically close to each other, there are non-random associations of epi-alleles between these loci, which we call methylation disequilibrium (MD) (Zhao ). EWAS2.0 can calculate the MD coefficients (Zhao ), identify the MD blocks using Gabriel et al.’s algorithm (Barrett ; Gabriel ) and estimate the frequency of meplotype (a group of specific epi-alleles on a chromosome) using Excoffier et al.’s Maximum Likelihood Estimate method (Excoffier and Slatkin, 1995). For case/control data, EWAS2.0 can scan the whole epigenome and identify the disease-related meplotype (calculate Chi-square, P-value, odd ratio and 95% confidential interval). We suggest that users perform meplotype analysis to identify the combinations of some SMP loci related to diseases/phenotypes after performing the single SMP analysis.Since the results of the similar EWAS studies are often inconsistent, we developed an epigenome-wide meta-analysis module. At first, EWAS2.0 test the heterogeneity between individual studies using Cochran’s Q-statistics. Then, the fixed effects model (all studies share a common effect size) and a random effects model (each study has a specific effect size) were used to evaluate the association between marker and disease/phenotype. We suggest that users select fixed effects model for low heterogeneity, and random effects model for high heterogeneity.EWAS2.0 software is a JAVA application based on JAVA 1.7 and is freely available at: http://www.ewas.org.cn. The current status of EWAS2.0 is depicted in Table 1. More functions will be added in the future version (such as EWAS for gene region, KEGG pathway, GO categories, network, interacting with genetic marker, regulation of gene expression, RNA modification, histone modification). Some comparisons between different methods can be found in a supplement (http://www.bioapp.org/ewas/supplement.html). We expect it to become a useful tool.
Table 1.
Overview of novel functions in EWAS2.0
Category
Description
-t.test
T-test for case/control or binomial phenotype
-linear
Linear regression analysis for continuous phenotype
-logistic
Logistic regression analysis for case/control or binomial phenotype
-cor
The Pearson's correlation coefficients for continuous phenotype
-SMP.allele_chisq
Chisq-square test for epi-allele: 2 (phenotype)*2 (M vs. U) table
-SMP.aa
Identify the type of epi-allele association: synergic or exclusive
-meplotype
Epigenome-wide meplotype association analysis
-MD
Calculate the MD coefficient
-block
Identify the MD blocks and calculate the frequency of meplotype
Authors: Stacey B Gabriel; Stephen F Schaffner; Huy Nguyen; Jamie M Moore; Jessica Roy; Brendan Blumenstiel; John Higgins; Matthew DeFelice; Amy Lochner; Maura Faggart; Shau Neen Liu-Cordero; Charles Rotimi; Adebowale Adeyemo; Richard Cooper; Ryk Ward; Eric S Lander; Mark J Daly; David Altshuler Journal: Science Date: 2002-05-23 Impact factor: 47.728
Authors: Simone Wahl; Alexander Drong; Benjamin Lehne; Marie Loh; William R Scott; Sonja Kunze; Pei-Chien Tsai; Janina S Ried; Weihua Zhang; Youwen Yang; Sili Tan; Giovanni Fiorito; Lude Franke; Simonetta Guarrera; Silva Kasela; Jennifer Kriebel; Rebecca C Richmond; Marco Adamo; Uzma Afzal; Mika Ala-Korpela; Benedetta Albetti; Ole Ammerpohl; Jane F Apperley; Marian Beekman; Pier Alberto Bertazzi; S Lucas Black; Christine Blancher; Marc-Jan Bonder; Mario Brosch; Maren Carstensen-Kirberg; Anton J M de Craen; Simon de Lusignan; Abbas Dehghan; Mohamed Elkalaawy; Krista Fischer; Oscar H Franco; Tom R Gaunt; Jochen Hampe; Majid Hashemi; Aaron Isaacs; Andrew Jenkinson; Sujeet Jha; Norihiro Kato; Vittorio Krogh; Michael Laffan; Christa Meisinger; Thomas Meitinger; Zuan Yu Mok; Valeria Motta; Hong Kiat Ng; Zacharoula Nikolakopoulou; Georgios Nteliopoulos; Salvatore Panico; Natalia Pervjakova; Holger Prokisch; Wolfgang Rathmann; Michael Roden; Federica Rota; Michelle Ann Rozario; Johanna K Sandling; Clemens Schafmayer; Katharina Schramm; Reiner Siebert; P Eline Slagboom; Pasi Soininen; Lisette Stolk; Konstantin Strauch; E-Shyong Tai; Letizia Tarantini; Barbara Thorand; Ettje F Tigchelaar; Rosario Tumino; Andre G Uitterlinden; Cornelia van Duijn; Joyce B J van Meurs; Paolo Vineis; Ananda Rajitha Wickremasinghe; Cisca Wijmenga; Tsun-Po Yang; Wei Yuan; Alexandra Zhernakova; Rachel L Batterham; George Davey Smith; Panos Deloukas; Bastiaan T Heijmans; Christian Herder; Albert Hofman; Cecilia M Lindgren; Lili Milani; Pim van der Harst; Annette Peters; Thomas Illig; Caroline L Relton; Melanie Waldenberger; Marjo-Riitta Järvelin; Valentina Bollati; Richie Soong; Tim D Spector; James Scott; Mark I McCarthy; Paul Elliott; Jordana T Bell; Giuseppe Matullo; Christian Gieger; Jaspal S Kooner; Harald Grallert; John C Chambers Journal: Nature Date: 2016-12-21 Impact factor: 49.962
Authors: Sultan Nilay Can; Adam Nunn; Dario Galanti; David Langenberger; Claude Becker; Katharina Volmer; Katrin Heer; Lars Opgenoorth; Noe Fernandez-Pozo; Stefan A Rensing Journal: Epigenomes Date: 2021-05-04