Literature DB >> 33016200

CUE: CpG impUtation ensemble for DNA methylation levels across the human methylation450 (HM450) and EPIC (HM850) BeadChip platforms.

Gang Li1, Laura Raffield2, Mark Logue3,4,5,6, Mark W Miller3,4, Hudson P Santos7,8, T Michael O'Shea9, Rebecca C Fry8,10,11, Yun Li2,12,13.   

Abstract

DNA methylation at CpG dinucleotides is one of the most extensively studied epigenetic marks. With technological advancements, geneticists can profile DNA methylation with multiple reliable approaches. However, profiling platforms can differ substantially in the CpGs they assess, consequently hindering integrated analysis across platforms. Here, we present CpG impUtation Ensemble (CUE), which leverages multiple classical statistical and modern machine learning methods, to impute from the Illumina HumanMethylation450 (HM450) BeadChip to the Illumina HumanMethylationEPIC (HM850) BeadChip. Data were analysed from two population cohorts with methylation measured both by HM450 and HM850: the Extremely Low Gestational Age Newborns (ELGAN) study (n = 127, placenta) and the VA Boston Posttraumatic Stress Disorder (PTSD) genetics repository (n = 144, whole blood). Cross-validation results show that CUE achieves the lowest predicted root-mean-square error (RMSE) (0.026 in PTSD) and the highest accuracy (99.97% in PTSD) compared with five individual methods tested, including k-nearest-neighbours, logistic regression, penalized functional regression, random forest, and XGBoost. Finally, among all 339,033 HM850-only CpG sites shared between ELGAN and PTSD, CUE successfully imputed 289,604 (85.4%) sites, where success was defined as RMSE < 0.05 and accuracy >95% in PTSD. In summary, CUE is a valuable tool for imputing CpG methylation from the HM450 to HM850 platform.

Entities:  

Keywords:  DNA methylation; HM450; HM850; ensemble learning; imputation; placenta; whole blood

Mesh:

Year:  2020        PMID: 33016200      PMCID: PMC8330997          DOI: 10.1080/15592294.2020.1827716

Source DB:  PubMed          Journal:  Epigenetics        ISSN: 1559-2294            Impact factor:   4.528


  48 in total

Review 1.  Missing value imputation for gene expression data: computational techniques to recover missing data from available information.

Authors:  Alan Wee-Chung Liew; Ngai-Fong Law; Hong Yan
Journal:  Brief Bioinform       Date:  2010-12-14       Impact factor: 11.622

2.  iDNA-Methyl: identifying DNA methylation sites via pseudo trinucleotide composition.

Authors:  Zi Liu; Xuan Xiao; Wang-Ren Qiu; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2015-01-14       Impact factor: 3.365

3.  Epigenome-wide association of PTSD from heterogeneous cohorts with a common multi-site analysis pipeline.

Authors:  Andrew Ratanatharathorn; Marco P Boks; Adam X Maihofer; Allison E Aiello; Ananda B Amstadter; Allison E Ashley-Koch; Dewleen G Baker; Jean C Beckham; Evelyn Bromet; Michelle Dennis; Melanie E Garrett; Elbert Geuze; Guia Guffanti; Michael A Hauser; Varun Kilaru; Nathan A Kimbrel; Karestan C Koenen; Pei-Fen Kuan; Mark W Logue; Benjamin J Luft; Mark W Miller; Colter Mitchell; Nicole R Nugent; Kerry J Ressler; Bart P F Rutten; Murray B Stein; Eric Vermetten; Christiaan H Vinkers; Nagy A Youssef; Monica Uddin; Caroline M Nievergelt; Alicia K Smith
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2017-07-10       Impact factor: 3.568

4.  Prediction of methylated CpGs in DNA sequences using a support vector machine.

Authors:  Manoj Bhasin; Hong Zhang; Ellis L Reinherz; Pedro A Reche
Journal:  FEBS Lett       Date:  2005-08-15       Impact factor: 4.124

Review 5.  Stress, burnout and depression: A systematic review on DNA methylation mechanisms.

Authors:  Jelena Bakusic; Wilmar Schaufeli; Stephan Claes; Lode Godderis
Journal:  J Psychosom Res       Date:  2016-11-23       Impact factor: 3.006

Review 6.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

Review 7.  A comprehensive overview of Infinium HumanMethylation450 data processing.

Authors:  Sarah Dedeurwaerder; Matthieu Defrance; Martin Bizet; Emilie Calonne; Gianluca Bontempi; François Fuks
Journal:  Brief Bioinform       Date:  2013-08-29       Impact factor: 11.622

8.  BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.

Authors:  Luli S Zou; Michael R Erdos; D Leland Taylor; Peter S Chines; Arushi Varshney; Stephen C J Parker; Francis S Collins; John P Didion
Journal:  BMC Genomics       Date:  2018-05-23       Impact factor: 3.969

9.  CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure.

Authors:  Christoph Bock; Martina Paulsen; Sascha Tierling; Thomas Mikeska; Thomas Lengauer; Jörn Walter
Journal:  PLoS Genet       Date:  2006-03-03       Impact factor: 5.917

10.  Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling.

Authors:  Ruth Pidsley; Elena Zotenko; Timothy J Peters; Mitchell G Lawrence; Gail P Risbridger; Peter Molloy; Susan Van Djik; Beverly Muhlhausler; Clare Stirzaker; Susan J Clark
Journal:  Genome Biol       Date:  2016-10-07       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.