Literature DB >> 35419567

A Novel Framework for the Identification of Reference DNA Methylation Libraries for Reference-Based Deconvolution of Cellular Mixtures.

Shelby Bell-Glenn1, Jeffrey A Thompson1, Lucas A Salas2, Devin C Koestler1.   

Abstract

Reference-based deconvolution methods use reference libraries of cell-specific DNA methylation (DNAm) measurements as a means toward deconvoluting cell proportions in heterogeneous biospecimens (e.g., whole-blood). As the accuracy of such methods depends highly on the CpG loci comprising the reference library, recent research efforts have focused on the selection of libraries to optimize deconvolution accuracy. While existing approaches for library selection work extremely well, the best performing approaches require a training data set consisting of both DNAm profiles over a heterogeneous cell population and gold-standard measurements of cell composition (e.g., flow cytometry) in the same samples. Here, we present a framework for reference library selection without a training dataset (RESET) and benchmark it against the Legacy method (minfi:pickCompProbes), where libraries are constructed based on a pre-specified number of cell-specific differentially methylated loci (DML). RESET uses a modified version of the Dispersion Separability Criteria (DSC) for comparing different libraries and has four main steps: (1) identify a candidate set of cell-specific DMLs, (2) randomly sample DMLs from the candidate set, (3) compute the Modified DSC of the selected DMLs, and (4) update the selection probabilities of DMLs based on their contribution to the Modified DSC. Steps 2-4 are repeated many times and the library with the largest Modified DSC is selected for subsequent reference-based deconvolution. We evaluated RESET using several publicly available datasets consisting of whole-blood DNAm measurements with corresponding measurements of cell composition. We computed the RMSE and R 2 between the predicted cell proportions and their measured values. RESET outperformed the Legacy approach in selecting libraries that improve the accuracy of deconvolution estimates. Additionally, reference libraries constructed using RESET resulted in cellular composition estimates that explained more variation in DNAm as compared to the Legacy approach when evaluated in the context of epigenome-wide association studies (EWAS) of several publicly available data sets. This finding has implications for the statistical power of EWAS. RESET combats potential challenges associated with existing approaches for reference library assembly and thus, may serve as a viable strategy for library construction in the absence of a training data set.

Entities:  

Keywords:  DNA Methylation; EWAS; IDOL; cell heterogeneity; cell proportion estimation; reference-based deconvolution

Year:  2022        PMID: 35419567      PMCID: PMC9004796          DOI: 10.3389/fbinf.2022.835591

Source DB:  PubMed          Journal:  Front Bioinform        ISSN: 2673-7647


  25 in total

Review 1.  Recommendations for the design and analysis of epigenome-wide association studies.

Authors:  Karin B Michels; Alexandra M Binder; Sarah Dedeurwaerder; Charles B Epstein; John M Greally; Ivo Gut; E Andres Houseman; Benedetta Izzi; Karl T Kelsey; Alexander Meissner; Aleksandar Milosavljevic; Kimberly D Siegmund; Christoph Bock; Rafael A Irizarry
Journal:  Nat Methods       Date:  2013-10       Impact factor: 28.547

2.  On perfect clustering of high dimension, low sample size data.

Authors:  Soham Sarkar; Anil K Ghosh
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-04-23       Impact factor: 6.226

3.  Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays.

Authors:  Martin J Aryee; Andrew E Jaffe; Hector Corrada-Bravo; Christine Ladd-Acosta; Andrew P Feinberg; Kasper D Hansen; Rafael A Irizarry
Journal:  Bioinformatics       Date:  2014-01-28       Impact factor: 6.937

4.  Fractional Norms and Quasinorms Do Not Help to Overcome the Curse of Dimensionality.

Authors:  Evgeny M Mirkes; Jeza Allohibi; Alexander Gorban
Journal:  Entropy (Basel)       Date:  2020-09-30       Impact factor: 2.524

5.  An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray.

Authors:  Lucas A Salas; Devin C Koestler; Rondi A Butler; Helen M Hansen; John K Wiencke; Karl T Kelsey; Brock C Christensen
Journal:  Genome Biol       Date:  2018-05-29       Impact factor: 13.583

6.  Guidelines for cell-type heterogeneity quantification based on a comparative analysis of reference-free DNA methylation deconvolution software.

Authors:  Clémentine Decamps; Florian Privé; Raphael Bacher; Daniel Jost; Arthur Waguet; Eugene Andres Houseman; Eugene Lurie; Pavlo Lutsik; Aleksandar Milosavljevic; Michael Scherer; Michael G B Blum; Magali Richard
Journal:  BMC Bioinformatics       Date:  2020-01-13       Impact factor: 3.169

7.  Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis.

Authors:  Yun Liu; Martin J Aryee; Leonid Padyukov; M Daniele Fallin; Espen Hesselberg; Arni Runarsson; Lovisa Reinius; Nathalie Acevedo; Margaret Taub; Marcus Ronninger; Klementy Shchetynsky; Annika Scheynius; Juha Kere; Lars Alfredsson; Lars Klareskog; Tomas J Ekström; Andrew P Feinberg
Journal:  Nat Biotechnol       Date:  2013-01-20       Impact factor: 54.908

8.  DNA methylation arrays as surrogate measures of cell mixture distribution.

Authors:  Eugene Andres Houseman; William P Accomando; Devin C Koestler; Brock C Christensen; Carmen J Marsit; Heather H Nelson; John K Wiencke; Karl T Kelsey
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

9.  Accounting for cellular heterogeneity is critical in epigenome-wide association studies.

Authors:  Andrew E Jaffe; Rafael A Irizarry
Journal:  Genome Biol       Date:  2014-02-04       Impact factor: 13.583

10.  Pre-diagnosis neutrophil-to-lymphocyte ratio and mortality in individuals who develop lung cancer.

Authors:  Carmen J Marsit; Jennifer A Doherty; Laurie Grieshober; Stefan Graw; Matt J Barnett; Gary E Goodman; Chu Chen; Devin C Koestler
Journal:  Cancer Causes Control       Date:  2021-07-08       Impact factor: 2.506

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.