| Literature DB >> 34252972 |
Sapir Margalit1,2, Yotam Abramson1,2, Hila Sharim1,2, Zohar Manber2,3, Surajit Bhattacharya4, Yi-Wen Chen4,5, Eric Vilain4,5, Hayk Barseghyan4,5, Ran Elkon2,3, Roded Sharan2,6, Yuval Ebenstein1,2.
Abstract
MOTIVATION: While promoter methylation is associated with reinforcing fundamental tissue identities, the methylation status of distant enhancers was shown by genome-wide association studies to be a powerful determinant of cell-state and cancer. With recent availability of long reads that report on the methylation status of enhancer-promoter pairs on the same molecule, we hypothesized that probing these pairs on the single-molecule level may serve the basis for detection of rare cancerous transformations in a given cell population. We explore various analysis approaches for deconvolving cell-type mixtures based on their genome-wide enhancer-promoter methylation profiles.Entities:
Mesh:
Year: 2021 PMID: 34252972 PMCID: PMC8275347 DOI: 10.1093/bioinformatics/btab306
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Scheme 1.Hierarchy of terms used: dataset, sample, training/test set.
Fig. 1.Methylation states in predicted enhancer–promoter pairs. (A) schematic illustration of possible methylation states for a promoter and enhancers, and potential interaction between them. (B) Bionano Genomics optical methylation map of a region in chromosome 17 in GM12878 DNA. The region contains the gene TP53, its promoter (small blue box), and several predicted enhancers (pink boxes). Dark blue dots denote unmethylated sites and orange dots denote genetic tags used for alignment to the hg38 reference.
Fig. 2.Deconvolution of mixtures containing B-lymphocytes and myoblast cells by different methods using methylation states in promoters alone and enhancer–promoter pairs, accounting for one enhancer per promoter. (A) Calculated mixing ratio according to the different methods versus the known mixing ratio. (B) The mean error in calculated mixing ratio, calculated as the absolute distance from the known ratio, in the different methods.
Fig. 3.Deconvolution of B-lymphocytes and myoblast cells mixtures by different methods using methylation states in all predicted enhancer–promoter pairs. (A) calculated mixing ratio according to the different methods versus the known mixing ratio. (B) the mean error in calculated mixing ratio, calculated as the absolute distance from the known ratio, in the different methods.
Fig. 4.Deconvolution of myoblast cells from two different donors by different methods. (A) calculated mixing ratio according to the different methods versus the known mixing ratio. (B) the mean error in calculated mixing ratio, calculated as the absolute distance from the known ratio, in the different methods.
Fig. 5.Subsets of E–P pairs, selected by supervised selection methods. The mean error in calculated mixing ratio (distance from theoretical ratio) is displayed against the log10 of the number of best pairs selected in each combination of deconvolution method and ranking method. (A) A mixture of two cell types: B-lymphocytes and myoblasts. (B) A mixture of two myoblast cells derived from different individuals.