| Literature DB >> 34425889 |
Weitian Chen1,2, Zhe Weng2, Zhe Xie2,3, Yeming Xie2, Chen Zhang2, Zhichao Chen1,2, Fengying Ruan2, Juan Wang2, Yuxin Sun2, Yitong Fang2, Mei Guo2, Yiqin Tong2, Yaning Li2, Chong Tang4,5,6.
Abstract
BACKGROUND: Although extrachromosomal DNA (ecDNA) has been intensively studied for several decades, the mechanisms underlying its tumorigenic effects have been revealed only recently. In most conventional sequencing studies, the high-throughput short-read sequencing largely ignores the epigenetic status of most ecDNA regions except for the junctional areas.Entities:
Keywords: Chromatin accessibility; Methylation; Methyltransferase; ecDNA; m6A
Mesh:
Substances:
Year: 2021 PMID: 34425889 PMCID: PMC8383416 DOI: 10.1186/s13072-021-00416-5
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Fig. 1CCDA-seq for profiling chromatin accessibility and nucleosome position in ecDNAs. A Intact chromatin was treated with m6A methyltransferase (EcoGII), which preferentially methylates DNA bases in the open chromatin region on ecDNAs and linear DNAs. High molecular weight DNA was then isolated and subjected to exonuclease digestion to remove partially linear DNAs. The remaining DNA molecules were subjected to nanopore library construction and nanopore sequencing. The data were aligned to the hg19 genome to identify ecDNAs based on head-to-tail pattern. The methylated bases were used to reconstruct nucleosomes in ecDNAs and other linear DNAs. In contrast, the ATAC-seq used the transposon to attack the open chromatin. The tagmentated short fragments were amplified and subjected to NGS. The short reads were aligned with genome to identify ecDNA bases. The mapped reads were calling as peaks representing the open chromatin region. B CCDA-seq bioinformatics pipeline is illustrated. The signal data were processed through guppy base calling to generate sequence. The sequences were aligned to the genome to identify linear DNAs and ecDNAs. We assembled the ecDNA sequence reference. Based on the ecDNA and linear DNA references, we used Megalodon to call the m6A sites based on ecDNA and linear DNA sequences. Then, we performed the accessibility analysis, gene element annotation, gene expression analysis, and co-accessibility assessment. C Large aggregate CCDA-seq signal enrichments match closely with DNase-seq accessibility peaks. (Chr20: 49220090–58167461)
Fig. 2ecDNA and linear DNA have the different chromatin accessibility pattern. A Density distribution of the methylation ratio in ecDNA and linear DNA. B Average chromatin status around ecDNA neighboring regions is illustrated. The junction site and its right neighboring regions demonstrate the more open chromatin. C Average methylations of gene regions on ecDNA and linear DNA (from TSS to TES). The genes were classified into two groups: (I) the genes on linear DNA have more open chromatin structure than ecDNA carried genes (above centerline); (II) the ecDNA carried genes have more open chromatin structure than the genes on linear DNA (below centerline). D Average CCDA-seq profile around transcription start site on ecDNAs and linear DNA. E Average CCDA-seq profile around the transcription end site in the ecDNA and linear DNAs (aggregated over 50-bp windows sliding every 5 bp; the sequencing depth was normalized for ecDNA and linear DNA; see Method for details)
Fig. 3CCDA-seq reveals the distribution of alternative chromatin states in ecDNA arrays. A Shown are all reads covering the linear DNA region chr10: 42383201–42389201. The box highlights the active and inactive chromatins. B Shown are all reads covering the ecDNA region chr10: 42383201–42389201. The upper panel indicates the positive strand, and the lower panel indicates the negative strand
Fig. 4Chromatin co-accessibility profiles for the chr10: 42383201–42389201 show high and low correlation with ecDNA and linear DNA. A Chromatin co-accessibility profiles for the chr10: 42383201–42389201 show correlation and low correlation with ecDNA. Red indicates the positive correlation and blue indicates the low correlation. B Chromatin co-accessibility profiles for the chr10: 42383201–42389201 show correlation and low correlation on linear DNA