| Literature DB >> 35030991 |
Ludwig Mann1, Kathrin M Seibt1, Beatrice Weber1, Tony Heitkam2.
Abstract
BACKGROUND: Extrachromosomal circular DNAs (eccDNAs) are ring-like DNA structures physically separated from the chromosomes with 100 bp to several megabasepairs in size. Apart from carrying tandemly repeated DNA, eccDNAs may also harbor extra copies of genes or recently activated transposable elements. As eccDNAs occur in all eukaryotes investigated so far and likely play roles in stress, cancer, and aging, they have been prime targets in recent research-with their investigation limited by the scarcity of computational tools.Entities:
Keywords: Extrachromosomal circular DNA; Mitochondrial minicircle; Mobilome-seq; TE activity; circDNA; circSeq; eccDNA
Mesh:
Substances:
Year: 2022 PMID: 35030991 PMCID: PMC8760651 DOI: 10.1186/s12859-021-04545-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Overview of the ECCsplorer pipeline (a) and it’s output (b, c). a Our pipeline consists of four main modules: one for the data preparation module and three for the analysis modules. The execution of the different modules depends on the given input data. Thick grey arrows represent multi-step bioinformatics computation processes, thin black arrows show data transfer. Semi-artificial circSeq and control data were simulated from a circularized retrotransposon and a random reference sequence to illustrate a typical pipeline output. A more detailed overview of the ECCsplorer pipeline is available in Additional file 1: Figure S1. b Mapping against the reference sequence revealed an eccDNA candidate region. c The comparative read clustering identified 13 clusters containing potential eccDNA candidates. d The comparative module links the retrotransposon candidate region from the mapping module to all 13 clusters from the clustering module
Fig. 2Outputs of ECCsplorer for different case studies representing the running modes and input data configurations. a–c An ECCsplorer run using A. thaliana circSeq (epi12) and control data (WT) [12] detected EVD/ATCOPIA93 as key eccDNA candidate (orange). a A mapping against the reference genome sequence (TAIR10) revealed eight eccDNA candidate regions. These were identified by applying three criteria: the region is flanked by split reads (at least five), contains discordant mapping reads (at least one pair), and shows a high coverage compared to its neighboring regions (peak prominence of at least one). b The comparative read clustering identified 41.04% of the clustered epi12 reads as potential eccDNA candidates. c To report eccDNA candidates with high confidence the outputs of the mapping and clustering strategies are compared by the ECCsplorer pipeline. The comparative module links the EVD/ATCOPIA93 candidate region from the mapping module to four clusters from the clustering module. d An ECCsplorer run using H. sapiens circSeq data [32] detected one eccDNA candidate region containing genic content (light blue) and many candidate region with low confidence (grey blue). e ECCsplorer analysis using B. vulgaris circSeq data and available sequenceing data [30] as control data detected multiple candidate clusters that were in the following annotated as mitchondrial minicircles a/d/pO (orange). f Overview on the running modes, their required input data and the corresponding case studies
Fig. 3Comparison of results from ECCsplorer with Circle-Map [18] and originally published eccDNA candidates. We reanalyzed data from A. thaliana [12] and H. sapiens [32] using available eccDNA tools, our ECCsplorer and Circle-Map. a Mapping plots from all three approaches (ECCsplorer, Circle-Map and original published data by Lanciano et al. 2017) of (re-) analyzed A. thaliana circSeq data. For each approach, the detected eccDNA candidates have been highlighted in blue with the key eccDNA candidate EVD/ATCOPIA93 shown in orange (experimentally validated: “ground truth”; maximal coverage depth of each individual plot corresponds to 100% relative depth). b Comparison of all three approaches (ECCsplorer, Circle-Map and original published data by Møller et al. 2018) of (re-)analyzed H. sapiens circSeq data. Bars in track 2 represent the detected eccDNA candidate regions by each approach. The connections highlight similar (overlapping) genomic regions detected on chromosome 16 of hg38 (arrow points to eccDNA candidate with high confidence). c Length distribution of by each approach detected eccDNA candidates using A. thaliana circSeq data (1 candidates with low confidence additionally shown in light grey). The ECCsplorer’s high-confidence hits show large overlap to the “ground truth”. d Venn-diagram of similarly detected eccDNA candidates using A. thaliana circSeq data with EVD/ATCOPIA93 candidate overlaps shown in orange. e Length distribution of by each approach detected eccDNA candidates using H. sapiens circSeq data. f Venn-diagram of similarly detected eccDNA candidates using H. sapiens circSeq data. The ECCsplorer shows a larger overlap to the original data than the CircleMap tool; yet “ground truth” data is still lacking