| Literature DB >> 35061087 |
Mariya Shtumpf1, Kristan V Piroeva1, Shivam P Agrawal1, Divya R Jacob1, Vladimir B Teif2.
Abstract
Nucleosome positioning is involved in many gene regulatory processes happening in the cell, and it may change as cells differentiate or respond to the changing microenvironment in a healthy or diseased organism. One important implication of nucleosome positioning in clinical epigenetics is its use in the "nucleosomics" analysis of cell-free DNA (cfDNA) for the purpose of patient diagnostics in liquid biopsies. The rationale for this is that the apoptotic nucleases that digest chromatin of the dying cells mostly cut DNA between nucleosomes. Thus, the short pieces of DNA in body fluids reflect the positions of nucleosomes in the cells of origin. Here, we report a systematic nucleosomics database - NucPosDB - curating published nucleosome positioning datasets in vivo as well as datasets of sequenced cell-free DNA (cfDNA) that reflect nucleosome positioning in situ in the cells of origin. Users can select subsets of the database by a number of criteria and then obtain raw or processed data. NucPosDB also reports the originally determined regions with stable nucleosome occupancy across several individuals with a given condition. An additional section provides a catalogue of computational tools for the analysis of nucleosome positioning or cfDNA experiments and theoretical algorithms for the prediction of nucleosome positioning preferences from DNA sequence. We provide an overview of the field, describe the structure of the database in this context, and demonstrate data variability using examples of different medical conditions. NucPosDB is useful both for the analysis of fundamental gene regulation processes and the training of computational models for patient diagnostics based on cfDNA. The database currently curates ~ 400 publications on nucleosome positioning in cell lines and in situ as well as cfDNA from > 10,000 patients and healthy volunteers. For open-access cfDNA datasets as well as key MNase-seq datasets in human cells, NucPosDB allows downloading processed mapped data in addition to the regions with stable nucleosome occupancy. NucPosDB is available at https://generegulation.org/nucposdb/ .Entities:
Keywords: Cell-free DNA; Liquid biopsy; Nucleosome positioning; Nucleosomics; cfDNA
Mesh:
Substances:
Year: 2022 PMID: 35061087 PMCID: PMC8776978 DOI: 10.1007/s00412-021-00766-9
Source DB: PubMed Journal: Chromosoma ISSN: 0009-5915 Impact factor: 2.919
Fig. 1The motivation for the use of nucleosome positioning in situ and cfDNA as a diagnostic marker. A) Nucleosome positioning acts as the cell memory at intermediate timescales between faster changes of gene expression and reaction metabolites and long-term changes such as the accumulation of mutations and changes of DNA methylation. B) cfDNA extracted from blood plasma or other body liquids reflects the nucleosome positioning landscape in the cells of origin. This is because enzymes that shred chromatin into pieces in processes such as apoptosis, necrosis or NETosis preferentially cut DNA between nucleosomes
Fig. 2The structure of NucPosDB containing six major sections (listed left to right in the scheme): (1) nucleosome maps measured in vivo in different cell types, (2) sequenced cfDNA datasets, (3) regions with stable nucleosome occupancy in the human genome for different conditions based on (1) and (2), (4) software for analysis of nucleosome mapping experiments, (5) software for predicting preferences of nucleosome formation from the DNA sequence and (6) software for cfDNA-specific analysis
Fig. 3The distribution of nucleosome positioning datasets across different biological species
Example open-access datasets from NucPosDB reporting whole genome sequencing of cfDNA
| Description | Medical conditions | |
|---|---|---|
| Generation of highly biomimetic quality control materials for non-invasive prenatal testing based on enzymatic digestion of matched mother–child cell lines (Zhang et al. | Prenatal testing | 2 |
| Sequencing of cfDNA derived from the plasma of individuals of different ages (Teo et al. | Ageing | 12 |
| Very short mitochondrial DNA fragments and heteroplasmy in human plasma (Zhang et al. | Sepsis, tissue transplantation | 7 |
| Cell-free DNA comprises an in vivo, genome-wide nucleosome footprint that informs its tissue(s)-of-origin (Snyder et al. | Healthy, lupus, Crohn’s disease, colitis, cancer | 60 |
| Cell-free DNA provides a good representation of the tumour genome despite its biased fragmentation patterns (Ma et al. | Cancer | 5 |
| The next-generation sequencing (NGS) technologies related assessments of circulating tumour DNA (ctDNA) in both primary brain tumours and metastatic brain tumours (Liang et al. | cancer | 28 |
| WGS of human pooled plasma cfDNA sampled from GI diseased individuals (PRJEB1791) | Healthy, cancer, inflammatory bowel disease | 24 |
| Decoding the evolutionary response to prostate cancer therapy by plasma genome sequencing (Ramesh et al. | Cancer | 23 |
Fig. 4Aggregate characteristics of cfDNA datasets across different medical conditions (A, B) and ages of healthy people (C, D). A GC content as a function of the distance from the end of cfDNA fragment (Snyder et al. 2016). B Distribution of lengths of cfDNA fragments (Snyder et al. 2016). C GC content as a function of the distance from the end of cfDNA fragment for 25-, 70- and 100-year-old people (Teo et al. 2019), compared with pooled healthy people from another study (Snyder et al. 2016). D Differences of cfDNA fragment sizes for cfDNA of breast cancer patients collected in three different studies (Snyder et al. 2016; Song et al., 2017 and Butler et al., 2015)