| Literature DB >> 33980276 |
Marcin W Wojewodzic1, Magnus Leithaug2,3, Marianne Lauritzen4, Robert Lyle2,3, Sofia Haglund5, Carl-Johan Rubin6, Philip A Ewels7, Tom Grotmol4, Trine B Rounge4,8.
Abstract
BACKGROUND: Long-term stored serum is considered challenging for epigenomic analyses: as there are no cells, circulating DNA is scarce, and amplification removes epigenetic signals. Additionally, pre-analytical treatments and storage might introduce biases and fragmentation to the DNA. In particular, starting with low-input DNA can result in low-diversity libraries. However, successful whole-genome bisulphite sequencing (WGBS) of such serum samples has the potential to open biobanks for epigenetic analyses and deliver novel prediagnostic biomarkers. Here, we perform WGBS using the Accel-NGS library preparation kit on ultralow amounts of DNA from long-term archived samples with diverse pretreatments from the Janus Serum Bank.Entities:
Keywords: Biobanks; Epigenetics; Methods; Methylome; Serum; Storage; WGBS
Mesh:
Year: 2021 PMID: 33980276 PMCID: PMC8114536 DOI: 10.1186/s13148-021-01097-3
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Yield of DNA isolated from 500 µl of serum for each group and quantified by Qubit. Box plot with median marked
Fig. 2Coverage and duplication information in the sequencing project for the 96 samples. a Distribution of the number of locations in the reference genome with a given depth of coverage. X axis is the coverage, and y-axis is the number of locations (millions). b Mean coverage box plot per sampling procedure. Median marked. c Fraction of the genome covered by at least X reads. X axis is the coverage, and Y axis is the fraction of the reference genome. d Complexity curves, total molecules vs unique molecules. The data up to x = 600 M are based on actual results, beyond that is a simulation/projection showing the expected diminishing returns of additional sequencing in terms of unique reads
Fig. 3Duplication rate as a total of reads produced for each sample. X axis is the total number of reads produced, and Y axis shows the proportion of duplicated reads. The amount of the DNA (ng) entering library preparation was labelled with different sizes of the circles and group by colours
Fig. 4Methylation in mitochondrial DNA and two random CpG Island locations for 12 randomly chosen samples, representing all sample groups (1–6) and two failed samples. The bedGraphs represent the methylation values at given genomic locations. Scale represents methylation value
Fig. 5Mean cell composition estimated from methylation signatures for a Houseman and b methylCC method. X-axis is the blood collection group (1–6) and cell type, and Y-axis shows the proportion of cell types
Samples groups from the Janus Serum Bank repository
| Group | Sample collection period | Sample source | Serum processing | Produced reads M range (mean) | Analysed Cs (M) range (mean) | Analysed for DNA yield ( |
|---|---|---|---|---|---|---|
| 1 | 1972–1978 | HE | Iodoacetate added | 287–889 (602) | 3642–12,188 (8102) | 10 |
| 2 | 1979–1986 | HE | No additives | 450–892 (707) | 6200–11,843 (9586) | 13 |
| 3 | 1987–2004 | HE | Separating gel tubes | 202–898 (644) | 2573–12,184 (8650) | 39 |
| 4 | 1973–1979 | RCBD | Lyophilisation | 376–889 (666) | 5005–12,270 (8983) | 14 |
| 5 | 1980–1990 | RCBD | No additives | 356–854 (626) | 4611–11,850 (8341) | 13 |
| 6 | 1997–2004 | RCBD | No additives | 327–685 (602) | 4881–9480 (8059) | 5 |
| Total | 94 out of 96 | 94 out of 96 |
The samples are collected as part of national health examinations (HE), or from red cross donors (RCBD). Pre-analytical handling differs between groups. The last columns show the number of reads produced (millions, range and mean indicated), number of C analysed (millions, range and mean indicated) and number of samples used for calculating DNA yield, respectively. Further description of the groups can be found somewhere else [2, 4, 6]