| Literature DB >> 24929644 |
Kristina Warton, Vita Lin, Tina Navin, Nicola J Armstrong, Warren Kaplan, Kevin Ying, Brian Gloss, Helena Mangs, Shalima S Nair, Neville F Hacker, Robert L Sutherland, Susan J Clark, Goli Samimi1.
Abstract
BACKGROUND: Free circulating DNA (fcDNA) has many potential clinical applications, due to the non-invasive way in which it is collected. However, because of the low concentration of fcDNA in blood, genome-wide analysis carries many technical challenges that must be overcome before fcDNA studies can reach their full potential. There are currently no definitive standards for fcDNA collection, processing and whole-genome sequencing. We report novel detailed methodology for the capture of high-quality methylated fcDNA, library preparation and downstream genome-wide Next-Generation Sequencing. We also describe the effects of sample storage, processing and scaling on fcDNA recovery and quality.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24929644 PMCID: PMC4078241 DOI: 10.1186/1471-2164-15-476
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Comparison of fcDNA isolation from plasma and serum. (A) PCR quantitation of fcDNA obtained from 200 μL plasma and serum samples. Bars represent the average concentration fold change in matched serum and plasma samples obtained from 3 separate donors ± SD. (B) Lane 1: fcDNA isolated from 4 mL serum; Lane 2: fcDNA isolated from 4 mL plasma; Lane 3: genomic DNA spiked into serum sample prior to fcDNA processing; Lane 4: genomic DNA spiked into plasma sample prior to fcDNA processing; Lane 5: genomic DNA. Arrow: high molecular weight genomic DNA; MWM: molecular weight marker.
Figure 2Contamination of fcDNA with genomic DNA during blood storage. Concentration change of DNA extracted from plasma (A) or serum (C) measured by PCR at 0 hr, 4 hr, 8 hr, 24 hr and 48 hr following blood collection. Each bar represents the average concentration fold change of triplicate (A) or duplicate (C) experiments ± SD. (B) fcDNA isolated from 4 mL plasma immediately (Lane 1), 8 hrs (Lane 2), 24 hrs (Lane 3) and 48 hrs (Lane 4) following blood collection. Lane 5: genomic DNA. (D) fcDNA isolated from 3 mL serum immediately (Lane 1), 4 hrs (Lane 2), 8 hrs (Lane 3), 24 hrs (Lane 4) and 48 hrs (Lane 5) following blood collection. Lane 6: genomic DNA. Arrow: High molecular weight genomic DNA; MWM: molecular weight marker.
Concentration of fcDNA in 5 control subjects and DNA quantitation at consecutive stages of sample processing
| Sample | Plasma fcDNA concentration (ng/mL) | MethylMiner input (ng) | MethylMiner recovery (ng) (%) | Illumina NGS* input for library generation (ng) | Amount of library generated (ng) |
|---|---|---|---|---|---|
| 1 | 6.9 | 48.7 | 5.04 (10.4) | 4 | 464 |
| 2 | 7.3 | 49.6 | 5.04 (10.2) | 4 | 245 |
| 3 | 7.8 | 49.5 | 5.73 (11.6) | 4 | 314 |
| 4 | 9.9 | 42.2 | 6.28 (14.9) | 4 | 39 |
| 5 | 10.7 | 43.2 | 5.32 (12.3) | 4 | 222 |
*NGS = Next-Generation Sequencing.
Figure 3Size distribution of fcDNA in 5 control samples and following fcDNA library construction (A and B) and DNA recovery following MBD-capture (C). Agilent Bioanalyzer chip with fcDNA isolated from 5 control subjects prior to (A) and following (B) methylation enrichment and library preparation. Increase in molecular weight reflects successful adapter ligation. MWM: molecular weight marker. DNA recovery following MBD-capture (C). Recovery of DNA (based on % of input DNA) as quantitated by qPCR following the standard (grey bars) or our modified (white bars) MBD-capture protocol. SFTA3 promoter is unmethylated in PBMC DNA and methylated in the methylation positive control SSSI. At low DNA inputs (100 ng), SFTA3 is recovered from both PBMC and SSSI DNA, suggesting high background MBD binding and low methylation enrichment. With our modified protocol, SFTA3 is minimally recovered from PBMC DNA (unmethylated) but is recovered from SSSI DNA (methylated), even at low DNA inputs. Data is presented as % DNA recovery ± range.
Descriptions of Next-Generation Sequencing read numbers
| Sample | Total reads 1 | Unaligned 2 | Multiple site aligned 3 | Single site aligned 4 | Unique 5 | % Unique |
|---|---|---|---|---|---|---|
| 1 | 197,921,529 | 16,066,294 | 44,319,996 | 137,535,239 | 76,590,535 | 55.7 |
| 2 | 199,302,552 | 15,885,135 | 43,676,951 | 139,740,466 | 77,836,232 | 55.7 |
| 3 | 108,955,707 | 8,729,606 | 26,671,313 | 73,554,788 | 43,869,724 | 59.6 |
| 4 | 109,132,802 | 11,042,561 | 24,138,010 | 73,952,231 | 37,270,330 | 50.4 |
| 5 | 200,639,214 | 15,558,416 | 44,708,970 | 140,371,828 | 85,538,368 | 60.9 |
1Total reads obtained.
2Reads which could not be aligned to a site within the human genome.
3Reads which could not be accurately mapped since they aligned to multiple sites within the human genome.
4Reads which aligned to a single site within the human genome.
5Unique reads which aligned to a single site within the human genome.
6Unique reads as a percentage of the reads which could be aligned at a single site within the genome.
Figure 4% GC distribution of sequenced reads. (A) Expected (blue line) and actual (red line) % GC content present in total sequenced reads. Representative reads from a single sample are shown. (B) Elucidation of source of high and low % GC peaks in the sequencing reads. Model depicts captured fcDNA fragments (blue lines with sequenced 50 base reads shown on ends) containing methylated CpG sites which allow for capture (red circles). Arrows indicate the contribution of each read to the low % GC peak (methylated CpG sites are located in the middle of the fragment and thus fall outside 50 base read) and the high % GC peak (methylated CpG sites are located near the edge of the fragment and thus contained within 50 base read). (C) Expected (blue line) and actual (red line) % GC content present in non-overlapping sequenced reads. Representative reads from a single sample are shown.
Figure 5Clonal bisulphite sequencing of fcDNA from sample 5. (A) Validation of 3 promoter regions found to be methylated in SSSI positive control (green) and in fcDNA (blue). (B) Validation of 3 promoter regions found to be methylated SSSI positive control (green) but not in fcDNA (blue). (C) Validation of 3 loci found to be unmethylated in SSSI positive control (green) but methylated in fcDNA (blue). Closed circles indicate methylated CpGs as determined by clonal bisulphite sequencing; open circles indicate unmethylated CpGs as determined by clonal bisulphite sequencing; grey arrows indicate transcriptional start sites; red rectangles indicate the regions sequenced for validation.