| Literature DB >> 27060140 |
Anders Ståhlberg1, Paul M Krzyzanowski2, Jennifer B Jackson3, Matthew Egyud3, Lincoln Stein2, Tony E Godfrey4.
Abstract
Detection of cell-free DNA in liquid biopsies offers great potential for use in non-invasive prenatal testing and as a cancer biomarker. Fetal and tumor DNA fractions however can be extremely low in these samples and ultra-sensitive methods are required for their detection. Here, we report an extremely simple and fast method for introduction of barcodes into DNA libraries made from 5 ng of DNA. Barcoded adapter primers are designed with an oligonucleotide hairpin structure to protect the molecular barcodes during the first rounds of polymerase chain reaction (PCR) and prevent them from participating in mis-priming events. Our approach enables high-level multiplexing and next-generation sequencing library construction with flexible library content. We show that uniform libraries of 1-, 5-, 13- and 31-plex can be generated. Utilizing the barcodes to generate consensus reads for each original DNA molecule reduces background sequencing noise and allows detection of variant alleles below 0.1% frequency in clonal cell line DNA and in cell-free plasma DNA. Thus, our approach bridges the gap between the highly sensitive but specific capabilities of digital PCR, which only allows a limited number of variants to be analyzed, with the broad target capability of next-generation sequencing which traditionally lacks the sensitivity to detect rare variants.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27060140 PMCID: PMC4914102 DOI: 10.1093/nar/gkw224
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.SiMSen-Seq. (A) Sequence composition of hairpin protected barcode primer. Different sequence elements are indicated by color. (B) Schematic design and structure of hairpin protected barcode primer. (C) Melting curve analysis of hairpin protected barcode primer using a temperature controlled spectrophotometer. The derivative of the absorption over time is shown. The melting temperature (Tm) where 50% of primers are in an open configuration is indicated. (D). Thirty-six different hairpin protected barcode primers were evaluated (Mean = 70.01, SD ± 0.24).
Figure 2.Schematic library construction workflow. In the first PCR consisting of three cycles, target DNA is amplified with hairpin protected barcode primers. The reaction is terminated with an incubation step that is a combined dilution and protease treatment step. In the second PCR that consists of 18–30 cycles, all individual amplicons are amplified to generate PCR products with Illumina adapter primers. Final libraries are purified with magnetic beads, normalized for concentration differences between samples and sequenced.
Figure 3.Library purity and uniformity. (A) Electropherogram of a purified final library targeting one DNA sequence using the Fragment Analyzer. (B) Electropherogram of a purified final library targeting 31 DNA sequences using the Fragment Analyzer. (C) Relative raw read depth of 31 multiplexed amplicons were analyzed. DNA from tumor cell line CP-A was used. The average raw read depth was 1.4 × 104 per amplicon.
Figure 4.SiMSen-Seq reduces PCR induced errors and enables rare mutant molecule detection. (A) Cumulative plot of total raw and consensus read errors. Total and consensus read error was calculated for of each of 417 base positions. Error frequency corresponds to the number of non-reference reads (raw reads or consensus reads) divided by the total number of reads for that base position. Data from five amplicons covering 417 nucleotides and 12 replicates using the same CP-A DNA source are shown. The average raw read depth was 2.3 × 106 per amplicon and the average consensus read depth was 7700 per amplicon when 30 raw reads with the same barcode was applied as cutoff. (B) Plot showing raw and consensus read error for each base position along with corresponding 95% confidence intervals. (C) Dot plot of total raw and consensus read errors for 13 amplicons and 1042 nucleotides. The average raw read depth was 5.5 × 105 per amplicon and the average consensus read depth was 4700 per amplicon when 10 raw reads with the same barcode was applied as cutoff. The amplicons are ranked from left to right by consensus read depth (Supplementary Figure S4) and the nucleotides within each amplicon are ranked from left to right by their total consensus read error. Consensus reads without any observed errors for a given nucleotide are plotted with half the value of the lowest detected read error. The difference between raw read error and consensus read error at each nucleotide position indicates the relative error correction using SiMSen-Seq. (D) Rare mutation detection in TP53. Pooled plasma DNA from more than 10 individuals and DNA from a clonally derived cell line (CP-A) were analyzed for a single TP53 amplicon using SiMSen-Seq (n = 3–4). The x-axis represents individual nucleotide positions in the amplicon. For each nucleotide position, we identified the most frequent, non-reference (variant) allele and determined both absolute variant count (left side y-axis) normalized to read depth and variant allele frequency (right side y-axis). Error bars indicate 95% confidence intervals for the consensus error observed at each position. Primary tumor DNA with a known TP53 mutation (marked *) was spiked into the plasma DNA at two different allele fractions with 10× separation (blue and red marked bar).