| Literature DB >> 34591958 |
Kan Xiong1, Douglas Shea1, Justin Rhoades1, Timothy Blewett1, Ruolin Liu1, Jin H Bae1, Erica Nguyen1, G Mike Makrigiorgos1,2, Todd R Golub1,3, Viktor A Adalsteinsson1.
Abstract
Accurate DNA sequencing is crucial in biomedicine. Underlying the most accurate methods is the assumption that a mutation is true if altered bases are present on both strands of the DNA duplex. We now show that this assumption can be wrong. We establish that current methods to prepare DNA for sequencing, via 'End Repair/dA-Tailing,' may substantially resynthesize strands, leading amplifiable lesions or alterations on one strand to become indiscernible from true mutations on both strands. Indeed, we discovered that 7-17% and 32-57% of interior 'duplex base pairs' from cell-free DNA and formalin-fixed tumor biopsies, respectively, could be resynthesized in vitro and potentially introduce false mutations. To address this, we present Duplex-Repair, and show that it limits interior duplex base pair resynthesis by 8- to 464-fold, rescues the impact of induced DNA damage, and affords up to 8.9-fold more accurate duplex sequencing. Our study uncovers a major Achilles' heel in sequencing and offers a solution to restore high accuracy.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34591958 PMCID: PMC8755016 DOI: 10.1093/nar/gkab855
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Characterization of Duplex-Repair using capillary electrophoresis. (A) overview of Duplex-Repair vs. conventional ER/AT methods. (B) Schematic of the major products of various synthetic duplexes subjected to each step of Duplex-Repair and conventional ER/AT as determined by capillary electrophoresis (raw traces are in Fig S2). The non-fluorophore-tagged ends of the synthetic molecules are depicted, and fragment sizes are drawn to scale. Duplexes demarcated by asterisks (*) do not contain fluorophores and were not directly observed by capillary electrophoresis; however, their presence is predicted due to the characterized activities of UDG and FPG. Regions of strand resynthesis are illustrated in light blue.
Figure 2.Quantification of strand resynthesis using Single-Molecule Real-Time (SMRT) sequencing. (A) Schematic of library construction for PacBio SMRT sequencing using modified dNTPs to aid in identifying resynthesis regions. (B) Estimated fractions of interior base pairs (>12 bp from either end of the original duplex fragment) that were resynthesized using conventional ER/AT and several variations of Duplex-Repair. (C) Observed average interpulse durations (IPD; in frames) for circular consensus sequence (CCS) read strands relative to the distance from the original 3′ end of those strands across three sample types. (D) Estimated fraction of interior base pairs resynthesized for both conventional ER/AT and Duplex-Repair across three sample types.
Figure 3.Targeted panel sequencing of cfDNA and FFPE tumor biopsies. (A) Measured duplex sequencing error rates of HD_78 cfDNA damaged with varied concentrations of DNase I (to induce nicks) and CuCl2/H2O2 (to induce oxidative damage) and then repaired by using Duplex-Repair or conventional ER/AT (three replicates per condition). (B) Duplex sequencing error rates of four healthy cfDNA samples (three replicates per condition), three cancer patient cfDNA samples (one replicate per condition), and five cancer patient FFPE tumor biopsies (three replicates per condition) treated with conventional ER/AT or Duplex-Repair. (C) Aggregate mutant bases and their position relative to the end of the original duplex fragment. Dashed line represents the threshold of the interior of the fragment (12 bp). (D) Error rates from (B) compared to their corresponding estimates of interior base pair resynthesis fractions from Figure 2D. Pearson's correlation calculated for all data points.