| Literature DB >> 28119419 |
Marie-Theres Gansauge1, Tobias Gerber1, Isabelle Glocke1, Petra Korlevic1, Laurin Lippik1, Sarah Nagel1, Lara Maria Riehl2, Anna Schmidt1, Matthias Meyer1.
Abstract
DNA library preparation for high-throughput sequencing of genomic DNA usually involves ligation of adapters to double-stranded DNA fragments. However, for highly degraded DNA, especially ancient DNA, library preparation has been found to be more efficient if each of the two DNA strands are converted into library molecules separately. We present a new method for single-stranded library preparation, ssDNA2.0, which is based on single-stranded DNA ligation with T4 DNA ligase utilizing a splinter oligonucleotide with a stretch of random bases hybridized to a 3΄ biotinylated donor oligonucleotide. A thorough evaluation of this ligation scheme shows that single-stranded DNA can be ligated to adapter oligonucleotides in higher concentration than with CircLigase (an RNA ligase that was previously chosen for end-to-end ligation in single-stranded library preparation) and that biases in ligation can be minimized when choosing splinters with 7 or 8 random nucleotides. We show that ssDNA2.0 tolerates higher quantities of input DNA than CircLigase-based library preparation, is less costly and better compatible with automation. We also provide an in-depth comparison of library preparation methods on degraded DNA from various sources. Most strikingly, we find that single-stranded library preparation increases library yields from tissues stored in formalin for many years by several orders of magnitude.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28119419 PMCID: PMC5449542 DOI: 10.1093/nar/gkx033
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Library preparation methods for highly degraded DNA. (A) In the single-stranded library preparation method described here (ssDNA2.0), DNA fragments (black) are 5΄ and 3΄ dephosphorylated and separated into single strands by heat denaturation. 3΄ biotinylated adapter molecules (red) are attached to the 3΄ ends of the DNA fragments via hybridization to a stretch of six random nucleotides (marked as ‘N’) belonging to a splinter oligonucleotide complementary to the adapter and nick closure with T4 DNA ligase. Following the immobilization of the ligation products on streptavidin-coated beads, the splinter oligonucleotide is removed by bead wash at an elevated temperature. Synthesis of the second strand is carried out using the Klenow fragment of Escherichia coli DNA polymerase I and a primer with phosphorothioate backbone modifications (red stars) to prevent exonucleolytic degradation. Unincorporated primers are removed through a bead wash at an elevated temperature, preventing the formation of adapter dimers in the subsequent blunt-end ligation reaction, which is again catalyzed by T4 DNA ligase. Adapter self-ligation is prevented through a 3΄ dideoxy modification in the adapter. The final library strand is released from the beads by heat denaturation. (B) In the single-stranded library preparation method originally described in Gansauge and Meyer, (4), the first adapter was attached through true single-stranded DNA ligation using CircLigase. The large fragment of Bst DNA polymerase was used to copy the template strand, leaving overhanging 3΄ nucleotides, which had to be removed in a blunt-end repair reaction using T4 DNA polymerase. (C) The ‘454’ method of double-stranded library preparation in the implementation of Meyer and Kircher, (23), is based on non-directional blunt-end ligation of a mixture of two adapters to blunt-end repaired DNA fragments using T4 DNA ligase. To prevent adapter self-ligation, no phosphate groups are present at the 5΄ ends of the adapters, resulting in the ligation of the adapter strands only and necessitating subsequent nick fill-in with a strand-displacing polymerase. Intermittent DNA purification steps are required in-between enzymatic reactions. (D) The ‘Illumina’ method of double-stranded library preparation, shown here as implemented in New England Biolabs’ NEBNext Ultra II kit, requires the addition of A-overhangs (marked as ‘A’) to blunt-end repaired DNA fragments using a 3΄-5΄ exonuclease deletion mutant of the Klenow fragment of E. coli DNA polymerase I. Both adapter sequences are combined into one bell-shaped structure, which carries a 3΄ T overhang to allow sticky end ligation with T4 DNA ligase. Following ligation, adapter strands are separated by excision of uracil. Excess adapters and adapter dimers are removed through size-selective purification.
Figure 2.Single-stranded DNA ligation with T4 DNA ligase and CircLigase. A pool of 60 nt acceptor oligonucleotides (‘60N’) were ligated to 10 pmol of a 3΄ biotinylated donor oligonucleotide (CL78) using either T4 DNA ligase in the presence of a splinter oligonucleotide (TL38) or CircLigase. Ligation products were visualized on a 10% denaturing polyacrylamide gel stained with SybrGold. Band shifts from 60 nt to 80 nt indicate successful ligation. Schematic overviews of the reaction schemes are shown on top. The scheme developed by Kwok et al. (19) is shown for comparison. M: Single-stranded DNA size marker.
Figure 3.Effects of single-stranded ligation schemes on library characteristics. (A) Informative sequence content of libraries prepared with CircLigase and T4 DNA ligase as a function of the input volume of ancient DNA extract used for library preparation. (B) Average GC content of the sequences obtained with the two ligation schemes. Note that the average GC content exceeds that of a typical mammalian genome because most sequences derive from microbial DNA, which is the dominant source of DNA in most ancient bones. (C) Fragment size distribution in the libraries as inferred from overlap-merged paired-end reads. Short artifacts in the library prepared from extremely little input DNA (corresponding to ∼1 mg bone) are mainly due to the incorporation of splinter fragments. (D) Frequencies of damage-induced C to T substitutions near the 5΄ and 3΄ ends of sequences.
Figure 4.Performance of single- and double-stranded library preparation methods using DNA from different sources. The informative sequence content of each library is provided in percent of that obtained with the best performing method for each sample. In addition, the number of nuclear genomes present in each library was calculated by dividing the informative sequence content by the size of the reference genome used for mapping.