| Literature DB >> 31287019 |
Rishvanth K Prabakar1, Liya Xu2, James Hicks2, Andrew D Smith3.
Abstract
We present SMURF-seq, a protocol to efficiently sequence short DNA molecules on a long-read sequencer by randomly ligating them to form long molecules. Applying SMURF-seq using the Oxford Nanopore MinION yields up to 30 fragments per read, providing an average of 6.2 and up to 7.5 million mappable fragments per run, increasing information throughput for read-counting applications. We apply SMURF-seq on the MinION to generate copy number profiles. A comparison with profiles from Illumina sequencing reveals that SMURF-seq attains similar accuracy. More broadly, SMURF-seq expands the utility of long-read sequencers for read-counting applications.Entities:
Keywords: Copy number variation; Long-read sequencing; Nanopore sequencing; Read-counting applications
Mesh:
Year: 2019 PMID: 31287019 PMCID: PMC6615205 DOI: 10.1186/s13059-019-1732-1
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1SMURF-seq efficiently sequences short fragments of DNA for read-counting applications with a reference genome on long-read sequencers and yields up to 30 countable fragments per sequenced read. SMURF-seq sequences short DNA molecules by generating long concatenated molecules from these. SMURF-seq reads are aligned by splitting them into multiple fragments, each aligning to a distinct region in the genome
Fig. 2Accurate copy number profiles with SMURF-seq. a CNV profile of a normal diploid genome. Each blue point is a bin ratio to mean and the red line is the segmented bin ratio. b Superimposed CNV profiles of SK-BR-3 genome generated using SMURF-seq and Illumina WGS reads. c Venn diagram illustrating the accuracy of event calls using SMURF-seq compared with Illumina WGS. d Zoom-in of copy number changes on chromosome 8. e Scatter plot of bin ratio of SK-BR-3 genome using SMURF-seq and Illumina WGS reads. Pearson correlation of the data is shown
Fig. 3Multiple SMURF-seq CNV profiles by multiplexing in a single run. a CNV profile of SK-BR-3 genome with down-sampled 10k SMURF-seq reads. b Scatter plot of normalized bin counts of the original SMURF-seq data and data down-sampled to 10k SMURF-seq reads. Pearson correlation of the data is shown. c CNV profile of barcode01 (Normal diploid genome) reads. d CNV profile of barcode02 (SK-BR-3 cancer genome) reads. e Scatter plot of bin ratios of SK-BR-3 genome using multiplexed SMURF-seq and Illumina WGS reads