| Literature DB >> 31340783 |
Syukri Shukor1, Alfred Tamayo1, Lorenzo Tosi1, H Benjamin Larman2, Biju Parekkadan3,4,5.
Abstract
BACKGROUND: Long Adapter Single-Stranded Oligonucleotide (LASSO) probes were developed as a novel tool for massively parallel cloning of kilobase-long genomic DNA sequences. LASSO dramatically improves the capture length limit of current DNA padlock probe technology from approximately 150 bps to several kbps. High-throughput LASSO capture involves the parallel assembly of thousands of probes. However, malformed probes are indiscernible from properly formed probes using gel electrophoretic techniques. Therefore, we used next-generation sequencing (NGS) to assess the efficiency of LASSO probe assembly and how it relates to the nature of DNA capture and amplification. Additionally, we introduce a simplified single target LASSO protocol using classic molecular biology techniques for qualitative and quantitative assessment of probe specificity.Entities:
Keywords: Genomic libraries; Long adapter single-stranded oligonucleotides (LASSO); Multiplex PCR; Multiplex cloning
Mesh:
Substances:
Year: 2019 PMID: 31340783 PMCID: PMC6657055 DOI: 10.1186/s12896-019-0547-1
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Fig. 1LASSO probe de novo synthesis and capture. a Schematic of LASSO probe assembly protocol outline. b LASSO target capture outlines probe-template hybridization, target capture and dissociation in a single reaction followed by post-capture PCR
Fig. 2Probe assembly NGS data analysis. a Mean read depth of all sequencing reads mapped to the LASSO probe library. The reference probe library sequences (N = 3164) were grouped according to ranges of expected capture size in increasing order to highlight biases in probe formation and predict downstream capture performance. Read depth is defined as the number of reads that map to a specific reference sequence. b On the horizontal axis, probe library sequences were grouped according to expected probe capture size ranges. The percentages of ORFs represented by concordant probes within these expected capture size ranges were plotted for both LASSO assembly methods. Concordant probes are properly formed probes with paired-end reads that map to a unique probe reference sequence
Fig. 3Multiplexed LASSO capture NGS data analysis a Distribution of reads aligned to E. coli genome and LASSO probe backbone reference sequences. Unaligned reads were also included to reflect total raw sequencing data. b Target enrichment values were derived from the ratio of median normalized read depths of targets over median normalized read depths of non-targets. Statistical significance of target enrichment over non-targets was determined using Student’s t-test with a 0.99 confidence interval (p-value < 0.01. c The lengths of LASSO capture targets longer than 1.0 kb with full sequencing coverage were normalized to 1 (Base-pair position normalization). The sequenced depth for each base in captures was then plotted for each normalized position along gene to visualize for target enrichment levels and capture efficiency
High-quality LASSO captures for both target and non-target sequences
| Captured Sequences (> 10-fold Coverage) | Expected Sequences | ||
|---|---|---|---|
| 50 μl | 2 ml | ||
| Targets | 590 | 1282 | 3164 |
| Non-Targets | 454 | 388 | 4434 |
Fold-coverage refers to the number of times that a certain sequence is fully sequenced. High-quality captures were strictly defined as sequences with more than ten-fold sequencing coverage, calculated as a function of sequencing read length (50 bp for HiSeq sequencing platform) multiplied by aligned read depth divided by sequence length
Fig. 4Single-Target LASSO capture gel resolvability. a Single target captures resolved on agarose (left) and polyacrylamide gels (right). Both gels are scaled to PAGE DNA ladder, with the major band slightly underneath the 300 bp ladder band. For control lanes, GDNA only control only contains M13 genomic DNA while LASSO only controls only contain M13_137 probes. Annealing temperatures for captures are denoted by “Ta”, while gDNA and LASSO probe only controls were done at 60 °C b Pixel density plots of single target LASSO captures on agarose and PAGE were obtained by ImageJ plot lane functions. In both plots, LASSO captures (60 °C Ta) are overlaid with a LASSO probe only control to compare non-specific capture with baseline pixel density
Fig. 5Possible ligation events within probe assembly. Both Probe A and Probe B are formed after the Fusion PCR step prior to restriction enzyme digest and self-circularization. Examples of intermolecular ligation errors formed during self-circularization step (ligation reaction) of the LASSO probe assembly. Resultant discordant probes from intermolecular ligations shown here have mismatched arms from separate probe sequences
LASSO long adapter sequence
| Sequence Name | Sequence (5′-3′) |
|---|---|
| Long Adapter for Multiplexed LASSO Probes | AGAGAAGTCCTAGCACGGTAACCTCCGAGGATGTCATCAAAGAGTTTAAAGAGTTTATGAGATTTAAGGTCAAGATGGAGGGAAGCGTCAACGGACACGAGTTCGAGATTGAGGGAGAAGGAGAAGGCCGGCCTTACGAGGGCACACAAACCGCTAAGCTCAAGGTCACAAAAGGAGGACTAACTATAACGGTCCTAAGGTAGCGAACCCTCCCCTTCTCCTGGGATATTCTGAGCCCTCAGTTCCAGTACGGAAGCGAATTCCAGCTT |
Primer sequences for LASSO assembly and capture
| Primer | Sequence (5′-3′) |
|---|---|
| BlaF | GAGTATTACCGCGGCGAATTC |
| RFPR200EcoRI | AAGCTGGAATTCGCTTCCGTACTGGAACTGAGGGC |
| SapINew | GGTTCCTGGCTCTTCGATC |
| TiolNew | A*T*C*GCCGCAAGAAGTGTU |
| PCR1kbCaptF400 | GTGAAACTCAGAGGAACCAACTTCC |
| ICeul200CaptF | CTCCCCTTCTCCTGGGATATTCTG |
| M13_137_R(Tiol) | TTCAAAGCGAACCAGACCGG |
| M13_137_F(SapINew) | /5Phos/GCAATCCGCTTTGCTTCTGAC |
“*” symbol denotes phosphorothioate bonds while /5Phos/ indicates a phosphorylated 5′ end
Single-target M13 LASSO probe design
| Sequence Name | Sequence (5′-3′) |
|---|---|
| M13_137 | GCAATCCGCTTTGCTTCTGACTATAATAGTCAGGGTAAAGACCTAGAGAAGTCCTAGCACGGTAACCTCCGAGGATGTCATCAAAGAGTTTAAAGAGTTTATGAGATTTAAGGTCAAGATGGAGGGAAGCGTCAACGGACACGAGTTCGAGATTGAGGGAGAAGGAGAAGGCCGGCCTTACGAGGGCACACAAACCGCTAAGCTCAAGGTCACAAAAGGAGGACTAACTATAACGGTCCTAAGGTAGCGAACCCTCCCCTTCTCCTGGGATATTCTGAGCCCTCAGTTCCAGTACGGAAGCAAAGCCTATGTTAAACACCCTGCCGACATCCCTGACTATCTGAAGCTCTCCTTCCCTGAAGGCTTCAAGTGGGAGAGATTCATGAACTTCGAGGACGGAGGCGTGGTGACAGTCACACAAGATAGCACCCTCCAGGACGGAGAGTTTATTTATAAGGTGAAACTCAGAGGAACCAACTTCCCCTCCGATGGCCCTGTCATGAATTCTTGGAGTTTGCTTCCGGTCTGGTTCGCTTTGAA |