| Literature DB >> 26999359 |
Tomasz Suchan1, Camille Pitteloud1, Nadezhda S Gerasimova2,3, Anna Kostikova3, Sarah Schmid1, Nils Arrigo1, Mila Pajkovic1, Michał Ronikier4, Nadir Alvarez1.
Abstract
In the recent years, many protocols aimed at reproducibly sequencing reduced-genome subsets in non-model organisms have been published. Among them, RAD-sequencing is one of the most widely used. It relies on digesting DNA with specific restriction enzymes and performing size selection on the resulting fragments. Despite its acknowledged utility, this method is of limited use with degraded DNA samples, such as those isolated from museum specimens, as these samples are less likely to harbor fragments long enough to comprise two restriction sites making possible ligation of the adapter sequences (in the case of double-digest RAD) or performing size selection of the resulting fragments (in the case of single-digest RAD). Here, we address these limitations by presenting a novel method called hybridization RAD (hyRAD). In this approach, biotinylated RAD fragments, covering a random fraction of the genome, are used as baits for capturing homologous fragments from genomic shotgun sequencing libraries. This simple and cost-effective approach allows sequencing of orthologous loci even from highly degraded DNA samples, opening new avenues of research in the field of museum genomics. Not relying on the restriction site presence, it improves among-sample loci coverage. In a trial study, hyRAD allowed us to obtain a large set of orthologous loci from fresh and museum samples from a non-model butterfly species, with a high proportion of single nucleotide polymorphisms present in all eight analyzed specimens, including 58-year-old museum samples. The utility of the method was further validated using 49 museum and fresh samples of a Palearctic grasshopper species for which the spatial genetic structure was previously assessed using mtDNA amplicons. The application of the method is eventually discussed in a wider context. As it does not rely on the restriction site presence, it is therefore not sensitive to among-sample loci polymorphisms in the restriction sites that usually causes loci dropout. This should enable the application of hyRAD to analyses at broader evolutionary scales.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26999359 PMCID: PMC4801390 DOI: 10.1371/journal.pone.0151651
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Lab-work procedure used for hyRAD development.
Homologous reads from shotgun genomic libraries are captured through hybridization on random RAD-based probes. These fragments are then separated using streptavidin-coated beads and sequenced.
Fig 2Bioinformatic pipeline used for processing hyRAD sequences.
First, the reads are demultiplexed and cleaned. Different types of references were built and the captured fragments were mapped on the reference. The SNPs are then called after correcting for post-mortem DNA damages.
Lycaena helle samples used in the study.
| Type of preservation | Year of collection | DNA concentration [ng/ul] | Locality |
|---|---|---|---|
| dry (pin-mounted) | 1957 | 2.12 | Kuusamo, Finland |
| dry (pin-mounted) | 1957 | 3.02 | Kuusamo, Finland |
| dry (pin-mounted) | 1957 | 1.48 | Kuusamo, Finland |
| dry (pin-mounted) | 1985 | 29.2 | Kuusamo, Finland |
| dry (pin-mounted) | 1985 | 19.7 | Kuusamo, Finland |
| dry (pin-mounted) | 1985 | 17.5 | Kuusamo, Finland |
| dry (pin-mounted) | 1985 | 8.84 | Kuusamo, Finland |
| ethanol | 2007 | 2.12 | Dumbrava Vadului, Romania |
Summary of Oedaleus decorus samples used in the study.
| Type of preservation | Mean year of collection (range) | Mean DNA concentration [ng/ul] ± SD (range) | Localities |
|---|---|---|---|
| ethanol | 2006 (2005–2009) | 28.34 ± 28.04 (3.2–105.7) | Croatia, France, Italy, Russia, Spain, Switzerland |
| dry (pin-mounted) | 1952 (1908–1997) | 18.31 ± 21.73 (0.3–121.4) | Algeria, France, Greece, Italy, Madeira, Portugal (mainland and Madeira), Spain (mainland and Canary islands), Switzerland, Turkey |
Oligonucleotides used in the protocol.
x = barcode sequence in the adapters; barcode sequences can be designed using published scripts [24], available at: https://bioinf.eva.mpg.de/multiplex/; I = inosine in the region complementary to the barcode in blocking oligonucleotides sequences.
| RAD probes P1 adapters, SbfI-compatible (RAD-P1) | |
| RAD-P1.1 | ACACTCTTTCCCTACACGACGCTCTTCCGATCTxxxxxxCCTGCA |
| RAD-P1.2 | GGxxxxxxAGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT |
| RAD probes P2 adapter, MseI-compatible (RAD-P2) | |
| RAD-P2.1 | GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT |
| RAD-P2.2 | TAAGATCGGAAGAGCGAGAACAA |
| Shotgun library P1 adapters | |
| P1.1 | ACACTCTTTCCCTACACGACGCTCTTCCGATCTxxxxxx |
| P1.2 | xxxxxxAGATCGGAAGAGC |
| Shotgun library P2 oligonucleotide | |
| P2 | GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTCCCCC |
| PCR primers | |
| ILLPCR1 | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTT |
| ILLPCR2_01 | CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_02 | CAAGCAGAAGACGGCATACGAGATACATCGGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_03 | CAAGCAGAAGACGGCATACGAGATGCCTAAGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_04 | CAAGCAGAAGACGGCATACGAGATTGGTCAGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_05 | CAAGCAGAAGACGGCATACGAGATCACTGTGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_06 | CAAGCAGAAGACGGCATACGAGATATTGGCGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_07 | CAAGCAGAAGACGGCATACGAGATGATCTGGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_08 | CAAGCAGAAGACGGCATACGAGATTCAAGTGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_09 | CAAGCAGAAGACGGCATACGAGATCTGATCGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_10 | CAAGCAGAAGACGGCATACGAGATAAGCTAGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_11 | CAAGCAGAAGACGGCATACGAGATGTAGCCGTGACTGGAGTTCAGACGTGTGC |
| ILLPCR2_12 | CAAGCAGAAGACGGCATACGAGATTACAAGGTGACTGGAGTTCAGACGTGTGC |
| Blocking oligonucleotides | |
| BO1.P5.F | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT |
| BO2.P5.R | AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT |
| BO3.P7.F | CAAGCAGAAGACGGCATACGAGATIIIIIIGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT |
| BO4.P7.R | AGATCGGAAGAGCACACGTCTGAACTCCAGTCACIIIIIIATCTCGTATGCCGTCTTCTGCTTG |
Data on obtained references for L. helle (RAD-ref, RAD-ref-ext, assembly-ref) and O. decorus (assembly-ref).
| Reference | Number of contigs | Largest contig (bp) | Total length (bp) | N50 |
|---|---|---|---|---|
| RAD-ref ( | 25 478 | 544 | 5 445 942 | 209 |
| RAD-ref-ext ( | 24 820 | 851 | 2 613 024 | 98 |
| assembly-ref ( | 304 161 | 2 352 | 35 579 979 | 666 |
| assembly-ref ( | 408 851 | 13 103 | 119 789 911 | 321 |
Fig 3Percentage of the captured reads showing unique mapping events for different types of DNA preparations and bioinformatic pipelines.
Fig 4Mean number of SNPs per sample obtained for different types of DNA preparations and bioinformatic pipelines.
Fig 5Number of loci obtained using different bioinformatic approaches, identified using the OrthoMCL [42] pipeline for orthology detection.
Fig 6Spatial genetic structure of O. decorus inferred using fastStructure with k = 2 and simple priors.
Colours denote the two different genetic groups supported by a previous study relying on mtDNA markers [22].