| Literature DB >> 30716066 |
Stephen M Richards1, Nelli Hovhannisyan2, Matthew Gilliham3, Joshua Ingram1, Birgitte Skadhauge4, Holly Heiniger1, Bastien Llamas1, Kieren J Mitchell1, Julie Meachen5, Geoffrey B Fincher6, Jeremy J Austin1, Alan Cooper1.
Abstract
Hybridization capture with in-solution oligonucleotide probes has quickly become the preferred method for enriching specific DNA loci from degraded or ancient samples prior to high-throughput sequencing (HTS). Several companies synthesize sets of probes for in-solution hybridization capture, but these commercial reagents are usually expensive. Methods for economical in-house probe synthesis have been described, but they do not directly address one of the major advantages of commercially synthesised probes: that probe sequences matching many species can be synthesised in parallel and pooled. The ability to make "phylogenetically diverse" probes increases the cost-effectiveness of commercial probe sets, as they can be used across multiple projects (or for projects involving multiple species). However, it is labour-intensive to replicate this with in-house methods, as template molecules must first be generated for each species of interest. While it has been observed that probes can be used to enrich for phylogenetically distant targets, the ability of this effect to compensate for the lack of phylogenetically diverse probes in in-house synthesised probe sets has not been tested. In this study, we present a refined protocol for in-house RNA probe synthesis and evaluated the ability of probes generated using this method from a single species to successfully enrich for the target locus in phylogenetically distant species. We demonstrated that probes synthesized using long-range PCR products from a placental mammal mitochondrion (Bison spp.) could be used to enrich for mitochondrial DNA in birds and marsupials (but not plants). Importantly, our results were obtained for approximately a third of the cost of similar commercially available reagents.Entities:
Year: 2019 PMID: 30716066 PMCID: PMC6361428 DOI: 10.1371/journal.pone.0209499
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic of probe synthesis and hybridization capture.
Long-range PCR primers.
| Primer Name | Amplicon Name | Primer Sequence (‘5-‘3) | Amplicon Length (bp) |
|---|---|---|---|
| GTTGCACCAAAGTTTTTGGTTCCTAAGACC | |||
| AGAATRTCAGCTTTGGGTGTTGATGG | |||
Primer sets to amplify a partial mitogenome of the northern hairy-nosed wombat and complete mitogenome of modern bison. The T7 RNA polymerase promoter is attached to the 5’ end of one primer of each primer pair to allow transcription of the amplicon for probe synthesis. Bold nucleotides = T7 RNA polymerase promoter sequence
Samples.
| ACAD | Scientific name | Common Name | Country Origin | Location Found | Tissue | Age—Years | Divergence Time |
|---|---|---|---|---|---|---|---|
| Steppe bison | Yukon Territory, Canada | Irish Gulch | Astragalus bone | Unknown | |||
| Bighorn sheep | WY, USA | Natural Trap Cave | Tooth | 12,000 to 20,000 | 24.6 MYA | ||
| Thylacine | Tas, Australia | Ouse, Central Highlands | Drilled tooth and bone power, inner jaw | ≈ 145 | 159 MYA | ||
| Emu | Tas, Australia | Mole Creek | Bone | 320 MYA | |||
| Broomcorn millet | Vayots Dazor, Armenia | Areni-1 Cave | Seed | 1496 MYA | |||
| Northern hairy-nosed wombat | QLD, Australia | Bullamon Station, Moonie River | Drilled bone powder, lower jaw | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest Station | Drilled bone powder from leg bone | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest Station | Fragments from nasal passage | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest Station | Bone fragments nasal passage + proximate end teeth | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Drilled bone powder from leg bone | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Post-cranial, mostly vertebrae, drilled powder | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Drilled bone powder from leg bone | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Drilled bone powder from leg bone | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Drilled bone powder from leg bone | ≈ 120 | - | ||
| Northern hairy-nosed wombat | QLD, Australia | Epping Forest National Park | Drilled bone powder from pelvis | ≈ 120 | - |
Divergence times were generated with TimeTree (http://www.timetree.org) [19]. Calibrated carbon dates are given in bold. “≈” approximately, “-”not applicable
PCR cycle number for indexing amplification of mtDNA enriched libraries (determined using qPCR).
| 20 | 26 | |
| 26 | 30 | |
| 21 | 28 | |
| 26 | 31 | |
| 23 | 27 |
Fig 2MapDamage plots.
Levels of cytosine deamination in mitogenome mapped data were assayed with mapDamage 2 [27]. Only plots from hybridization capture enrichments that used high stringency washes are shown. Additionally, only a single northern hairy-nosed wombat sample is shown as an example. All samples exhibit increased levels of C → T and G → A substitutions that are typical of aDNA. Hybridization capture enrichments with low stringency washes and the other wombat samples gave similar results as those shown above.
Wombat mapping statistics.
| 4219884 | 43 | 43 | 0 | 2 | 0.23 | 20.93 | - | - | |
| 1755334 | 88 | 86 | 0 | 4 | 0.54 | 39.94 | - | - | |
| 2993544 | 11 | 11 | 0 | 2 | 0.07 | 6.61 | - | - | |
| 5445759 | 9 | 9 | 0 | 1 | 0.05 | 4.71 | - | - | |
| 1446171 | 160 | 157 | 0 | 8 | 0.95 | 55.19 | - | - | |
| 2790106 | 3 | 3 | 0 | 3 | 0.01 | 0.54 | - | - | |
| 2339861 | 215 | 214 | 0 | 8 | 1.17 | 64.49 | - | - | |
| 3627332 | 43 | 43 | 0 | 3 | 0.25 | 20.82 | - | - | |
| 2533495 | 282 | 276 | 0 | 7 | 1.60 | 78.95 | - | - | |
| 2907911 | 715 | 691 | 0 | 14 | 4.01 | 96.13 | - | - | |
| 4940987 | 286958 | 7030 | 0 | 57 | 34.83 | 99.96 | 139.63 | 151.43 | |
| 316534 | 18500 | 3098 | 0 | 51 | 19.13 | 99.87 | 199.77 | 35.43 | |
| 1083154 | 1522 | 943 | 0 | 21 | 5.28 | 97.80 | 236.93 | 75.43 | |
| 4774242 | 41996 | 1945 | 0 | 32 | 10.24 | 99.13 | 246.51 | 204.8 | |
| 4137981 | 538950 | 12843 | 0 | 153 | 85.07 | 99.99 | 28.59 | 89.55 | |
| 3626269 | 5173 | 82 | 0 | 73 | 0.33 | 1.44 | 21.03 | 84.53 | |
| 3449102 | 577350 | 15649 | 1 | 159 | 98.90 | 100.00 | 49.61 | 249.84 | |
| 3952352 | 49720 | 10372 | 0 | 141 | 62.46 | 99.99 | 221.37 | 61.9 | |
| 1396352 | 328930 | 15426 | 0 | 161 | 99.04 | 99.98 | 101.41 | 49.97 | |
| 3428373 | 1139995 | 27564 | 6 | 239 | 200.36 | 100.00 | 33.83 | 151.43 | |
Ancient DNA from museum hairy-nosed northern wombat specimens was converted into sequencing libraries and enriched for mtDNA. Shotgun and mtDNA enriched libraries were mapped to a truncated wombat mitogenome reference (GenBank: KJ868118.1, bp 1 to 15420) using BWA aln [25].
Fig 3Correlation of fraction unique mapped reads in wombat libraries.
The fraction of unique mapped reads for the wombat shotgun and mtDNA enriched libraries was generated by dividing the number of unique mapped reads by the number of collapsed reads (Table 4). The high correlation between the fraction of unique reads in shotgun and enriched libraries suggest that initial screening of a sample with shallow sequencing and adjusting sequencing effort accordingly may reduce the cost of studies involving hybridization capture enrichment.
Mapping statistics for subsampled divergent taxa libraries.
| 144 | 143 | 0 | 5 | 0.97 | 63.36 | - | - | |
| 57 | 56 | 0 | 3 | 0.19 | 17.71 | - | - | |
| 3714 | 3327 | 0 | 57 | 14.37 | 93.09 | - | - | |
| 26 | 26 | 0 | 3 | 0.11 | 10.25 | - | - | |
| 35500 | 32138 | 0 | 82 | 5.26 | 39.50 | - | - | |
| 13408 | 7891 | 0 | 202 | 60.96 | 99.89 | 55.18 | 62.85 | |
| 65992 | 3447 | 0 | 42 | 13.20 | 97.25 | 61.55 | 69.47 | |
| 181291 | 11171 | 0 | 310 | 70.39 | 94.04 | 3.36 | 4.90 | |
| 5943 | 1231 | 0 | 75 | 8.10 | 70.70 | 47.35 | 73.64 | |
| 30790 | 24969 | 0 | 204 | 4.12 | 38.86 | 0.78 | 0.78 | |
| 71992 | 15320 | 1 | 339 | 144.92 | 100.00 | 107.13 | 149.40 | |
| 27076 | 1057 | 0 | 32 | 4.20 | 82.17 | 18.88 | 22.11 | |
| 481887 | 9306 | 0 | 335 | 58.14 | 93.06 | 2.80 | 4.05 | |
| 7634 | 339 | 0 | 69 | 2.45 | 42.04 | 13.04 | 22.27 | |
| 34030 | 21829 | 0 | 99 | 3.55 | 38.85 | 0.68 | 0.67 | |
In-house hybridization capture probes made from modern bison DNA were used to enrich mtDNA from ancient specimens of divergent taxa. To eliminate the effect of sequencing depth, analysis was performed on 2.5 x 106 reads randomly subsampled from each library. The bison probes were able to enrich divergent animal mtDNA but not plant. The effect of low stringency washes (low temperature and high salt concentration) and high stringency washes (high temperature and low salt concentration) on enrichment efficiency with in-house bison probes indicates that high stringency washes are more effective with closely related taxa while low stringency washes are more effective with distantly related species. Mapping statistics for the entire sequencing data set are given in S2 Table.
Fig 4Read depth of animal mtDNA enriched libraries.
Read depths of animal sequencing libraries enriched for mtDNA and mapped to a mitogenome reference. The small black graph above the read depth plot represents the fraction of GC nucleotides in reference mitogenome in a 200 bp sliding window. Sequence composition did not appear to have a consistent impact on the recovery of mtDNA with the RNA probes. Some regions of references with low GC content did appear to have reduced levels of read depth, which include nucleotides 16,151 to 16,743 of the thylacine reference where the GC content drops to ≈ 13%. The large variability in GC content of the thylacine mitogenome may have contributed to the reduced enrichment efficiency observed with the marsupial sample.