| Literature DB >> 34582623 |
Tomasz Suchan1, Mariya A Kusliy1,2, Naveed Khan1,3, Loreleï Chauvey1, Laure Tonasso-Calvière1, Stéphanie Schiavinato1, John Southon4, Marcel Keller5, Keiko Kitagawa6,7, Johannes Krause8,9, Alexander N Bessudnov10, Alexander A Bessudnov11, Alexander S Graphodatsky2, Silvia Valenzuela-Lamas12, Jarosław Wilczyński13, Sylwia Pospuła13, Krzysztof Tunia14, Marek Nowak15, Magdalena Moskal-delHoyo16, Alexey A Tishkin17, Alexander J E Pryor18, Alan K Outram18, Ludovic Orlando1.
Abstract
DNA hybridization-capture techniques allow researchers to focus their sequencing efforts on preselected genomic regions. This feature is especially useful when analysing ancient DNA (aDNA) extracts, which are often dominated by exogenous environmental sources. Here, we assessed, for the first time, the performance of hyRAD as an inexpensive and design-free alternative to commercial capture protocols to obtain authentic aDNA data from osseous remains. HyRAD relies on double enzymatic restriction of fresh DNA extracts to produce RNA probes that cover only a fraction of the genome and can serve as baits for capturing homologous fragments from aDNA libraries. We found that this approach could retrieve sequence data from horse remains coming from a range of preservation environments, including beyond radiocarbon range, yielding up to 146.5-fold on-target enrichment for aDNA extracts showing extremely low endogenous content (<1%). Performance was, however, more limited for those samples already characterized by good DNA preservation (>20%-30%), while the fraction of endogenous reads mapping on- and off-target was relatively insensitive to the original endogenous DNA content. Procedures based on two instead of a single round of capture increased on-target coverage up to 3.6-fold. Additionally, we used methylation-sensitive restriction enzymes to produce probes targeting hypomethylated regions, which improved data quality by reducing post-mortem DNA damage and mapping within multicopy regions. Finally, we developed a fully automated hyRAD protocol utilizing inexpensive robotic platforms to facilitate capture processing. Overall, our work establishes hyRAD as a cost-effective strategy to recover a set of shared orthologous variants across multiple ancient samples.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34582623 PMCID: PMC9291508 DOI: 10.1111/1755-0998.13518
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 8.678
Samples used in the study and endogenous DNA content prior to capture
| Sample name | Sample age | Site | Material | Endogenous DNA content (all/unique) |
|---|---|---|---|---|
| PLMie10 | Lusatian culture, Bronze Age/Early Iron Age (1,200–600 | Miechów, Poland | Metatarsus | 0.34/0.34% |
| PLSla2 | Funnelbeaker culture, Eneolithic (3,700–3,000 | Sławęcinek, Poland | Molar | 0.51/0.51% |
| PLMie8 | Found in Funnelbeaker culture feature, Eneolithic (3,700–3,000 | Miechów, Poland | Metacarpus | 6.58/6.55% |
| DIV9 | Upper Palaeolithic, Late Glacial period (13,000–14,500 uncal | Divnogor'ye, Voronezh Region, Russia | Molar | 27.42/1.72% |
| KB217 | Upper Palaeolithic | Medvezhiya cave, upper course of the river Pechora, Urals, Russia | Metapodium | 16.83/11.20% |
| PLMie3 | Przeworsk culture, Roman period (1st–4th century | Miechów, Poland | Femur | 13.09/13.85% |
| LOG3 | Upper Palaeolithic (infinite 14C date, >52,200 uncal | Hyena's Lair, Altai Republic, Russia | Metatarsus | 20.64/20.55% |
| SV2019‐22 | Iron Age, Middle Numidian period (5th–6th century | Althiburos, Tunisia | Tooth | 33.84/33.66% |
| SV2019‐18 | Iron Age, Middle Numidian period (5th–6th century | Althiburos, Tunisia | Tooth | 47.19/46.97% |
| PLKaz4 | Trzciniec culture, Early Bronze Age (1,900–1,200 | Kazimierza Wielka, Poland | Petrous temporal bone | 51.67/51.48% |
| PLKaz1 | Trzciniec culture, Early Bronze Age (1,900–1,200 | Kazimierza Wielka, Poland | Petrous temporal bone | 60.43/60.15% |
| PLKaz2 | Trzciniec culture, Early Bronze Age (1,900–1,200 | Kazimierza Wielka, Poland | Petrous temporal bone | 73.78/73.59% |
bce = Before Common Era. ce = Common Era. bp = Before Present. Uncal = uncalibrated radiocarbon range.
FIGURE 1Percentage of endogenous, unique endogenous (after removing PCR duplicates), reads flagged as PCR duplicates, on‐target reads (i.e., showing at least 1 nucleotide overlap with the target regions) and unique on‐target reads. Results are shown for shotgun genomic libraries (pre) and the same libraries with one (capture 1) or two rounds (capture 2) of capture with three types of hyRAD probes, filtering for a minimum mapping quality of 25 (analyses relaxing the mapping quality filter are shown in Figure S10 so as to illustrate the impact of repeated elements in the sequence data)
Sequence characteristics for the sequenced probes libraries (*in‐silico calculations for HpaII–MseI probes do not take methylation‐sensitivity into account)
| Enzyme combination | Raw reads | Properly paired reads | Number of multimapping properly paired reads | Number of targets | Number of targets after merging overlapping targets | Percentage of the genome targeted | Percentage of the genome predicted |
|---|---|---|---|---|---|---|---|
|
| 502,815 | 380,495 | 43,713 | 247,716 | 246,973 | 1.81% | 4.77% |
|
| 502,815 | 381,803 | 55,672 | 260,579 | 257,588 | 2.07% | 7.16% |
|
| 502,815 | 387,275 | 32,569 | 234,924 | 223,789 | 1.55% | 7.16%* |
FIGURE 2Percentage of sites with non‐null coverage, considering unique on‐target reads. Results are shown for shotgun genomic libraries (pre) and the same libraries with one (capture 1) or two rounds (capture 2) of capture with the three types of hyRAD probes
FIGURE 3The effect of one (capture 1) or two rounds (capture 2) of capture with three types of hyRAD probes on the %GC content and sequenced fragment lengths, as compared with the shotgun libraries (pre)
FIGURE 4Enrichment‐fold values for 12 samples enriched in two consecutive rounds on PstI–MseI probes, for all and unique fragments (i.e., after PCR duplicate removal). The enrichment is calculated as the proportion of mapped‐to‐all reads, on‐target‐to‐all reads, and on‐target‐to‐mapped reads. Note that the two top panels are shown on log‐scale
FIGURE 5Enrichment‐fold values as a function of the fragment length for 12 samples enriched in two consecutive rounds on PstI–MseI probes
FIGURE 6(a) Number of mapped nucleotides shared by n samples without and after two consecutive rounds on PstI–MseI probes. Note the log scale on the y‐axis. (b) Number of shared genomic variants shared by n samples without and after two consecutive rounds of capture on PstI–MseI probes. (c) Number of genomic variants detected in each sample without and after two consecutive rounds of capture on PstI–MseI probes. Note that the upper and lower plots show different scales in both (b) and (c)
FIGURE 7Post‐mortem DNA damage estimates of the unique on‐target reads after one and two rounds of capture on MspI–MseI vs HpaII–MseI probes. The plot shows 5′‐end cytosine deamination rates in CpG context as estimated by pmdtools and cumulated across the first 10 read positions
FIGURE 8Percentage of endogenous, unique endogenous reads, reads flagged as PCR duplicates, on‐target reads, and unique on‐target reads as compared between manual and robotic capture procedures, using PstI–MseI probes