| Literature DB >> 27881876 |
H M Nistelberger1, O Smith2, N Wales3, B Star1, S Boessenkool1.
Abstract
The majority of archaeological plant material is preserved in a charred state. Obtaining reliable ancient DNA data from these remains has presented challenges due to high rates of nucleotide damage, short DNA fragment lengths, low endogenous DNA content and the potential for modern contamination. It has been suggested that high-throughput sequencing (HTS) technologies coupled with DNA enrichment techniques may overcome some of these limitations. Here we report the findings of HTS and target enrichment on four important archaeological crops (barley, grape, maize and rice) performed in three different laboratories, presenting the largest HTS assessment of charred archaeobotanical specimens to date. Rigorous analysis of our data - excluding false-positives due to background contamination or incorrect index assignments - indicated a lack of endogenous DNA in nearly all samples, except for one lightly-charred maize cob. Even with target enrichment, this sample failed to yield adequate data required to address fundamental questions in archaeology and biology. We further reanalysed part of an existing dataset on charred plant material, and found all purported endogenous DNA sequences were likely to be spurious. We suggest these technologies are not suitable for use with charred archaeobotanicals and urge great caution when interpreting data obtained by HTS of these remains.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27881876 PMCID: PMC5121605 DOI: 10.1038/srep37347
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sample details and characteristics including: sequencing platform, paired-end (pe) or single-end (se), maximum read length (bp); whether libraries were shotgun sequenced or subjected to target enrichment, whole genome (WG) or solid-state(SS); double stranded (ds) or single stranded (ss) library build; average read length (bp); total number of raw reads, or read pairs if paired-end; percentage of reads that mapped to sample genomes after duplicate removal.
| Sample | HTS platform | Method | Library build | Avg. read length (bp) | # Raw reads | % Reads mapped to target |
|---|---|---|---|---|---|---|
| Barley1a | HiSeq2500/pe/125 bp | target enrichment | ds | 92 | 9363464 | 0.008 |
| Barley1b | Miseq/pe/75 bp | target enrichment | ss | 78 | 4052951 | 0.029 |
| Barley2a | HiSeq2500/pe/125 bp | target enrichment | ds | 100 | 6943249 | 0.007 |
| Barley2b | Miseq/pe/75 bp | target enrichment | ss | 76 | 4577402 | 0.040 |
| Barley3a | HiSeq2500/pe/125 bp | shotgun | ds | 80 | 1445024 | 0.010 |
| Barley3b | Miseq/pe/75 bp | shotgun | ss | 72 | 5006428 | 0.022 |
| Barley4a | HiSeq2500/pe/125 bp | shotgun | ds | 95 | 6406123 | 0.008 |
| Barley4b | Miseq/pe/75 bp | shotgun | ss | 83 | 353474 | 0.014 |
| Barley5 | Miseq/pe/75 bp | target enrichment | ss | 69 | 3641058 | 0.123 |
| Barley6 | Miseq/pe/75 bp | target enrichment | ss | 68 | 3031164 | 0.085 |
| Barley7 | Miseq/pe/75 bp | shotgun | ss | 68 | 2384691 | 0.076 |
| Barley8 | Miseq/pe/75 bp | shotgun | ss | 78 | 639883 | 0.015 |
| Grape1 | HiSeq2500/se/100 bp | shotgun | ds | 69 | 6337515 | 0.003 |
| Grape2 | HiSeq2500/se/100 bp | shotgun | ds | 72 | 8657842 | 0.003 |
| Grape3 | HiSeq2500/se/81 bp | shotgun | ds | 72 | 6491713 | 0.008 |
| Grape4 | HiSeq2500/se/81 bp | shotgun | ds | 67 | 9147375 | 0.009 |
| Grape5 | HiSeq2500/se/81 bp | shotgun | ds | 66 | 5794427 | 0.015 |
| Maize1 | HiSeq2500/se/81 bp | shotgun | ds | 72 | 5052523 | 0.006 |
| Maize2 | HiSeq2500/se/81 bp | shotgun | ds | 73 | 7351039 | 0.007 |
| Maize3 | HiSeq2500/se/81 bp | shotgun | ds | 67 | 135982 | 0.011 |
| Maize4 | HiSeq2500/se/81 bp | shotgun | ds | 65 | 10742918 | 0.013 |
| Maize5 | HiSeq2500/se/81 bp | shotgun | ds | 66 | 7822641 | 0.016 |
| Maize6 | HiSeq2500/se/81 bp | shotgun | ds | 67 | 7645073 | 0.012 |
| Maize7 | HiSeq2500/se/81 bp | shotgun | ds | 67 | 6588747 | 0.007 |
| Maize8a | HiSeq2500/se/100 bp | shotgun | ds | 72 | 28239900 | 0.026 |
| Maize8b | HiSeq2500/se/100 bp | target enrichment | ds | 78 | 21895732 | 0.016 |
| Rice1 | MiSeq/se/250 bp | target enrichment WG | ds | 193 | 390515 | 0.005 |
| Rice2 | MiSeq/se/250 bp | target enrichment | ds | 63 | 500538 | 0.015 |
| Rice3 | MiSeq/se/250 bp | target enrichment SS | ds | 141 | 1783367 | 0.001 |
| Rice4 | MiSeq/se/250 bp | target enrichment WG | ds | 228 | 499372 | 0.001 |
| Rice5 | MiSeq/se/250 bp | target enrichment SS | ds | 114 | 4953182 | 0.001 |
| Rice6 | MiSeq/se/250 bp | target enrichment WG | ds | 219 | 567506 | 0.002 |
| Rice7 | MiSeq/se/250 bp | target enrichment SS | ds | 131 | 6101994 | 0.001 |
| Rice8 | MiSeq/se/250 bp | target enrichment WG | ds | 194 | 447690 | 0.002 |
| Rice9 | MiSeq/se/250 bp | target enrichment WG | ds | 227 | 301980 | 0.002 |
| Rice10 | MiSeq/se/250 bp | target enrichment SS | ds | 177 | 3425508 | 0.000 |
| Rice11 | MiSeq/se/250 bp | target enrichment WG | ds | 172 | 592182 | 0.013 |
| Rice12a | MiSeq/se/250 bp | shotgun | ds | 206 | 2608916 | 0.000 |
| Rice12b | MiSeq/se/250 bp | target enrichment WG | ds | 179 | 1476971 | 0.001 |
| Rice13 | MiSeq/se/250 bp | target enrichment | ds | 70 | 396830 | 0.012 |
| Rice14 | MiSeq/se/250 bp | target enrichment | ds | 74 | 1096766 | 0.018 |
| Rice15 | MiSeq/se/250 bp | target enrichment | ds | 78 | 595556 | 0.006 |
| Rice16 | MiSeq/se/250 bp | target enrichment | ds | 64 | 338135 | 0.009 |
| Rice17 | MiSeq/se/250 bp | target enrichment | ds | 93 | 373420 | 0.002 |
| Blank_barley1 | HiSeq2500/se/81 bp | shotgun | ds | 54 | 33740465 | NA |
| Blank_barley2 | HiSeq2500/se/81 bp | shotgun | ds | 66 | 7126362 | NA |
| Blank_barley3 | HiSeq2500/se/81 bp | shotgun | ds | 68 | 671521 | NA |
| Blank_grape1 | HiSeq2500/se/100 bp | shotgun | ds | 58 | 33090380 | NA |
| Blank_maize1 | HiSeq2500/se/100 bp | shotgun | ds | 52 | 43339302 | NA |
| Blank_maize2 | HiSeq2500/se/81 bp | shotgun | ds | 71 | 15727571 | NA |
| Blank_rice1 | MiSeq/se/250 bp | shotgun | ds | 91 | 250044 | NA |
Average percentage of reads from all barley, grape, maize, rice and blank samples that map to the four reference genomes.
| Samples | Reference genome | |||
|---|---|---|---|---|
| Barley | Grape | Maize | Rice | |
| 5300 Mbp | 500 Mbp | 2500 Mbp | 430 Mbp | |
| Barley samples | 0.036 | 0.019 | 0.025 | 0.022 |
| Grape samples | 0.018 | 0.007 | 0.021 | 0.012 |
| Maize samples | 0.018 | 0.011 | 0.013 | 0.009 |
| Rice samples | 0.010 | 0.008 | 0.004 | 0.005 |
| Extraction blanks | 0.051 | 0.037 | 0.038 | 0.028 |
Genome size is listed under each reference genome in megabase pair (Mbp).
Percentage taxonomic assignments of reads averaged across each species and the series of blanks determined using MEGAN.
| Barley | Maize | Grape | Rice | Blanks | |
|---|---|---|---|---|---|
| Bacteria | 55.66 | 67.25 | 72.26 | 42.92 | 9.36 |
| Eukaryotes | 4.06 | 8.06 | 4.59 | 1.82 | 5.74 |
| Plants | 0.65 | 1.62 | 2.04 | 0.36 | 0.19 |
| Target | 0.01 | 0.14 | 0.16 | 0.03 | 0.00 |
| Not assigned | 39.61 | 22.94 | 20.95 | 54.86 | 84.70 |
Results indicate assignment prior to PIA filtering.
Results of BLASTn and Phylogenetic Intersect Analysis (PIA) showing the number of reads BLASTED after duplicate removal using Prinseq, the number of metagenomic BLAST hits on the sample species, Post PIA filtering hits at 95% coverage (>0.2 taxon diversity) and 99% coverage (>0.2 taxon diversity).
| Sample | # Reads | BLAST hits target species | Post-PIA hits (95%) cvg. | Post-PIA hits (99%) cvg. |
|---|---|---|---|---|
| Barley1a | 8433351 | 73 | 2 | 2 |
| Barley1b | 2855828 | 20 | 0 | 0 |
| Barley2a | 6525081 | 64 | 0 | 0 |
| Barley2b | 3163453 | 18 | 0 | 0 |
| Barley3a | 1415373 | 13 | 0 | 0 |
| Barley3b | 4001313 | 9 | 0 | 0 |
| Barley4a | 6019807 | 40 | 0 | 0 |
| Barley4b | 242478 | 1 | 0 | 0 |
| Barley5 | 2561257 | 22 | 1 | 1 |
| Barley6 | 2283844 | 4 | 0 | 0 |
| Barley7 | 1741206 | 7 | 0 | 0 |
| Barley8 | 409890 | 3 | 0 | 0 |
| Grape1 | 6157219 | 32 | 0 | 0 |
| Grape2 | 8477672 | 28 | 0 | 0 |
| Grape3 | 6315327 | 755 | 2 | 1 |
| Grape4 | 5685642 | 873 | 3 | 3 |
| Grape5 | 8672311 | 744 | 1 | 0 |
| Maize1 | 4911944 | 2 | 0 | 0 |
| Maize2 | 7254256 | 1 | 0 | 0 |
| Maize3 | 88796 | 0 | 0 | 0 |
| Maize4 | 8530317 | 71 | 0 | 0 |
| Maize5 | 5396523 | 128 | 1 | 1 |
| Maize6 | 6446171 | 74 | 0 | 0 |
| Maize7 | 6479527 | 2 | 0 | 0 |
| Maize8a | 27518813 | 2714 | 54 | 37 |
| Maize8b | 14715580 | 3096 | 114 | 82 |
| Rice1 | 390515 | 0 | 0 | 0 |
| Rice2 | 500538 | 0 | 0 | 0 |
| Rice3 | 1783367 | 13 | 0 | 0 |
| Rice4 | 499372 | 64 | 0 | 0 |
| Rice5 | 4953182 | 0 | 0 | 0 |
| Rice6 | 567506 | 0 | 0 | 0 |
| Rice7 | 6101994 | 166 | 0 | 0 |
| Rice8 | 447690 | 0 | 0 | 0 |
| Rice9 | 301980 | 1 | 0 | 0 |
| Rice10 | 3425508 | 99 | 0 | 0 |
| Rice11 | 592182 | 26 | 0 | 0 |
| Rice12a | 2608916 | 150 | 0 | 0 |
| Rice12b | 1476971 | 66 | 0 | 0 |
| Rice13 | 396830 | 217 | 0 | 0 |
| Rice14 | 1096766 | 99 | 0 | 0 |
| Rice15 | 595556 | 289 | 0 | 0 |
| Rice16 | 338135 | 111 | 0 | 0 |
| Rice17 | 373420 | 47 | 0 | 0 |
| Blank_barley1 | 3754102 | 50 | 0 | 0 |
| Blank_barley2 | 5390039 | 12 | 0 | 0 |
| Blank_barley3 | 567952 | 24 | 0 | 0 |
| Blank_grape1 | 2053566 | 13 | 0 | 0 |
| Blank_maize1 | 2687850 | 371 | 0 | 0 |
| Blank_maize2 | 10293162 | 11 | 0 | 0 |
| Blank_rice1 | 250044 | 0 | 0 | 0 |
Figure 1Examples of the charred material for each of the four species used in this study.
Clockwise from top left: barley seed (Hordeum vulgarum), grape seed (Vitis vinifera), maize partial cob (Zea mays) and rice grain (Oryza sativa) (scale bar indicates mm).