| Literature DB >> 23029156 |
Nathan Wales1, J Alberto Romero-Navarro, Enrico Cappellini, M Thomas P Gilbert.
Abstract
DNA extracted from ancient plant remains almost always contains a mixture of endogenous (that is, derived from the plant) and exogenous (derived from other sources) DNA. The exogenous 'contaminant' DNA, chiefly derived from microorganisms, presents significant problems for shotgun sequencing. In some samples, more than 90% of the recovered sequences are exogenous, providing limited data relevant to the sample. However, other samples have far less contamination and subsequently yield much more useful data via shotgun sequencing. Given the investment required for high-throughput sequencing, whenever multiple samples are available, it is most economical to sequence the least contaminated sample. We present an assay based on quantitative real-time PCR which estimates the relative amounts of fungal and bacterial DNA in a sample in comparison to the endogenous plant DNA. Given a collection of contextually-similar ancient plant samples, this low cost assay aids in selecting the best sample for shotgun sequencing.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23029156 PMCID: PMC3447772 DOI: 10.1371/journal.pone.0045644
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Primers for qPCR contamination assay.
| Targeted genome or organism | Primer sequence | Length (bp) | Ref. |
| Plant nuclear (tRNA-His gene) | F: | 50 | This study |
| R: | |||
| Plant chloroplast ( | F: | 138–140 |
|
| R: | |||
| Bacteria (16S rRNA gene) | F: | 65 |
|
| R: | |||
| Fungi (18S rRNA gene) | F: | 131–132 |
|
| R: |
Archaeobotanical samples tested in assay.
| Name | Species | Tissue | Context | Library PCR details |
| AR 6 |
| Desiccated branch | Areni I cave, Armenia. Medieval context. | 22 cycles |
| FE 2599 |
| Desiccated seed | Porta Remo-Via Vespergolo site, Ferrara, Italy.Stratigraphic unit 2599, dated by artifacts tofirst half 11th century AD. | 22 cycles |
| AZ 935 |
| Desiccated kernel | Turkey House Ruin, Arizona. 707±23 14CYBP. | 20 cycles |
| PLM 4 |
| Desiccated kernel | Playa Miller 4 site, Chile. Dated to 750-550 YearsBP. | 22 cycles |
| MEX 1 |
| Desiccated cob | Mexican archaeological site, unknownprovenance and unknown age. | 22 cycles |
| CMAG 10189 |
| Desiccated cob | Cueva del Maguey 1 site, Pueblo Nuevo, Durango,Mexico. Dated to 1410±25 14CYBP. | 18 cycles |
| CMAG 10237 |
| Desiccated cob | Cueva del Maguey 1 site, Pueblo Nuevo, Durango,Mexico. Dated to 1410±25 14CYBP. | 25 cycles |
Verification of assay on amplified libraries with shotgun sequencing data.
| qPCR assay results | Shotgun sequencing results | ||||
| Sample | Primer set | Relative to maximum | Endogenous DNA | Mapping and BLASTfindings | Relative to maximum |
| AR 6 | Plant genome | 2.26% | 5.04% |
| 13.85% |
| Chloroplast | 0.58% |
| 0.66% | ||
| Bacteria | Maximum | Bacteria | Maximum | ||
| Fungi | 0.75% | Fungi | 3.06% | ||
| Fe 2599 | Plant genome |
| 0.14% |
| 0.56% |
| Chloroplast | 11.34% |
| 0.01% | ||
| Bacteria | Maximum | Bacteria | Maximum | ||
| Fungi | 11.42% | Fungi | 2.66% | ||
| AZ 935 | Plant genome | Maximum | 92.38% |
| Maximum |
| Chloroplast | 25.53% |
| 0.30% | ||
| Bacteria | 20.73% | Bacteria | 0.32% | ||
| Fungi | 0.91% | Fungi | 0.06% | ||
| PLM 4 | Plant genome | Maximum | 90.59% |
| Maximum |
| Chloroplast | 7.75% |
| 0.19% | ||
| Bacteria | 32.99% | Bacteria | 0.55% | ||
| Fungi | 7.75% | Fungi | 0.41% | ||
| MEX 1 | Plant genome |
| 80.86% |
| Maximum |
| Chloroplast | Maximum |
| 0.15% | ||
| Bacteria | 11.10% | Bacteria | 3.89% | ||
| Fungi | 0.31% | Fungi | 0.46% | ||
| CMAG 10189 | Plant genome |
| 11.00% |
| 51.73% |
| Chloroplast |
|
| 0.08% | ||
| Bacteria | Maximum | Bacteria | Maximum | ||
| Fungi | 41.18% | Fungi | 18.49% | ||
| CMAG 10237 | Plant genome |
| 24.69% |
| 99.39% |
| Chloroplast | 2.52% |
| 0.05% | ||
| Bacteria | Maximum | Bacteria | Maximum | ||
| Fungi | 1.63% | Fungi | 6.98% | ||
As discussed in the methods section, the assay percentages are meant as a guide to compare samples and are not expected to match the absolute values yielded via shotgun sequencing.
The scaled shotgun sequencing results do not include reads without BLAST matches or reads which matched higher taxonomic levels (e.g., eukaryotes or metazoa).
N/A indicates the primer set did not fluoresce before the negative control for the sample.
Verification of assay on DNA extracts.
| qPCR assay results | ||
| Sample | Primer set | Relative to maximum |
| AR 6 | Plant genome | 1.24% |
| Chloroplast | 5.18% | |
| Bacteria | Maximum | |
| Fungi | 75.79% | |
| AZ 935 | Plant genome | Maximum |
| Chloroplast | 8.30% | |
| Bacteria | 3.56% | |
| Fungi | 1.20% | |
| CMAG 10189 | Plant genome |
|
| Chloroplast | 0.01% | |
| Bacteria | 5.63% | |
| Fungi | Maximum | |
Figure 1Directions to interpret qPCR assay data.
Follow the questions in the flowchart to compare the endogenous DNA content of different ancient plant samples. The least contaminated samples are best suited for shotgun sequencing.