| Literature DB >> 34931883 |
Clio Der Sarkissian1, Irina M Velsko2, Anna K Fotakis3, Åshild J Vågene4, Alexander Hübner2,5, James A Fellows Yates2,6,7.
Abstract
Like modern metagenomics, ancient metagenomics is a highly data-rich discipline, with the added challenge that the DNA of interest is degraded and, depending on the sample type, in low abundance. This requires the application of specialized measures during molecular experiments and computational analyses. Furthermore, researchers often work with finite sample sizes, which impedes optimal experimental design and control of confounding factors, and with ethically sensitive samples necessitating the consideration of additional guidelines. In September 2020, early career researchers in the field of ancient metagenomics met (Standards, Precautions & Advances in Ancient Metagenomics 2 [SPAAM2] community meeting) to discuss the state of the field and how to address current challenges. Here, in an effort to bridge the gap between ancient and modern metagenomics, we highlight and reflect upon some common misconceptions, provide a brief overview of the challenges in our field, and point toward useful resources for potential reviewers and newcomers to the field.Entities:
Keywords: ancient DNA; ancient metagenomics; archaeogenomics; archaeology; authentication; cultural heritage; ethics; metagenomics; microbe; microbiome; paleogenetics; peer review
Year: 2021 PMID: 34931883 PMCID: PMC8691190 DOI: 10.1128/msystems.01315-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
Common terms used in ancient metagenomics
| Term | Description | Relevant literature |
|---|---|---|
| SPAAM | Standards, Precautions, and Advances in Ancient Metagenomics, a community of researchers in ancient metagenomics who run regular discussion meetings and community projects. |
|
| Ancient | An organism or tissue is often considered to qualify as ancient when it is more than 100 years old. This cutoff, however, is arbitrary. | |
| Dental calculus | Mineralized dental plaque, also known as dental tartar, contains the petrified remains of an oral biofilm of microorganisms and other micro- and macroremains derived from the oral cavity. | |
| Paleofeces | Ancient human or animal feces that remain in an organic or partially organic state. Completely mineralized fecal remains are termed coprolites (however, note that the terms are often used interchangeably in archaeology). |
|
| Historic specimen | Any specimen that is no longer living and can no longer undergo molecular repair mechanisms. Sometimes used to distinguish natural history museum specimen collections aged less than 200 years old from mostly mineralized tissues found in archaeological excavations older than 200 years. | |
| Bio-cultural heritage | Biological specimens, tissues, or secondary substrates (e.g., dental calculus and paleofeces) or the habitats that are derived from (human) culture that are of cultural significance to a society or individual. |
|
| Necrobiome | The (micro)organisms that decompose dead organic materials, mostly referring to dead bodies. |
|
| Postdepositional | Changes or contamination affecting the specimen after deposition in the ground or environmental context not used during life (e.g., burial). | |
| Degradation | The process of biomolecules being broken up and damaged through a variety of chemical and mechanical processes. |
|
| Molecular preservation | State of the chemical preservation of the biomolecules, used to evaluate the feasibility of cost-effective hybridization capture, sequencing, and analysis. | |
| Destructive sampling | Sampling that removes part of the sample or specimen such that the removed/collected part cannot be returned after analysis. | |
| Ancient DNA | DNA from deceased organisms that has undergone some form of postmortem degradation processes (e.g., fragmentation, deamination damage). Often referred to as aDNA. | |
| Fragmentation | Breakage of DNA backbones that normally occurs at sites of base depurination and often results in single-stranded overhangs. Over time, aDNA reads become very short, typically 30–70 base pairs (bp). | |
| Misincorporation | Higher frequency of cytosine deamination-derived uracils (C-to-U) that are read as T by nonproofreading polymerases at DNA termini, caused by hydrolysis. This occurs on exposed single-stranded overhangs of fragmented DNA. | |
| Damage pattern | Can refer to either nucleotide misincorporation or fragment length distributions or both. | |
| Contamination | Ancient and modern DNA not deriving from the original organism or sample of interest (e.g., from the burial environment, museum collection, laboratory environment, curators, researchers). | |
| Endogenous DNA | DNA originally derived from the sample or tissue type that does not come from contaminating sources. | |
| Authentication | Determining whether a given set of DNA molecules is truly ancient and belonging to, or originating from, the sample in question. Normally based on characteristic damage patterns and endogenous content. | |
| Negative controls | Controls that do not contain samples for analysis (typically water) to act as indicators for the laboratory performance of reagents, or protocols and cross-contamination from the laboratory or samples in the same batch. |
|
| Radiocarbon dating | A means of annual age determination by measuring the decay of radioactive carbon, which occurs at a consistent rate. Also known as C-14 or carbon dating, with dates typically reported as “before present,” where “present” is usually defined as approximately 1950. |
|
| FAIR principles | A set of guidelines that focus on sharing data and metadata in a way that is machine readable, easily findable, and reusable to promote reproducibility. |
|
Nonexhaustive list of suggested reviews, laboratory comparisons, dedicated analysis tools, and benchmarking studies relevant for ancient metagenomics
| Publication type | Description | Reference |
|---|---|---|
| General review | General but comprehensive introduction to microbial archaeology, including ancient metagenomics and pathogen reconstruction and their challenges. Output of the first SPAAM meeting in 2016. |
|
| Descriptions of the challenges related to contamination in next-generation sequencing data sets of low-biomass samples, including ancient DNA. |
| |
| Review of approaches developed for estimating the level of contamination in ancient human DNA data sets, covering aspects also relevant for ancient metagenomics. |
| |
| State-of-the-art summary of ancient pathogen research and what can be learned from such genomes. |
| |
| Laboratory protocol | Comparison of decontamination and aDNA extraction protocols for ancient dental calculus. |
|
| Comparison of decontamination protocols for ancient mammalian dental calculus. |
| |
| Comparison of decontamination protocols for ancient human dental calculus. |
| |
| Comparison of different protocols for simultaneous extraction of different ancient biomolecules from dental calculus. |
| |
| Comparison of DNA extraction methods for paleofeces, including commonly used modern DNA extraction kits. |
| |
| Comparison of microbial genome enrichment techniques for ancient pathogens. |
| |
| Development of techniques for aDNA retrieval from cave sediments and mammalian DNA capture techniques. |
| |
| Extraction method for retrieval of eukaryotic aDNA from marine sediment. |
| |
| Capture protocol for enrichment of marine eukaryotic aDNA. |
| |
| Computational tool | gargammel: synthetic ancient DNA data set generation with a damage and contamination modeling tool. |
|
| MALT: includes a description of the ultrafast BLAST-like metagenomic aligner MALT, which includes adaptations to account for ancient DNA damage. Often used for taxonomic profiling or pathogen detection. |
| |
| PIA: a taxonomic read binner used to identify the likely host source of typically sedimentary aDNA (sedaDNA) reads, with a focus on accounting for extinct taxa not present in modern databases. |
| |
| SNP_Evaluation: a tool allowing for evaluating and visualizing confidence in variant calling of low-coverage pathogen genomes. |
| |
| MEx-IPA: an interactive viewer of output from the HOPS metagenomic authentication pipeline. |
| |
| cuperdec: an R package for the estimation and visualization of the endogenous taxonomic content of ancient microbiomes. |
| |
| PyDamage: a tool for separating ancient and modern contigs from |
| |
| Pipeline | Holi: a pipeline for taxonomic profiling of ancient metagenomic reads based on competitive mapping to large databases. Often used to profile ancient environmental samples (e.g., sedaDNA) for reads assigned to animal or plant taxa. |
|
| metaBIT: the first pipeline with configurations for high-throughput ancient metagenomic screening with MetaPhlAn and a range of taxonomic profile comparison analyses. |
| |
| coproID: a pipeline for the prediction of the host organism of ancient fecal material, including taxonomic profiling of the endogenous content of both host and microbial DNAs. |
| |
| HOPS: a pipeline integrating MALT alignment with postalignment ancient DNA characteristic authentication. Includes damage profiling, fragment length, and visualization of possible contamination. |
| |
| nf-core/eager: a general ancient DNA genomics pipeline with taxonomic profiling for pathogen screening (and microbiome reconstruction) and authentication components, as well as steps allowing for analysis of microbial genomes. |
| |
| Benchmarking | Comparison of different metagenomic taxonomic classifiers applied to ancient DNA data, with description of effects of aDNA damage and short fragment lengths. |
|
| Comparison of the effects of nucleotide-to-nucleotide vs nucleotide-to-protein taxonomic classification for short fragment lengths. |
|
FIG 1Historical label for a museum specimen at the Zoological Museum of Copenhagen, Denmark. Museum documentation enabled curators to decipher and translate the Danish inscription into “Mytilus edulis L. Drift Ice in the Kara Sea. The Dijmphna Expedition.” This documentation, however, leaves scarce contextual information useful for downstream analyses and interpretations, such as precise collection date and geographical coordinates, environmental conditions of the find, and postcollection processing. (Photo courtesy of Clio Der Sarkissian.)
FIG 2Main characteristics and challenges of ancient metagenomic DNA. The original “endogenous” aDNA (i.e., the original DNA in the sample) of a wide range of sample types often makes up only a small fraction of the total genetic content of a sample. Contaminating environmental DNA from close relatives will often reduce confidence in taxonomic profiling and variant calling due to alignment of reads with variants from organisms other than the true ancient organism under investigation. True aDNA has a greater frequency of characteristic C-to-T deamination miscoding lesions at 5′ molecule termini (and a corresponding G-to-A mutation on the complementary 3′ strand of double-stranded libraries). True aDNA fragment distributions typically peak at short lengths, often around 30 to 70 bp, compared to the long lengths of modern DNA.