Christoph Schwörer1, Maria Leunda1,2, Nadir Alvarez3,4, Felix Gugerli2, Christoph Sperisen2. 1. Institute of Plant Sciences & Oeschger Centre for Climate Change Research, University of Bern, 3013, Bern, Switzerland. 2. WSL Swiss Federal Research Institute, 8903, Birmensdorf, Switzerland. 3. Natural History Museum of Geneva, 1208, Geneva, Switzerland. 4. Department of Genetics and Evolution, University of Geneva, 1205, Geneva, Switzerland.
Abstract
The rapid development of ancient DNA analysis in the last decades has induced a paradigm shift in ecology and evolution. Driven by a combination of breakthroughs in DNA isolation techniques, high-throughput sequencing, and bioinformatics, ancient genome-scale data for a rapidly growing variety of taxa are now available, allowing researchers to directly observe demographic and evolutionary processes over time. However, the vast majority of paleogenomic studies still focus on human or animal remains. In this article, we make the case for a vast untapped resource of ancient plant material that is ideally suited for paleogenomic analyses: plant remains, such as needles, leaves, wood, seeds, or fruits, that are deposited in natural archives, such as lake sediments, permafrost, or even ice caves. Such plant remains are commonly found in large numbers and in stratigraphic sequence through time and have so far been used primarily to reconstruct past local species presences and abundances. However, they are also unique repositories of genetic information with the potential to revolutionize the fields of ecology and evolution by directly studying microevolutionary processes over time. Here, we give an overview of the current state-of-the-art, address important challenges, and highlight new research avenues to inspire future research.
The rapid development of ancient DNA analysis in the last decades has induced a paradigm shift in ecology and evolution. Driven by a combination of breakthroughs in DNA isolation techniques, high-throughput sequencing, and bioinformatics, ancient genome-scale data for a rapidly growing variety of taxa are now available, allowing researchers to directly observe demographic and evolutionary processes over time. However, the vast majority of paleogenomic studies still focus on human or animal remains. In this article, we make the case for a vast untapped resource of ancient plant material that is ideally suited for paleogenomic analyses: plant remains, such as needles, leaves, wood, seeds, or fruits, that are deposited in natural archives, such as lake sediments, permafrost, or even ice caves. Such plant remains are commonly found in large numbers and in stratigraphic sequence through time and have so far been used primarily to reconstruct past local species presences and abundances. However, they are also unique repositories of genetic information with the potential to revolutionize the fields of ecology and evolution by directly studying microevolutionary processes over time. Here, we give an overview of the current state-of-the-art, address important challenges, and highlight new research avenues to inspire future research.
Over the last decades, the analysis of ancient DNA (aDNA) has evolved from the recovery of a few hundred base pairs (bp) of mitochondrial DNA from century‐old historical samples (Higuchi et al., 1984) to the sequencing of whole genomes at high coverage (Meyer et al., 2012) and up to a million years old (van der Valk et al., 2021). Both the number of aDNA studies and the number of taxa for which ancient genomic information is available have increased exponentially (Orlando et al., 2021). This tremendous development has mainly been driven by the introduction of high‐throughput sequencing (HTS), also referred to as next‐generation sequencing (Goodwin et al., 2016), in combination with breakthroughs in DNA isolation techniques (Meyer et al., 2008; Dabney et al., 2013; Schmid et al., 2017; Lendvay et al., 2018b; Rohland et al., 2018). Paleogenomic data allow researchers to directly observe demographic and evolutionary processes over time. This includes population expansions and declines (Lorenzen et al., 2011), range shifts and migrations (Lipson et al., 2017; Moreno‐Mayar et al., 2018), adaptation to environmental stressors (Marciniak & Perry, 2017; Sandoval‐Castellanos et al., 2017; Dehasque et al., 2020), domestication processes (da Fonseca et al., 2015; Scott et al., 2019; Librado et al., 2021), gene flow and hybridization (Der Sarkissian et al., 2013; Schaefer et al., 2016; van der Valk et al., 2021), species extinctions (Lorenzen et al., 2011; Dehasque et al., 2021), and speciation (van der Valk et al., 2021). The groundbreaking results of many aDNA studies have thus led to a paradigm shift in such different fields as archaeology, anthropology, ecology, and evolution.aDNA studies are based on two different approaches: extracting DNA from either ancient, preserved tissues as starting material (i.e. referred to as aDNA), or from ancient source material, such as lake sediment, soil, permafrost, or ice (Box 1), which contains a mixture of tissue, cells, or extracellular DNA from a wide range of organisms (i.e. referred to as environmental DNA, or in the case of sediment as sedaDNA). Currently, the vast majority of paleogenomic studies still focus on human (Rasmussen et al., 2010; Meyer et al., 2012; Moreno‐Mayar et al., 2018) or animal remains (Librado et al., 2021; van der Valk et al., 2021), where mineralized tissues, such as bones and teeth, may provide well‐preserved aDNA. Those tissues mainly stem from archaeological sites or natural archives (e.g. permafrost; see Box 1) and allow an individual‐based approach. By contrast, most studies focusing on plants so far have rather applied a metabarcoding approach from sedaDNA to reconstruct past floristic communities (Alsos et al., 2016; Parducci et al., 2017; Wang et al., 2021). SedaDNA studies can provide important information about the past presence, diversity, and possibly also abundance of species that cannot be resolved by paleoecological methods such as pollen analysis alone because of the latter's lower taxonomic resolution (Parducci et al., 2017). A metabarcoding approach uses specific primers (e.g. targeting the chloroplast trnL P6 loop in plants; Taberlet et al., 2007) that amplify short DNA fragments known to harbor a high sequence variability among species (but low variation at the within‐species level) that can then be sequenced using HTS and compared with a reference database (Alsos et al., 2016; Parducci et al., 2017; Wang et al., 2021). The advantage here is obviously that a multitude of taxa can be identified from a single sediment sample. Another option for the analysis of sedaDNA samples is to use shotgun metagenomics; that is, sequencing billions of reads from a single sediment sample and then using dedicated bioinformatic analyses to align them with publicly available sequence databases and/or newly established reference libraries to identify the species present (Pedersen et al., 2016; Parducci et al., 2017; Armbrecht et al., 2021). However, the efficiency of the method is limited by the fraction of the reads that can be assigned to the target species due to the lack of reference genomes available and the high amounts of microbial non‐target DNA. An advantage of shotgun sequencing compared with metabarcoding is that aDNA can be authenticated based on characteristic patterns of postmortem DNA damage, particularly deaminated cytosine residues (Hofreiter et al., 2001) – although these patterns must be corrected bioinformatically to avoid accounting for artefactual variation. However, it is at least theoretically possible, albeit complex, to sequence the genetic information from individuals or reconstruct entire genomes – see Kurland et al. (2019) and Guirao‐Rico & González (2021) for a discussion on Pool‐seq approaches. Nevertheless, so far, there are no studies that have reconstructed the entire genome of ancient plants from environmental DNA, and studies that have applied an individual‐based approach to ancient plant material are rare. Whereas a few have, for instance, used archaeobotanical remains to study domestication processes (da Fonseca et al., 2015; Estrada et al., 2018; Scott et al., 2019), others used plant remains or pollen from natural archives (Parducci et al., 2005; Schmid et al., 2017; Lendvay et al., 2018a; Wagner et al., 2018) or herbarium collections (Bieker & Martin, 2018; Kistler et al., 2020) to reconstruct population dynamics or genetic diversity. One reason for the scarcity of individual‐based plant aDNA studies focusing on natural populations is that endogenous DNA content in soft tissue, such as plant remains, is relatively low due to high amounts of contaminant microbial DNA and/or high levels of DNA degradation (Green & Speller, 2017). Also, many plant remains found in an archaeological context are charred remains that rarely contain exploitable amounts of endogenous DNA (Nistelberger et al., 2016).Natural archives, such as lake or mire sediments, fluvial and landslide deposits, ice caves, and permafrost can conserve biological remains for millennia, due to either anoxic conditions or low temperatures that prevent the decomposition of tissues (Birks, 2003; Jørgensen et al., 2012; Leunda et al., 2019; Fig. 1). Indeed, lake and mire archives have been regularly used in paleoecology for a century to reconstruct past species abundances and vegetation composition, mostly based on the analysis of pollen (Birks, 2019). Besides pollen that is ubiquitous in such archives, larger plant remains, such as leaves, needles, bud scales, wood, seeds, or fruits, also occur regularly and can often be determined to species level using morphological and anatomical characteristics. In contrast to pollen, which is readily dispersed by wind over large distances, such macrofossils commonly originate from the local vegetation around a site, thus more reliably indicating local species presence compared with pollen. The same processes that prevent the decomposition of these macrofossils should also minimize the degradation of the DNA within the plant cells, making macrofossils a viable source for past genetic information. There are also other advantages: plant remains are commonly deposited in stratigraphic order and, depending on size, can be directly dated using radiocarbon dating, instead of relying on the dating of surrounding organic material. In many cases, there are numerous remains available from different individuals within a certain time period, which allows the quantitative reconstruction of past local species abundance. This also makes the genetic analysis of entire populations possible and allows comparing the past genetic composition with present‐day and/or nonlocal populations. However, even though the potential of macrofossils as a source for aDNA has been recognized for more than a decade (Parducci & Petit, 2004; Gugerli et al., 2005, 2013), only a handful of paleogenetic studies based on macrofossils exist so far. One of the earliest such studies is from the Carpathians, where the authors analyzed chloroplast aDNA from subfossil Picea abies (Norway spruce) seeds and cone scales, as well as pollen, and found the same genetic haplotypes as in extant populations, indicating strong demographic stasis over millennial timescales (Magyari et al., 2011). The authors could also show a decrease in genetic variability since the beginning of the Holocene, which could be associated with the repeated bottlenecks inferred from paleoecological data. The results were recently confirmed by a follow‐up study at the same site based on macrofossils alone, including needles, concluding that such remains are an invaluable repository for information on past population genetic dynamics (Lendvay et al., 2018a). In the Southern Alps, genome‐scale aDNA data extracted from subfossil Abies alba (silver fir) needles were used to infer changes in genetic variation between 7.2–5.8 ka cal. bp (calibrated years before present; Schmid et al., 2017), when anthropogenic disturbance led to a drastic decrease in population size (Tinner et al., 1999). The aDNA analysis revealed a lowered observed heterozygosity during the palynologically inferred population decrease, which confirms the paleoecological interpretation of population fragmentation in response to disturbance (Fig. 2). With a recovery of the estimated population size after 6.5 ka cal. bp, genetic variation also returned to predisturbance levels. The lack of genetic differentiation between the populations growing before and after the population decline indicates reexpansion of local trees in the study area (Schmid et al., 2017). Besides macrofossils preserved in lake sediments, plant aDNA has also been extracted from waterlogged, subfossil wood remains found in archaeological or sedimentological contexts (Lendvay et al., 2018b; Wagner et al., 2018). Such remains are widely used in dendroclimatology and chronology to reconstruct past climatic conditions or precisely date wood remains based on tree‐ring patterns (Büntgen et al., 2011; Hafner et al., 2021). In the most comprehensive study to date, 167 waterlogged wood remains from European white oaks have been analyzed using high throughput sequencing (HTS) (Wagner et al., 2018). Even though endogenous DNA content was mostly low (< 1% of total DNA reads), the comparison of ancient and extant chloroplast haplotypes indicates a continuous presence of local populations with limited changes in haplotype composition over millennia. Another recent study was able to taxonomically identify 13 000‐yr‐old pine trunks buried in clay as Pinus sylvestris (Scots pine) using amplicon sequencing of chloroplast aDNA (Lendvay et al., 2018b). Similarly, a study from Lithuania, based on mitochondrial DNA and nuclear microsatellites, could link ancient haplotypes from 11 000‐yr‐old submerged Scots pine stumps in the Baltic Sea with extant populations and refugia in the Balkan Peninsula (Danusevičius et al., 2021).
Fig. 1
(a) Lakes are natural archives that can conserve plant material, such as leaves (e.g. Dryas octopetala; left inset), seeds (e.g. winged Betula fruits and catkin scale; middle inset), and needles (e.g. Abies alba; right inset), as well as endogenous genetic information for millennia. Lake sediment archives are especially suited for plant ancient DNA (aDNA) studies, because they can contain numerous macrofossils that are deposited in stratigraphic order over time (b). Other natural archives that represent valuable sources for plant aDNA studies are (c) ice caves, (d) fluvial or landslide deposits, and (e) permafrost soils.
Fig. 2
Past changes in the genetic diversity of Abies alba populations in response to the decline and subsequent recovery of population size around Lago di Origlio, southern Switzerland (Tinner et al., 1999; Schmid et al., 2017). (a) Anthropogenic disturbance caused a drastic decline in A. alba populations during the period 6.5–6.2 ka cal. bp, as reflected in pollen percentages. Individual A. alba needles from selected time periods (orange squares; n, number of needles) were used for ancient DNA analysis. (b) The genetic analysis revealed a significantly lower observed heterozygosity during the period of population decline at 6.5–6.2 ka cal. bp (letters above bars refer to statistically significant differences). However, the absence of significant changes in allelic richness after population recovery at 6.2–5.8 ka cal. bp indicates that genetic diversity was able to recover. (c) Since there was no genetic differentiation between the populations growing before (7.2–6.6 ka cal. bp) and after population recovery (6.2–5.8 ka cal. bp), this process was most likely driven by internal recruitment (H1) and not external recruitment (H2; Schmid et al., 2017).
(a) Lakes are natural archives that can conserve plant material, such as leaves (e.g. Dryas octopetala; left inset), seeds (e.g. winged Betula fruits and catkin scale; middle inset), and needles (e.g. Abies alba; right inset), as well as endogenous genetic information for millennia. Lake sediment archives are especially suited for plant ancient DNA (aDNA) studies, because they can contain numerous macrofossils that are deposited in stratigraphic order over time (b). Other natural archives that represent valuable sources for plant aDNA studies are (c) ice caves, (d) fluvial or landslide deposits, and (e) permafrost soils.Past changes in the genetic diversity of Abies alba populations in response to the decline and subsequent recovery of population size around Lago di Origlio, southern Switzerland (Tinner et al., 1999; Schmid et al., 2017). (a) Anthropogenic disturbance caused a drastic decline in A. alba populations during the period 6.5–6.2 ka cal. bp, as reflected in pollen percentages. Individual A. alba needles from selected time periods (orange squares; n, number of needles) were used for ancient DNA analysis. (b) The genetic analysis revealed a significantly lower observed heterozygosity during the period of population decline at 6.5–6.2 ka cal. bp (letters above bars refer to statistically significant differences). However, the absence of significant changes in allelic richness after population recovery at 6.2–5.8 ka cal. bp indicates that genetic diversity was able to recover. (c) Since there was no genetic differentiation between the populations growing before (7.2–6.6 ka cal. bp) and after population recovery (6.2–5.8 ka cal. bp), this process was most likely driven by internal recruitment (H1) and not external recruitment (H2; Schmid et al., 2017).
Survival of plant ancient DNA in waterlogged sediments
The preservation of biological remains depends on an array of processes and conditions that involve physical, chemical, and biological agents (Behrensmeyer et al., 2000). Such processes can affect the DNA of waterlogged tissues at different steps of taphonomy: during the transport to and within the aquatic environment, at the sediment–water interface, and after burial in the sediment. In the preburial environment, DNA may be degraded primarily by microbes and intracellular nucleases, whereas these processes may stabilize after burial in the sediment due to anoxic conditions. In these instances, hydrolytic processes, particularly DNA depurination leading to single‐strand breaks (Lindahl, 1993), may further limit the time over which DNA remains intact in a tissue. However, it has been suggested that depurination of DNA preserved in waterlogged sediments proceeds at slower rates than theoretically predicted or estimated for terrestrial environments (Corinaldesi et al., 2008).An increasing number of studies indicate that waterlogged plant remains represent a rich source of aDNA sequences. Waterlogged seeds (Kistler et al., 2015; Wales et al., 2016; Ramos‐Madrigal et al., 2019), fruit fragments (Kistler et al., 2014), needles (Schmid et al., 2017), and wood (Wagner et al., 2018) have all been shown to yield aDNA suitable for chloroplast or nuclear genome‐scale analyses (Table 1). As expected for aDNA, the recovered DNA was degraded to small average size (< 95 bp) and, when analyzed, characterized by an increased occurrence of purines (adenine and guanosine residues) before strand breaks, putatively due to DNA depurination (Briggs et al., 2007). Moreover, an increased frequency of cytosine‐to‐thymine misincorporations close to the ends of the DNA fragments was observed, due to deamination of cytosine residues that occur primarily in the single‐stranded DNA overhangs (Brotherton et al., 2007; Table 1). Such characteristic damage patterns can in turn also be used to authenticate aDNA (Hofreiter et al., 2001; Jónsson et al., 2013). The most detailed information about plant aDNA preservation is available for waterlogged wood of European white oaks (Wagner et al., 2018). Using wood retrieved from lake sediment, marine silt, clay, and peat, it was found that the DNA fragment size was linearly correlated with thermal age, a measure combining the age of the specimen with average temperatures since deposition (Smith et al., 2003). However, the data indicated that other factors in addition to depurination may contribute to DNA fragmentation (Wagner et al., 2018). Furthermore, it was observed that all millennia‐old wood samples with moderate‐to‐high endogenous oak DNA contents (> 1%–16.5% of total DNA reads), with the remaining DNA originating from microbes, were retrieved from wood samples embedded in calcareous lake sediments, suggesting that such sediments could represent particularly promising environments for the preservation of aDNA in wood.
Table 1
Recovery of aDNA using high throughput sequencing (HTS) from waterlogged plant remains.
Study
Species
Tissue
Age range (ka cal. bp)
No. of sites
Endogenous DNA content (%)
Average DNA fragment length (bp)
Method
Target
DNA damage
Kistler et al. (2014)
Lagenaria siceraria
Fruit fragment (gourd rind)
10.2–9.8
1
na
63
Target enrichment
Large single‐copy region of the plastid genome
na
Wales et al. (2014)
Vitis vinifera
Seed
1.35–1.25
1
na
na
PCR
Plant rbcL marker (138 bp)
na
Kistler et al. (2015)
Cucurbita spp.
Seed
Holocene
1
0.08–0.761
na
Target enrichment
Plastid genome
Deamination
Wales et al. (2016)
Vitis vinifera
Seed
3.2–0.45
21
1.42
59.1–86.31
Target enrichment and shotgun
Plastid genome
Deamination
Lendvay et al. (2018b)
Pinus sylvestris
Wood
13.9–13.0
1
na
na
PCR
Plastid trnL region (84 bp) and trnF region (109 bp)
Deamination
Schmid et al. (2017)
Abies alba
Needle
7.2–5.8
1
0.01–0.33
na
Target enrichment
Nuclear exome, complexity reduced
Deamination
Wagner et al. (2018)
Quercus robur/petraea
Wood
9.8–0.55
26
0–16.5
≤ 95
Shotgun
Part of plastid genome
Deamination and depurination
Ramos‐Madrigal et al. (2019)
Vitis vinifera
Seed
2.46–0.75
9
0–33.5
58.1–77.31
Target enrichment and shotgun
Set of nuclear genes
Deamination
na, not analyzed.
Values after target enrichment.
Value for one sample.
Recovery of aDNA using high throughput sequencing (HTS) from waterlogged plant remains.na, not analyzed.Values after target enrichment.Value for one sample.
Challenges of plant ancient DNA analysis
Despite indications about favorable environmental conditions for the preservation of aDNA in plant macrofossils, it is currently not possible to predict the suitability of samples for genome‐scale aDNA analyses. In paleogenomic studies, it is common to initially screen samples to select the most promising material for further analyses; for instance, using PCR‐based assays (Wales et al., 2012; Lendvay et al., 2018b). However, if all DNA of a sample is fragmented to the point that none of the markers will amplify, the assay cannot provide any guidance, because such samples are not necessarily devoid of endogenous DNA. Additional challenges of plant paleogenomics lie in practical aspects, such as the often small size of specimens, plant compounds that inhibit downstream enzymatic reactions, and contamination with modern DNA. Fortunately, there is a fast‐growing number of tools available to address these challenges, such as pretreatment of plant remains, dedicated extraction protocols for ultrashort aDNA fragments, targeted sequencing of endogenous aDNA using hybridization‐capture approaches, postmortem deamination correction, or mapping shotgun‐sequenced libraries to ever‐increasing numbers of reference genomes.The fragmented, low‐quality, and fragile nature of plant aDNA requires strict protocols to recover generally low quantities of degraded DNA and minimize contamination from modern sources (Kistler et al., 2020; Latorre et al., 2020). Thus, all extraction and preamplification steps should be performed in dedicated aDNA laboratories to avoid and/or identify sources of contamination (Kistler et al., 2020). Different methods have been developed that optimize the recovery and processing of the short and damaged plant aDNA fragments (Latorre et al., 2020). However, there is currently no standard protocol for DNA extraction from ancient plant remains, largely due to the diversity of plant taxa and tissue types recovered in ancient deposits (Lendvay et al., 2018a). Therefore, pilot studies are often required to identify the best practice for a given set of samples (Kistler et al., 2020). Most plant tissues are rich in polysaccharides and polyphenols, and waterlogged plant remains can also contain humic acids derived from sediment (Kistler et al., 2020). All these molecules tend to coextract with DNA and can act as inhibitors for downstream enzymatic reactions (Kistler, 2012). Several methods have been developed for extracting DNA from modern plant material, aiming to maximize the DNA yield and simultaneously reduce inhibitors. Most of these protocols include either sodium dodecyl sulfate (SDS) or cetyl‐trimethyl ammonium bromide (CTAB) as detergents in the extraction buffer. The anionic SDS is used to precipitate polysaccharides and proteins (Dellaporta et al., 1983). Likewise, the cationic CTAB is capable of precipitating polysaccharides (Doyle & Doyle, 1987). In a comparative study, it was shown that extraction with SDS yields higher DNA amounts from ancient and historical plant remains than extraction with CTAB does (Wales et al., 2014). Indeed, the majority of plant aDNA studies conducted so far have used SDS‐based extraction buffers (see Pont et al., 2019, table 2), in some cases (e.g. Gutaker et al., 2017; Schmid et al., 2017) in combination with N‐phenacylthiazolium bromide (PTB), an agent that cleaves glucose‐derived protein crosslinks and thus can help to release DNA from protein–DNA complexes (Poinar et al., 1998). Extraction of aDNA from herbarium specimens with PTB and SDS was found to decrease the average DNA fragment length when compared with CTAB (Gutaker et al., 2017). Additionally, silica‐based DNA purification techniques allow the efficient recovery of short DNA fragments (Rohland et al., 2018), and by adjusting chaotropic salt concentrations of the binding buffer, fragments as short as 35 bp (Dabney et al., 2013) or even shorter (≥ 25 bp; Glocke & Meyer, 2017) can be retained.After extraction and purification, aDNA molecules must be converted into sequencing libraries, which requires the addition of individual barcodes and platform‐specific sequencing adapters to each DNA molecule (Goodwin et al., 2016). Library preparation should also be optimized for degraded DNA. In comparison with double‐stranded, single‐stranded library preparation techniques minimize the loss of DNA molecules with single‐strand breaks on both strands and/or DNA molecules with end modifications located on one of the two strands (Gansauge & Meyer, 2013), thereby increasing the number of library molecules that can be retrieved from highly degraded DNA. Recent studies show fast and inexpensive, single‐stranded library preparation methods (Troll et al., 2019) even optimized for aDNA (Tin et al., 2014; Kapp et al., 2021).Ancient DNA libraries often contain < 1% endogenous DNA, with the majority of sequencing capacity taken up by DNA from other sources, such as microorganisms. A way to overcome this limitation is to enrich the libraries using hybridization probes before sequencing (Carpenter et al., 2013). These methods use either commercially synthesized probes (e.g. Ali et al., 2016; Ramos‐Madrigal et al., 2019), which can be costly, or benchtop‐produced hybridization probes (Suchan et al., 2016; Schmid et al., 2017). Plant genomes are generally complex, containing 10–80% noncoding repeated elements (Metcalfe & Casane, 2013), and can thus be very large, such as in conifers (Nystedt et al., 2013; Mosca et al., 2019). Libraries can be enriched for chloroplast genomes (Meucci et al., 2021; Schulte et al., 2021) or exome (protein‐coding) sequences, using probes generated from messenger RNA (Schmid et al., 2017; Toussaint et al., 2021), which may significantly reduce sequencing costs. Last but not least, bioinformatic suites have made it possible to apply the whole set of post‐sequencing analytical steps in a glance (Schubert et al., 2014; Fellows Yates et al., 2021), including corrections for postmortem damage (Jónsson et al., 2013) and incorporating uncertainty in the genotype calling for low‐coverage sequence data (Nielsen et al., 2011). For a whole review on the downstream aDNA bioinformatic analyses, see Orlando et al. (2021).
Applications of ancient DNA analyses based on plant macrofossils
Reconstructing postglacial range shifts using a multisite approach
Ongoing and future climate change is expected to lead to widespread range shifts of plant species that are tracking their current climatic niche (Parmesan & Yohe, 2003; Steinbauer et al., 2018). The unprecedented rate of change is raising the question of whether the dispersal capacity of plants is sufficient to keep up with the rising temperatures. Some scientists have even argued that species might need ‘assisted migration’ to prevent their local extinction (McLachlan et al., 2007; Aitken & Bemmels, 2016; Dauphin et al., 2021). Species migration rates have either been inferred by estimating the species’ dispersal capacity (e.g. by directly or indirectly determining seed dispersal distances) or by tracking the first establishment of a species at different sites in response to past climatic changes using pollen and macrofossil analyses (Pearson, 2006; Feurdean et al., 2013; Birks, 2019). Paleoecological techniques have also been applied to estimate expansion pathways from refugial locations during the last Ice Age, sometimes in combination with ecological niche modeling and phylogenetic data that provide information about past geographic isolation and location of refugia (Gavin et al., 2014). However, these approaches commonly rely on the location of source populations from the main refugia and may ignore secondary or cryptic refugia, which could significantly alter effective species dispersal rates (Birks, 2019). Additionally, paleoecological approaches alone are not able to resolve population‐level dynamics due to intrinsic taxonomic constraints that do not allow the identification of within‐species lineages. This makes it impossible to track species range shifts in detail. Phylogeographic approaches, on the other hand, rely on present‐day genetic variation only and, therefore, are not able to identify cryptic lineages that became extinct in the past. It is, however, possible to infer past demographic changes and migration patterns from extant populations using demographic inference (Marchi et al., 2021).Macrofossils deposited in natural archives not only allow determining and dating the local population establishment (e.g. in response to past climate warming), but the genetic information preserved within such remains also provides crucial information about the relationship among populations. By analyzing aDNA from the first populations that established around a network of sites and inferring the degree of relationship among them, the expansion of a population can be tracked with unprecedented detail (Fig. 3a). The analysis of aDNA also allows identifying populations that became extinct during the Holocene and/or might have originated from previously unknown (‘cryptic’) refugia. The identification of refugial populations is important to calculate expansion rates more precisely and to understand the processes involved in species survival under adverse climatic conditions. Indeed, a study identifying sedaDNA of Scots pine and Norway spruce in the lake sediment of an ice‐free potential refugium from northern Scandinavia during the Last Glacial Maximum, as well as the Early Holocene presence of a rare mitochondrial haplotype, point to the persistence of trees in northern Scandinavia during the last glacial period (Parducci et al., 2012), even though this interpretation immediately aroused criticism (Birks et al., 2012). Moreover, a more recent study also based on sedaDNA could not fully confirm nor reject the findings, because the low presence of spruce and pine DNA was not distinguishable from background contamination (Alsos et al., 2020). Unfortunately, the metabarcoding approach used here erases the signature of deamination patterns potentially present in the original DNA template, which could have represented evidence for ancient DNA.
Fig. 3
Applications of plant ancient DNA (aDNA) analyses at different temporal and spatial scales illustrated with hypothetical examples. (a) Postglacial recolonization patterns of plant species (red, orange, and blue arrows) from refugial locations (red, orange, and blue DNA double helices) can be reconstructed in detail, when the timing of population establishment is combined with aDNA analysis of the first macrofossils at several sites (stars). Such patterns might differ from present‐day phylogeographic analyses (filled pie charts) due to subsequent range expansions and admixtures. This approach would also allow the identification of cryptic populations or lineages that became extinct during the Holocene (pink star) and greatly improve the precision of estimated population expansion rates. (b) By analyzing aDNA from plant macrofossils deposited in stratigraphic order, changes in haplotype diversity (pink, orange, and blue bars) and in genetic diversity in response to demographic changes (as indicated by pollen percentages) can be reconstructed in detail, using population genetic indices such as allelic richness A
r (red line) or expected heterozygosity H
exp (blue line). In this hypothetical example, the extinction of the cryptic lineage (pink bars) can also be observed in more detail. (c) Comparing genetic information, studying putatively adaptive loci, from populations before (blue double helix) and after rapid past climate change (red double helix), such as the transition from the Younger Dryas cold period to the Early Holocene (as indicated by independent climate reconstructions; e.g. Heiri et al., 2014), would potentially allow to test whether plant species were able to adapt to rapid climate change and may again do so in the future.
Applications of plant ancient DNA (aDNA) analyses at different temporal and spatial scales illustrated with hypothetical examples. (a) Postglacial recolonization patterns of plant species (red, orange, and blue arrows) from refugial locations (red, orange, and blue DNA double helices) can be reconstructed in detail, when the timing of population establishment is combined with aDNA analysis of the first macrofossils at several sites (stars). Such patterns might differ from present‐day phylogeographic analyses (filled pie charts) due to subsequent range expansions and admixtures. This approach would also allow the identification of cryptic populations or lineages that became extinct during the Holocene (pink star) and greatly improve the precision of estimated population expansion rates. (b) By analyzing aDNA from plant macrofossils deposited in stratigraphic order, changes in haplotype diversity (pink, orange, and blue bars) and in genetic diversity in response to demographic changes (as indicated by pollen percentages) can be reconstructed in detail, using population genetic indices such as allelic richness A
r (red line) or expected heterozygosity H
exp (blue line). In this hypothetical example, the extinction of the cryptic lineage (pink bars) can also be observed in more detail. (c) Comparing genetic information, studying putatively adaptive loci, from populations before (blue double helix) and after rapid past climate change (red double helix), such as the transition from the Younger Dryas cold period to the Early Holocene (as indicated by independent climate reconstructions; e.g. Heiri et al., 2014), would potentially allow to test whether plant species were able to adapt to rapid climate change and may again do so in the future.
Tracking changes in genetic diversity through time
Genetic diversity is one of the fundamental components of biodiversity and an important prerequisite for adaptation to changing environmental conditions. It is therefore crucial for preserving species and maintaining ecosystem resilience. The effects of demographic processes, such as range shifts or population declines, on genetic diversity have been intensely investigated theoretically (Pauls et al., 2013; Dauphin et al., 2021), but empirical studies, especially with long‐lived organisms such as trees, are rare and are unable to resolve the impacts of climate change over several generations (Pluess, 2011; Lesser et al., 2013; Elleouet & Aitken, 2019). Recently, there has been a lot of concern about the effect of population declines on the genetic diversity of many species. However, there is virtually no baseline to compare present vs ancient levels of genetic diversity (but see Leigh et al., 2019; Gauthier et al., 2020).Plant aDNA studies based on macrofossils would allow the reconstruction of changes in the genetic diversity of a species over extended time periods as required for long‐lived organisms such as trees (Fig. 3b). In contrast to herbarium collections, which are also used as an important resource of past genetic diversity (Bieker & Martin, 2018; Lopez et al., 2020), natural archives go beyond the historical period of human‐driven impacts on ecosystems, thereby providing information from truly natural populations. In long‐lived organisms such as trees, extant populations can also be used to study allele frequency changes over a few generations (Dauphin et al., 2021). By using plant remains deposited in natural archives, such analyses can be extended over much longer time periods (e.g. Schmid et al., 2017; Fig. 3b). Neutral population genetic processes can be tracked by using aDNA, given that allelic frequencies at the population scale are directly impacted by demographic events such as population expansions and declines, gene flow from neighboring populations, or random loss of certain alleles due to genetic drift. A better understanding of the effects of demographic processes on genetic diversity would help us to make more accurate predictions about future changes in genetic diversity.
Testing the adaptive potential of plants to climate change
It is still an open question whether (or to what extent and at what speed) plants can genetically adapt to rapid climatic changes (Birks, 2019). However, this knowledge is crucial in assessing the impact of future climate change on the vegetation. It is clear that species can adapt to local environmental conditions through natural selection, resulting in distinct phenotypes, but the pace at which such processes occur is still debated. The novel research field of landscape genomics aims to identify genes that are associated with certain environmental conditions and result in the expression of respective phenotypes that convey higher fitness (Sork et al., 2013). Adaptive loci are either identified by genome‐wide association studies (GWAS) that link adaptive genes to associated phenotypic traits (Bragg et al., 2015) or by environmental association analysis (EAA; also termed genotype–environment associations), which is based on correlations between genetic variants with environmental conditions (Rellstab et al., 2015). In a recent article by Napier et al. (2020), the authors argue that both GWAS and EAA can also be applied to aDNA, thereby testing if plants were able to adapt to past climatic changes. Similarly, a recent study linking genomic information of adult and juvenile Swiss stone pine (Pinus cembra) age cohorts in the Alps with environmental data indicates that environment‐driven allele frequency changes over centuries are small, which suggests that such long‐lived species may not be able to adapt fast enough, potentially resulting in epigenetically mediated acclimation rather than adaptation, or to local extinction (Dauphin et al., 2021).Expanding the temporal scale of plant species adaptation to changing environments, one may compare the genetic information preserved within macrofossils from time periods with marked climatic changes (Fig. 3c). For example, the transition from the Younger Dryas cold period to the current Holocene interglacial c. 11 700 yr ago in Europe is considered a close analogue to the current climate warming regarding the rate of climate change, with temperatures rising 2–4°C within less than a century (Heiri et al., 2014). By comparing the allele frequencies of putatively adaptive loci between populations growing before and after the climate warming at the same site and comparing this change with the situation in extant populations, it would be possible to better estimate the adaptive potential of a species.
Conclusions and outlook
Climate change will have profound impacts on plant distribution and abundance, as well as associated ecosystem services and functioning. Analyzing plant aDNA from macrofossils deposited in natural archives has the potential to assess the effects of past rapid climate change on plant species at the genetic level. This will ultimately allow better predictions about the effects of future climate change on the abundance, distribution, adaptive potential, and genetic diversity of plants.With ever more ancient genetic information available, it will also be possible to test and validate population genetic models. Such paleo‐validated models can then be used in turn to make detailed predictions about future changes in genetic diversity. This approach of comparing model output with paleo‐data is standard procedure for climate and vegetation models, but is not very common for population genetic models, at least for long‐lived organisms over long time scales. Paleogenetic information from individual species (aDNA) could also be combined with data from multitaxon approaches based on sedaDNA (Dussex et al., 2021).Overall, we believe that the proposed framework has the potential to fundamentally improve our understanding of population genetic processes, by opening a window into the past and allowing us to retrospectively track genetic changes over time.
Author contributions
C Schwörer, NA, FG and C Sperisen designed the research; C Schwörer and ML wrote a first draft and all authors contributed to the final manuscript.
Authors: Adrian W Briggs; Udo Stenzel; Philip L F Johnson; Richard E Green; Janet Kelso; Kay Prüfer; Matthias Meyer; Johannes Krause; Michael T Ronan; Michael Lachmann; Svante Pääbo Journal: Proc Natl Acad Sci U S A Date: 2007-08-21 Impact factor: 11.205
Authors: Matthias Meyer; Martin Kircher; Marie-Theres Gansauge; Heng Li; Fernando Racimo; Swapan Mallick; Joshua G Schraiber; Flora Jay; Kay Prüfer; Cesare de Filippo; Peter H Sudmant; Can Alkan; Qiaomei Fu; Ron Do; Nadin Rohland; Arti Tandon; Michael Siebauer; Richard E Green; Katarzyna Bryc; Adrian W Briggs; Udo Stenzel; Jesse Dabney; Jay Shendure; Jacob Kitzman; Michael F Hammer; Michael V Shunkov; Anatoli P Derevianko; Nick Patterson; Aida M Andrés; Evan E Eichler; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo Journal: Science Date: 2012-08-30 Impact factor: 47.728
Authors: Rute R da Fonseca; Bruce D Smith; Nathan Wales; Enrico Cappellini; Pontus Skoglund; Matteo Fumagalli; José Alfredo Samaniego; Christian Carøe; María C Ávila-Arcos; David E Hufnagel; Thorfinn Sand Korneliussen; Filipe Garrett Vieira; Mattias Jakobsson; Bernardo Arriaza; Eske Willerslev; Rasmus Nielsen; Matthew B Hufford; Anders Albrechtsen; Jeffrey Ross-Ibarra; M Thomas P Gilbert Journal: Nat Plants Date: 2015-01-08 Impact factor: 15.793
Authors: Tomasz Suchan; Camille Pitteloud; Nadezhda S Gerasimova; Anna Kostikova; Sarah Schmid; Nils Arrigo; Mila Pajkovic; Michał Ronikier; Nadir Alvarez Journal: PLoS One Date: 2016-03-21 Impact factor: 3.240