| Literature DB >> 31283764 |
Camilla Fløjgaard1, Tobias Guldberg Frøslev2, Ane Kirstine Brunbjerg1, Hans Henrik Bruun3, Jesper Moeslund1, Anders Johannes Hansen2, Rasmus Ejrnæs1.
Abstract
Environmental DNA (eDNA) is increasingly applied in ecological studies, including studies with the primary purpose of criminal investigation, in which eDNA from soil can be used to pair samples or reveal sample provenance. We collected soil eDNA samples as part of a large national biodiversity research project across 130 sites in Denmark. We investigated the potential for soil eDNA metabarcoding in predicting provenance in terms of environmental conditions, habitat type and geographic regions. We used linear regression for predicting environmental gradients of light, soil moisture, pH and nutrient status (represented by Ellenberg Indicator Values, EIVs) and Quadratic Discriminant Analysis (QDA) to predict habitat type and geographic region. eDNA data performed relatively well as a predictor of environmental gradients (R2 > 0.81). Its ability to discriminate between habitat types was variable, with high accuracy for certain forest types and low accuracy for heathland, which was poorly predicted. Geographic region was also less accurately predicted by eDNA. We demonstrated the application of provenance prediction in forensic science by evaluating and discussing two mock crime scenes. Here, we listed the plant species from annotated sequences, which can further aid in identifying the likely habitat or, in case of rare species, a geographic region. Predictions of environmental gradients and habitat types together give an overall accurate description of a crime scene, but care should be taken when interpreting annotated sequences, e.g. due to erroneous assignments in GenBank. Our approach demonstrates that important habitat properties can be derived from soil eDNA, and exemplifies a range of potential applications of eDNA in forensic ecology.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31283764 PMCID: PMC6613677 DOI: 10.1371/journal.pone.0202844
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Interpreting Ellenberg Indicator Values.
Ellipses showing the multivariate normal distribution of vegetation types plotted along Ellenberg indicator values for light, soil moisture, pH and nutrient status. Agricultural includes rotational fields, old fields and lays. The colors of the ellipses correspond to characteristic sites depicted on the right.
Habitat types and geographic origins and their description.
| Explanation | |
| High forest | High forest, characterized by tall trunks and with an average canopy height ≥ 9 m (n = 23) |
| Forest | Forests including woodlands and tall shrubs with and average canopy height ≥ 2 m (n = 53) |
| Agriculture | Rotational field, grass ley and fallow field (n = 14) |
| Dwarf shrub | Dominance of dwarf shrubs ( |
| Beech | Forest with |
| Oak | Forest with deciduous |
| Willow | Tall shrubland or low forest on moist to wet soils dominated by |
| Coniferous | Forest and tall shrubland dominated by coniferous trees and shrubs ( |
| Heathland | Northern Atlantic wet heath with |
| Alder | Swamp forest with |
| Reed swamp | Wetlands dominated by |
| Explanation | |
| Atlantic | Located in the Atlantic biogeographic region of Denmark (as opposed to the Continental region) (n = 35) |
| Jutland | Located in Jutland, the mainland of Denmark (as opposed to the Islands, i.e., Funen, Zealand, Lolland and Møn) (n = 78) |
The habitat types are binary, i.e., the characteristic is either present or absent at a site.
Predicting environmental gradients of light, soil moisture, pH and nutrient status from variation in soil eDNA.
| EIV | Fungi | mt16S-‘Insects’ | Eukaryotes | Plants |
|---|---|---|---|---|
| Light | 0.76 | 0.59 | 0.55 | 0.65 |
| Light (GAM) | - | - | - | |
| Moisture | 0.80 | 0.84 | 0.78 | |
| pH | 0.75 | 0.79 | 0.72 | |
| Nutrient | 0.82 | 0.64 | 0.71 | |
| N | 130 | 130 | 130 | 130 |
Model R2 values of linear models with EIVs of light, soil moisture, pH and nutrient status as response variable and NMS ordination axes of soil OTU community composition of fungi, insects, eukaryotes and plants as explanatory variables. Text in bold indicates the best model for each environmental gradient used for prediction.
Predicting habitat types from variation in soil eDNA.
| Habitat type | Fungi | mt16S ‘insect’ | Eukaryotes | Plants | With sequences |
|---|---|---|---|---|---|
| HighForest | 0.90 (0.78) | 0.82 (0) | 0.89 (0.74) | - | |
| Forest | 0.85 (0.81) | 0.94 (0.92) | 0.92 (0.83) | - | |
| Agriculture | 0.92 (0.64) | 0.95 (0.5) | 0.92 (0.71) | ||
| Coniferous | 0.93 (0.00) | 0.93 (0.00) | 0.93 (0.00) | 0.93 (0.00) | 0.96 (0.44) |
| Beech | 0.92 (0.71) | 0.87 (0) | 0.95 (0.88) | 0.95 (0.76) | |
| Oak | 0.92 (0.00) | 0.92 (0.00) | 0.92 (0.00) | 0.95 (0.45) | |
| Willow | 0.92 (0.00) | 0.92 (0.00) | 0.93 (0.30) | 0.95 (0.60) | |
| Heathland | 0.95 (0.00) | 0.95 (0.00) | 0.95 (0.00) | 0.95 (0.00) | 0.92 (0.50) |
| Dwarf shrubs | 0.78 (0.00) | 0.89 (0.71) | 0.87 (0.75) | - | |
| Alder | 0.88 (0.00) | 0.9 (0.47) | 0.9 (0.20) | 0.93 (0.42) | |
| Reed swamp | 0.88 (0.50) | 0.89 (0.50) | 0.9 (0.39) | 0.88 (0.18) | |
| Atlantic | 0.73 (0.00) | 0.77 (0.26) | 0.73 (0.00) | 0.73 (0.00) | - |
| Jutland | 0.65 (0.90) | 0.66 (0.91) | 0.62 (0.86) | - |
Fig 2Mock crime scene 1 and the provenance derived from soil eDNA.
Predicted location in environmental space (Ellenberg Indicator Values) based on the best linear models and GAM for light (top left). The cross marks the predicted EIV value and the length of the arms show the 95% confidence intervals. Blue dots show the actual EIVs at the sample site. The predicted probability of binary habitat types based on Quadratic Discriminant Analysis are shown as red lines on a box plot of the distribution of predicted values for each characteristic (middle part). A priori classification membership is indicated by asterisk of the 0/1 variable. The probabilities for heathland, coniferous and Atlantic are not shown. A list of plant species sequences, their frequency of sequences (FS) in the sample, and if applicable, the geographic region of their distribution in Denmark is listed bottom left. For evaluation, we provide a picture of the sample location (bottom right).
Fig 3Mock crime scene 2 and the provenance derived from soil eDNA.
Predicted location in environmental space (Ellenberg Indicator Values) based on the best linear models and GAM for light (top left). The cross marks the predicted EIV value and the length of the arms show the 95% confidence intervals. Blue dots show the actual EIVs at the sample site. The predicted probability of binary habitat types based on Quadratic Discriminant Analysis are shown as red lines on a box plot of the distribution of predicted values for each characteristic (middle part). A priori classification membership is indicated by asterisk of the 0/1 variable. The probabilities for heathland, coniferous and Atlantic are not shown. A list of plant species sequences, their frequency of sequences (FS) in the sample, and if applicable, the geographic region of their distribution in Denmark is listed bottom left. For evaluation, we provide a picture of the sample location (bottom right).