| Literature DB >> 32191699 |
Beverly McClenaghan1, Zacchaeus G Compson1, Mehrdad Hajibabaei1,2,3.
Abstract
Environmental DNA (eDNA) metabarcoding is an increasingly popular method for rapid biodiversity assessment. As with any ecological survey, false negatives can arise during sampling and, if unaccounted for, lead to biased results and potentially misdiagnosed environmental assessments. We developed a multi-scale, multi-species occupancy model for the analysis of community biodiversity data resulting from eDNA metabarcoding; this model accounts for imperfect detection and additional sources of environmental and experimental variation. We present methods for model assessment and model comparison and demonstrate how these tools improve the inferential power of eDNA metabarcoding data using a case study in a coastal, marine environment. Using occupancy models to account for factors often overlooked in the analysis of eDNA metabarcoding data will dramatically improve ecological inference, sampling design, and methodologies, empowering practitioners with an approach to wield the high-resolution biodiversity data of next-generation sequencing platforms.Entities:
Year: 2020 PMID: 32191699 PMCID: PMC7082047 DOI: 10.1371/journal.pone.0224119
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic illustration of the three stochastic levels included in the multi-scale, multi-species occupancy model.
Primer pairs used to amplify five target amplicons in the COI region of the mitochondrial genome from water samples collected in Conception Bay, Newfoundland, Canada.
| Marker | Target Length (bp) | Forward Primer | Reverse Primer | Reference |
|---|---|---|---|---|
| Fishe (Mini_SH-E) | 226 | [ | ||
| Fishc (Mini_SH-C) | 127 | [ | ||
| F230 | 235 | [ | ||
| Leray | 330 | [ | ||
| BR5 | 310 | [ |
Model comparison between multi-scale, multi-species occupancy models using four methods (WAIC, CPO, Brier score and Log Score).
The covariates (water depth at the sampling site, sequencing depth for each technical replicate, and amplicon sequenced for each technical replicate) included at each level of the model (occupancy: ψ, capture: ϴ, detection: p) are listed on the left. Bolded values indicate the best model for each method of model comparison.
| MODELS | WAIC | CPO | Brier Score | Log Score |
|---|---|---|---|---|
| 13,340 | 5,621,109 | 1842 | ||
| 33,626 | 14,217,497 | 217 | 2034 | |
| 12,993 | 4,061,736 | 1834 | ||
| 35,894 | 20,793,808 | 135 | 2823 | |
| 13,135 | 2,748,236 | 143 | 1759 | |
| 35,649 | 17,897,824 | 224 | 2732 | |
| 142 | ||||
| 33,499 | 16,555,815 | 133 | 2405 |
Fig 2(A) Community mean occupancy by water depth (m) and (C) community mean probability of detection by sequencing depth predicted using a multi-species, multi-scale community occupancy model. The dashed lines represent the 95% confidence interval. Parameter estimate for each species for (B) the effect of water depth on occupancy and (D) the effect of sequencing depth on detection in a multi-species, multi-scale community occupancy model. Solid red line indicates the community mean and dashed red lines indicate the upper and lower limits of the 95% confidence intervals of the community mean parameter estimate. Blue lines indicate 95% confidence intervals of individual species parameter estimates that do not overlap with 0. Grey lines indicate 95% confidence intervals of individual species parameter estimates that do overlap with 0.
Fig 3Mean detection probability estimated from occupancy model 9 (ψ(.) ϴ(.) p(amplicon)) for each species plotted by amplicon.
The band in the middle of the box represents the median and the upper and lower edges of the box represent the upper and lower quartiles. The whiskers represent 1.5 times the inter-quartile range.
Fig 4Mean detection probability estimated from occupancy model 9 (ψ(.) ϴ(.) p(amplicon)) for each species plotted by amplicon and phylum for metazoan phyla only.
The band in the middle of the box represents the median and the upper and lower edges of the box represent the upper and lower quartiles. The whiskers represent 1.5 times the inter-quartile range.