| Literature DB >> 28406159 |
Chelsea Hatzenbuhler1,2, John R Kelly1, John Martinson3, Sara Okum2,4, Erik Pilgrim4.
Abstract
High-throughput DNA metabarcoding has gained recognition as a potentially powerful tool for biomonitoring, including early detection of aquatic invasive species (AIS). DNA based techniques are advancing, but our understanding of the limits to detection for metabarcoding complex samples is inadequate. For detecting AIS at an early stage of invasion when the species is rare, accuracy at low detection limits is key. To evaluate the utility of metabarcoding in future fish community monitoring programs, we conducted several experiments to determine the sensitivity and accuracy of routine metabarcoding methods. Experimental mixes used larval fish tissue from multiple "common" species spiked with varying proportions of tissue from an additional "rare" species. Pyrosequencing of genetic marker, COI (cytochrome c oxidase subunit I) and subsequent sequence data analysis provided experimental evidence of low-level detection of the target "rare" species at biomass percentages as low as 0.02% of total sample biomass. Limits to detection varied interspecifically and were susceptible to amplification bias. Moreover, results showed some data processing methods can skew sequence-based biodiversity measurements from corresponding relative biomass abundances and increase false absences. We suggest caution in interpreting presence/absence and relative abundance in larval fish assemblages until metabarcoding methods are optimized for accuracy and precision.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28406159 PMCID: PMC5390320 DOI: 10.1038/srep46393
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of experimental design for Trial A.
| Treatment | Target | Non-target | |||
|---|---|---|---|---|---|
| T1 | n/a | 25.00 | 25.00 | 25.00 | 25.00 |
| T2 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| T3 | 0.10 | 24.98 | 24.98 | 24.98 | 24.98 |
| T4 | 0.04 | 24.99 | 24.99 | 24.99 | 24.99 |
| T5 | 0.02 | 24.995 | 24.995 | 24.995 | 24.995 |
Tissue homogenates for each species were mixed prior to DNA extractions to achieve the following biomass ratios. Approximate relative biomass abundance per taxon as a percent of total sample biomass for single species controls (not listed) and treatment replicates (single species control, n = 1 per species; T1, nreplicate = 1; T2–T5 nreplicate = 4; ntotal = 22). Common names for taxa from left to right; Tubenose Goby, Spottail Shiner, Rock Bass, Northern Pike, Three Spine Stickleback.
Summary of experimental design.
| Richness treatment | Target | Non-target | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| T1 | n/a | n/a | |||||||||
| T2 | 50 | 50 | |||||||||
| T3 | 1 | 99 | |||||||||
| T4 | 0.33 | 99.67 | |||||||||
| T5 | 0.167 | 99.83 | |||||||||
| T6 | 0.125 | 99.88 | |||||||||
| T1 | n/a | 25 | 25 | 25 | 25 | ||||||
| T2 | 20 | 20 | 20 | 20 | 20 | ||||||
| T3 | 1 | 24.75 | 24.75 | 24.75 | 24.75 | ||||||
| T4 | 0.33 | 24.92 | 24.92 | 24.92 | 24.92 | ||||||
| T5 | 0.167 | 24.96 | 24.96 | 24.96 | 24.96 | ||||||
| T6 | 0.125 | 24.97 | 24.97 | 24.97 | 24.97 | ||||||
| T7 | 0.05 | 24.99 | 24.99 | 24.99 | 24.99 | ||||||
| T1 | n/a | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 |
| T2 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 | 9.09 |
| T3 | 1 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 | 9.9 |
| T4 | 0.33 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 | 9.97 |
| T5 | 0.167 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 | 9.98 |
| T6 | 0.125 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 | 9.99 |
Trial B richness (S) subsets with low (S = 2) intermediate (S = 5) and high (S = 11) species richness. Approximate relative biomass abundance per taxon as a percent of total sample biomass for single species controls (not listed) and treatment replicates (single species control, n = 1 per species; T1, nreplicate = 2; T2–T7 nreplicate = 4; ntotal = 79). Common names for taxa from left to right; Troutperch, White & Longnose Suckers, Yellow Perch, Johnny Darter, Tubenose Goby, Largemouth Bass, Rock Bass, Logperch, Rainbow Smelt, Golden Shiner, Northern Pike.
Equations (eq.) used to calculate (eq. 1) tissue homogenate (H, ) concentrations and (eq. 2, 3) homogenate volumes (V μL) for non-target (nt) and target (t), respectively.
For eq. 1, T is the total mass of cryogenic tissue homogenate used from a single species. For eq. 2 total sample mass (T) and number of non-target taxa (S) and for eq. 3 (T) and probability of detection for target species P(D) at the corresponding ratio of target mass to total sample mass.
Figure 1Metabarcoding results from larval fish tissue samples constructed for Trial A before setting false presence filtering thresholds.
The observed distribution of genetic signals as the percent of total sequences (y-axis) recovered for species in Treatments 1–5 (x –axis; T1, n = 1; T2–T5, nreplicate = 4; ntotal = 17) constructed with equal proportions of biomass per non-target species (T1–T5) and spiked with decreasing amounts of target (P. semilunaris) tissue (T2–T5). The genetic signal for the target taxon was observed in two replicates in T3, one replicate in T5 and was not present in T4 replicates.
Figure 2Distribution of genetic signals as the percent of total sequences recovered for each taxon (pie chart values) obtained from metabarcoded larval fish tissue samples constructed for Treatment 2 (T2) in Trial B subsets before setting false presence filtering thresholds.
Panel (a) Low richness subset (nreplicate = 4). Panel (b) Intermediate richness subset (nreplicate = 4). Panel (c) High richness subset (nreplicate = 4). T2 replicates were constructed with equal proportions of biomass between the target (P. omiscomaycus) and all non-target species in each particular subset.
Figure 3Metabarcoding results from larval fish tissue samples constructed for Trial B subsets before setting false presence filtering thresholds.
Panel (a) Intermediate richness subset (S = 5). Panel (b) High richness subset (S = 11). The genetic signal distribution as the percent of total sequences recovered for species in each treatment (T1, nreplicate = 2, T3–T7, nreplicate = 4; S = 5, ntotal = 22; S = 11, ntotal = 18) constructed with equal proportions of biomass per non-target species (T1, T3–T7) and spiked with decreasing amounts of target (P. omiscomaycus) tissue (T3–T7).