| Literature DB >> 28542313 |
Katy E Klymus1, Nathaniel T Marshall1, Carol A Stepien1.
Abstract
Describing and monitoring biodiversity comprise integral parts of ecosystem management. Recent research coupling metabarcoding and environmental DNA (eDNA) demonstrate that these methods can serve as important tools for surveying biodiversity, while significantly decreasing the time, expense and resources spent on traditional survey methods. The literature emphasizes the importance of genetic marker development, as the markers dictate the applicability, sensitivity and resolution ability of an eDNA assay. The present study developed two metabarcoding eDNA assays using the mtDNA 16S RNA gene with Illumina MiSeq platform to detect invertebrate fauna in the Laurentian Great Lakes and surrounding waterways, with a focus for use on invasive bivalve and gastropod species monitoring. We employed careful primer design and in vitro testing with mock communities to assess ability of the markers to amplify and sequence targeted species DNA, while retaining rank abundance information. In our mock communities, read abundances reflected the initial input abundance, with regressions having significant slopes (p<0.05) and high coefficients of determination (R2) for all comparisons. Tests on field environmental samples revealed similar ability of our markers to measure relative abundance. Due to the limited reference sequence data available for these invertebrate species, care must be taken when analyzing results and identifying sequence reads to species level. These markers extend eDNA metabarcoding research for molluscs and appear relevant to other invertebrate taxa, such as rotifers and bryozoans. Furthermore, the sphaeriid mussel assay is group-specific, exclusively amplifying bivalves in the Sphaeridae family and providing species-level identification. Our assays provide useful tools for managers and conservation scientists, facilitating early detection of invasive species as well as improving resolution of mollusc diversity.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28542313 PMCID: PMC5436814 DOI: 10.1371/journal.pone.0177643
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of targeted species for the MOL16S assay.
| Species | GenBank accession numbers | ||
|---|---|---|---|
| golden mussel | (1) JQ267790 | ||
| (3) AF507051–52, EF414448 (1) AF038998 (1) KY426891 | |||
| (2) AF038996, KY426895 (9) DQ333745–46, AF507047–48, AY302247, JX099457, KY426892-94 (1) JQ348913 (1) KY426896 | |||
| (14) GBXKY426897-902, DQ333747–48, EF414464–66, AF038997, JX099458, AF507049 | |||
| (7) AB522656, AF038999, AF152024, DQ280039, KC429294, KY426903-04 | |||
| (1) GU128616 (18) GU128617–35 (2) KY426905-06 | |||
| (1) KY426907 | |||
| greater European peaclam | (7) EU559086–89, AY093572, DQ062609–10 | ||
| henslow peaclam | (1) DQ062644 (8) EU559115–22, DQ062620–22 (1) KF483297 | ||
| pygmy peaclam | (3) DQ062626, DQ06262628–29 (1) DQ06262627 | ||
| humpbacked peaclam | (7) DQ062646–50, EU559148, AY093569 | ||
| (1) AY093560 (2) AF152029, KY426908 (1) AY957810 (1) AY957812 | |||
| (1) AY634079 (1) AY955392 (2) AY9553886–87 (3) AY955388–89, AY634106 (1) AY955377 (1) AY955378 (1) AY955381 (1) AY955391 (8) AY955379–80, AY955382–85, AY634104, AY634107 (5) JN639013–14, KY426909, JQ346706, AY955393 (1) AY634109 (10) JQ346702–05, JQ346708–09, AY634080, AY955376, AY314009, EU573989 | |||
| (2) AF445344, JX970531 (1) FJ160288 | |||
| (1) KY426912 | |||
| (1) AF101006 (1) KP284119 (1) KP284124 (5) KP284118, KP284120, KP284122, KP2841227, AY010517 (9) KP774656–58, KP774669–72, AY791930–31 (23) AY791911, AY791915, KP774640, KP774644–45, KP774647–55, KP774659, KP774661–62, KP774664–68, KY426914 (1) KP284121 (1) AY456618 (5) AY791910, KP774660, KP774663, AY456616–17 (1) KP284126 (1) AY791914 (7) KP774633–39 (3) KP284125, KP284128–29 | |||
| island apple snail | |||
| European ear snail | |||
| (1) FJ917248 (1) KY426915 | |||
| banded mystery snail | (1) AY377626 |
Parentheses denote unique OTUs for the MOL16S amplicon, with the number inside representing the number of sequences belonging to that OTU. Accession numbers are for 16S sequence data for each species.
* indicates invasive species whose extractions were used to test primers in vitro.
Ŧ indicates native or non-invasive species also used to test primers. All other species are invasive in the Great Lakes.
Composition of the five mock communities showing the target amplicon copy number per extraction.
| Species | Mock Communities | ||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | |
| 9090 | 1818 | 363 | 72 | 14 | |
| 1818 | 363 | 72 | 14 | 9090 | |
| 363 | 72 | 14 | 9090 | 1818 | |
| 72 | 14 | 9090 | 1818 | 363 | |
| 14 | 9090 | 1818 | 363 | 72 | |
| 2272 | 1136 | 568 | 284 | 142 | |
| 1136 | 568 | 284 | 142 | 2272 | |
| 568 | 284 | 142 | 2272 | 1136 | |
| 284 | 142 | 2272 | 1136 | 568 | |
| 142 | 2272 | 1136 | 568 | 284 | |
| 18 | 18 | 18 | 18 | 18 | |
Number of individuals from each species placed into two aquaria for later eDNA sampling.
| Species | Approximate | Approximate |
|---|---|---|
| 40 | 40 | |
| 10 | 0 | |
| 10 | 10 | |
| 4 | 1 | |
| Pleurocerid snails | 3 | 3 |
Primers used for library preparation.
| Assay | Primers | Sequence 5’– 3’ |
|---|---|---|
| MOL16S | MOL16S_E_F | |
| MOL16S_F_F | ||
| MOL16S_G_F | ||
| MOL16S_H_F | ||
| MOL16S_E_R | ||
| MOL16S_F_R | ||
| MOL16S_G_R | ||
| MOL16S_H_R | ||
| SPH16S | SPH16S_E_F | |
| SPH16S_F_F | ||
| SPH16S_G_F | ||
| SPH16S_H_F | ||
| SPH16S_E_R | ||
| SPH16S_F_R | ||
| SPH16S_G_R | ||
| SPH16S_H_R |
Assay specific primers (in bold) with a 7–17 nucleotide spacer (underlined) and the sequencing primer region (italicized). 5’ to 3’ direction for all primers, the last character in the primer name indicates direction (F = forward, R = reverse).
Number of reads, percentage observed and percentage expected for each of the five mock community samples run with the MOL16S assay and fish blocking primer.
| 138075 | 43.11 | 57.62 | 3745 | 2.15 | 11.52 | 885 | 0.53 | 2.30 | |
| 85010 | 26.54 | 11.52 | 2174 | 1.25 | 2.30 | 341 | 0.20 | 0.46 | |
| 13 | 0.00 | 2.30 | 0 | 0.00 | 0.46 | 0 | 0.00 | 0.09 | |
| 1385 | 0.43 | 0.46 | 42 | 0.02 | 0.09 | 112573 | 67.28 | 57.62 | |
| 195 | 0.06 | 0.09 | 149359 | 85.89 | 57.62 | 15359 | 9.18 | 11.52 | |
| 23002 | 7.18 | 14.40 | 2536 | 1.46 | 7.20 | 800 | 0.48 | 3.60 | |
| 59973 | 18.72 | 7.20 | 5166 | 2.97 | 3.60 | 2686 | 1.61 | 1.80 | |
| 5594 | 1.75 | 3.60 | 565 | 0.32 | 1.80 | 209 | 0.12 | 0.90 | |
| 6235 | 1.95 | 1.80 | 783 | 0.45 | 0.90 | 31795 | 19.00 | 14.40 | |
| 711 | 0.22 | 0.90 | 9513 | 5.47 | 14.40 | 2654 | 1.59 | 7.20 | |
| 99 | 0.03 | 0.11 | 12 | 0.01 | 0.11 | 13 | 0.01 | 0.11 | |
| 773 | 0.29 | 0.46 | 30 | 0.01 | 0.09 | ||||
| 244 | 0.09 | 0.09 | 163825 | 78.40 | 57.62 | ||||
| 2679 | 1.02 | 57.62 | 63 | 0.03 | 11.52 | ||||
| 116994 | 44.64 | 11.52 | 2199 | 1.05 | 2.30 | ||||
| 17620 | 6.72 | 2.30 | 331 | 0.16 | 0.46 | ||||
| 1794 | 0.68 | 1.80 | 246 | 0.12 | 0.90 | ||||
| 4872 | 1.86 | 0.90 | 33459 | 16.01 | 14.40 | ||||
| 43807 | 16.71 | 14.40 | 3752 | 1.80 | 7.20 | ||||
| 67309 | 25.68 | 7.20 | 4494 | 2.15 | 3.60 | ||||
| 5921 | 2.26 | 3.60 | 536 | 0.28 | 1.80 | ||||
| 84 | 0.03 | 0.11 | 26 | 0.01 | 0.11 | ||||
Fig 1Regressions of log 10 transformed observed read percentages versus log 10 transformed expected read percentages for each of the five mock communities run with the MOL16S assay.
Fig 2Regressions of log 10 transformed observed read percentages versus log 10 transformed expected read percentages for each of the species in the mock communities run with the MOL16S assay.
Number of reads, percentage observed and percentage expected for each of the mock community samples run with and without the fish blocking primer and the MOL16S assay.
| 885 | 0.529 | 718 | 0.496 | 773 | 0.295 | 314 | 0.229 | |
| 341 | 0.204 | 284 | 0.196 | 244 | 0.093 | 257 | 0.187 | |
| 0 | 0.000 | 2 | 0.001 | 2679 | 1.022 | 11956 | 8.717 | |
| 112573 | 67.282 | 100862 | 69.734 | 116994 | 44.638 | 54766 | 39.931 | |
| 15359 | 9.180 | 12823 | 8.866 | 17620 | 6.723 | 7461 | 5.440 | |
| 800 | 0.478 | 701 | 0.485 | 1794 | 0.684 | 1055 | 0.769 | |
| 2686 | 1.605 | 1924 | 1.330 | 4872 | 1.859 | 2276 | 1.659 | |
| 209 | 0.125 | 144 | 0.100 | 43807 | 16.714 | 23583 | 17.195 | |
| 31795 | 19.003 | 25147 | 17.386 | 67309 | 25.681 | 32549 | 23.732 | |
| 2654 | 1.586 | 2024 | 1.399 | 5921 | 2.259 | 2876 | 2.097 | |
| 13 | 0.008 | 9 | 0.006 | 84 | 0.032 | 57 | 0.042 | |
| 49883 | 64.312 | 29488 | 44.726 | |||||
| 2253 | 2.905 | 1449 | 2.198 | |||||
| 121 | 0.156 | 31 | 0.047 | |||||
| 0 | 0.000 | 0 | 0.000 | |||||
| 11929 | 15.380 | 27159 | 41.194 | |||||
| 12078 | 15.572 | 7164 | 10.867 | |||||
| 19 | 0.024 | 5 | 0.008 | |||||
| 992 | 1.279 | 205 | 0.311 | |||||
| 18 | 0.023 | 6 | 0.009 | |||||
| 0 | 0.000 | 0 | 0.000 | |||||
Number of reads, percentage observed and percentage expected for each of the mock community samples run with the SPH16S assay.
| Mock Community 1 | Mock Community 2 | Mock Community 4 | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Species | Read Numbers | % Observed | % Expected | Read Numbers | % Observed | % Expected | Read Numbers | % Observed | % Expected |
| 108760 | 99.480 | 99.063 | 1461 | 1.437 | 16.645 | 500 | 0.006 | 3.196 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
| 551 | 0.504 | 0.785 | 16 | 0.016 | 0.128 | 82990 | 92.219 | 80.692 | |
| 17 | 0.016 | 0.153 | 100190 | 98.547 | 83.227 | 6502 | 7.225 | 16.112 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0,000 | 0 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
| 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | 0 | 0.000 | 0.000 | |
Fig 3Bar charts denoting the proportion of reads for each assay (MOL16S and SPH16S) that were recovered from sequencing of water samples from tanks A and B. The middle bar shows the original composition of the tanks.
Number of reads per OTU identified at 97% similarity via BLAST search of aquaria samples.
| Assay and Sample | Sequence ID Accession Number | Species | Number of Reads | Identity (%) | Coverage (%) | Percentage of total reads (%) | Percentage of just the 97% ID reads (%) |
|---|---|---|---|---|---|---|---|
| MOL16S Tank A | |gb|AF152045 | 25707 | 98–99 | 100 | 8.989 | 86.737 | |
| |gb|KP052744 | 3037 | 100 | 100 | 1.062 | 10.247 | ||
| |gb|AF038996 | 235 | 100 | 100 | 0.082 | 0.793 | ||
| |gb|KF889403 | 218 | 100 | 100 | 0.076 | 0.736 | ||
| |gb|AY957830 | 132 | 99 | 100 | 0.046 | 0.445 | ||
| |dbj|AB365626 | 117 | 100 | 100 | 0.041 | 0.395 | ||
| |gb|DQ459934 | 74 | 97–99 | 100 | 0.026 | 0.250 | ||
| |gb|JN681057 | 56 | 100 | 100 | 0.020 | 0.189 | ||
| |gb|AY957811 | 52 | 99 | 100 | 0.018 | 0.175 | ||
| |gb|AF152044 | 10 | 98 | 100 | 0.003 | 0.034 | ||
| MOL16STank B | |gb|AF152045 | 21806 | 99 | 100 | 7.922 | 93.128 | |
| |gb|KP052744 | 1286 | 100 | 100 | 0.467 | 5.492 | ||
| |gb|JX099457 | 222 | 100 | 100 | 0.081 | 0.948 | ||
| |gb|KC429295 | 35 | 100 | 100 | 0.013 | 0.149 | ||
| |gb|JN681057 | 29 | 100 | 100 | 0.011 | 0.124 | ||
| |gb|GQ355403 | 25 | 99 | 100 | 0.009 | 0.107 | ||
| |gb|KX594326 | 7 | 100 | 100 | 0.003 | 0.030 | ||
| |dbj|AB365626 | 5 | 100 | 100 | 0.002 | 0.021 | ||
| SPH16STank A | |gb|AF152045 | 240165 | 99 | 100 | 99.520 | 99.959 | |
| |gb|AF152044 | 416 | 98 | 98 | 0.172 | 0.173 | ||
| |gb|AY957830 | 215 | 99 | 100 | 0.089 | 0.089 | ||
| |gb|KC429295 | 68 | 98 | 100 | 0.028 | 0.028 | ||
| SPH16STank B | |gb|AF152045 | 217725 | 99 | 100 | 99.783 | 99.978 | |
| ]gb|KC429295 | 48 | 100 | 100 | 0.022 | 0.022 |
*Denotes species from NCBI GenBank (https://www.ncbi.nlm.nih.gov/genbank/) not recognized on the World Register of Marine Species (WoRMS; http://www.marinespecies.org/).
** Pisidium casertanum is accepted as Euglesa casertana
Number of reads per OTU identified at 97% similarity via BLAST search of Maumee River water samples.
| Sample | Sequence ID Accession Number | Species | Number of Reads | Identity(%) | Coverage (%) | Percentage of total reads (%) | Percentage of the 97% ID reads (%) |
|---|---|---|---|---|---|---|---|
| Maumee River RM 26.7 | |dbj|AB365626 | 476 | 98 | 100 | 0.064 | 49.739 | |
| |gb|FJ426631 | 124 | 99 | 100 | 0.017 | 12.957 | ||
| |gb|HQ691208 | 78 | 99 | 100 | 0.011 | 8.150 | ||
| |gb|GQ343273 | 47 | 100 | 100 | 0.006 | 4.911 | ||
| |gb|KX372733 | 43 | 100 | 100 | 0.006 | 4.493 | ||
| |gb|EU038309 | 38 | 100 | 100 | 0.005 | 3.971 | ||
| |gb|CP012504 | 37 | 98 | 100 | 0.005 | 3.866 | ||
| |gb|AY093554 | 31 | 100 | 100 | 0.004 | 3.239 | ||
| |gb|FJ426630 | 18 | 99 | 100 | 0.002 | 1.881 | ||
| |gb|KP306894 | 14 | 100 | 100 | 0.002 | 1.463 | ||
| |dbj|AB365638 | 11 | 99 | 100 | 0.001 | 1.149 | ||
| |gb|JF275060 | 8 | 100 | 100 | 0.001 | 0.836 | ||
| |gb|DQ459934 | 7 | 99 | 100 | 0.001 | 0.731 | ||
| |gb|GQ466406 | 7 | 98 | 91 | 0.001 | 0.731 | ||
| |dbj|AB365640 | 5 | 100 | 100 | 0.001 | 0.522 | ||
| |gb|AY520093 | 4 | 100 | 100 | 0.001 | 0.418 | ||
| |gb|GQ343273 | 3 | 98 | 80 | 0.000 | 0.313 | ||
| |dbj|AB466325 | 2 | 100 | 100 | 0.000 | 0.209 | ||
| |dbj|AB365642 | 2 | 100 | 100 | 0.000 | 0.209 | ||
| |gb|AF038996 | 2 | 99 | 100 | 0.000 | 0.209 | ||
| Maumee River RM 58.1 | |dbj|AB365626 | 125320 | 97–100 | 98–100 | 14.204 | 65.371 | |
| |dbj|AB365631 | 28959 | 97–98 | 100 | 3.282 | 15.106 | ||
| |dbj|AB365642 | 16787 | 98–100 | 100 | 1.903 | 8.757 | ||
| |gb|KJ186046 | 8101 | 100 | 100 | 0.918 | 4.226 | ||
| |dbj|AB365623 | 3236 | 100 | 100 | 0.367 | 1.688 | ||
| |gb|KP306894 | 2744 | 100 | 100 | 0.311 | 1.431 | ||
| |gb|AF038996 | 1760 | 98–99 | 100 | 0.199 | 0.918 | ||
| |dbj|AB365640 | 1581 | 100 | 100 | 0.179 | 0.825 | ||
| |gb|DQ311116 | 1160 | 100 | 100 | 0.131 | 0.605 | ||
| |gb|KX372733 | 795 | 100 | 100 | 0.090 | 0.415 | ||
| |gb|AY216556 | 391 | 97–100 | 94–100 | 0.044 | 0.204 | ||
| |gb|KM051966 | 380 | 99 | 100 | 0.043 | 0.198 | ||
| |gb|GQ343301 | 140 | 97 | 100 | 0.016 | 0.073 | ||
| |dbj|AB126083 | 132 | 100 | 100 | 0.015 | 0.069 | ||
| |gb|AF038486 | 106 | 99 | 100 | 0.012 | 0.055 | ||
| |dbj|AB466325 | 96 | 100 | 100 | 0.011 | 0.050 | ||
| |gb|GQ343296 | 7 | 98 | 99 | 0.001 | 0.004 | ||
| |dbj|AB365622 | 5 | 97 | 100 | 0.001 | 0.003 | ||
| |gb|KF162319 | 4 | 97 | 93 | 0.000 | 0.002 | ||
| |gb|FJ426630 | 3 | 98 | 96 | 0.000 | 0.002 | ||
| Maumee River RM 76.1 | |gb|KP279638 | 903 | 100 | 100 | 0.255 | 24.048 | |
| |dbj|AB365623 | 601 | 100 | 100 | 0.170 | 16.005 | ||
| |gb|AY093554 | 592 | 100 | 100 | 0.167 | 15.770 | ||
| |gb|DQ311116 | 427 | 100 | 100 | 0.121 | 11.372 | ||
| |gb|AY885589 | 341 | 99 | 100 | 0.096 | 9.081 | ||
| |gb|AY216556 | 251 | 100 | 100 | 0.071 | 6.68 | ||
| |gb|FJ372631 | 140 | 98 | 100 | 0.040 | 3.73 | ||
| |gb|DQ536422 | 118 | 100 | 100 | 0.033 | 3.142 | ||
| |gb|KX353761 | 91 | 100 | 100 | 0.026 | 2.423 | ||
| |gb|KP013098 | 88 | 99 | 100 | 0.025 | 2.343 | ||
| |dbj|AB365628 | 69 | 100 | 100 | 0.019 | 1.838 | ||
| |gb|KR476977 | 55 | 100 | 100 | 0.016 | 1.464 | ||
| |gb|DQ912062 | 34 | 100 | 100 | 0.010 | 0.905 | ||
| |gb|AF152045 | 29 | 99 | 100 | 0.008 | 0.772 | ||
| |gb|KP013087 | 16 | 100 | 100 | 0.005 | 0.426 | ||
| Maumee River Blank | |gb|AF152044 | 63 | 99 | 100 | |||
| |dbj|AB365642 | 1 | 99 | 100 | ||||
| |dbj|AB365626 | 1 | 99 | 100 | ||||
| |gb|GQ343301 | 1 | 95 | 100 | ||||
| |gb|AF152044 | 1 | 94 | 100 | ||||
| |dbj|AB365641 | 1 | 94 | 97 | ||||
| |gb|AF499051 | 1 | 89 | 75 | ||||
| |gb|AF325131 | 1 | 87 | 79 | ||||
| |gb|AF499051 | 1 | 87 | 75 |
* Denotes species from NCBI GenBank (https://www.ncbi.nlm.nih.gov/genbank/) not recognized in the World Register of Marine Species (WoRMS; http://www.marinespecies.org/).
Fig 4Pie charts comparing the molecular and morphological/ visual species identification methods from three different water samples taken on the Maumee River.