| Literature DB >> 29020743 |
Alfred J Arulandhu1,2, Martijn Staats1, Rico Hagelaar1, Marleen M Voorhuijzen1, Theo W Prins1, Ingrid Scholtens1, Adalberto Costessi3, Danny Duijsings3, François Rechenmann4, Frédéric B Gaspar5, Maria Teresa Barreto Crespo5, Arne Holst-Jensen6, Matthew Birck7, Malcolm Burns8, Edward Haynes9, Rupert Hochegger10, Alexander Klingl11, Lisa Lundberg12, Chiara Natale13, Hauke Niekamp14, Elena Perri15, Alessandra Barbante15, Jean-Philippe Rosec16, Ralf Seyfarth17, Tereza Sovová18, Christoff Van Moorleghem19, Saskia van Ruth1,2, Tamara Peelen20, Esther Kok1.
Abstract
DNA metabarcoding provides great potential for species identification in complex samples such as food supplements and traditional medicines. Such a method would aid Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) enforcement officers to combat wildlife crime by preventing illegal trade of endangered plant and animal species. The objective of this research was to develop a multi-locus DNA metabarcoding method for forensic wildlife species identification and to evaluate the applicability and reproducibility of this approach across different laboratories. A DNA metabarcoding method was developed that makes use of 12 DNA barcode markers that have demonstrated universal applicability across a wide range of plant and animal taxa and that facilitate the identification of species in samples containing degraded DNA. The DNA metabarcoding method was developed based on Illumina MiSeq amplicon sequencing of well-defined experimental mixtures, for which a bioinformatics pipeline with user-friendly web-interface was developed. The performance of the DNA metabarcoding method was assessed in an international validation trial by 16 laboratories, in which the method was found to be highly reproducible and sensitive enough to identify species present in a mixture at 1% dry weight content. The advanced multi-locus DNA metabarcoding method assessed in this study provides reliable and detailed data on the composition of complex food products, including information on the presence of CITES-listed species. The method can provide improved resolution for species identification, while verifying species with multiple DNA barcodes contributes to an enhanced quality assurance.Entities:
Keywords: CITES; COI; Customs agencies; DNA metabarcoding; Endangered species; Traditional medicines; cyt b; matK; mini-barcodes; rbcL
Mesh:
Substances:
Year: 2017 PMID: 29020743 PMCID: PMC5632295 DOI: 10.1093/gigascience/gix080
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Overview of the PCR primer sets used in this study for amplifying plant and animal DNA barcodes and mini-barcodes.
| DNA marker | Primer name | Primer sequence 5΄-3΄ | Amplicon length (nt) | Reference |
|---|---|---|---|---|
| Universal animal DNA barcodes and mini-barcodes | ||||
| 16S | 16sar-L | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCGCCTGTTTATCAAAAACAT | 500–600 | Palumbi [ |
| 16sar-H | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCGGTCTGAACTCAGATCACGT | |||
| Mini-16S | 16S-forward | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGAYAAGACGAGAAGACCC | 250 | Sarri et al. [ |
| 16S-reverse | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGATTGCGCTGTTATTCC | |||
| COI[ | LepF1_t1 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGATTCAACCAATCATAAAGATATTGG | 648c | Modified from Ivanova et al. [ |
| VF1_t1 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTTCTCAACCAACCACAAAGACATTGG | |||
| VF1d_t1 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTTCTCAACCAACCACAARGAYATYGG | |||
| VF1i_t1 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTTCTCAACCAACCAIAAIGAIATIGG | |||
| LepR1_t1 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTAAACTTCTGGATGTCCAAAAAATCA | |||
| VR1d_t1 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTAGACTTCTGGGTGGCCRAARAAYCA | |||
| VR1_t1 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTAGACTTCTGGGTGGCCAAAGAATCA | |||
| VR1i_t1 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTAGACTTCTGGGTGICCIAAIAAICA | |||
| Mini-COI | mlCOIintF | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGGWACWGGWTGAACWGTWTAYCCYCC | 313 | Leray et al. [ |
| jgHCO2198 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTAIACYTCIGGRTGICCRAARAAYCA | |||
| cyt | L14816 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCATCCAACATCTCAGCATGATGAAA | 743 | Palumbi [ |
| CB3-H | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGGCAAATAGGAARTATCATTC | |||
| Mini-cyt | L14816 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCATCCAACATCTCAGCATGATGAAA | 357 | Parson et al. [ |
| H15173 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCCTCAGAATGATATTTGTCCTCA | |||
| Universal plant DNA barcodes and mini-barcodes | ||||
|
| matK-KIM1R | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGACCCAGTCCATCTGGAAATCTTGGTTC | 656–889 | Fazekas et al. [ |
| matK-KIM3F | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCGTACAGTACTTTTGTGTTTACGAG | |||
|
| matK-390f | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCGATCTATTCATTCAATATTTC | 656–889 | Cuénoud et al. [ |
| matK-1326r | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTCTAGCACACGAAAGTCGAAGT | |||
|
| rbcLa-F | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGATGTCACCACAAACAGAGACTAAAGC | 654 | Levin et al. [ |
| rbcLa-R | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGTAAAATCAAGTCCACCRCG | Kress and Erickson [ | ||
|
| rbcL a-F | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGATGTCACCACAAACAGAGACTAAAGC | 607 | Levin et al. [ |
| rbcLajf634R | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGAAACGGTCTCTCCAACGCAT | Fazekas et al. [ | ||
| Mini- | F52 | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTTGGATTCAAAGCTGGTGTTA | 140c | Little [ |
| R193 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCVGTCCAMACAGTWGTCCATGT | |||
|
| c | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCGAAATCGGTAGACGCTACG | 767 | Taberlet et al. [ |
| d | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGGGGATAGAGGGACTTGAAC | |||
|
| g | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGGGCAATCCTGAGCCAA | 10–143c | Taberlet et al. [ |
| h | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCATTGAGTCTCTGCACCTATC | |||
| ITS2 | S2F | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGATGCGATACTTGGTGTGAAT | 160–320c | Chen et al. [ |
| S3R | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACGCTTCTCCAGACTACAAT | |||
|
| psbAf | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTTATGCATGAACGTAATGCTC | 264–792 | Sang et al. [ |
| trnH2 | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCGCGCATGGTGGATTCACAATCC | |||
The shaded text represents the sequence of the Illumina overhang adapters.
aModified COI cocktail primers without M13-tails were used [21].
bThe primers were not included in the final panel of DNA barcodes.
cAmplicon length excluding primers.
Figure 1:Schematic representation of the CITESspeciesDetect pipeline.
Pilot study: composition of the experimental mixtures and taxa identified using the default setting of the CITESspeciesDetect pipeline.
| Experimental mixtures | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Species/genus | Common name | EM1 | EM2 | EM3 | EM4 | EM5 | EM6 | EM7 | EM8 | EM9 | EM10 | EM10R | EM11 | EM11R | EM12 | EM13 | EM14 | EM15 |
|
| Cattle | 99% (S) | 90% (S) | 1% (S) | 10% (S) | 99% (S) | 95% (S) | 85% (S) | 10% (S) | 10% (S) | 46% (S) | 46% (S) | 95% (S) | 85% (S) | ||||
|
| Shrimp | 1% | 3% | 10% | 10% | 1% | 1% | 1% | 3% | |||||||||
|
| European eel | 1% | 3% |
|
|
|
|
|
| |||||||||
|
| Nile crocodile |
|
|
|
| |||||||||||||
|
| Domestic chicken | 1% (S) | 3% (S) | 10% (S) | 10% (S) | 1% (S) | 1% (S) | 1% (S) | 3% (S) | |||||||||
|
| Large white (caterpillar) | 1% (S) | 3% (S) | 10% (S) | 10% (S) | 1% (S) | 1% (S) | 1% (S) | 3% (S) | |||||||||
|
| Barrel cactus |
|
|
|
|
|
|
|
| |||||||||
|
| Spurge |
|
|
|
|
|
|
|
| |||||||||
|
| Tiger aloe | 1% (F) | 1% (F) | 3% (F) | 10% (F) | 10% (F) | 1% (F) | 1% (F) | 1% (F) | 3% (F) | ||||||||
|
| Dendrobium (orchid) |
|
|
|
| |||||||||||||
|
| Sago palm | 1% | 3% |
|
| 1% |
|
|
| |||||||||
|
| Lettuce | 1% (S) | 10% (S) | 99% (S) | 90% (S) | 95% (S) | 85% (S) | 10% (S) | 10% (G) | 46% (S) | 46% (S) | 95% (S) | 85% (S) | |||||
Taxa were identified at the species level unless otherwise indicated in parentheses. Cells highlighted in grey indicate that taxa that were not identified. Identified taxa listed by CITES are highlighted in bold.
The symbol next to percentage indicates the taxonomic resolution of the identified taxon: (F): family level; (G): genus level; (S): species level.
aSpecies listed by CITES.
b Aloe variegata (synonym Gonialoe variegata) was recently assigned to the genus Gonialoe [55].
Pilot study: average number of Illumina MiSeq reads, the average number of (pseudo-)reads that passed QC, and the percentage of QC (pseudo-)reads that were assigned to DNA barcodes and OTUs generated per sample.
| Experimental | Number of raw | Percentage of QC | Percentage DNA barcode | Percentage OTU clustered |
|---|---|---|---|---|
| mixture | reads | (pseudo-)reads[ | assigned (pseudo-)reads[ | (pseudo-)reads[ |
| EM1 | 466 108 | 88.07 | 95.68 | 83.86 |
| EM2 | 448 428 | 86.04 | 97.24 | 84.04 |
| EM3 | 496 328 | 87.46 | 96.61 | 84.34 |
| EM4 | 273 104 | 77.38 | 95.74 | 80.54 |
| EM5 | 582 254 | 96.26 | 97.84 | 90.63 |
| EM6 | 442 574 | 92.81 | 97.54 | 81.48 |
| EM7 | 394 354 | 93.04 | 97.14 | 80.70 |
| EM8 | 455 172 | 79.62 | 95.66 | 82.35 |
| EM9 | 434 326 | 86.23 | 97.30 | 83.60 |
| EM10 | 387 816 | 87.73 | 97.00 | 75.11 |
| EM10R | 723 130 | 95.59 | 98.02 | 87.39 |
| EM11 | 363 374 | 84.44 | 96.74 | 78.63 |
| EM11R | 635 304 | 91.11 | 98.21 | 87.01 |
| EM12 | 355 634 | 92.55 | 97.54 | 76.54 |
| EM13 | 405 742 | 89.46 | 96.49 | 77.31 |
| EM14 | 480 772 | 85.74 | 95.98 | 81.91 |
| EM15 | 554 602 | 87.05 | 88.78 | 82.98 |
| Average[ | 464 648 | 88.27 | 96.44 | 82.26 |
a(Pseudo-)reads are the combined QC pseudo-reads and the QC processed unmerged forward and reverse reads.
bAveraged across the 17 Illumina MiSeq datasets.
Taxonomic resolution provided by each DNA barcode marker for EM10 and EM10R.
| Species/genus | Species | Genus | Family |
|---|---|---|---|
|
| cyt |
| |
|
| |||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
|
|
|
Highlighted in bold are DNA barcodes with the same taxonomic resolution in both samples.
Taxonomic resolution provided by each DNA barcode marker for EM11 and EM11R.
| Species/genus | Species | Genus | Family |
|---|---|---|---|
|
|
| ||
|
| |||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
| mini- | ||
|
|
|
|
|
Highlighted in bold are DNA barcodes with the same taxonomic resolution in both samples.
Figure 2:The percentage of QC reads assigned to DNA barcodes for samples EM10, EM10R, EM11, and EM11R of the pilot study.
Laboratories participating in the interlaboratory trial.
| Laboratory | City and country |
|---|---|
| Agenzia delle Dogane E dei Monopoli | Genoa, Italy |
| AGES | Vienna, Austria |
| BaseClear BV | Leiden, The Netherlands |
| Biolytix AG | Witterswil, Switzerland |
| CREA-SCS sede di Tavazzano—Laboratorio | Tavazzano, Italy |
| Crop Research Institute | Prague, Czech Republic |
| Dutch Customs Laboratory | Amsterdam, The Netherlands |
| Eurofins GeneScan GmbH | Freiburg, Germany |
| Fera | Sand Hutton, United Kingdom |
| Generalzolldirektion | Hamburg, Germany |
| Laboratoire de Montpellier | Montpellier, France |
| Laboratorium Douane Accijnzen | Leuven, Belgium |
| LGC | Middlesex, United Kingdom |
| Livsmedelsverket | Uppsala, Sweden |
| RIKILT Wageningen University & Research | Wageningen, The Netherlands |
| US Customs and Border Protection Laboratory | Newark, United States |
Interlaboratory trial study: composition of the complex mixtures and taxa identified using the default setting of the CITESspeciesDetect pipeline.
| Homogenized mixtures | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Species/genus | Common name | S1 | S2 | S4 | S5 | S6 | S7 | S9 | S10 |
|
| Maize | 1% (13) Poaceae | 1% (14) Poaceae | 1% (14) Poaceae | 1% (15) Poaceae | 1% (16) Poaceae | 1% (15) Poaceae | 1% (15) Poaceae | 1% (14) Poaceae |
|
| Soy bean | 1% (16) | |||||||
|
| Cotton | 1% (16) | |||||||
|
| Canola | 1% (16) | |||||||
|
| Wheat | 1% (15) Poaceae | |||||||
|
| Sugar beet | 1% (4) | |||||||
|
| Turkey | 1% (16) | |||||||
|
| Papaya | 1% (16) | |||||||
|
| Tomato | 1% (16) | |||||||
|
| Tiger | 1% | 2% | 3% | 4% | 1% | 2% | 3% | 4% |
|
[ | aloe | (16) | (16) | (16) | (16) | (16) | (16) | (16) | (16) |
| Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | Xanthorrhoeaceae | ||
|
| Dendrobium |
|
|
|
|
|
|
|
|
| sp.[ | orchid |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| Sturgeon/Kaluga |
|
|
|
|
|
|
|
|
|
| Nile crocodile |
|
|
|
|
|
|
|
|
|
| Lettuce | 10% (16) | 10% (16) | 10% (16) | 10% (16) | ||||
|
| White cabbage | 47% (16) | 45% (16) | 43% (16) | 41% (16) | 32% (16) | 30% (16) | 28% (16) | 26% (16) |
|
| Pig | 10% (16) | 10% (16) | 10% (16) | 10% (16) | ||||
|
| Cattle | 47% (16) | 45% (16) | 43% (16) | 41% (16) | 32% (16) | 30% (16) | 28% (16) | 26% (16) |
|
| European plaice | 10% (16) | 10% (16) | 10% (16) | 10% (16) | ||||
Taxa were identified at the species level unless otherwise indicated. The number of laboratories that have identified a taxon at the species level or higher is provided in parentheses. Identified taxa listed by CITES are highlighted in bold.
aSpecies listed by CITES.
b Aloe variegata (synonym Gonialoe variegata) was recently assigned to the genus Gonialoe [55].
Sample S3 ingredients list and taxa (species, genus, family, order) identified.
| Ingredients label | Common name | Species/genus | Family | (Infra)order |
|---|---|---|---|---|
| Herba Cistanches | Cistanche extract |
| Orobanchaceae | Lamiales |
| Cauda cervi | Mature deer tail |
| Cervidae | Pecora |
| Radix Rehmanniae praeparata | Processed |
| Rehmanniaceae | Lamiales |
| Radix Ginseng | Dried root of |
| Araliaceae | Apiales (8) |
| Radix morindae Officinalis | Morinda root |
| Rubiaceae | Gentianales |
| Semen Cuscutae | Chinese dodder seed |
| Convolvulaceae (2) | Solanales |
| Radix Achyranthis bidentatae | Dried root of |
| Amaranthaceae | Caryophyllales |
| Rhizoma Cibotii | Root of |
| Cibotiaceae | Cyatheales |
| Semen Platycladi | Dry ripe kernel of |
| Cupressaceae | Cupressales |
| Cortex Eucommiae | Bark of |
| Eucommiaceae | Garryales |
| Radix Astragali | Astragalus root |
| Fabaceae (16) | Fabales |
| Fructus Schisandrae chinensis | Chinese magnolia-vine fruit |
| Schisandraceae | Austrobaileyales |
| Cortex Cinnamomi | Dried inner bark of |
| Lauraceae | Laurales |
| Cornu Cervi Pantotrichum | Antler of |
| Cervidae | Pecora |
| Undeclared identified taxa[ |
| |||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
|
The number of laboratories that have identified a taxon is provided in parentheses. Species marked in grey are listed by CITES.
aSpecies identified by at least 14 laboratories that were not mentioned in the ingredients list.
Sample S8 ingredients list and taxa (species, genus, family, order) identified.
| Ingredients label | Common name | Species/genus | Family | (Infra)order |
|---|---|---|---|---|
| Kola nut | Fruit of kola nut |
| Malvaceae | Malvales |
| Siberian ginseng | Siberian ginseng |
| Araliaceae | Apiales |
| Horny goat weed | Horny goat weed |
| Berberidaceae (16) | Ranunculales |
| Catuaba | Catuaba bark |
| Calophyllaceae | Malpighiales |
| Muria puama | Marapuama, potency wood |
| Olacaceae | Santalales |
| Korean ginseng | Korean ginseng |
| Araliaceae (16) | Apiales |
| Damiana | Damiana leaves |
| Passifloraceae | Malpighiales |
| Saw palmetto | Extract of fruit the of |
| Arecaceae (16) | Arecales |
| Yohimbe | Extract from the bark of |
| Rubiaceae (16) | Gentianales |
| Magnesium stearate | ||||
| Undeclared identified taxa[ |
| Asteraceae (16) | ||
|
| Meliaceae (16) | |||
|
| ||||
|
| ||||
|
| ||||
|
|
The number of laboratories that have identified a taxon is provided in parentheses. Species marked in grey are listed by CITES.
aSpecies identified by at least 14 laboratories that were not mentioned in the ingredients list.