| Literature DB >> 32298363 |
Mikko Pentinsaari1, Sujeevan Ratnasingham1, Scott E Miller2, Paul D N Hebert1.
Abstract
Applications of biological knowledge, such as forensics, often require the determination of biological materials to a species level. As such, DNA-based approaches to identification, particularly DNA barcoding, are attracting increased interest. The capacity of DNA barcodes to assign newly encountered specimens to a species relies upon access to informatics platforms, such as BOLD and GenBank, which host libraries of reference sequences and support the comparison of new sequences to them. As parameterization of these libraries expands, DNA barcoding has the potential to make valuable contributions in diverse applied contexts. However, a recent publication called for caution after finding that both platforms performed poorly in identifying specimens of 17 common insect species. This study follows up on this concern by asking if the misidentifications reflected problems in the reference libraries or in the query sequences used to test them. Because this reanalysis revealed that missteps in acquiring and analyzing the query sequences were responsible for most misidentifications, a workflow is described to minimize such errors in future investigations. The present study also revealed the limitations imposed by the lack of a polished species-level taxonomy for many groups. In such cases, applications can be strengthened by mapping the geographic distributions of sequence-based species proxies rather than waiting for the maturation of formal taxonomic systems based on morphology.Entities:
Year: 2020 PMID: 32298363 PMCID: PMC7162515 DOI: 10.1371/journal.pone.0231814
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of query results (top matches) for 17 insect species between Meiklejohn et al. [10] and the present study.
| Meiklejohn et al. | Present study | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Query sequence | Order | Family | Genus | Species | Sequence length | Database | Top match genus | Top match species | Identity | Top match order | Top match family | Top match genus | Top match species | Identity |
| MK905407 | Coleoptera | Scarabaeidae | Phanaeus | vindex | 581 | GenBank | Phanaeus | sp. | 0.99821 | Coleoptera | Scarabaeidae | Phanaeus | sp. | 0.9982 |
| MK905407 | Coleoptera | Scarabaeidae | Phanaeus | vindex | 581 | BOLD_all | Phanaeus | sp. | 0.9982 | Coleoptera | Scarabaeidae | Phanaeus | sp. | 0.9982 |
| MK905402 | Dermaptera | Forficulidae | Forficula | auricularia | 429 | GenBank | Forficula | auricularia | 0.99299 | Dermaptera | Forficulidae | Forficula | aff. auricularia A | 0.9930 |
| MK905402 | Dermaptera | Forficulidae | Forficula | auricularia | 429 | BOLD_all | Dyscheralcis | retroflexa | 0.5 | Dermaptera | Forficulidae | Forficula | auricularia-A | 0.9976 |
| MK905402 / RC | Dermaptera | Forficulidae | Forficula | auricularia | 429 | BOLD_all | Lepidoptera | Geometridae | Dyscheralcis | retroflexa | 0.5 | |||
| MK905396 | Diptera | Calliphoridae | Chrysomya | rufifacies | 592 | GenBank | Chrysomya | rufifacies | 1 | Diptera | Calliphoridae | Chrysomya | rufifacies | 1 |
| MK905396 | Diptera | Calliphoridae | Chrysomya | rufifacies | 592 | BOLD_all | Chrysomya | rufifacies | 1 | Diptera | Calliphoridae | Chrysomya | rufifacies | 1 |
| MK905397 | Diptera | Calliphoridae | Calliphora | vicina | 553 | GenBank | Calliphora | vicina | 0.98373 | Diptera | Calliphoridae | Calliphora | vicina | 0.9837 |
| MK905397 | Diptera | Calliphoridae | Calliphora | vicina | 553 | BOLD_all | Calliphora | vicina | 1 | Diptera | Calliphoridae | Calliphora | vicina | 1 |
| MK905393 | Diptera | Culicidae | Aedes | aegypti | 658 | GenBank | Aedes | aegypti | 0.99848 | Diptera | Culicidae | Aedes | aegypti | 0.9985 |
| MK905393 | Diptera | Culicidae | Aedes | aegypti | 658 | BOLD_all | Aedes | aegypti | 1 | Diptera | Culicidae | Aedes | aegypti | 1 |
| MK905403 | Diptera | Glossinidae | Glossina | palpalis | 611 | GenBank | Glossina | brevipalpis | 0.971 | Diptera | Glossinidae | Glossina | brevipalpis | 0.9710 |
| MK905403 | Diptera | Glossinidae | Glossina | palpalis | 611 | BOLD_all | Glossina | brevipalpis | 0.9694 | Diptera | Glossinidae | Glossina | brevipalpis | 0.9694 |
| MK905404 | Diptera | Muscidae | Musca | domestica | 645 | GenBank | Cryptopygus | tricuspis | 0.996 | Entomobryomorpha | Isotomidae | Cryptopygus | tricuspis | 0.9960 |
| MK905404 | Diptera | Muscidae | Musca | domestica | 645 | BOLD_all | Amphiura | incana | 0.5571 | Entomobryomorpha | Isotomidae | Folsomia | cf. diplopthalma | 1 |
| MK905404 / RC | Diptera | Muscidae | Musca | domestica | 645 | BOLD_all | Ophiurida | Amphiuridae | Amphiura | incana | 0.5586 | |||
| MK905400 | Ephemeroptera | Ephemeridae | Hexagenia | limbata | 254 | GenBank | Glossina | brevipalpis | 0.9681 | Diptera | Glossinidae | Glossina | brevipalpis | 0.9681 |
| MK905400 | Ephemeroptera | Ephemeridae | Hexagenia | limbata | 254 | BOLD_all | Glossina | brevipalpis | 0.9675 | Diptera | Glossinidae | Glossina | brevipalpis | 0.9675 |
| MK905409 | Hymenoptera | Vespidae | Vespula | squamosa | 560 | GenBank | Vespula | squamosa | 0.99643 | Hymenoptera | Vespidae | Vespula | squamosa | 0.9964 |
| MK905409 | Hymenoptera | Vespidae | Vespula | squamosa | 560 | BOLD_all | Vespula | squamosa | 1 | Hymenoptera | Vespidae | Vespula | squamosa | 1 |
| MK905395 | Lepidoptera | Saturniidae | Callosamia | promethea | 354 | GenBank | Callosamia | promethea | 0.9969 | Lepidoptera | Saturniidae | Callosamia | promethea | 0.9969 |
| MK905395 | Lepidoptera | Saturniidae | Callosamia | promethea | 354 | BOLD_all | Callosamia | promethea | 0.9938 | Lepidoptera | Saturniidae | Callosamia | promethea | 0.9940 |
| MK905401 | Lepidoptera | Nymphalidae | Danaus | plexippus | 623 | GenBank | Danaus | plexippus | 1 | Lepidoptera | Nymphalidae | Danaus | plexippus | 1 |
| MK905401 | Lepidoptera | Nymphalidae | Danaus | plexippus | 623 | BOLD_all | Danaus | plexippus | 1 | Lepidoptera | Nymphalidae | Danaus | plexippus | 1 |
| MK905405 | Mecoptera | Meropeidae | Merope | tuber | 655 | GenBank | Merope | tuber | 0.92006 | Mecoptera | Meropeidae | Merope | tuber | 0.9201 |
| MK905405 | Mecoptera | Meropeidae | Merope | tuber | 655 | BOLD_all | Craesus | alniastri | 0.5 | Mecoptera | Meropeidae | Merope | tuber | 0.9430 |
| MK905405 / RC | Mecoptera | Meropeidae | Merope | tuber | 655 | BOLD_all | Hymenoptera | Tenthredinidae | Craesus | alniastri | 0.5 | |||
| MK905408 | Neuroptera | Ascalaphidae | Ululodes | quadripunctatus | 635 | GenBank | Ululodes | quadrimaculatus | 1 | Neuroptera | Ascalaphidae | Ululodes | quadripunctatus | 1 |
| MK905408 | Neuroptera | Ascalaphidae | Ululodes | quadripunctatus | 635 | BOLD_all | Xanthopimpla | sp. | 0.5152 | Neuroptera | Ascalaphidae | Ululodes | quadripunctatus | 1 |
| MK905408 / RC | Neuroptera | Ascalaphidae | Ululodes | quadripunctatus | 635 | BOLD_all | Hymenoptera | Ichneumonidae | Xanthopimpla | sp. | 0.5152 | |||
| MK905399 | Odonata | Gomphidae | Gomphus | exilis | 612 | GenBank | Cecidomyiidae | sp. | 0.9934 | Diptera | Cecidomyiidae | Cecidomyiidae | sp. | 0.9935 |
| MK905399 | Odonata | Gomphidae | Gomphus | exilis | 612 | BOLD_all | Dolichophis | schmidti | 0.5283 | Diptera | Cecidomyiidae | Cecidomyiidae | sp. | 0.9967 |
| MK905399 / RC | Odonata | Gomphidae | Gomphus | exilis | 612 | BOLD_all | Squamata | Colubridae | Dolichophis | schmidti | 0.5283 | |||
| MK905398 | Orthoptera | Gryllidae | Gryllus | assimilis | 278 | GenBank | Gryllus | pennsylvanicus | 0.9964 | Orthoptera | Gryllidae | Gryllus | pennsylvanicus | 0.9964 |
| MK905398 | Orthoptera | Gryllidae | Gryllus | assimilis | 278 | BOLD_all | Gryllus | pennsylvanicus | 0.9964 | Orthoptera | Gryllidae | Gryllus | pennsylvanicus | 0.9964 |
| MK905394 | Siphonaptera | Pulicidae | Ctenocephalides | felis | 643 | GenBank | Pulex | irritans | 0.9642 | Siphonaptera | Pulicidae | Pulex | irritans | 0.9642 |
| MK905394 | Siphonaptera | Pulicidae | Ctenocephalides | felis | 643 | BOLD_all | Natrix | tessellata | 0.6176 | Siphonaptera | Pulicidae | Pulex | irritans | 0.9642 |
| MK905394 / RC | Siphonaptera | Pulicidae | Ctenocephalides | felis | 643 | BOLD_all | Squamata | Colubridae | Natrix | tessellata | 0.6176 | |||
| MK905406 | Phthiraptera | Pediculidae | Pediculus | humanus capitis | 384 | GenBank | Stylops | sp. | 1 | Strepsiptera | Stylopidae | Stylops | sp. | 1 |
| MK905406 | Phthiraptera | Pediculidae | Pediculus | humanus capitis | 384 | BOLD_all | Akapala | rudis | 0.596 | Strepsiptera | Stylopidae | Stylops | sp. | 0.9013 |
| MK905406 / RC | Phthiraptera | Pediculidae | Pediculus | humanus capitis | 384 | BOLD_all | Hymenoptera | Eucharitidae | Akapala | rudis | 0.5960 | |||
RC = reverse complement. Blue and red shading indicate correct or inaccurate identification, respectively, at each taxonomic rank.
Fig 1Geographic distributions and sequence clustering of the three barcode lineages of Forficula auricularia in North America.
Three categories of operational errors which compromised efforts by Meiklejohn et al. [10] to test the effectiveness of the BOLD and GenBank reference libraries in identifying 17 insect species.
| Specimen # | ID | Reverse Complement | Contamination | Incorrect ID |
|---|---|---|---|---|
| — | — | Yes | ||
| Yes | — | — | ||
| — | — | — | ||
| — | — | — | ||
| — | — | — | ||
| — | — | Yes | ||
| Yes | Yes | N.D. | ||
| — | Yes | N.D. | ||
| — | — | — | ||
| — | — | — | ||
| — | — | — | ||
| Yes | — | — | ||
| Yes | — | — | ||
| Yes | Yes | N.D. | ||
| — | — | Yes | ||
| Yes | — | Yes | ||
| Yes | Yes | N.D. |
N.D. = not determined.
Fig 2Five key workflow features to maximize the chance of recovering reliable sequence records.