| Literature DB >> 22276096 |
Rupert A Collins1, Karen F Armstrong, Rudolf Meier, Youguang Yi, Samuel D J Brown, Robert H Cruickshank, Suzanne Keeling, Colin Johnston.
Abstract
BACKGROUND: Poorly regulated international trade in ornamental fishes poses risks to both biodiversity and economic activity via invasive alien species and exotic pathogens. Border security officials need robust tools to confirm identifications, often requiring hard-to-obtain taxonomic literature and expertise. DNA barcoding offers a potentially attractive tool for quarantine inspection, but has yet to be scrutinised for aquarium fishes. Here, we present a barcoding approach for ornamental cyprinid fishes by: (1) expanding current barcode reference libraries; (2) assessing barcode congruence with morphological identifications under numerous scenarios (e.g. inclusion of GenBank data, presence of singleton species, choice of analytical method); and (3) providing supplementary information to identify difficult species. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22276096 PMCID: PMC3262790 DOI: 10.1371/journal.pone.0028381
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of descriptive barcode statistics for the three data partitions analysed in the study.
| Statistic | This study | GenBank/Bold | Combined |
| Individuals | 678 | 562 | 1240 |
| Species (no. unique sp.) | 172 (91) | 238 (157) | 329 |
| Mean individuals per sp. (range) | 3.9 (1–12) | 2.4 (1–42) | 3.8 |
| Singletons | 20 | 125 | 97 |
| Genera | 45 | 63 | 65 |
| Mean sampling events per sp. (range) | 2.32 (1–8) | - | - |
| Mean seq. length bp (range) | 645 (378–651) | 639 (441–651) | 643 (378–651) |
| No. barcodes | 5 | 1 | 6 |
| Mean haplotypes per species | 1.97 (1–7) | 1.61 (1–8) | 2.07 (1–10) |
| Mean intraspecific dist. (range) | 0.90% (0–14.7%) | 0.86% (0–24.1%) | 1.13% (0–24.1%) |
| Mean smallest interspecific dist. (range) | 9.11% (0–23.2%) | 8.40% (0–26.0%) | 8.06% (0–26.0%) |
| 95% intraspecific var. | 5.48% | 2.13% | 6.85% |
| 95% smallest interspecific dist. | 1.72% | 0.00% | 0.15% |
| Prop. intraspecific dist. | 19.0% | 32.2% | 28.3% |
| Prop. intraspecific dist. | 13.5% | 5.90% | 12.7% |
Ranges or subsets are presented in parentheses. Abbreviations: dist. = distance(s); no. = number; prop. = proportion; seq. = sequence; sp. = species; tot. = total; var. = variation. “Combined” refers to data generated in this study combined with collected GenBank/Bold data.
Figure 1Illustrating the utility of DNA barcodes in biosecurity.
Puntius filamentosus (A) and P. assimilis (B) are two species strikingly similar in appearance; morphological differences are especially difficult to discern when these are exported as juveniles. Here, we demonstrate they can be readily separated by DNA barcodes, with the two specimens pictured here differing by a 17.6% divergence in K2P distance for COI.
Figure 2Cumulative error and threshold optimisation.
False positive (orange) and false negative (blue) identification error rates summed across a range of distance thresholds from 0–10% in 0.2% increments (combined data). Definition of errors follows Meyer and Paulay [82]. Optimum threshold is 0.8%.
Identification percent success rates for each of the five primary analytical methods across three data partitions (with singletons both included and excluded from results), plus optimum threshold values from cumulative error estimation.
| Measure | Singletons | This study (%) | GenBank/Bold (%) | Combined (%) |
| NJ mono. | excl. | 96.7 (3.3) | 83.5 (16.5) | 84.7 (15.3) |
| incl. | 93.8 (6.2) | 64.9 (35.1) | 78.1 (21.9) | |
| NJ mono. boot. | excl. | 89.7 (10.3) | 78.7 (21.3) | 74.7 (25.3) |
| incl. | 87.0 (13.0) | 61.2 (38.8) | 68.9 (31.1) | |
|
| excl. | 98.9 (1.1) | 93.6 (6.4) | 94.8 (5.2) |
| incl. | 96.0 (3.9) | 72.8 (27.2) | 87.4 (12.6) | |
| GMYC | excl. | 94.2 (3.6, 2.1) | 72.1 (17.3, 10.5) | 82.2 (12.5, 5.3) |
| incl. | 91.4 (3.5, 5.0) | 58.5 (14.1, 27.4) | 77.0 (11.7, 11.3) | |
| Bold: 1% thresh. | excl. | 93.2 (0.0, 3.2, 3.6) | 75.3 (2.5, 12.8, 9.4) | 82.9 (1.5, 6.6, 8.9) |
| incl. | 90.4 (0.0, 6.0, 3.6) | 58.5 (5.3, 28.8, 7.3) | 76.5 (2.8, 12.5, 8.2) | |
| Bold: opt. thresh. | excl. | 93.9 (0.0, 2.4, 3.6) | 75.3 (2.5, 12.8, 9.4) | 83.4 (1.7, 6.9, 8.0) |
| incl. | 91.2 (0.0, 5.3, 3.5) | 58.5 (5.3, 28.8, 7.3) | 76.9 (2.9, 12.0, 7.3) | |
| BCM: 1% thresh. | excl. | 94.8 (0.2, 3.2, 1.8) | 77.6 (3.4, 12.8, 6.2) | 86.7 (2.4, 6.6, 4.2) |
| incl. | 92.0 (0.1, 6.0, 1.8) | 60.3 (6.0, 28.8, 4.8) | 79.9 (3.7, 12.5, 3.9) | |
| BCM: opt. thresh. | excl. | 95.6 (0.2, 2.4, 1.8) | 77.6 (3.4, 12.8, 6.2) | 86.5 (2.4, 6.9, 4.2) |
| incl. | 92.8 (0.1, 5.3, 1.8) | 60.3 (6.0, 28.8, 4.8) | 79.8 (3.5, 12.9, 3.9) | |
| Opt. thresh. value | 1.4 | 1.0 | 0.8 |
Values in parentheses show failure rate broken down into “misidentification”, “no identification” and “ambiguous” (BCM and Bold only) respectively. “Combined” refers to data generated in this study combined with collected GenBank/Bold data. Abbreviations: BCM = “best close match”; boot. = bootstrap (); excl. = excluded; incl. = included; mono. = monophyly; opt. = optimum; thresh. = threshold.
Figure 3Incongruences and inconsistencies in barcode data.
This reduced-taxon NJ phylogram highlights cases of haplotype sharing and paraphyly/polyphyly between nominal species. Data generated in this study are prefixed “RC0”, “YGN” and “EUN” (otherwise GenBank), with anomalous individuals represented in red.
Figure 4Cryptic and unrecognised species.
An NJ phylogram showing deep COI barcode divergences in selected ornamental species. Taxa of interest are highlighted in blue.
Exploring unrecognised diversity: undescribed and putative cryptic species were assessed with COI and nuclear RHO data in the context of their closest known congener or conspecifics.
| Putative cryptic or unrecognised taxon | Taxon comparison |
| Mean K2P % COI/RHO | No. CAs COI/RHO | Bootstrap % COI/RHO | Rosenberg's |
|
|
| 6 | 7.4/0.5 | 23/2 | 100/92.7* | Y/N* |
|
|
| 7 | 9.0/1.3 | 21/10 | 100/89.9 | Y/Y |
|
|
| 6 | 7.0/1.1 | 40/7 | 100/100 | Y/Y |
|
|
| 6 | 8.6/0.6 | 48/5 | 100/97.1 | Y/Y |
|
|
| 6 | 8.1/0.6 | 47/5 | 100/99.8 | Y/Y |
|
|
| 5 | 3.7/0.5 | 23/3 | 100/95.3 | N/N |
|
|
| 7 | 17.2/4.1 | 76/27 | 100/100 | Y/Y |
|
| intraspecific | 5 | 7.8/0.4 | 40/3 | 100/95.7 | N |
|
|
| 6 | 10.9/1.5 | 46/8 | 100/82.5 | Y/Y |
|
|
| 7 | 2.2/0.2 | 11/1 | 100/18.1 | Y/N |
|
| intraspecific | 10 | 13.8/2.3 | 42/9 | 100/99.6 | Y/Y |
Notes: (*) renders Danio choprae paraphyletic; () P monophyly significant to the level with combined COI data (15 specimens); () species likely described during manuscript preparation as Brevibora cheeya [99]. Abbreviations: CA = pure, simple characteristic attribute (i.e., discrete diagnostic character state); Y = Rosenberg's P, significant to ; N = not significant.