| Literature DB >> 25255319 |
Mikko Pentinsaari1, Paul D N Hebert2, Marko Mutanen1.
Abstract
With 400 K described species, beetles (Insecta: Coleoptera) represent the most diverse order in the animal kingdom. Although the study of their diversity currently represents a major challenge, DNA barcodes may provide a functional, standardized tool for their identification. To evaluate this possibility, we performed the first comprehensive test of the effectiveness of DNA barcodes as a tool for beetle identification by sequencing the COI barcode region from 1872 North European species. We examined intraspecific divergences, identification success and the effects of sample size on variation observed within and between species. A high proportion (98.3%) of these species possessed distinctive barcode sequence arrays. Moreover, the sequence divergences between nearest neighbor species were considerably higher than those reported for the only other insect order, Lepidoptera, which has seen intensive analysis (11.99% vs up to 5.80% mean NN divergence). Although maximum intraspecific divergence increased and average divergence between nearest neighbors decreased with increasing sampling effort, these trends rarely hampered identification by DNA barcodes due to deep sequence divergences between most species. The Barcode Index Number system in BOLD coincided strongly with known species boundaries with perfect matches between species and BINs in 92.1% of all cases. In addition, DNA barcode analysis revealed the likely occurrence of about 20 overlooked species. The current results indicate that DNA barcodes distinguish species of beetles remarkably well, establishing their potential to provide an effective identification tool for this order and to accelerate the discovery of new beetle species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25255319 PMCID: PMC4177932 DOI: 10.1371/journal.pone.0108651
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Nucleotide composition of the >500 bp sequences used in all the analyses.
| min | mean | max | SE | |
| G % | 10.33 | 16.32 | 20.52 | 0.01 |
| C % | 12.92 | 17.66 | 26.29 | 0.03 |
| A % | 22.19 | 29.58 | 35.41 | 0.02 |
| T % | 26.44 | 36.42 | 42.86 | 0.03 |
| GC % | 26.29 | 33.98 | 44.83 | 0.04 |
| GC % codon pos 1 | 31.91 | 44.86 | 54.59 | 0.04 |
| GC % codon pos 2 | 35.11 | 42.5 | 45.89 | 0.01 |
| GC % codon pos 3 | 1.82 | 14.51 | 39.21 | 0.08 |
Figure 1Distributions of Kimura 2-parameter distances within (black) and between species (white).
Sampled individuals and observed haplotypes per species.
| individuals | haplotypes | |||||
| number of ind./hapl. | species count | % | cumul. % | species count | % | cumul. % |
| 1 | 562 | 30.02 | 30.02 | 911 | 48.66 | 48.66 |
| 2 | 427 | 22.81 | 52.83 | 510 | 27.24 | 75.9 |
| 3 | 331 | 17.68 | 70.51 | 239 | 12.77 | 88.67 |
| 4 | 219 | 11.7 | 82.21 | 140 | 7.48 | 96.15 |
| 5 | 171 | 9.13 | 91.34 | 49 | 2.62 | 98.77 |
| 6 | 98 | 5.24 | 96.58 | 14 | 0.75 | 99.52 |
| 7 | 35 | 1.87 | 98.45 | 6 | 0.32 | 99.84 |
| 8 | 15 | 0.8 | 99.25 | 2 | 0.11 | 99.95 |
| 9 | 2 | 0.11 | 99.36 | 0 | 0 | 99.95 |
| 10 | 2 | 0.11 | 99.47 | 0 | 0 | 99.95 |
| >10 | 10 | 0.53 | 100 | 1 | 0.05 | 100 |
Figure 2Sample size versus maximum sequence divergence observed within species.
The shaded area represents the 95% confidence interval for the fitted LOESS curve.
Figure 3Number of species of Carabidae sampled versus the average sequence divergence (K2P) between nearest neighbor species.
The shaded area represents the 95% confidence interval for the fitted LOESS curve.