| Literature DB >> 35621761 |
Haiguang Zhang1,2, Wenjun Bu2.
Abstract
The genetic variation in the COI gene has had a great effect on the final results of species delimitation studies. However, little research has comprehensively investigated the genetic divergence in COI among Insecta. The fast-growing COI data in BOLD provide an opportunity for the comprehensive appraisal of the genetic variation in COI among Insecta. We calculated the K2P distance of 64,414 insect species downloaded from BOLD. The match ratios of the clustering analysis, based on different thresholds, were also compared among 4288 genera (35,068 species). The results indicate that approximately one-quarter of the species of Insecta showed high intraspecific genetic variation (>3%), and a conservative estimate of this proportion ranges from 12.05% to 22.58%. The application of empirical thresholds (e.g., 2% and 3%) in the clustering analysis may result in the overestimation of the species diversity. If the minimum interspecific genetic distance of the congeneric species is greater than or equal to 2%, it is possible to avoid overestimating the species diversity on the basis of the empirical thresholds. In comparison to the fixed thresholds, the "threshOpt" and "localMinima" algorithms are recommended for the provision of a reference threshold for threshold-based species delimitation studies.Entities:
Keywords: COI; DNA barcoding; Insecta; species delimitation
Year: 2022 PMID: 35621761 PMCID: PMC9147995 DOI: 10.3390/insects13050425
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 3.139
Figure 1Frequency density of maximal intraspecific genetic distances of different orders.
Figure 2Intraspecific and interspecific genetic distances of 4288 genera. maxintra: the maximum intraspecific genetic distance of congeneric species; mininter: the minimum interspecific genetic distance of congeneric species.
The result of clustering analysis based on different thresholds.
| Mininter | SN | M_0.01 | M_0.02 | M_0.022 | M_0.03 | M_mininter |
|---|---|---|---|---|---|---|
| ≥0 | 35,068 | 57,950 (65%) | 44,367 (27%) | 42,603 (21%) | 37,698 (7%) | 78,353 (123%) |
| ≥0.01 | 20,714 | 35,243 (70%) | 27,473 (33%) | 26,541 (28%) | 23,976 (16%) | 24,967 (21%) |
| ≥0.02 | 16,037 | 27,653 (72%) | 21,988 (37%) | 21,298 (33%) | 19,476 (21%) | 18,171 (13%) |
| ≥0.03 | 13,034 | 22,909 (76%) | 18,139 (39%) | 17,651 (35%) | 16,474 (26%) | 14,484 (11%) |
| ≥0.04 | 10,454 | 18,584 (78%) | 14,646 (40%) | 14,237 (36%) | 13,278 (27%) | 11,366 (9%) |
| ≥0.05 | 8395 | 15,029 (79%) | 11,803 (41%) | 11,466 (37%) | 10,708 (28%) | 8997 (7%) |
| ≥0.06 | 6706 | 12,196 (82%) | 9521 (42%) | 9256 (38%) | 8627 (29%) | 7093 (6%) |
| ≥0.07 | 5317 | 9810 (85%) | 7633 (44%) | 7406 (39%) | 6908 (30%) | 5569 (5%) |
| ≥0.08 | 4177 | 7706 (84%) | 6015 (44%) | 5842 (40%) | 5457 (31%) | 4326 (4%) |
| ≥0.09 | 3266 | 6093 (87%) | 4724 (45%) | 4589 (41%) | 4273 (31%) | 3365 (3%) |
| ≥0.1 | 2395 | 4456 (86%) | 3456 (44%) | 3356 (40%) | 3131 (31%) | 2447 (2%) |
Mininter: minimum interspecific genetic distances of congeneric species; SN: species number; M_*: the number of MOTUs based on the threshold of *. The values in parentheses represent overestimated ratios.
Figure 3Match ratios of different methods. Match_ratio_*: the match ratio of 4288 genera in clustering analysis on the basis of the threshold of *. mininter: the minimum interspecific genetic distance of congeneric species; localMinima: the possible thresholds from the distance matrix; opt: the optimal thresholds.
Figure 4Numbers of “MATCH”, “MERGE”, “MIXTURE”, and “SPLIT” with different methods. T_*: the fixed thresholds of *. mininter: the minimum interspecific genetic distance of congeneric species; localMinima: the possible thresholds from the distance matrix; opt: the optimal thresholds.