| Literature DB >> 27171471 |
Per Sundberg1, Sebastian Kvist2,3, Malin Strand4.
Abstract
Whereas many nemerteans (ribbon worms; phylum Nemertea) can be identified from external characters if observed alive, many are still problematic. When it comes to preserved specimens (as in e.g. marine inventories), there is a particular need for specimen identifier alternatives. Here, we evaluate the utility of COI (cytochrome c oxidase subunit I) as a single-locus barcoding gene. We sequenced, data mined, and compared gene fragments of COI for 915 individuals representing 161 unique taxonomic labels for 71 genera, and subjected different constellations of these to both distance-based and character-based DNA barcoding approaches, as well as species delimitation analyses. We searched for the presence or absence of a barcoding gap at different taxonomic levels (phylum, subclass, family and genus) in an attempt to understand at what level a putative barcoding gap presents itself. This was performed both using the taxonomic labels as species predictors and using objectively inferred species boundaries recovered from our species delimitation analyses. Our data suggest that COI works as a species identifier for most groups within the phylum, but also that COI data are obscured by misidentifications in sequence databases. Further, our results suggest that the number of predicted species within the dataset is (in some cases substantially) higher than the number of unique taxonomic labels-this highlights the presence of several cryptic lineages within well-established taxa and underscores the urgency of an updated taxonomic backbone for the phylum.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27171471 PMCID: PMC4865114 DOI: 10.1371/journal.pone.0155541
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Result from the manual calculations of intraspecific (red) and interspecific (blue) COI distances.
Note the absences of a disjunction between intraspecific and interspecific distances (the lack of a barcoding gap), which is further discussed in the text. A, Nemertea, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; B, Nemertea, enlarged view of the barcoding gap region with the x-axis set to a maximum of 500 comparisons; C, Heteronemertea, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; D, Heteronemertea, enlarged view of the barcoding gap region with the x-axis set to a maximum of 100 comparisons; E, Hoplonemertea, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; F, Hoplonemertea, enlarged view of the barcoding gap region with the x-axis set to a maximum of 100 comparisons; G, Palaeonemertea, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; H, Palaeonemertea, enlarged view of the barcoding gap region with the x-axis set to a maximum of 100 comparisons.
Fig 2Result from the manual calculations of intraspecific (red) and interspecific (blue) COI distances.
A, Lineidae, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; B, Lineidae, enlarged view of the barcoding gap region with the x-axis set to a maximum of 40 comparisons; C, Cephalotrichidae, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; D, Cephalotrichidae, enlarged view of the barcoding gap region with the x-axis set to a maximum of 100 comparisons; E, Oerstediidae, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; F, Oerstediidae, enlarged view of the barcoding gap region with the x-axis set to a maximum of 200 comparisons.
Fig 3Result from the manual calculations of intraspecific (red) and interspecific (blue) COI distances.
Note the absences of a disjunction between intraspecific and interspecific distances (the lack of a barcoding gap), which is further discussed in the text. A, Cerebratulus, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; B, Cerebratulus, enlarged view of the barcoding gap region with the x-axis set to a maximum of 10 comparisons; C, Lineus, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; D, Lineus, enlarged view of the barcoding gap region with the x-axis set to a maximum of 40 comparisons; E, Micrura, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; F, Micrura, enlarged view of the barcoding gap region with the x-axis set to a maximum of 100 comparisons.
Summary statistics for the CAOS analyses of the problematic groups at the species level.
Each of the investigated species showed intraspecific variation above 2%, suggesting that DNA barcoding may be hampered by the lack of a distinct barcoding gap. However, each of the different lineages within these species groups possesses diagnostic characters that can aid in the future identification of the lineages (no diagnostic characters where found when comparing the sequences to the entire pool of nemertean taxa; see text for discussion).
| Taxon | Number of sequences analyzed | Global, private nucleotides exist | State: position | Local, private nucleotides exist for any clade (for each clade) | Clade (state: position) |
|---|---|---|---|---|---|
| 110 | No | N/A | Yes (Yes) | 1 (A:66, A:396); 2 (T:243, A:300); 3 (C:57, G:117); 4 (A:54, G:102, A:342, G:504); 5 (G:126, G:291, C:312); 6 (A:36, C:150, G:276, C:474, G:501); 7 (A:12, C:30, A:48, T:117, C:124, T:126, C:193, C:195, T:209, A:261, C:274, T:276, T:282, A:483, G:495); 8 (G:16, A:39, G:87, G:93, G:96, G:147, A:150, C:159, C:186, T:190, A:192, G:213, C:225, A:234, C:243, G:249, G:261, G:288, G:303, A:306, C:309, C:369, C:396, A:456, C:469, T:471, T:475, A:477, T:484, A:486, C:508) | |
| 7 | No | N/A | Yes (Yes) | 1 (C:12, G:15, A:27, C:33, A:60, G:72, A:75, C:78, T:111, A:156, A:180, A:192, C:198, A:201, A:210, C:217, A:234, A:237, A:238, G:243, T:261, A:276, A:280, T:303, T:318, C:344, T:363, C:426, A:429, C:435, A:438, A:444, T:477, A:483, T:492, A:507); 2 (A:12, T:21, C:39, A:54, G:60, C:85, T:87, A:96, G:111, T:126, C:133, G:135, C:147, C:162, G:201, C:231, C:243, G:246, G:252, A:261, G:273, T:277, G:306, G:348, A:360, T:396, C:408, A:441, C:466, T:468, A:471, A:495, G:507, G:513); 3 (G:3, T:6, C:22, C:60, C:69, G:87, G:99, A:11, C:114, G:117, T:144, G:147, G:171, C:193, T:195, T:201, C:205, G:207, C:210, G:228, T:234, A:243, T:245, C:249, A:252, T:255, G:258, C:279, C:282, C:294, A:306, G:309, A:315, C:318, C:355, G:363, A:387, A:426, T:435, C:460, T:462, C:474, G:482, T:483, C:508, T:510); 4 (C:15, A:21, A:24, T:27, C:30, G:36, A:39, T:60, G:66, A:72, G:90, T:96, T:99, C:111, T:117, G:126, G:132, G:159, A:168, T:190, G:192, C:198, C:201, A:219, A:220, T:221, A:225, A:228, A:231, T:243, T:249, C:274, T:276, T:280, C:281, A:282, A:286, G:330, G:342, A:345, C:372, A:382, T:384, G:387, A:399, T:400, T:401, T:405, T:406, A:408, G:429, G:448, G:450, G:474, T:475, C:484, T:486, A:492, C:498, C:501) | |
| 41 | No | N/A | Yes (Yes | 1 (G:9, A:24, T:27, T:54, C:57, G:60, T:81, C:85, G:111, G:117, T:120, A:123, T:124, G:126, T:150, C:153, A:168, C:174, A:198, T:210, G:231, G:270, T:279, C:294, G:303, A:315, T:328, A:330, C:369, T:381, A:411, T:426, G:510); 2 (A:9, T:15, G:24, C:27, T:39, C:54, A:60, C:81, A:111, A:117, G:123, C:124, T:126. G:150, T:153, T:168, T:174, G:198, C:210, T:231, A:270, C:279, T:294, T:303, G:315, C:328, T:330, T:369, G:381, G:411, G:436, A:510) | |
| 6 | No | N/A | Yes (Yes) | 1 (A:21, T:22, A:24, A:36, A:48, G:54, A:96, A:126, C:132, G:141, C:150, G:156, T:190, A:192, G:195, C:198, G:207, C:209, G:213, T:214, G:216, G:225, T:228, G:231, T:234, A:240, A:243, G:255, T:261, A:264, G:270, T:274, G:276, G:295, A:300, A:303, G:309, G:315, T:328, A:330, C:333, G:360, T:405, A:429, T:441, T:453, A:456, C:460, T:469, A:471, C:475, T:480, T:501, A:510); 2 (G:6, A:12, A:87, C:90, T:108, G:132, G:144, G:168, A:183, G:208, G:237, G:243, G:273, T:282, C:291, A:363, G:381, G:396, G:405, A:420, G:447, G:462, C:466, T:468, A:477); 3 (G:15, A:18, C:37, T:39, C:45, T:72, A:108, A:111, G:123, T:132, A:144, G:159, T:177, A:189, G:201, C:210, C:217, T:243, A:258, C:270, C:294, A:399, A:405, G:408, G:423, A:438, A:457, A:504, T:513) | |
| 6 | No | N/A | Yes (Yes) | 1 (C:1, T:3, T:6, G:18, T:48, A:60, T:72, G:87, G:90, T:99, A:102, G:111, G:117, G:132, A:141, C:159, T:168, A:171, G:189, C:193, T:195, T:201, T:210, C:217, T:219, A:234, A:237, A:240, T:243, T:255, A:258, A:261, T:264, T:270, A:273, T:279, A:282, A:291, A:300, A:309, A:318, T:327, G:342, A:345, C:348, T:363, G:385, A:396, A:399, C:405, G:408, G:420, G:423, G:435, A:438, A:444, A:447, A:453, A:462, C:463, A:468, C:469, T:471, G:483, G:492, A:513); 2 (T:1, G:3, A:6, T:18, A:48, T:60, A:72, A:87, T:90, A:99, G:102, T:111, A:117, T:132, G:141, T:159, G:168, G:171, A:189, T:193, G:201, A:210, T:217, A:219, G:234, G:237, T:240, G:243, G:255, G:258, T:261, A:264, C:270, T:273, A:279, T:282, T:291, G:300, T:309, T:318, A:327, T:342, T:345, A:348, A:363, A:385, G:396, G:399, A:408, A:423, T:435, G:438, T:444, G:447, T:453, G:462, T:463, G:468, T:469, A:471, A:483, T:492, G:513) |
* Result based on comparison of the entire pool of sequences for the taxon listed versus all sequences from the 915-taxon dataset.
† Relative position of the nucleotides based on the alignment used in the present study.
‡ Result based on comparison of each clade found in the NJ analysis versus the remainder of sequences with identical taxonomic labels.
§ Clade number refers to the numbers in the NJ tree (S1 Fig).
Fig 4Result from the manual calculations of intraspecific (red) and interspecific (blue) COI distances using species affiliation determined by the ML solution in PTP.
Note the presence of a short barcoding gap at ~3–5%, which is further discussed in the text. A, Nemertea, full view of the chart with the x-axis set above the upper limit of the number of comparisons within the dataset; B, Nemertea, enlarged view of the barcoding gap region with the x-axis set to a maximum of 500 comparisons.