| Literature DB >> 28072819 |
Thomas W A Braukmann1, Maria L Kuzmina1, Jesse Sills1, Evgeny V Zakharov1, Paul D N Hebert1.
Abstract
Their relatively slow rates of molecular evolution, as well as frequent exposure to hybridization and introgression, often make it difficult to discriminate species of vascular plants with the standard barcode markers (rbcL, matK, ITS2). Previous studies have examined these constraints in narrow geographic or taxonomic contexts, but the present investigation expands analysis to consider the performance of these gene regions in discriminating the species in local floras at sites across Canada. To test identification success, we employed a DNA barcode reference library with sequence records for 96% of the 5108 vascular plant species known from Canada, but coverage varied from 94% for rbcL to 60% for ITS2 and 39% for matK. Using plant lists from 27 national parks and one scientific reserve, we tested the efficacy of DNA barcodes in identifying the plants in simulated species assemblages from six biogeographic regions of Canada using BLAST and mothur. Mean pairwise distance (MPD) and mean nearest taxon distance (MNTD) were strong predictors of barcode performance for different plant families and genera, and both metrics supported ITS2 as possessing the highest genetic diversity. All three genes performed strongly in assigning the taxa present in local floras to the correct genus with values ranging from 91% for rbcL to 97% for ITS2 and 98% for matK. However, matK delivered the highest species discrimination (~81%) followed by ITS2 (~72%) and rbcL (~44%). Despite the low number of plant taxa in the Canadian Arctic, DNA barcodes had the least success in discriminating species from this biogeographic region with resolution ranging from 36% with rbcL to 69% with matK. Species resolution was higher in the other settings, peaking in the Woodland region at 52% for rbcL and 87% for matK. Our results indicate that DNA barcoding is very effective in identifying Canadian plants to a genus, and that it performs well in discriminating species in regions where floristic diversity is highest.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28072819 PMCID: PMC5224991 DOI: 10.1371/journal.pone.0169515
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of localities, corresponding terrestrial ecozones and biogeographic regions used to test the taxonomic resolution of rbcL, matK, and ITS2 libraries for the vascular plants of Canada.
The number of species at each locale is in parentheses.
| Park (species) | Terrestrial Ecozones | Region |
|---|---|---|
| Ellesmere Island NP (135) | Northern Arctic | Arctic |
| Ivvavik NP (402) | Southern Arctic | Arctic |
| Nahannii NP (652) | Taiga Plains | Arctic |
| Torngat Mountains NP (368) | Arctic Cordillera | Arctic |
| Ukkusiksalik NP (150) | Northern Arctic | Arctic |
| Wapusk NP (272) | Hudson Plains | Arctic |
| Forillon NP (622) | Atlantic Maritime | Atlantic |
| Fundy NP (732) | Atlantic Maritime | Atlantic |
| Kejimkujik NP (578) | Atlantic Maritime | Atlantic |
| Prince Edward Island NP (648) | Atlantic Maritime | Atlantic |
| La Mauricie NP (461) | Boreal Shield | Boreal |
| Mingan Archipelago NP (441) | Boreal Shield | Boreal |
| Pukaskwa NP (545) | Boreal Shield | Boreal |
| Terra Nova NP (506) | Boreal Shield | Boreal |
| Banff NP (910) | Montane Cordillera | Pacific |
| Glacier NP (634) | Montane Cordillera | Pacific |
| Mount Revelstoke NP (435) | Montane Cordillera | Pacific |
| Pacific Rim NP (436) | Pacific Maritime | Pacific |
| Yoho NP (658) | Montane Cordillera | Pacific |
| Elk Island NP (482) | Boreal Plains | Prairies |
| Grasslands NP (427) | Prairies | Prairies |
| Prince Albert NP (643) | Boreal Plains | Prairies |
| Riding Mountain NP (714) | Prairies | Prairies |
| Waterton Lakes NP (976) | Prairies | Prairies |
| 1000 Islands NP (1631) | Mixed wood Plains | Woodland |
| Bruce Peninsula NP (877) | Mixed wood Plains | Woodland |
| Koffler Scientific Reserve (621) | Mixed wood Plains | Woodland |
| Point Pelee NP (858) | Mixed wood Plains | Woodland |
Fig 1Coverage by barcode locus for the plant communities at 28 Canadian localities.
The number of plant species present at each site is indicated in parentheses.
Fig 2Boxplots of MPD and MNTD for rbcL, matK, and ITS2.
Boxplots comparing MPD and MNTD for the vascular plant families of Canada for rbcL, matK, and ITS2. Significance (p–adjusted < 0.005) is indicated with an asterisk(s).
The mean MPD and MNTD for 25 species-rich families with the number of sampled species.
| Family | ITS2 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| No. of species | MPD | MNTD | No. of species | MPD | MNTD | No. of species | MPD | MNTD | |
| 85 | 0.0889 | 0.0060 | 36 | 0.1395 | 0.0227 | 75 | 0.2660 | 0.0201 | |
| 98 | 0.0315 | 0.0037 | 59 | 0.0621 | 0.0106 | 66 | 0.3154 | 0.0469 | |
| 569 | 0.0729 | 0.0022 | 239 | 0.0699 | 0.0059 | 481 | 0.3732 | 0.0209 | |
| 74 | 0.0579 | 0.0047 | 21 | 0.1065 | 0.0115 | 42 | 0.5789 | 0.0538 | |
| 233 | 0.0320 | 0.0025 | 82 | 0.0700 | 0.0153 | 208 | 0.1887 | 0.0184 | |
| 48 | 0.0756 | 0.0030 | 23 | 0.1138 | 0.0118 | 7 | 0.8123 | 0.3753 | |
| 134 | 0.0805 | 0.0075 | 41 | 0.1841 | 0.0287 | 121 | 0.3109 | 0.0325 | |
| 397 | 0.0329 | 0.0040 | 119 | 0.0552 | 0.0063 | 112 | 0.2724 | 0.0574 | |
| 71 | 0.0992 | 0.0120 | 54 | 0.1071 | 0.0135 | 61 | 0.1939 | 0.0236 | |
| 211 | 0.1602 | 0.0092 | 104 | 0.1291 | 0.0114 | 156 | 0.3646 | 0.0377 | |
| 69 | 0.0952 | 0.0040 | 39 | 0.1742 | 0.0092 | 19 | 0.2826 | 0.0081 | |
| 92 | 0.0616 | 0.0036 | 22 | 0.0844 | 0.0203 | 36 | 0.5781 | 0.1954 | |
| 59 | 0.0558 | 0.0036 | 22 | 0.0543 | 0.0150 | 54 | 0.1015 | 0.0078 | |
| 61 | 0.0690 | 0.0125 | 24 | 0.1148 | 0.0234 | 50 | 0.6388 | 0.0612 | |
| 79 | 0.0700 | 0.0077 | 24 | 0.0927 | 0.0274 | 49 | 0.3724 | 0.0516 | |
| 95 | 0.0942 | 0.0099 | 37 | 0.1654 | 0.0326 | 77 | 0.4472 | 0.0388 | |
| 438 | 0.1185 | 0.0043 | 242 | 0.0963 | 0.0083 | 235 | 0.3941 | 0.0391 | |
| 43 | 0.0341 | 0.0044 | 7 | 0.0885 | 0.0288 | 39 | 0.1060 | 0.0165 | |
| 85 | 0.0638 | 0.0053 | 25 | 0.1537 | 0.0521 | 41 | 0.5349 | 0.1005 | |
| 40 | 0.0906 | 0.0116 | 16 | 0.1063 | 0.0196 | 37 | 0.2703 | 0.0496 | |
| 120 | 0.0943 | 0.0057 | 39 | 0.1539 | 0.0181 | 93 | 0.3356 | 0.0341 | |
| 276 | 0.1021 | 0.0030 | 128 | 0.1183 | 0.0059 | 200 | 0.3677 | 0.0340 | |
| 100 | 0.0172 | 0.0005 | 82 | 0.0093 | 0.0007 | 46 | 0.0362 | 0.0128 | |
| 72 | 0.0635 | 0.0057 | 42 | 0.1318 | 0.0158 | 63 | 0.3226 | 0.0355 | |
| 35 | 0.0127 | 0.0026 | 12 | 0.0310 | 0.0138 | 29 | 0.1021 | 0.0262 | |
| 143.36 | 0.0710 | 0.0056 | 61.56 | 0.1045 | 0.0171 | 95.88 | 0.3427 | 0.0559 | |
| 28.51 | 0.0552 | 0.0180 | 16.95 | 0.0770 | 0.0249 | 24.54 | 0.2823 | 0.0987 | |
Level of species resolution (%) for each barcode for BLAST and mothur.
For mothur, species resolution is reported for both a posterior probability cut-off (0.95) and the true level of resolution.
| Region | Park | Blast | Mothur PP | Mothur Actual | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Arctic | Ellesmere Island NP (135, 107, 84) | 31.85 | 59.81 | 42.96 | 48.60 | 17.04 | 45.79 | |||
| Arctic | Ivvavik NP (399, 292, 298) | 34.59 | 66.78 | 36.09 | 55.14 | 24.81 | 51.71 | |||
| Arctic | Nahannii NP (643, 456, 450) | 36.86 | 69.11 | 36.39 | 54.82 | 25.51 | 55.92 | |||
| Arctic | Torngat Mountains NP (364, 260, 252) | 39.84 | 66.67 | 38.19 | 53.46 | 25.27 | 55.38 | |||
| Arctic | Ukkusiksalik NP (149, 125, 111) | 38.00 | 67.20 | 36.00 | 56.00 | 24.00 | 53.60 | |||
| Arctic | Wapusk NP (269, 224, 225) | 35.69 | 68.89 | 37.55 | 57.59 | 27.14 | 58.48 | |||
| Atlantic | Forillon NP (611, 420, 355) | 47.46 | 70.70 | 37.97 | 58.81 | 32.90 | 64.76 | |||
| Atlantic | Fundy NP (714, 442, 406) | 49.02 | 74.63 | 38.80 | 59.05 | 35.15 | 65.16 | |||
| Atlantic | Kejimkujik NP (561, 332, 319) | 52.76 | 75.55 | 41.35 | 62.35 | 40.64 | 65.83 | |||
| Atlantic | Prince Edward Island NP (633, 414, 365) | 51.82 | 76.16 | 39.97 | 61.84 | 39.18 | 67.40 | |||
| Boreal | La Mauricie NP (450, 295, 220) | 51.33 | 70.91 | 42.00 | 56.95 | 38.00 | 63.39 | |||
| Boreal | Mingan Archipelago NP (432, 299, 285) | 46.99 | 73.33 | 37.50 | 61.87 | 31.94 | 65.55 | |||
| Boreal | Pukaskwa NP (532, 379, 296) | 46.05 | 70.95 | 37.59 | 56.46 | 31.58 | 59.89 | |||
| Boreal | Terra Nova NP (495, 297, 281) | 49.29 | 71.89 | 37.98 | 64.98 | 33.94 | 68.01 | |||
| Pacific | Banff NP (886, 561, 606) | 38.37 | 70.30 | 33.97 | 55.44 | 25.96 | 58.29 | |||
| Pacific | Glacier NP (618, 418, 404) | 41.68 | 70.30 | 35.54 | 54.78 | 26.82 | 61.72 | |||
| Pacific | Mount Revelstoke NP (427, 296, 271) | 40.98 | 70.85 | 35.83 | 55.41 | 29.98 | 62.50 | |||
| Pacific | Pacific Rim NP (423, 285, 258) | 54.61 | 82.56 | 35.93 | 60.35 | 35.46 | 69.12 | |||
| Pacific | Yoho NP (643, 440, 436) | 39.50 | 72.02 | 33.59 | 56.92 | 27.84 | 59.86 | |||
| Prairies | Elk Island NP (476, 345, 299) | 41.81 | 73.58 | 32.77 | 53.62 | 29.41 | 57.68 | |||
| Prairies | Grasslands NP (408, 233, 278) | 41.91 | 70.14 | 31.37 | 52.79 | 29.66 | 58.80 | |||
| Prairies | Prince Albert NP (623, 434, 410) | 43.02 | 72.44 | 33.07 | 55.76 | 30.02 | 60.14 | |||
| Prairies | Riding Mountain NP (696, 473, 443) | 45.55 | 75.62 | 33.62 | 57.72 | 30.32 | 60.47 | |||
| Prairies | Waterton Lakes NP (949, 568, 632) | 40.67 | 72.94 | 34.25 | 54.23 | 26.98 | 59.86 | |||
| Woodland | 1000 Islands NP (1583, 880, 896) | 52.18 | 79.02 | 38.66 | 60.68 | 36.39 | 67.05 | |||
| Woodland | Bruce Peninsula NP (859, 580, 481) | 49.71 | 75.88 | 39.12 | 60.17 | 36.55 | 67.76 | |||
| Woodland | Koffler Scientific Reserve (612, 469, 320) | 53.92 | 73.13 | 39.05 | 62.26 | 40.85 | 67.59 | |||
| Woodland | Point Pelee NP (832, 513, 477) | 53.13 | 76.52 | 39.90 | 63.16 | 37.38 | 69.01 | |||
| 44.59 | 80.20 | 72.48 | 37.04 | 57.54 | 64.14 | 31.10 | 61.58 | 68.54 | ||
Fig 3Species resolution for the three DNA barcodes (rbcL, matK, and ITS2).
Species resolution for the three DNA barcodes (rbcL, matK, and ITS2) based on A) BLAST, B) mothur with a posterior probability cut–off 0.95 or C) the actual species resolution of mothur. Species resolution in the six biogeographic regions obtained with D) rbcL, E) matK, F) ITS2.
Percentage of taxonomic resolution by BLAST to family, genus and species.
Taxonomic resolution for rbcL, matK, and ITS2 for 25 species-rich families.
| Blast Family | ITS2 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Family | Genus | species | Family | Genus | species | Family | Genus | species | |
| Amaranthaceae | 100.00% | 96.48% | 50.70% | 100.00% | 100.00% | 91.09% | 100.00% | 95.65% | 54.35% |
| Apiaceae | 100.00% | 87.34% | 56.54% | 100.00% | 97.86% | 94.12% | 100.00% | 97.62% | 86.19% |
| Asteraceae | 100.00% | 77.97% | 26.39% | 99.76% | 96.76% | 67.17% | 100.00% | 91.50% | 58.37% |
| Boraginaceae | 100.00% | 88.89% | 52.59% | 100.00% | 100.00% | 100.00% | 100.00% | 100.00% | 83.56% |
| Brassicaceae | 100.00% | 90.29% | 33.44% | 100.00% | 100.00% | 86.60% | 100.00% | 91.81% | 49.67% |
| Caprifoliaceae | 100.00% | 85.23% | 53.69% | 100.00% | 100.00% | 100.00% | 100.00% | 100.00% | 89.66% |
| Caryophyllaceae | 100.00% | 81.49% | 33.63% | 100.00% | 100.00% | 92.83% | 100.00% | 95.61% | 67.80% |
| Cyperaceae | 99.94% | 95.15% | 30.08% | 100.00% | 97.29% | 85.67% | 100.00% | 100.00% | 89.19% |
| Ericaceae | 100.00% | 99.62% | 64.82% | 100.00% | 100.00% | 93.22% | 100.00% | 94.68% | 72.22% |
| Fabaceae | 95.66% | 81.07% | 41.42% | 100.00% | 92.48% | 72.09% | 100.00% | 89.39% | 70.88% |
| Juncaceae | 100.00% | 99.08% | 54.29% | 100.00% | 100.00% | 86.05% | 100.00% | 100.00% | 88.81% |
| Lamiaceae | 100.00% | 88.93% | 56.15% | 100.00% | 100.00% | 88.04% | 100.00% | 99.04% | 98.08% |
| Onagraceae | 100.00% | 99.55% | 49.77% | 100.00% | 100.00% | 100.00% | 100.00% | 100.00% | 61.75% |
| Orchidaceae | 100.00% | 89.08% | 77.59% | 100.00% | 100.00% | 83.70% | 100.00% | 96.12% | 92.24% |
| Orobanchaceae | 100.00% | 99.44% | 52.54% | 100.00% | 100.00% | 76.40% | 100.00% | 100.00% | 79.35% |
| Plantaginaceae | 100.00% | 98.64% | 64.07% | 100.00% | 100.00% | 91.88% | 100.00% | 100.00% | 83.54% |
| Poaceae | 100.00% | 81.48% | 39.89% | 100.00% | 95.00% | 75.25% | 100.00% | 95.23% | 66.84% |
| Polemoniaceae | 100.00% | 96.77% | 29.03% | 100.00% | 100.00% | 100.00% | 100.00% | 100.00% | 75.86% |
| Polygonaceae | 92.61% | 89.79% | 48.59% | 100.00% | 100.00% | 94.71% | 100.00% | 100.00% | 88.67% |
| Primulaceae | 100.00% | 99.17% | 71.07% | 100.00% | 100.00% | 88.16% | 100.00% | 100.00% | 63.11% |
| Ranunculaceae | 100.00% | 97.97% | 54.07% | 100.00% | 100.00% | 85.37% | 100.00% | 100.00% | 81.75% |
| Rosaceae | 100.00% | 88.78% | 39.86% | 100.00% | 100.00% | 80.37% | 100.00% | 99.84% | 85.02% |
| Salicaceae | 100.00% | 99.76% | 12.24% | 100.00% | 100.00% | 26.47% | 100.00% | 100.00% | 30.96% |
| Saxifragaceae | 100.00% | 95.06% | 68.72% | 100.00% | 97.69% | 91.33% | 100.00% | 100.00% | 89.24% |
| Violaceae | 100.00% | 100.00% | 30.41% | 100.00% | 100.00% | 90.00% | 100.00% | 100.00% | 69.78% |
Fig 4Level of taxonomic resolution provided by rbcL, matK or ITS2 for 25 families.
Level of taxonomic resolution provided by rbcL, matK or ITS2 for 25 families of vascular plant that are species-rich in Canada. The three colours show the proportion of species identified to a family (blue), genus (orange) or species (green) level.