| Literature DB >> 35448806 |
Joanna Sokołowska1, Hanna Fuchs2, Konrad Celiński1.
Abstract
The internal transcribed spacer 2 (ITS2) is one of the best-known universal DNA barcode regions. This short nuclear region is commonly used not only to discriminate taxa, but also to reconstruct phylogenetic relationships. However, the efficiency of using ITS2 in these applications depends on many factors, including the family under study. Pinaceae represents the largest family of extant gymnosperms, with many species of great ecological, economic, and medical importance. Moreover, many members of this family are representatives of rare, protected, or endangered species. A simple method for the identification of Pinaceae species based on DNA is necessary for their effective protection, authentication of products containing Pinaceae representatives, or phylogenetic inference. In this study, for the first time, we conducted a comprehensive study summarizing the legitimacy of using the ITS2 region for these purposes. A total of 368 sequences representing 71 closely and distantly related taxa of the seven genera and three subfamilies of Pinaceae were characterized for genetic variability and divergence. Intra- and interspecies distances of ITS2 sequences as well as rates of sequence identification and taxa discrimination among Pinaceae at various taxonomic levels, i.e., the species complex, genus, subfamily, and family, were also determined. Our study provides a critical assessment of the suitability of the ITS2 nuclear DNA region for taxa discrimination among Pinaceae. The obtained results clearly show that its usefulness for this purpose is limited.Entities:
Keywords: DNA barcoding; ITS2; Pinaceae; internal transcribed spacer; taxa identification
Year: 2022 PMID: 35448806 PMCID: PMC9029031 DOI: 10.3390/plants11081078
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Values of basic parameters characterizing genetic variation in ITS2 sequences in Pinaceae. AL—alignment length; CS—conserved sites; VS-variable sites; PIS—parsimony informative sites; SS—singleton sites; OMD—overall mean distance. Values in brackets are given as percentages.
| Taxonomic Group/Level | Name | AL | CS | VS | PIS | SS | OMD |
|---|---|---|---|---|---|---|---|
| Species complex |
| 243 | 243 (100.00) | 0 (0.00) | 0 (0.00) | 0 (0.00) | 0.000 |
| Genera |
| 246 | 169 (68.70) | 75 (30.49) | 27 (10.98) | 48 (19.51) | 0.016 |
|
| 249 | 118 (47.39) | 122 (48.99) | 0 (0.00) | 121 (48.5) | 0.236 | |
|
| 247 | 237 (95.95) | 10 (4.05) | 7 (2.83) | 3 (1.21) | 0.026 | |
|
| 233 | 172 (73.82) | 61 (26.18) | 15 (6.44) | 46 (19.74) | 0.013 | |
|
| 232 | 232 (100.00) | 0 (0.00) | 0 (0.00) | 0 (0.00) | 0.000 | |
|
| 236 | 232 (98.31) | 4 (1.69) | 3 (1.27) | 1 (0.42) | 0.004 | |
|
| 252 | 70 (27.78) | 177 (70.24) | 73 (28.97) | 101 (40.0) | 0.077 | |
| Subfamily | Abietoideae | 254 | 66 (25.98) | 186 (73.23) | 88 (34.65) | 97 (38.19) | 0.074 |
| Laricoideae | 233 | 148 (63.52) | 85 (36.48) | 48 (20.60) | 37 (15.88) | 0.023 | |
| Pinoideae | 252 | 61 (24.21) | 187 (74.21) | 107 (42.4) | 77 (30.56) | 0.149 | |
| Family | Pinaceae | 270 | 17 (6.30) | 250 (92.59) | 196 (72.5) | 49 (18.15) | 0.342 |
Values of genetic divergence indices in ITS2 sequences in Pinaceae.
| Taxonomic Group/Level | Name | All Interspecific | Minimum Interspecific Distance | All Intraspecific Distance | Theta | Coalescent Depth |
|---|---|---|---|---|---|---|
| Species complex |
| 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 |
| Genera |
| 0.0175 ± 0.0006 | 0.0000 ± 0.0000 | 0.0085 ± 0.0012 | 0.0176 ± 0.0039 | 0.0190 ± 0.0046 |
|
| 0.1998 ± 0.0973 | 0.0000 ± 0.0000 | 0.1972 ± 0.1919 | 0.1995 ± 0.0250 | 0.2958 ± 0.2874 | |
|
| 0.0356 ± 0.0119 | 0.0291 ± 0.0109 | 0.0061 ± 0.0061 | 0.0258 ± 00820 | 0.0062±0.0061 | |
|
| 0.0137 ± 0.0006 | 0.0000 ± 0.0000 | 0.0067 ± 0.0010 | 0.0130 ± 0.0033 | 0.0271 ± 0.0074 | |
|
| 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | |
|
| 0.0048 ± 0.0009 | 0.0000 ± 0.0000 | 0.0017 ± 0.0012 | 0.0043 ± 0.0023 | 0.0024 ± 0.0018 | |
|
| 0.0392 ± 0.0257 | 0.0000 ± 0.0000 | 0.0959 ± 0.0050 | 0.0768 ± 0.0102 | 0.0486 ± 0.0259 | |
| Subfamily | Abietoideae | 0.1089±0.0082 | 0.0000 ± 0.0000 | 0.0370 ± 0.0291 | 0.0665 ± 0.0065 | 0.0562 ± 0.0389 |
| Laricoideae | 0.0436 ± 0.0083 | 0.0000 ± 0.0000 | 0.0063 ± 0.0011 | 0.0222 ± 0.0037 | 0.0249 ± 0.0071 | |
| Pinoideae | 0.1702 ± 0.0059 | 0.0000 ± 0.0000 | 0.0342 ± 0.0229 | 0.1492 ± 0.0157 | 0.0396 ± 0.0210 | |
| Family | Pinaceae | 0.3586 ± 0.0041 | 0.0000 ± 0.0000 | 0.0294 ± 0.0141 | 0.3417 ± 0.0287 | 0.0423 ± 0.0162 |
Figure 1Abundance of all intraspecific and all interspecific K2P pairwise distance in Pinaceae.
Figure 2Species discrimination rates based on BLAST1 method.
Percentage of correct, incorrect and ambiguous sequence identifications based on the ‘best match’ and ‘best close match’ with TaxonDNA software.
| Taxonomic Group/Level | Name | Best Match (BM) | Best Close Match (BCM) | |||||
|---|---|---|---|---|---|---|---|---|
| Correct | Incorrect | Ambiguous | Correct | Incorrect | Ambiguous | No Match | ||
| Species complex |
| 0.00 | 0.00 | 100.00 | 0.00 | 0.00 | 100.00 | 0.00 |
| Genera |
| 16.19 | 8.57 | 75.24 | 16.19 | 8.57 | 75.24 | 0.00 |
|
| 11.11 | 0.00 | 88.89 | 11.10 | 0.00 | 88.89 | 0.00 | |
|
| 100.00 | 0.00 | 0.00 | 100.00 | 0.00 | 0.00 | 0.00 | |
|
| 21.43 | 3.57 | 75.00 | 21.43 | 2.68 | 75.00 | 0.89 | |
|
| 100.00 | 0.00 | 0.00 | 100.00 | 0.00 | 0.00 | 0.00 | |
|
| 20.83 | 0.00 | 79.17 | 20.83 | 0.00 | 79.17 | 0.00 | |
|
| 16.22 | 9.01 | 74.77 | 15.32 | 7.21 | 72.07 | 5.40 | |
| Subfamily | Abietoideae | 17.80 | 8.47 | 73.73 | 17.80 | 7.63 | 73.73 | 0.84 |
| Laricoideae | 23.48 | 3.48 | 73.04 | 23.40 | 2.61 | 75.00 | 0.87 | |
| Pinoideae | 17.04 | 7.40 | 75.50 | 16.30 | 5.93 | 73.33 | 4.44 | |
| Family | Pinaceae | 19.20 | 6.52 | 74.10 | 19.00 | 5.43 | 73.38 | 2.17 |