| Literature DB >> 31624576 |
Zhi-Peng Zhang1, Xiao-Yue Wang1, Zhao Zhang1, Hui Yao1, Xiao-Mei Zhang2, Yang Zhang1, Ben-Gang Zhang1.
Abstract
DNA barcoding is widely used in species identification, but there is considerable controversy regarding the extent of sampling in research methods. Some scholars have proposed that this small sample size underestimates the intraspecific genetic diversity, which would impact on the accuracy of DNA barcoding to identify species. In study, we selected all Phellodendron species (including P. amurense Rupr., P. chinense Schneid., and P. chinense var. glabriusculum Schneid.) as the materials, collected 59 P. amurense samples from 35 populations greatly to represent the genetic diversity, and analyzed the haplotype, genetic distance, barcoding gap, and Neighbor-Joining (NJ) trees based on psbA-trnH and internal transcribed spacer gene sequences. Additionally, a sampling simulation was conducted to assess the correlation between genetic diversity and the number of populations. Finally, analysis of critical geographical populations was performed. Based on analysis of haplotype, genetic distance, barcoding gap, and NJ trees, we found that eight P. amurense samples impacted on the effectiveness of DNA barcoding, which genetic information were very important to identify Phellodendron species. Moreover, the result of the NJ tree analysis performed the small-scale P. amurense sample size did not completely match the objective phylogenetic relationship in Phellodendron. In simulation sampling analysis, the data showed the genetic diversity indexes at the same population level gradually decreased and stabilized as the number of simulation sampling populations increased. We found that 1-2 samples from over 24 populations based on uniform geographical distribution could represent 80% of the genetic diversity of P. amurense and ensure authenticity and reliability of DNA barcoding. Thus, we proposed it is particularly important adequately samples to cover infraspecific genetic diversity in order to ensure identification accuracy of DNA barcoding.Entities:
Keywords: DNA barcoding; Phellodendron; accuracy; genetic diversity; identify species
Year: 2019 PMID: 31624576 PMCID: PMC6787823 DOI: 10.1002/ece3.5590
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Geographical distribution of 41 sample populations (red circles mark critical populations)
GenBank accession No. for the samples in this study
| Codes | Scientific name | Family | GenBank accession No. | |
|---|---|---|---|---|
| ITS |
| |||
| A1‐A59 |
| Rutaceae | MK419239–MK419297 | MK419162–MK41922 |
| B1‐B6 |
| Rutaceae | MK419298–MK419303 | MK419221–MK419226 |
| C1‐C8 |
| Rutaceae | MK419304–MK419311 | MK419227–MK419234 |
| D1‐D4 |
| Rutaceae | MK419312–MK419315 | MK419235–MK419238 |
ITS sequence haplotypes and variation sites in Phellodendron species
| Scientific name | Haplotype | Number of samples | Percentage | Sites/bp | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 1 | 1 | 2 | 3 | 4 | 4 | 4 | 4 | 5 | 6 | ||||
| 5 | 7 | 7 | 9 | 0 | 9 | 1 | 3 | 4 | 8 | 9 | 2 | ||||
| 7 | 3 | 7 | 0 | 8 | 1 | 9 | 5 | 3 | 6 | 9 | 3 | ||||
|
| A1 | 50 | 84.7 | C | C | G | G | C | C | C | C | C | C | G | A |
| A2 | 3 | 5.1 | . | . | . | . | . | T | . | . | . | . | . | . | |
| A3 | 1 | 1.7 | . | . | . | A | . | . | . | . | . | . | . | . | |
| A4 | 1 | 1.7 | . | . | . | . | . | . | G | . | . | . | . | . | |
| A5 | 1 | 1.7 | . | . | . | . | . | . | . | . | . | . | . | C | |
| A6 | 3 | 5.1 | . | . | A | . | . | . | . | . | . | . | . | . | |
|
| A7 | 2 | 33.3 | . | T | . | . | . | . | . | . | . | . | A | . |
| A8 | 2 | 33.3 | . | T | . | . | . | . | . | . | . | . | . | . | |
| A9 | 1 | 16.7 | . | T | . | . | . | . | . | . | . | T | . | . | |
| A10 | 1 | 16.7 | T | T | . | . | . | . | . | . | . | . | . | . | |
|
| A11 | 6 | 75 | . | . | . | . | T | . | . | . | . | . | . | . |
| A12 | 1 | 12.5 | . | . | . | . | . | . | . | . | A | . | . | . | |
| A13 | 1 | 12.5 | . | . | . | . | . | . | . | T | A | . | . | . | |
(.) indicated the same base as the first row.
psbA‐trnH sequence haplotypes and variation sites in Phellodendron species
| Scientific name | Haplotype | Number of samples | Percentage | Sites/ bp | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 3 | 1 | 5 | 2 | 3 | 3 | 3 | 3 | 3 | 4 | ||||
| 0 | 7 | 8 | 8 | 8 | 9 | 4 | ||||||||
| 4 | 9 | 6 | 9 | 0 | 1 | 2 | 7 | 5 | ||||||
|
| B1 | 32 | 54.1 | G | C | C | A | T | T | T | G | C | A | T |
| B2 | 5 | 8.5 | . | . | . | . | . | G | C | A | A | . | . | |
| B3 | 3 | 5.1 | . | . | T | . | . | G | C | A | A | . | . | |
| B4 | 13 | 22.1 | . | . | T | . | . | . | . | . | . | . | . | |
| B5 | 1 | 1.7 | C | . | T | . | . | . | . | . | . | . | . | |
| B6 | 2 | 3.4 | C | . | . | C | . | . | . | . | . | . | . | |
| B7 | 1 | 1.7 | . | A | . | . | . | . | . | . | . | . | . | |
| B8 | 1 | 1.7 | ‐ | G | T | . | . | . | . | . | . | . | . | |
| B9 | 1 | 1.7 | . | . | . | . | . | . | . | . | . | . | C | |
|
| B10 | 6 | 100 | . | . | . | . | G | G | C | A | A | G | . |
|
| B10 | 8 | 100 | . | . | . | . | G | G | C | A | A | G | . |
(.) indicated the same base as the first row.
Analysis of the intergenus‐specific divergence and intraspecific variation with the two barcodes
| Markers | ITS |
|
|---|---|---|
| Theta(avg_intra_avg) | 0.0018 ± 0.0020 | 0.0018 ± 0.0026 |
| coalescent depth(avg_intra_max) | 0.0048 ± 0.0023 | 0.0081 ± 0.0115 |
| All intraspecific distance(avg_between_intraspecies) | 0.0005 ± 0.0010 | 0.0035 ± 0.0041 |
| Theta prime(avg_interbyG_avg) | 0.0024 ± 0.0010 | 0.0137 ± 0.0041 |
| minimum interspecific distance(avg_interbyG_min) | 0.0016 ± 0.0010 | 0.0046 ± 0.0041 |
| all interspecific distance | 0.0024 ± 0.0009 | 0.0137 ± 0.0036 |
Figure 2Relative distribution of the interspecific and intraspecific variation using the two barcodes based on the K2P genetic distance
Figure 3Phellodendron species NJ tree with the psbA‐trnH sequence (The bootstrap scores [1,000 replicates] are shown for each branch)
Figure 4Typical sample NJ tree pattern for reciprocal monophyly based on small‐scale P. amurense with the psbA‐trnH sequence (The bootstrap scores [1,000 replicates] is shown for each branch)
Figure 5Typical NJ tree pattern of paraphyly based on small‐scale P. amurense samples with the psbA‐trnH sequence (The bootstrap scores [1,000 replicates] are shown for each branch)
Figure 6Haplotype number versus population number curve
Figure 7Scatter plot of haplotype diversity
Figure 8Scatter plot of nucleotide diversity