| Literature DB >> 31892714 |
Yun Song1, Yongjiang Zhang1, Jin Xu1, Weimin Li1, MingFu Li2.
Abstract
The pantropical plant genus Dalbergia comprises approximately 250 species, most of which have a high economic and ecological value. However, these species are among the most threatened due to illegal logging and the timber trade. To enforce protective legislation and ensure effective conservation of Dalbergia species, the identity of wood being traded must be accurately validated. For the rapid and accurate identification of Dalbergia species and assessment of phylogenetic relationships, it would be highly desirable to develop more effective DNA barcodes for these species. In this study, we sequenced and compared the chloroplast genomes of nine species of Dalbergia. We found that these chloroplast genomes were conserved with respect to genome size, structure, and gene content and showed low sequence divergence. We identified eight mutation hotspots, namely, six intergenic spacer regions (trnL-trnT, atpA-trnG, rps16-accD, petG-psaJ, ndhF-trnL, and ndhG-ndhI) and two coding regions (ycf1a and ycf1b), as candidate DNA barcodes for Dalbergia. Phylogenetic analyses based on whole chloroplast genome data provided the best resolution of Dalbergia, and phylogenetic analysis of the Fabaceae showed that Dalbergia was sister to Arachis. Based on comparison of chloroplast genomes, we identified a set of highly variable markers that can be developed as specific DNA barcodes.Entities:
Mesh:
Year: 2019 PMID: 31892714 PMCID: PMC6938520 DOI: 10.1038/s41598-019-56727-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene map of the Dalbergia chloroplast genome. Genes on the inner circle are transcribed in the clockwise direction and those on the outer circle are transcribed in the counterclockwise direction. Genes in different functional groups are shown in different colors. The inner circle with thick lines indicates the extent of the inverted repeats (IRa and IRb) that separate the genomes into small single-copy (SSC) and large single-copy (LSC) regions.
Summary statistics for the assembly of the chloroplast genomes of nine Dalbergia species.
| Species | Total | LSC | IR | SSC | Total | Protein coding genes | tRNA | rRNA | GC% | Accession number in Genbank |
|---|---|---|---|---|---|---|---|---|---|---|
| 156,580 | 85,888 | 25,683 | 19,326 | 110 | 76 | 30 | 4 | 36.1 | MN251247 | |
| 156,568 | 85,895 | 25,683 | 19,307 | 110 | 76 | 30 | 4 | 36.1 | MN251242 | |
| 156,211 | 85,614 | 25,665 | 19,267 | 110 | 76 | 30 | 4 | 36.1 | MN251246 | |
| 155,726 | 85,343 | 25,673 | 19,037 | 110 | 76 | 30 | 4 | 36.1 | MN251249 | |
| 156,069 | 85,809 | 25,702 | 18,856 | 110 | 76 | 30 | 4 | 36.1 | MN251244 | |
| 156,546 | 85,765 | 25,677 | 19,427 | 110 | 76 | 30 | 4 | 35.9 | MN251248 | |
| 156,698 | 86,036 | 25,692 | 19,278 | 110 | 76 | 30 | 4 | 36.0 | MN251243 | |
| 156,055 | 85,765 | 25,702 | 18,886 | 110 | 76 | 30 | 4 | 36.1 | MN251241 | |
| 156,586 | 85,894 | 25,683 | 19,326 | 110 | 76 | 30 | 4 | 36.1 | MN251245 |
Figure 2Analysis of perfect simple sequence repeats (SSRs) in nine Dalbergia chloroplast genomes. (A) The number of SSRs detected in the nine chloroplast genomes; (B) The frequency of identified SSRs in large single-copy (LSC), inverted repeat (IR,) and small single-copy (SSC) regions; (C) The number of SSR types detected in the nine sequenced chloroplast genomes; (D) The frequency of identified SSR motifs in different repeat class types.
Figure 3Analysis of repeated sequences in nine sequenced Dalbergia chloroplast genomes.
Figure 4Sliding window analysis of the Dalbergia chloroplast genomes (window length: 600 bp; step size: 50 bp). X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity in each window.
Variability of eight novel markers and the three universal chloroplast DNA barcodes in Dalbergia.
| Markers | Length (bp) | Variable sites | Information sites | Mean distance | Number of Haplotypes | Nucleotide diversity | ||
|---|---|---|---|---|---|---|---|---|
| Numbers | % | Numbers | % | |||||
| 954 | 75 | 7.86% | 46 | 4.82% | 0.034 | 8 | 0.0296 | |
| 985 | 64 | 6.50% | 44 | 4.47% | 0.031 | 8 | 0.02813 | |
| 747 | 58 | 7.76% | 39 | 5.22% | 0.032 | 6 | 0.02974 | |
| 1,155 | 77 | 6.67% | 50 | 4.33% | 0.028 | 7 | 0.02699 | |
| 2,246 | 183 | 8.15% | 119 | 5.30% | 0.037 | 7 | 0.02889 | |
| 1,652 | 121 | 7.32% | 67 | 4.06% | 0.031 | 8 | 0.02582 | |
| 1,230 | 98 | 7.97% | 68 | 5.53% | 0.032 | 7 | 0.03118 | |
| 1,180 | 92 | 7.80% | 63 | 5.34% | 0.031 | 8 | 0.02902 | |
| 305 | 10 | 3.28% | 2 | 0.66% | 0.01 | 6 | 0.01121 | |
| 1,427 | 23 | 1.61% | 16 | 1.12% | 0.006 | 7 | 0.00642 | |
| 1,556 | 54 | 3.47% | 36 | 2.31% | 0.013 | 6 | 0.01283 | |
Figure 5Phylogenetic tree reconstruction of 81 taxa using maximum likelihood (ML) and Bayesian inference (BI) methods based on 81 genes in the chloroplast genome sequences. ML topology shown with ML bootstrap support value/Bayesian posterior probability presented at each node. Nodes with 100 BP/1.0 PP are not marked.