Literature DB >> 30505665

Accurate authentication of Dendrobium officinale and its closely related species by comparative analysis of complete plastomes.

Shuying Zhu1,2, Zhitao Niu1, Qingyun Xue1, Hui Wang1, Xuezhu Xie1, Xiaoyu Ding1.   

Abstract

Owing to its great medicinal and ornamental values, Dendrobium officinale is frequently adulterated with other Dendrobium species on the market. Unfortunately, the utilization of the common DNA markers ITS, ITS2, and matK+rbcL is unable to distinguish D. officinale from 5 closely related species of it (D. tosaense, D. shixingense, D. flexicaule, D. scoriarum and D. aduncum). Here, we compared 63 Dendrobium plastomes comprising 40 newly sequenced plastomes of the 6 species and 23 previously published plastomes. The plastomes of D. officinale and its closely related species were shown to have conserved genome structure and gene content. Comparative analyses revealed that small single copy region contained higher variation than large single copy and inverted repeat regions, which was mainly attributed to the loss/retention of ndh genes. Furthermore, the intraspecific sequence variability among different Dendrobium species was shown to be diversified, which necessitates a cautious evaluation of genetic markers specific for different Dendrobium species. By evaluating the maximum likelihood trees inferred from different datasets, we found that the complete plastome sequence dataset had the highest discriminatory power for D. officinale and its closely related species, indicating that complete plastome sequences can be used to accurately authenticate Dendrobium species.

Entities:  

Keywords:  Authentication; BS, bootstrap value; CE, capillary electrophoresis; Complete plastome sequence; Dendrobium officinale; Genetic marker; HPLC, high-pressure liquid chromatography; IR, inverted repeat region; Indels, insertions/deletions; LSC, large single copy region; ML, maximum likelihood; NGS, next-generation sequencing; Plastomic comparison; SNPs, single nucleotide polymorphisms; SSC, small single copy region; SV, sequence variability.

Year:  2018        PMID: 30505665      PMCID: PMC6251949          DOI: 10.1016/j.apsb.2018.05.009

Source DB:  PubMed          Journal:  Acta Pharm Sin B        ISSN: 2211-3835            Impact factor:   11.413


Introduction

The genus Dendrobium, one of the largest genera in the family Orchidaceae, comprises approximately 1200–1500 species, mainly distributed in tropical and subtropical Asia and eastern Australia1, 2. There are about 80 species of this genus in China. Dendrobium orchids are one of the most well known orchids in global horticultural trade due to their beautiful flowers and ideal characteristics as houseplants. Moreover, many species in this genus have been extensively used as traditional herbal medicine in many Asian countries for hundreds of years. Dendrobium officinale Kimura et Migo is a rare and endangered species endemic to China, mainly distributed in southern provinces such as Yunnan, Guizhou, Guangxi, Fujian and Zhejiang5, 6. However, because of its high medicinal and ornamental values, D. officinale has often been adulterated with other Dendrobium species on the market7, 8. Dendrobium species are notoriously difficult to identify due to their similar appearance and tissue structure9, 10, 11. Usually, their identification relies heavily on morphological methods as well as phytochemical approaches such as capillary electrophoresis (CE) and high-pressure liquid chromatography (HPLC). Unfortunately, these methods are unable to distinguish D. officinale from several Dendrobium species, especially 5 closely related species of it (D. tosaense Makino; D. shixingense Z. L. Chen, S. J. Zeng et J. Duan; D. flexicaule Z. H. Tsi, S. C. Sun et L. G. Xu; D. scoriarum W. W. Smith; D. aduncum Lindl.)14, 15 because of their close affinities. Recently, molecular techniques have been used in the authentication studies of Dendrobium, and a great number of molecular markers have been developed to identify D. officinale and its closely related species3, 16, 17, 18. Moreover, a single or a combination of DNA barcodes has also been adopted to infer their species relationships14, 19, 20, 21, 22, 23. However, the identification of D. officinale and its close relatives remains difficult because of the high similarities in their genetic backgrounds. For instance, by using 5 DNA markers (ITS, rbcL, matK, psbA-trnH and trnL intron) to construct the relationships among 109 Dendrobium species, the relationships among most of these Dendrobium species were clearly settled, yet D. officinale and its closely related species were still nested with each other. Xu et al. employed the sequence combination of ITS+matK to identify Dendrobium species, which showed high specificity for most of the species, but failed to distinguish D. officinale and its closely related species. Therefore, it is desirable to develop an effective method for authenticating D. officinale and its closely related species. Chloroplasts, the photosynthetic plastids, are usually uniparentally inherited and have their own genomes called plastomes. In general, plastomes of seed plants have relatively small sizes, conserved gene contents, and dense coding regions as compared with nuclear and mitochondrial genomes. With these unique features, plastomes can be easily assembled and compared. With the advance of next-generation sequencing (NGS), the number of complete plastome sequences has increased rapidly, which provided the opportunity to identify species using the complete plastome sequences. For example, Nock et al. employed complete plastome sequences for the species identification studies of grasses. Using the similar approach, Hu et al. established a phylogeny for the species of Orychophragmus and discriminated 6 close relatives in this genus, and Zhang et al. successfully distinguished 9 species of Echinacea. The positive results in these studies implied that the complete plastome sequences had the potential ability to clarify the relationships of D. officinale and its closely related species. In addition, the mutational hotspots among Dendrobium species were assessed by Niu et al., with 10 loci being put forth as mutational hotspots for Dendrobium plastomes. However, it remains to be determined whether these loci are still maintaining the highest degree of sequence variability within species. In this study, 63 Dendrobium plastomes were compared, comprising 40 newly sequenced plastomes of D. officinale and its closely related species as well as 23 previously published plastomes. Our aims were: (1) to characterize the plastomes of D. officinale and 5 close relatives of it regarding genome structure, gene content, and sequence divergence; (2) to detect the intraspecific plastomic mutational hotspots that are useful for population studies and conservation genetic research; (3) to assess the species discriminatory power of the complete plastome sequences for D. officinale and its closely related species. Based on the comparison of different sequence datasets, our results showed that complete plastome sequences could be used for the authentication study of D. officinale and its closely related species.

Materials and methods

Plant materials and DNA extraction

A total of 40 representative individuals were collected from the main distribution areas of D. officinale and 5 closely related species of it: D. tosaense, D. shixingense, D. flexicaule, D. scoriarum (D. guangxiense), and D. aduncum (Table 130, 31). All the plant samples were identified by Prof. Xiaoyu Ding and grown in a greenhouse at Nanjing Normal University, China. Total genomic DNA of each sample was extracted from 2 g fresh leaves using Dneasy Plant Mini Kits (QIAGEN, Germany). DNA samples that met the quality requirements (A260/280 ratio=1.8–2.0, A260/230 ratio >1.7, and DNA concentration >100 ng/µL) were used for sequencing.
Table 1

Sampling information and plastome characteristics of D. officinale and its closely related species.

No.SpeciesCodeLocationPlastome length (bp)LSC length (bp)SSC length (bp)IR length (bp)AT content (%)VoucherAccession No.
1D. officinaleDoWSWenshan, Yunnan Province15215685016145222630962.52ZSY01005LC331062
2D. officinaleDoGNGuangnan, Yunnan Province15201884910145142629762.50ZSY01011LC348520
3D. officinaleDoSPShiping, Yunnan Province15224685106145102631562.54ZSY01017LC348521
4D. officinaleDoXYXingyi, Guizhou Province15216585025145222630962.56ZSY01213LC348522
5D. officinaleDoSDSandu, Guizhou Province15202984919145162629762.46ZSY01808LC348523
6D. officinaleDoTETian׳e, Guangxi Province15204284920145222630062.51ZSY01710LC348524
7D. officinaleDoGLGuilin, Guangxi Province15216785040145092630962.52ZSY01709LC348525
8D. officinaleDoSGShaoguan, Guangdong Province15222485094145142630862.53ZSY01622LC348526
9D. officinaleDoHSHuoshan, Anhui Province15216385033145142630862.52ZSY01521LC348527
10D. officinaleDoSYShaoyang, Hunan Province15202384931144762630862.51ZSY01423LC348528
11D. officinaleDoLNLongnan, Jiangxi Province15222685096145142630862.53ZSY01314LC348529
12D. officinaleDoJGJinggangshan, Jiangxi Province15206684936145142630862.51ZSY01315LC348530
13D. officinaleDoLCLiancheng, Fujian Province15216385031145142630962.52ZSY01807LC348725
14D. officinaleDoLSLishui, Zhejiang Province15221985104145152630062.61ZSY01918LC348531
15D. officinaleDoYDYandang, Zhejiang Province15222185109145162629862.53Luo et al.30
16D. officinaleDoZJZhejiang Province15201884944145062628462.43Yang et al.31
17D. tosaenseDtTDTaidong, Taiwan15225385094145072632162.63ZSY03221LC348532
18D. tosaenseDtHLHualian, Taiwan15225085091145172632162.53ZSY03226LC348720
19D. tosaenseDtLNLongnan, Jiangxi Province15225685099145082632062.53ZSY03325LC348721
20D. shixingenseDsDADu’an, Guangxi Province15212385063145022627962.50ZSY09212LC348726
21D. shixingenseDsGLGuilin, Guangxi Province15219585076145012630962.56ZSY09331LC348722
22D. shixingenseDsLZLiuzhou, Guangxi Province15217585056145012630962.52ZSY09330LC348723
23D. shixingenseDsCZChenzhou, Hunan Province15218385062145032630962.50ZSY09688LC348724
24D. shixingenseDsNXNanxiong, Guangdong Province15218485063145032630962.52ZSY09008LC348861
25D. shixingenseDsSXShixing, Guangdong Province15217885057145032630962.52ZSY09001LC348860
26D. shixingenseDsLNLongnan, Jiangxi Province15217885058145022630962.52ZSY09335LC348863
27D. shixingenseDsQNQuannan, Jiangxi Province15218185061145022630962.52ZSY09213LC348862
28D. flexicauleDfGZGanzi, Sichuan Province15219185030144832633962.55ZSY05441LC348965
29D. flexicauleDfGLGanluo, Sichuan Province15224585045144922635462.56ZSY05225LC348856
30D. flexicauleDfSNShennongjia, Hubei Province15225285049144952635462.54ZSY05002LC348854
31D. flexicauleDfNZNanzhao, Henan Province15224285039144952635462.53ZSY05008LC348855
32D. scoriarumDgDBDebao, Guangxi Province15198284929144492630262.52ZSY02011LC348847
33D. scoriarumDgHCHechi, Guangxi Province15199284933144552630262.52ZSY02022LC348848
34D. scoriarumDgXLXilin, Guangxi Province15199884941144552630162.53ZSY02035LC348849
35D. scoriarumDgFNFuning, Yunnan Province15199584936144552630262.52ZSY02215LC348852
36D. scoriarumDgWSWenshan, Yunnan Province15199484938144542630162.53ZSY02205LC348851
37D. scoriarumDgXCXichou, Yunnan Province15197884921144552630162.52ZSY02238LC348853
38D. scoriarumDgXYXingyi, Guizhou Province15199084934144422630762.42ZSY02106LC348864
39D. scoriarumDgALAnlong, Guizhou Province15198584951144502629262.53ZSY02118LC348850
40D. aduncumDaLFLuofushan, Guangdong Province15211284952145222631962.52ZSY06111LC348858
41D. aduncumDaTYTaoyuan, Hunan Province15210484944145222631962.46ZSY06116LC348859
42D. aduncumDaXYXingyi, Guizhou Province15212384961145242631962.48ZSY06088LC348857
Sampling information and plastome characteristics of D. officinale and its closely related species.

DNA sequencing, plastome assembly, annotation, and PCR-based validation

Paired-end sequencing of 150 bp was conducted on an Illumina Hiseq. 4000 platform, and >6 Gb of sequence data for each sample was obtained. The raw reads were trimmed with an error probability <0.05 and by removing one nucleotide at both terminal ends, and then assembled on CLC Genomics Workbench 8.5.1 (CLC Bio, Aarhus, Denmark) by using the de novo assembling method coupled with reference-guided assembling method as described by Niu et al. The plastome of D. officinale NC_024019 was used as reference. The gaps and 4 junctions between inverted repeat (IR) regions and single copy (SC) regions were confirmed by PCR amplification and Sanger sequencing with specific primers. The complete plastome sequences were annotated by using the online program DOGMA. The tRNA genes were detected with tRNAscan-SE 1.2. The exact boundaries of each gene were manually checked by comparing them with homologous genes of other plastomes in the genus of Dendrobium.

The analysis of plastomic sequence divergence of D. officinale and its closely related species

In addition to 40 plastomes newly sequenced in this study, two published plastomes of D. officinale with the GenBank accession numbers NC_024019 and KJ862886 were also retrieved for comparative analyses. Full alignments of 42 these plastomes were performed using mVISTA program, with D. officinale (NC_024019) used as the reference. To estimate the sequence divergence of different regions of the plastomes, sequences of coding regions and non-coding regions (introns, intergenic spacer regions, and pseudogenes) were retrieved from these 42 plastomes. The syntenic loci were aligned using MUSCLE 3.8.31 implemented in MEGA 5.2. The sequences of non-coding regions were firstly aligned with the default parameters and then realigned with the “Refining” option. The sequences of protein-coding genes were aligned with the Align Codons option using the default parameters. The gaps located at the 5′- and 3′-ends of alignments were excluded. Then, single nucleotide polymorphisms (SNPs) and insertions/deletions (Indels) were identified by DnaSP v5.

Estimates of sequence variability

The intraspecific-level sequence variability (SV) for each intergenic and intronic locus with the length of more than 150 bp was estimated according to the formula described by Niu et al. as follows: SV (%)=(The number of SNPs+the number of Indel events) / (The number of conserved sites+the number of SNPs+the number of Indel events)×100. For each locus, there were 120, 3, 28, 12, 28, and 3 pairwise alignments for D. officinale, D. tosaense, D. shixingense, D. flexicaule, D. scoriarum, and D. aduncum, respectively. Finally, we determined the average SV of each syntenic locus for each species.

Species authentication analyses

To determine whether the complete plastome sequences or commonly used DNA markers have higher discriminatory power for D. officinale and its closely related species, a total of 10 DNA regions (ITS, ITS2, matK, rbcL, psbA-trnH, trnT-trnL, rpl32-trnL, clpP-psbB, trnL intron, and rps16-trnQ) and the complete plastome sequences of 72 samples were employed for the authentication studies (Supplementary Information Table S1). The sequences of DNA markers were aligned with MEGA 5.2. The complete plastome sequences were aligned with MAFFT v7.221 and then adjusted manually in MEGA 5.2. All of the gaps and ambiguous sites were removed. Subsequently, these sequences were categorized into 9 datasets: (1) ITS, (2) ITS2, (3) ITS+matK, (4) ITS+psbA-trnH, (5) ITS2+rbcL, (6) matK+rbcL, (7) ITS+matK+rbcL, (8) trnT-trnL+rpl32-trnL+clpP-psbB+trnL intron+rps16-trnQ, and (9) the complete plastome sequences. Maximum likelihood (ML) trees for the 9 datasets were reconstructed using RAxML 8.0.2 with the GTRGAMMA model. A thousand bootstrap replicates were executed to estimate the robustness of the ML trees. Moreover, given that different regions of plastomes vary in molecular evolutionary rates, we also reconstructed ML trees using the following datasets: (1) the large single copy (LSC) regions, (2) the small single copy (SSC) regions, (3) the IRs, (4) protein-coding genes, and (5) non-coding regions.

Statistical analyses

Statistical analyses were performed by using SPSS Statistics 22.0.

Results

Plastome features

A total of 40 complete plastomes from D. officinale and 5 closely related species of it were sequenced, which came from 14, 3, 8, 4, 8, and 3 individual plants of D. officinale, D. tosaense, D.shixingense, D. flexicaule, D. scoriarum, and D. aduncum, respectively (Table 130, 31). Additionally, two published plastomes of D. officinale (NC_024019 and KJ862886) were also included in comparative analyses. The plastome length of D. officinale ranged from 152,018 to 152,246 bp. All of the 16 plastomes displayed the typical quadripartite structure comprising a pair of IRs (26,284–26,315 bp) separated by the LSC (84,910–85,109 bp) and SSC (14,476–14,522 bp) regions. The overall AT content ranged from 62.43% to 62.61%. Each of these plastomes contained 103 unique genes (Fig. 1) consisting of 69 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Nine pseudogenes were detected, comprising 7 ndh remnants (ψndhA, D, E, F, G, H and J) and 2 incompletely duplicated genes at the IR/SC boundaries (ψrpl22 and ψycf1). The coding regions occupied 53.88–53.97% of the complete plastome. Non-coding regions that were composed of pseudogenes, introns, and intergenic spacers occupied 3.33–3.36%, 11.70–11.74%, and 30.95–31.04% of the plastome sequences, respectively. On the other hand, the plastomes of other 5 species had the lengths of 151,978–152,256 bp, and also possessed highly conserved structure, gene content and order.
Figure 1

Plastome map of D. officinale and its closely related species. The genes outside and inside the circle are transcribed clockwise and counterclockwise, respectively. The length of the plastomes among the six species ranges from 151,978 to 152,256 bp.

Plastome map of D. officinale and its closely related species. The genes outside and inside the circle are transcribed clockwise and counterclockwise, respectively. The length of the plastomes among the six species ranges from 151,978 to 152,256 bp. The sequences flanking IR/SC junctions were compared between D. officinale and its closely related species (Fig. 2). The boundaries of IR/LSC rarely changed among the 6 species. The junctions of IRa/SSC were located in the 5′ end of ycf1, which resulted in a duplicated ψycf1 in the IRb regions. The length of ψycf1 was 309 bp consistently in D. officinale, D. tosaense, and D. shixingense, while varing slightly among D. flexicaule, D. scoriarum, and D. aduncum (being 342, 326, and 318 bp, respectively). The IRb/SSC junctions of D. officinale, D. tosaense, and D. shixingense were located upstream of ψndhF by 3 bp; whereas the junctions of D. flexicaule, D. scoriarum, and D. aduncum were expanded into 3′ end of ψndhF, resulting in an overlap of ψycf1 and ψndhF by approximately 10 bp. These results revealed that the expansion/contraction of IRs was conversed among D. officinale and its closely related species.
Figure 2

Comparison of the regions flanking IR/SC junctions among D. officinale and its closely related species. The plastome of Phalaenopsis equestris is used as the reference.

Comparison of the regions flanking IR/SC junctions among D. officinale and its closely related species. The plastome of Phalaenopsis equestris is used as the reference.

Plastomic sequence divergence

The sequence divergence of 42 plastomes of D. officinale and its closely related species were estimated. As expected, the non-coding regions exhibited higher divergence levels than the coding regions (Fig. 3). Based on the plastome-wide investigation, totally 1168 SNPs and 452 Indels were detected among these plastomes (Table 2 and Supplementary information Table S2), with the average densities of 7.6 SNPs per kb and 3.0 Indels per kb across the complete plastome. Most of the variants (585 SNPs and 355 Indels) were located in intergenic spacers. In addition, the SNP and Indel distribution were also compared among the LSC, SSC, and IR regions. The remarkably higher SNP and Indel densities in SSC region indicated that the variation of SSC was higher than that of LSC and IR regions. The drastic variation of SSC region might be related to the independent loss/retention of ndh genes. Therefore, in order to determine the effects of the loss/retention of ndh genes on the variation of SSC, we divided the SSC sequences into 3 parts: ndh pseudogenes, intergenic spacer regions adjacent to ndh pseudogenes, and the other regions (Table 3). Our comparison results showed a higher density and number of SNPs and Indels in ndh pseudogenes and their adjacent intergnic spacer regions, suggesting that the loss/retention of ndh genes played an important role in causing the high variation of SSC.
Figure 3

Sequence identity plots among the plastomes of D. officinale and its closely related species with D. officinale (NC_024019) sequence as the reference by using mVISTA. Each species is represented by one accession.

Table 2

The SNPs and Indels in 42 complete plastomes of D. officinale and its closely related species.

RegionCoding region
Intergenic spacer
Intron
Pseudogene
Summary
SNPIndelSNPIndelSNPIndelSNPIndelSNPIndel
LSC174 (4.0)3 (0.07)425 (13.6)280 (9.0)112 (10.6)42 (4.0)7 (15.9)2 (4.6)718 (8.4)327 (3.8)
SSC105 (15.1)6 (0.9)104 (31.7)33 (10.1)91 (20.9)24 (5.6)300 (20.6)63 (4.3)
IRs66 (2.1)10 (0.3)56 (4.2)42 (3.1)26 (3.5)10 (1.4)2 (5.8)0150 (2.8)62 (1.2)
Total345 (4.2)19 (0.2)585 (12.2)355 (7.4)138 (7.7)52 (2.9)100 (19.5)26 (5.1)1168 (7.6)452 (3.0)

The figures in the bracket indicate the density (per kb) of SNPs or Indels in a target region.

Table 3

The SNPs and Indels in SSC regions of D. officinale and its closely related species.

RegionSNPIndel
Ndh pseudogene91 (21.1)24 (5.6)
Adjacent regions of ndh pseudogene77 (38.3)20 (10.0)
Other regions132 (16.0)19 (2.3)
Ndh pseudogene + adjacent regions of ndh pseudogene168 (26.6)44 (7.0)

The figures in the bracket indicate the density (per kb) of SNPs or Indels in a target region.

Sequence identity plots among the plastomes of D. officinale and its closely related species with D. officinale (NC_024019) sequence as the reference by using mVISTA. Each species is represented by one accession. The SNPs and Indels in 42 complete plastomes of D. officinale and its closely related species. The figures in the bracket indicate the density (per kb) of SNPs or Indels in a target region. The SNPs and Indels in SSC regions of D. officinale and its closely related species. The figures in the bracket indicate the density (per kb) of SNPs or Indels in a target region.

Diversified evolution of plastomic sequence variability among Dendrobium species

Recent studies showed diversified evolution of the plastome sequence among different orchid genera. In this study, to determine whether the evolution of SV was conserved within the genus of Dendrobium, we estimated the pairwised intraspecific SV of 90 syntenic non-coding loci including intergenic spacers and introns of D. officinale and its closely related species (Supplementary Information Table S3). Moreover, we also performed correlation tests for the 90 non-coding loci between the 5 species. Although they are closely related to each other, our results showed that the SV values were statistically uncorrelated (P>0.05) or weakly correlated (Spearman׳s r=0.208 to 0.458, P<0.05) between the 5 species. These results suggested that the evolution of SV among different Dendrobium species was variable. The top-12 mutational hotspots that contain the highest intraspecific variability (SV>0.1%) and the lowest interspecific variability (SV<3%) of D. officinale were shown in Fig. 4. However, with the comparison of the SV values of other 4 species, our results have shown that: (1) none of top-12 hotspots is common to all the 5 species; (2) only 3 of them (rps2-rpoC2, accD-psaI, and matK-5′trnK) are present in more than 3 species. This pointed to the fact that the plastomic mutational hotspots for the intraspecific-level studies of Dendrobium species were diversified.
Figure 4

The pairwised intraspecific sequence variability (SV) of 90 syntenic non-coding loci among D. officinale and its closely related species. (A) The comparison of interspecific and intraspecific SV values for the loci in the plastome of D. officinale. The interspecific SV values of these loci were obtained from Niu et al. (B) The twelve intergenic and intronic loci with the highest SV values in the plastome of D. officinale. (C) The comparison of interspecific and intraspecific SV values for the loci in the plastome of D. shixingense, D. flexicaule, D. scoriarum and D. aduncum, respectively. The intraspecific SV values of D. tosaense were excluded because of the limited variable sites. Only three loci of rps2-rpoC2, accD-psaI and matK-5′trnK (in red) are present in more than three species.

The pairwised intraspecific sequence variability (SV) of 90 syntenic non-coding loci among D. officinale and its closely related species. (A) The comparison of interspecific and intraspecific SV values for the loci in the plastome of D. officinale. The interspecific SV values of these loci were obtained from Niu et al. (B) The twelve intergenic and intronic loci with the highest SV values in the plastome of D. officinale. (C) The comparison of interspecific and intraspecific SV values for the loci in the plastome of D. shixingense, D. flexicaule, D. scoriarum and D. aduncum, respectively. The intraspecific SV values of D. tosaense were excluded because of the limited variable sites. Only three loci of rps2-rpoC2, accD-psaI and matK-5′trnK (in red) are present in more than three species. DNA barcodes or plastome mutational hotspots, such as ITS ITS2, ITS+matK, and trnT-trnL+rpl32-trnL+clpP-psbB+trnL intron+rps16-trnQ, were demonstrated to have high discriminatory power for most Dendrobium species. However, as shown in Fig. 5 and Supplementary Information Fig. S1, they all failed to distinguish D. officinale and its closely related species. For example, in the ML tree inferred from ITS2 dataset, D. officinale, D. tosaense, D. shixingense, and D. scoriarum were nested with each other. Though the species of D. flexicaule formed a monophyletic group in the ML trees based on the ITS and ITS2+rbcL datasets, the support values were below 50%.
Figure 5

ML trees of D. officinale and its closely related species inferred from the complete plastome (left) and commonly used DNA markers (right). Each of the six species is color coded. Numbers near the nodes are bootstrap support values (only values >50% are shown).

ML trees of D. officinale and its closely related species inferred from the complete plastome (left) and commonly used DNA markers (right). Each of the six species is color coded. Numbers near the nodes are bootstrap support values (only values >50% are shown). By contrast, our analyses using the complete plastome dataset achieved satisfactory results as follows: (1) the phylogenetic tree yielded high resolution (bootstrap value (BS)>85%) for all tree nodes with few exceptions within some species; (2) the individuals of each of the 6 species were resolved as a monophyletic group. Furthermore, the complete plastome dataset also showed higher resolution than other plastome-scale datasets, i.e., LSC, SSC, IRs, protein-coding genes, and non-coding sequences (Supplementary Information Fig. S2). These results indicated that the complete plastome sequences could be used to identify D. officinale and its closely related species.

Discussion

Variations in the SSC regions of D. officinale and its closely related species were mainly contributed by the loss/retention of ndh genes

Although the plastomes of land plants are conserved in terms of genome structure and gene content, some structure changes (e.g., inversions, rearrangements, and IR expansion/contraction) and the sequence variations have been detected in complete plastomes. In this study, while the plastomes of D. officinale and its closely related species were found to be highly conserved in plastome structure, and gene content and order, sequence variations (SNPs and Indels) were also observed in these plastomes. SNP loci and Indel events are very useful resources for phylogenetic analysis and species identification; SNPs have been successfully used to infer the phylogenetic relationships within the genus of Citrus, and Indels have been employed to design specific primers for authenticating buckwheat species. Recently, Niu et al. revealed a nonrandom location of Indels in Dendrobium plastomes. In line with their findings, the present study demonstrated higher densities of SNPs and Indels in SSC than in LSC and IR regions, pointing to the fact that both Indels and SNPs were nonrandomly distributed in plastomes of D. officinale and its closely related species. The remarkably higher SNP and Indel densities in SSC might be caused by 2 factors: (1) the drastic expansion/contraction of IRs and (2) the independent loss/retention of ndh genes. The expansion/contraction of IRs is known to be largely responsible for the variation of plastomes in different orchids46, 47 and many other species48, 49, 50. However, the current research showed that only slight changes occurred in the sequences flanking IR/SC boundaries among the plastomes of D. officinale and its closely related species, suggesting that the effect of expansion/contraction of IRs on the variation of SSC was negligible. This signified that the loss/retention of ndh genes was a potential determinant for the variation of SSC. Recent studies revealed an independent loss/retention of ndh genes in SSC region of Dendrobium plastomes, which led us to expect a higher variation in their located regions. Indeed, we found that ndh pseudogenes and their adjacent intergenic spacer regions possessed a higher density and a larger number of variants (SNPs and Indels) than the other regions in SSC. Therefore, we infer that the loss/retention of ndh genes mainly accounted for the variation of SSC.

Plastome-wide comparison is required for studies of orchid species at different levels

Mutational hotspots of plastome sequences are the most useful tools for the systematic and phylogeographic studies at the different taxonomic levels. However, Shaw et al. showed that the hotspot regions were diverse among different plant lineages. For orchid species, different mutational hotspots have been proposed for different orchid genera, i.e., the loci of rpl32-trnL, trnH-psbA, trnE-trnT, trnK-rps16, and trnT-trnL were employed for the species identification study of Cymbidium; and the loci of trnS-trnG, psaC-ndhE, clpP-psbB, rpl16 intron, rpoB-trnC, trnT-psbD, rbcL-accD, rpl32-trnL, ccsA-ndhD, and ndhC-trnV were listed as the top-10 mutational hotspots for the genus of Phalaenopsis. Recently, on the basis of comprehensive plastome-wide comparison, Niu et al. found that the mutational hotspots for orchid species were genus specific; they also proposed the most valuable hotspot combination for the interspecific-level studies of Dendrobium genus. However, the mutational hotspots for the intraspecific-level studies of Dendrobium species were still uncertain. For example, the loci of trnC-petN and trnE-trnT were employed to investigate the phylogeographic relationship among different population of D. moniliforme, while the loci of accD-psaI, trnC-petN, and rps15-ycf1 were used to infer the phylogeographic history of D. officinale. Thus, aiming to identify the mutational hotspots or combinations applicable for the intraspecific-level studies of Dendrobium species, we estimated the pairwised intraspecific SV of 90 syntenic non-coding loci of D. officinale and its closely related species. Nevertheless, our careful examination failed to select a common mutational hotspot for the 5 species D. officinale, D.shixingense, D. flexicaule, D. scoriarum, and D. aduncum, although they are closely related to each other. Our results indicated that the mutational hotspots for the intraspecific-level studies of Dendrobium species were diversified. Therefore, it is necessary to make a cautious evaluation of genetic markers specific for different Dendrobium species. Considering diversified mutational hotspots among different orchid genera and different species, we proposed that the plastome-wide comparison is required for studies of orchid species at different levels.

Complete plastome sequences can be used for the authentication of D. officinale and its closely related species

The accurate identification of medicinal plants is essential to their safe utilization and genetic resource conservation. However, Dendrobium species have been documented to be difficult for discrimination analysis due to their similar appearance and tissue structure9, 10, 11. Among them, D. officinale and its closely related species are a particularly difficult group for identification studies and phylogenetic analyses due to the overlapping morphological variations14, 54, 55, complex evolutionary histories, and the lack of effective molecular markers. Previous molecular identification and phylogenetic studies about this group were generally focused on one or a few DNA regions, but none of them successfully resolved relationships among the 6 species, which was mainly due to the inadequate variations provided by a limited number of DNA loci14, 19, 20, 21, 22, 23. Recently, complete plastome sequences containing massive variable sites have been successfully applied to authenticate species and resettle the phylogenetic relationships in taxonomically difficult groups, such as Orychophragmus, Cymbidium, and Schima. Thus, to overcome the disadvantage of lacking variations for DNA regions, the complete plastome was exploited to authenticate D. officinale and its closely related species and to resolve their relationships. In comparison with previous studies, we proposed that the method of discriminating D. officinale and its closely related species by using the complete plastome sequences was highly reliable, accurate and feasible, as demonstrated by 3 facts as follows. Firstly, the relationships among D. officinale and its closely related species were resolved with high support values (BS>85%). As mentioned above, the complete plastome could provide sufficient informative sites, which can help to infer robust relationships for intractable groups at low taxonomic levels52, 56, 57. Secondly, the monophyletic groups of D. officinale, D. tosaense, D. shixingense, D. flexicaule, D. scoriarum, and D. aduncum resolved in this study were based on a comprehensive taxon sampling, which included representative individuals of documented main distribution areas for the 6 species. In previous studies, the sampling of these species was relatively limited. Moreover, D. shixingense (a recently reported species) or D. tosaense was always not included. In addition, other 21 Dendrobium species, which were often used as the adulterants of D. officinale on the marker, were also sampled in this study. Thirdly, it is becoming simple and relatively inexpensive to obtain the complete plastome. With the development of NGS technologies, the sequencing cost has fallen sharply. Furthermore, the approach of assembling plastome from genomic sequencing data has led to a convenient way to generate complete plastome sequences. Therefore, the use of complete plastome is a promising way for authenticating Dendrobium species. Dendrobium species are famous for their great medical values, which has led to many adulterants sold as the Material Medica from Dendrobium. Unfortunately, the use of many effective DNA markers, such as rbcL, matK and even the sequences of ITS or ITS2, could not effectively identify the Material Medica from Dendrobium, especially for D. officinale and its closely related species because of their close genetic relationships. In this study, we successfully authenticated D. officinale and its closely related species by using their complete plastome sequences, indicating that the complete plastome could be considered an efficient super-barcode58, 59 for the authentication of Material Medica from Dendrobium. Compared to conventional barcoding approaches, using the complete plastome as super-barcode has many advantages, including higher accuracy in taxonomically difficult groups, and universality, as it does not require the use of taxon-specific primers60, 61. On the other hand, the main challenges of super-barcoding are the establishment of a rich plastome database and the improvement of processing power and analytical capacity for big data. Nevertheless, with the rapid advancement of molecular technologies and methodologies, we believe that it will soon be practical to apply the complete plastomes to the authentication studies of most land plants.

Conclusions

In conclusion, this is the first study to authenticate the taxonomically difficult group of D. officinale and its closely related species. Firstly, we investigated the relationship between plastome variations and loss/retention of ndh genes, and found that the variations in the SSC regions of D. officinale and its closely related species were mainly contributed by the loss/retention of ndh genes. Then, based on plastome-wide comparison, our analysis revealed the diversified evolution of SV among these plastomes, signifying that the plastome-wide comparison is essential for studies of Dendrobium species at different levels. Most importantly, after having carefully examined the ML trees inferred from different sequence datasets, we proposed that D. officinale and its closely related species could be unequivocally distinguished using the complete plastome sequences.
  48 in total

1.  Automatic annotation of organellar genomes with DOGMA.

Authors:  Stacia K Wyman; Robert K Jansen; Jeffrey L Boore
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

2.  Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA.

Authors:  Nolan Kane; Saemundur Sveinsson; Hannes Dempewolf; Ji Yong Yang; Dapeng Zhang; Johannes M M Engels; Quentin Cronk
Journal:  Am J Bot       Date:  2012-02-01       Impact factor: 3.844

3.  Genetic diversity analysis and conservation of the endangered Chinese endemic herb Dendrobium officinale Kimura et Migo (Orchidaceae) based on AFLP.

Authors:  Xuexia Li; Xiaoyu Ding; Bihai Chu; Qi Zhou; Ge Ding; Sun Gu
Journal:  Genetica       Date:  2007-09-06       Impact factor: 1.082

4.  Chloroplast DNA sequence utility for the lowest phylogenetic and phylogeographic inferences in angiosperms: the tortoise and the hare IV.

Authors:  Joey Shaw; Hayden L Shafer; O Rayne Leonard; Margaret J Kovach; Mark Schorr; Ashley B Morris
Journal:  Am J Bot       Date:  2014-10-30       Impact factor: 3.844

5.  Phylogeography of the endangered orchid Dendrobium moniliforme in East Asia inferred from chloroplast DNA sequences.

Authors:  Meirong Ye; Wei Liu; Qingyun Xue; Beiwei Hou; Jing Luo; Xiaoyu Ding
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2016-12-08       Impact factor: 1.514

6.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

7.  Loss of different inverted repeat copies from the chloroplast genomes of Pinaceae and cupressophytes and influence of heterotachy on the evaluation of gymnosperm phylogeny.

Authors:  Chung-Shien Wu; Ya-Nan Wang; Chi-Yao Hsu; Ching-Ping Lin; Shu-Miaw Chaw
Journal:  Genome Biol Evol       Date:  2011-09-19       Impact factor: 3.416

8.  Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus.

Authors:  Zhitao Niu; Qingyun Xue; Hui Wang; Xuezhu Xie; Shuying Zhu; Wei Liu; Xiaoyu Ding
Journal:  Int J Mol Sci       Date:  2017-11-02       Impact factor: 5.923

9.  Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

Authors:  Zhitao Niu; Jiajia Pan; Shuying Zhu; Ludan Li; Qingyun Xue; Wei Liu; Xiaoyu Ding
Journal:  Front Plant Sci       Date:  2017-10-04       Impact factor: 5.753

10.  Seven New Complete Plastome Sequences Reveal Rampant Independent Loss of the ndh Gene Family across Orchids and Associated Instability of the Inverted Repeat/Small Single-Copy Region Boundaries.

Authors:  Hyoung Tae Kim; Jung Sung Kim; Michael J Moore; Kurt M Neubig; Norris H Williams; W Mark Whitten; Joo-Hwan Kim
Journal:  PLoS One       Date:  2015-11-11       Impact factor: 3.240

View more
  17 in total

1.  Development, Identification, and Application of a Germplasm Specific SCAR Marker for Dendrobium officinale Kimura et Migo.

Authors:  Kaixin Zheng; Yuchen Cai; Weijie Chen; Yadi Gao; Jingjing Jin; Huizhong Wang; Shangguo Feng; Jiangjie Lu
Journal:  Front Plant Sci       Date:  2021-05-14       Impact factor: 5.753

2.  Comparative Plastid Genomics of Neotropical Bulbophyllum (Orchidaceae; Epidendroideae).

Authors:  Michelle Zavala-Páez; Leila do Nascimento Vieira; Valter Antônio de Baura; Eduardo Balsanelli; Emanuel Maltempi de Souza; Marco Cerna Cevallos; Mark W Chase; Eric de Camargo Smidt
Journal:  Front Plant Sci       Date:  2020-07-03       Impact factor: 5.753

3.  The complete chloroplast genome sequence of the medicinal plant Sophora tonkinensis.

Authors:  Fan Wei; Danfeng Tang; Kunhua Wei; Fang Qin; Linxuan Li; Yang Lin; Yanxia Zhu; Aziz Khan; Muhammad Haneef Kashif; Jianhua Miao
Journal:  Sci Rep       Date:  2020-07-27       Impact factor: 4.379

4.  Comparative and Phylogenetic Analyses of Ginger (Zingiber officinale) in the Family Zingiberaceae Based on the Complete Chloroplast Genome.

Authors:  Yingxian Cui; Liping Nie; Wei Sun; Zhichao Xu; Yu Wang; Jing Yu; Jingyuan Song; Hui Yao
Journal:  Plants (Basel)       Date:  2019-08-12

5.  Microfluidic Enrichment Barcoding (MEBarcoding): a new method for high throughput plant DNA barcoding.

Authors:  Morgan R Gostel; Jose D Zúñiga; W John Kress; Vicki A Funk; Caroline Puente-Lelievre
Journal:  Sci Rep       Date:  2020-05-26       Impact factor: 4.379

6.  The complete chloroplast genome sequence of Liparis vivipara (Orchidaceae).

Authors:  Diyang Zhang; Ding-Kun Liu; Yang Hao; Si-Ren Lan; Zhong-Jian Liu
Journal:  Mitochondrial DNA B Resour       Date:  2019-07-11       Impact factor: 0.658

7.  Plastome phylogenomics of Cephalotaxus (Cephalotaxaceae) and allied genera.

Authors:  Yunheng Ji; Changkun Liu; Jacob B Landis; Min Deng; Jiahui Chen
Journal:  Ann Bot       Date:  2021-04-17       Impact factor: 4.357

8.  Identification and phylogenetic analysis of five Crataegus species (Rosaceae) based on complete chloroplast genomes.

Authors:  Liwei Wu; Yingxian Cui; Qing Wang; Zhichao Xu; Yu Wang; Yulin Lin; Jingyuan Song; Hui Yao
Journal:  Planta       Date:  2021-06-28       Impact factor: 4.116

9.  Genome skimming and exploration of DNA barcodes for Taiwan endemic cypresses.

Authors:  Chung-Shien Wu; Edi Sudianto; Yu-Mei Hung; Bo-Cyun Wang; Chiun-Jr Huang; Chi-Tsong Chen; Shu-Miaw Chaw
Journal:  Sci Rep       Date:  2020-11-26       Impact factor: 4.379

10.  Comparative and Phylogenetic Analysis of the Complete Chloroplast Genomes of Three Paeonia Section Moutan Species (Paeoniaceae).

Authors:  Liwei Wu; Liping Nie; Zhichao Xu; Pei Li; Yu Wang; Chunnian He; Jingyuan Song; Hui Yao
Journal:  Front Genet       Date:  2020-09-18       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.