Literature DB >> 30847083

DNA barcoding of Corydalis, the most taxonomically complicated genus of Papaveraceae.

Feng-Ming Ren1,2, Ying-Wei Wang3, Zhi-Chao Xu1, Ying Li1, Tian-Yi Xin1, Jian-Guo Zhou1, Yao-Dong Qi1, Xue-Ping Wei1, Hui Yao1, Jing-Yuan Song1.   

Abstract

The genus Corydalis is recognized as one of the most taxonomically challenging plant taxa. It is mainly distributed in the Himalaya-Hengduan Mountains, a global biodiversity hotspot. To date, no effective solution for species discrimination and taxonomic assignment in Corydalis has been developed. In this study, five nuclear and chloroplast DNA regions, ITS, ITS2, matK, rbcL, and psbA-trnH, were preliminarily assessed based on their ability to discriminate Corydalis to eliminate inefficient regions, and the three regions showing good performance (ITS, ITS2 and matK) were then evaluated in 131 samples representing 28 species of 11 sections of four subgenera in Corydalis using three analytical methods (NJ, ML, MP tree; K2P-distance and BLAST). The results showed that the various approaches exhibit different species identification power and that BLAST shows the best performance among the tested approaches. A comparison of different barcodes indicated that among the single barcodes, ITS (65.2%) exhibited the highest identification success rate and that the combination of ITS + matK (69.6%) provided the highest species resolution among all single barcodes and their combinations. Three Pharmacopoeia-recorded medicinal plants and their materia medica were identified successfully based on the ITS and ITS2 regions. In the phylogenetic analysis, the sections Thalictrifoliae, Sophorocapnos, Racemosae, Aulacostigma, and Corydalis formed well-supported separate lineages. We thus hypothesize that the five sections should be classified as an independent subgenus and that the genus should be divided into three subgenera. In this study, DNA barcoding provided relatively high species discrimination power, indicating that it can be used for species discrimination in this taxonomically complicated genus and as a potential tool for the authentication of materia medica belonging to Corydalis.

Entities:  

Keywords:  Corydalis; DNA barcoding; biodiversity hotspot; matK; species identification; the internal transcribed spacer

Year:  2019        PMID: 30847083      PMCID: PMC6392370          DOI: 10.1002/ece3.4886

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

The Himalaya–Hengduan Mountains represent a global biodiversity hotspot with high levels of biodiversity and endemism and has recently become a priority conservation area due to the negative effects of climate change and intensive human activities in this region (Yan et al., 2014). Corydalis DC., the largest genus of Papaveraceae (Zhang, Su, & Liden, 2008), is an important component of the biodiversity in the Himalaya–Hengduan Mountains. This genus originated from the Hengduan Mountains and was recently distributed from this region to the Qinghai–Tibet Plateau (Linden, Fukuhara, & Axberg, 1995; Wu, 1996). Due to the complicated geological history of this region and the dramatic variations in its local climates and topography (Yan et al., 2014), as well as the reticulate evolution and intensive differentiation in the phylogenesis of Corydalis (Wang, 2006; Wu, 1996), this genus exhibits high levels of morphological and habitat diversity. Some species grow in specialized habitats, such as dry limestone cliffs (Figure 1) and alpine hillsides (Zhang et al., 2008), which are inaccessible. Most species exhibit complicated morphological characteristics. The leaves, subterranean organs, fruits, seeds, and particularly the floral structures of Corydalis species are very complex and show high variability, which seriously hampers accurate species discrimination and taxonomic assignment. Species identification is a precondition of biodiversity conservation and is also fundamental to almost all disciplines of botany (Chen et al., 2016). However, due to its complicated morphological characteristics, arduous procedures for sample collection, the absence of seasoned specialists, and the limitations of traditional morphology‐based taxonomy, Corydalis is recognized as one of the most taxonomically complicated plant taxa.
Figure 1

The specialized habitats of Corydalis saxicola in dry limestone cliffs

The specialized habitats of Corydalis saxicola in dry limestone cliffs DNA barcoding, which uses one or several short standardized DNA regions for species identification (CBOL Plant Working Group, 2009; Hebert, Cywinska, Ball, & deWaard, 2003; Kress, Wurdack, Zimmer, Weigt, & Janzen, 2005), exhibits the outstanding advantage of not being affected by morphological characteristics. This technique has been widely applied for species identification (Barco, Raupach, Laakmann, Neumann, & Knebelsberger, 2016; Liu, Provan, Gao, & Li, 2012; Yan et al., 2014), the discovery of new or cryptic species (Huemer, Karsholt, & Mutanen, 2014; Liu, Möller, Gao, Zhang, & Li, 2011; Zemlak, Ward, Connell, Holmes, & Hebert, 2009), the assessment of biodiversity (Ji et al., 2013; Lahaye et al., 2008; Taberlet, Coissac, Pompanon, Brochmann, & Willerslev, 2012), the characterization of genetic diversity (Sucher & Carles, 2008; Zhang, Niu, Guo, Wang, & Eaton, 2015), and the identification of species used in traditional Chinese medicine (Chen et al., 2010; Yuan et al., 2015). DNA barcoding has been shown to be an effective method for a variety of applications that involve species identification (Yan et al., 2014), but its performance in this taxonomically complicated genus has not been evaluated. Various regions in the nuclear and chloroplast genomes have been proposed as DNA barcodes for plants. The psbA‐trnH intergenic spacer region has been proposed as a DNA barcode for land plants (Kress et al., 2005). Portions of the plastid coding genes rbcL and matK have been suggested as the core barcodes to establish a barcoding database for plant species (CBOL Plant Working Group, 2009). The nuclear internal transcribed spacer (ITS), which has a high rate of nucleotide substitution and thus relatively high discrimination power, has been proposed to be incorporated into a core barcode for seed plants (China Plant BOL Group, 2011). The ITS2 region, a subregion of ITS, has been selected as a valuable sequence tag for the identification of medicinal plants and materia medica (Chen et al., 2010; Han et al., 2013; Yao et al., 2010). However, no single barcode for plants can perform as well as COI does in animals (Hollingsworth, Graham, & Little, 2011). In fact, individual barcodes usually exhibit unequal species discriminatory ability in different plant groups, and therefore, it is necessary to select appropriate barcodes for Corydalis. Furthermore, several different analytical methods, such as tree‐based, distance‐based, sequence similarity‐based, and character‐based methods, have been used for the assessment of species discrimination ability (Austerlitz et al., 2009; Frezal & Leblois, 2008; Li et al., 2011; Sandionigi et al., 2012; Yan et al., 2014). Different analytical methods typically show dissimilar species discrimination power on the same datasets (Kool et al., 2012; Li et al., 2011; van Velzen, Weitschek, Felici, & Bakker, 2012; Yan et al., 2014), but the discrimination ability of different analytical methods in Corydalis remains unknown. An ideal DNA barcode should have a highly universal single primer pair, provide high‐quality bidirectional sequences, have a high discriminatory power among species (CBOL Plant Working Group, 2009; Kress et al., 2005; Lahaye et al., 2008), and exhibit a “barcode gap” between intraspecific and interspecific genetic divergences (Lahaye et al., 2008; Meyer & Paulay, 2005). We preliminarily evaluated the discrimination ability of the five most commonly used regions, ITS, ITS2, matK, rbcL, and psbA‐trnH, for Corydalis. The sequence data for this preliminary evaluation were downloaded from NCBI and have been used in previous molecular phylogenetic studies of Papaveraceae and Corydalis (Linden, Fukuhara, Rylander, & Oxelman, 1997; Zhang et al., 2015; Zhang, Wang, & Yang, 2016). The results showed that the rbcL and psbA‐trnH regions were too well conserved to be able to sufficiently discriminate among species of Corydalis (Supporting Information Figures S2 and S3), whereas ITS, ITS2, and matK exhibited high rates of nucleotide substitution and a relatively high species discrimination rate. In this study, we chose three regions (ITS, ITS2, and matK) as barcodes for Corydalis and performed a systematic comparison of different regions and analytical methods to evaluate primer universality, the DNA barcoding gap and species discrimination efficiency. Our objectives were to (a) determine the performance of DNA barcoding in this taxonomically complex genus, (b) evaluate the species discrimination power of different barcodes, and (c) compare the species resolution rates of different analytical methods. Based on the results obtained in this study, we also discuss the subdivision of Corydalis and the potential implications of DNA barcoding in this genus.

MATERIALS AND METHODS

Sampling strategy

A total of 131 individuals representing 28 Corydalis species, including four Pharmacopoeia‐recorded medicinal plants (C. yanhusuo, C. decumbens, C. saxicola, and C. bungeana) and their crude drugs (nine leaf specimens of C. saxicola and three tuber specimens of each of the other three species), and two outgroups (Lamprocapnos spectabilis and Papaver somniferum) were used in this study (Table S1). Sequences for 118 individuals of 22 species were obtained de novo, and sequences for 13 individuals were obtained from GenBank. The GenBank sequences are derived from published articles, and we rechecked the sequences through BLAST with conspecifics or closely related species to ensure their correctness. All specimens were identified simultaneously by Professor YingWei Wang (Institute of Botany, Chinese Academy of Sciences), Doctor YaoDong Qi (Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences), and Professor ZhengYu Liu (Chongqing Institute of Medicinal Plant Cultivation). Voucher specimens were deposited at the Institute of Medicinal Plant Development in the Chinese Academy of Medical Sciences. Healthy and fresh leaves of wild plants were collected and dried immediately in silica gel for DNA extraction.

DNA extraction, PCR amplification, and sequencing

Genomic DNA was extracted from silica gel‐dried leaves using a Plant Genomic DNA Kit (Tiangen Biotech, Beijing, China) according to the manufacturer's recommended protocol, which was optimized with slight modifications. These modifications included the addition of more lysis buffer GP1 and cleaning the products one to three times using a nucleus separation liquid until the supernatant layer became lightly colored or colorless. The PCR amplification of the ITS and matK regions was conducted using a Peltier Thermal Cycler PTC2000 (Bio‐Rad) with approximately 30 ng of genomic DNA as the template in 25 μl of 2× Taq PCR MasterMix (Aidlab Biotechnologies, Beijing, China), 1 μl of each primer (2.5 μM, synthesized by Sangon, Shanghai, China), and distilled–deionized water. The primers used for PCR and sequencing and the PCR cycling conditions used in this study are provided in Table 1. For ITS, the primer pair ITSa and ITSb was used for the samples for which the initial PCR amplification or sequencing failed (Table 1, Supporting Information Figure S1). The PCR products were run on a 1.0% agarose gel in 0.5× TBE buffer to assess the success of the amplification and purified with a 1.0% agarose gel using the AXYGEN Gel Recovery Kit (AXYGEN, Hangzhou City, China). The purified PCR products were then sequenced in both directions using the DYEnamic ET Terminator Cycle Sequencing Kit (Amersham Pharmacia Biotech) according to the manufacturer's instructions and analyzed with an ABI 3730XL DNA Sequencer (Applied Biosystems, Foster City, CA, USA).
Table 1

Primers, their sequences, and the PCR amplification conditions used in this study

RegionsPrimer pairsSequence 5ʹ−3ʹThermocycling conditionsSource
ITSITS‐5FGGAAGTAAAAGTCGTAACAAGG94°C 3 min; [35 cycles: 94°C 60 s, 53°C 90 s, 72°C 90 s]; 72°C 10 minWhite, Bruns, Lee, & Taylor, (1990)
ITS‐4RCCTTATCATTTAGAGGAAGGAG
ITSaITS‐5FGGAAGTAAAAGTCGTAACAAGG94°C 3 min; [35 cycles: 94°C 60 s, 53°C 90 s, 72°C 90 s]; 72°C 10 minChen et al. (2010) and by Authors
ITS‐300RATTCACACCAAGTATCGCAT
ITSbITS2‐2FATTCACACCAAGTATCGCAT
ITS‐4RCCTTATCATTTAGAGGAAGGAG
matK 3F‐KIMCGTACAGTACTTTTGTGTTTACGAG94°C 3 min; [35 cycles: 94°C 60 s, 52°C 60 s, 72°C 90 s]; 72°C 10 minCBOL Plant Working Group (2009)
1R‐KIMACCCAGTCCATCTGGAAATCTTGGTTC
Primers, their sequences, and the PCR amplification conditions used in this study

Data analysis

All raw sequences, excluding primer regions, were assembled and edited with Condon Code Aligner V 5.1.5 (Condon Code Co., USA). The ITS2 sequences were obtained by removing the conserved 5.8S rRNA and ITS1 sequences of ITS using HMMer, which is based on a hidden Markov model. Sequence alignments for each region were performed using MEGA6.06 (Center for Evolutionary Medicine and Information, USA). Pairwise interspecific and intraspecific genetic distances were calculated based on the Kimura 2‐parameter (K2P) mode. Multiple sequence alignments were concatenated into a single file using Geneious Pro version 4.8.5 (Drummond et al., 2009). To evaluate the success of species discrimination, the three markers and their possible combinations were analyzed using three widely used methods, namely, tree‐based, similarity‐based, and distance‐based methods. For the tree‐based methods, three different phylogenetic trees, namely, neighbor‐joining (NJ) tree, maximum parsimony (MP) tree, and maximum likelihood (ML) tree, were evaluated to select the most suitable tree. The three trees were constructed with MEGA6.06 according to published protocols for species‐level discrimination within closely related groups (Liu et al., 2012; Tamura, Dudley, Nei, & Kumar, 2007; Yuan et al., 2015). Species discrimination was considered successful if all conspecific individuals formed a single clade. For the similarity‐based method (BLAST), NCBI BLAST 2.2.29+ (Tao, 2010; Yan et al., 2014) was used to build local reference databases, and all the sequences were then queried using the blastn command. Species discrimination was considered successful if all individuals of a species had a top matching hit of only a conspecific individual (Yan et al., 2014). For the distance‐based analysis, we used the K2P‐distance method (Little & Stevenson, 2007). Species discrimination was considered successful if the minimum interspecific K2Pdistance involving a species was larger than the maximum intraspecific distance for that species (CBOL Plant Working Group, 2009; China Plant BOL Group, 2011).

RESULTS

Barcode universality and sequence characteristics

The separate evaluations of the success rates of PCR amplification and sequencing revealed that the proportions of ITS and matK regions were both 100%. With regard to primer universality, ITS exhibited high amplification success with the commonly used primer pair ITS5F/4R, but high‐quality bidirectional sequences from 14 samples (11.0%) could not be generated using this primer pair. Nevertheless, these samples were successfully amplified and sequenced using the primer pair ITSa and ITSb (Tables 1 and 2). Thus, the ITS barcode could be successfully amplified and sequenced from all the samples, and ITS2 sequences were then obtained from the ITS sequences. All matK sequences obtained in this study were generated by direct sequencing.
Table 2

Evaluation of three DNA barcoding regions

DNA regionsITS matK ITS2
Percentage PCR success (%)100100/
Sequencing using a single primer pair (%)89.0100/
No. accessions/total121121121
Sequence length (bp)519–581842–858222–246
Aligned sequence length (bp)680883275
No. variable sites (%)258 (37.9)219 (26.3)123 (44.7)
No. informative sites (%)194 (28.5)167 (18.9)94 (34.2)
No. indel (length in bp)26 (1–22)10 (1–12)9 (1–22)
Intraspecific distance mean (range)0.0103 (0–0.0317)0.0016 (0–0.0062)0.0159 (0–0.0579)
Interspecific distance mean (range)0.0705 (0.0024–0.1185)0.0474 (0–0.0878)0.1026 (0.0014–0.1981)
Evaluation of three DNA barcoding regions A total of 393 sequences (ITS, matK, and ITS2) for 131 individuals, including 354 sequences from 118 individuals that were newly generated in this study, were available for further analysis. The sequence characteristics of the three regions are summarized in Table 2. The five species with only a single individual and two outgroups (a total of 10 sequences) were excluded from this analysis. Among the three regions, ITS showed the greatest variability in length (519–581 bp) and the highest number of indels (26 indels within a 680‐bp aligned sequence). In contrast, matK showed the longest aligned sequence length, the least variability in length (842–858 bp), and the lowest numbers of variable sites (219, 26.3%), informative sites (167, 18.9%) and indels (10 indels within an 883‐bp aligned sequence). ITS2 exhibited the highest numbers of variable sites (123, 44.7%) and informative sites (94, 34.2%; Table 2).

Genetic distance and DNA barcoding gap assessment

Of the three regions, ITS2 exhibited the greatest intraspecific and interspecific distances, followed by ITS, and matK showed the lowest values (Table 2). The relative distribution of K2P distances based on single barcodes and barcode combinations is shown in Figure 2.
Figure 2

Relative distributions of intraspecific and interspecific K2P distances

Relative distributions of intraspecific and interspecific K2P distances In general, the mean interspecific distances were greater than the mean intraspecific distances for all three barcodes (Table 2). The rank of the three sequences in terms of mean sequence divergences in Corydalis was ITS2>ITS>matK. The barcoding gaps obtained with the single barcodes and barcode combinations are shown in Figure 2.

Comparison of species resolution with different barcodes and their combinations

Among the single barcodes, ITS exhibited the highest success rates for the identification of Corydalis species, ITS2 showed a lower identification success rate, and matK provided the lowest identification success rate (Table 3). Three of the four Pharmacopoeia‐recorded medicinal plants (C. yanhusuo, C. decumbens, and C. bungeana) and their crude drugs were identified successfully using the ITS and ITS2 regions (Figure 4). The Pharmacopoeia‐recorded species C. saxicola and its close relative C. tomentella was discriminated using the QR code (two‐dimensional code) of the ITS2 region (Figure 3).
Table 3

The comparative analysis of different analytical methods for species resolution

Species resolution (%) of the potential barcodes
MethodITS matKITS2ITS + matKITS2 + matK
BLAST65.256.560.969.660.9
K2P‐distance56.552.252.265.265.2
NJ tree65.247.860.965.265.2
ML tree60.952.256.560.960.9
MP tree52.247.852.260.960.9
Figure 4

The maximum likelihood tree of 28 Corydalis species and two outgroup species of Papaveraceae based on ITS + matK regions

Figure 3

Two closely related medicinal plants Corydalis tomentella and Corydalis saxicola

The comparative analysis of different analytical methods for species resolution Two closely related medicinal plants Corydalis tomentella and Corydalis saxicola Among the barcode combinations, ITS + matK provided a higher discrimination success rate than ITS2 + matK. In addition, the barcode combinations exhibited a higher discrimination success rate than any single barcode (Table 3). Among all single barcodes and their combinations used in this study, ITS + matK provided the highest identification success rate, and matK alone provided the lowest identification success rate (Table 3).

Comparison of different analytical methods for species resolution

The species discrimination ability depends on the analytical method used. A comparison of different analytical methods using data obtained with a single barcode revealed that the BLAST method yielded the highest discrimination success rate (Table 3 and S2‐S6). The barcode combinations yielded different results. When applied to data obtained from the combination ITS + matK, BLAST provided the highest identification success rate, but when applied to data obtained with ITS2 + matK, the NJ tree‐ and K2P distance‐based methods showed the highest identification success rate (Table 3). The comparisons of different analytical methods using data obtained with all the single barcodes and their combinations used in this study showed that BLAST tended to provide the highest discrimination success for all barcodes with the exception of ITS2 + matK. The results obtained using the NJ tree‐based method were similar to but slightly better than those obtained using the K2P distance‐based method (Table 3). Overall, regardless of the method used, the barcode combinations nearly always resulted in improved species resolution, and in this study, the barcode combination ITS + matK (69.6%) with the BLAST method provided the highest species resolution.

Phylogenetic analyses of chloroplast and nuclear DNA regions

The ML tree constructed using the data obtained with ITS + matK recovered Corydalis as a monophyletic group. C. rupestris was strongly supported as the basal taxon that diverged first, and two major clades were then recognized (Figure 4). The first clade included two subclades: four species of sect. Thalictrifoliae, one species of sect. Aulacostigma, three species of sect. Sophorocapnos, and one species of sect. Corydalis formed one subclade, and two species of sect. Racemosae and one species of sect. Thalictrifoliae formed the other subclade. The second clade included three subclades: one of these subclades consisted of five species of sect. Pes‐gallinaceus Irmisch and one species of sect. Chinenses; the second subclade consisted of one species of sect. Capnogorium; and the third subclade included two species of sect. Asterostigma, one species of sect. Duplotuber and one species of sect. Ramoso‐sibiricae (Figure 4). The maximum likelihood tree of 28 Corydalis species and two outgroup species of Papaveraceae based on ITS + matK regions The ML tree constructed using the data obtained with ITS was slightly distinct from that constructed using the data obtained with ITS + matK. The ITS‐based ML tree recovered Corydalis as a monophyletic group and revealed two well‐supported clades. The division of the two clades into two subclades was weakly supported. Furthermore, compared with the ITS + matK‐based tree, the phylogenetic positions of some species were changed in the ITS‐based tree. Specifically, C. rupestris and C. capnoides formed a subclade that diverged from the first clade, and C. decumbens and C. ochotensis formed a clade that diverged from the second subclade (Supporting Information Figure S4). The ML tree constructed using the data obtained with matK was similar to the ITS + matK‐based tree. It also recovered Corydalis as a monophyletic group, C. rupestris was strongly supported as the basal taxon and diverged first, and the remaining species formed two well‐supported clades. The species composition of the first clade was consistent with that found with the ITS + matK‐based tree. However, the second clade did not divide into three subclades as in the ITS + matK‐based tree; instead, C. nobilis of sect. Capnogorium formed its own subclade that diverged first as a sister subclade to the rest of the clade, and the second subclade was then further divided into two clades (Supporting Information Figure S5).

DISCUSSION

Evaluation of DNA barcodes for Corydalis

Corydalis is one of the most taxonomically complicated plant genera, and the discrimination of species within this genus has always been recognized as a great challenge. Jiang et al. (2018) used DNA barcoding to identify two herbal species of Corydalis in the Pharmacopoeia of China based on 57 samples of 14 species, and their results showed that DNA barcoding can be used as an effective method for the identification of medicinal species belonging to Corydalis. In this study, a total of 131 individuals of 28 Corydalis species and two outgroups were assessed, including 21 species that were not previously evaluated. The specimens included representatives of all three subgenera (Linden et al., 1997) and all four herbal species of Corydalis in the Pharmacopoeia of China. In view of the widespread existence of infraspecific polymorphisms in Corydalis (Wu, 1996), we analyzed a large number of samples of each species with high morphological variation (up to 15 samples of a single species), which contributed to the determination of authentic phylogenetic relationships. Based on these data, we performed a systematic comparison of different loci and analytical methods to select appropriate barcodes and analytical methods for Corydalis. The evaluation of different analytical methods showed that BLAST exhibited the highest species discrimination ability. The superior performance of the BLAST approach compared with other methods has been observed in several previous studies (Chen et al., 2010; Kool et al., 2012; Li et al., 2011; Van Velzen et al., 2012; Yan et al., 2014). The species discrimination power of an approach is related to the theory and algorithm used by the approach. BLAST often shows significantly higher identification rates than other approaches (Chen et al., 2010; Li et al., 2011; Sandionigi et al., 2012), and it appears to be the best choice for the identification of Corydalis species. The NJ tree usually shows low species resolution, which significantly reduces its usefulness (Liu et al., 2012; Van Velzen et al., 2012; Yan et al., 2014), and in this study, the NJ tree provided a lower species resolution than BLAST. However, because of its advantages of faster speed and a more intuitive display of genetic relationships, which facilitates understanding and analysis, we still advocate that the NJ tree‐based method is useful for the discrimination of Corydalis species. The comparison of different barcodes indicated that ITS exhibited the highest species resolution (65.2%) among all three single barcodes evaluated in this study. The resolution level obtained in this study is equivalent to the average level (67.2%) observed in a large dataset of 5,583 samples representing 1,349 species in 141 genera (Li et al., 2011). Compared with the species resolution obtained with ITS for other large genera, such as Rhododendron (12.2%, 15.3%), Angelica (73.9%), Pedicularis (86.2%), and Primula (88.2%) (Li et al., 2011; Yan et al., 2014; Yuan et al., 2015), that obtained in this study corresponds to a medium level of identification efficiency. However, a previous study showed that the ITS region is not suitable for the molecular analysis of Corydalis due to a low PCR amplification and sequencing success rates (61.9% and 28.6%, respectively) and that matK provides the highest species resolution (100%) and can thus be considered an ideal barcode for Corydalis (Jiang et al., 2018). In this study, we initially used the universal primer pair ITS5F/4R and obtained low PCR amplification and sequencing efficiency. Therefore, we designed two pairs of primers to perform a fractional amplification of the full‐length sequence of ITS and thus obtain a complete ITS sequence (Table 1, Supporting Information Figure S1), and our resulting PCR amplification and sequencing success rates were both 100%. The ITS and matK regions were then evaluated using 131 samples representing 28 species of Corydalis, which included 23 species with more than three samples and 16 species that were not included in previous studies (two species are herbal species in the Pharmacopoeia of China) (Jiang et al., 2018). Our results showed that ITS exhibited a higher species resolution (65.2%) than matK (56.5%),15 of the 23 species could be successfully identified by ITS, 13 of the 23 species could be successfully identified by matK, and one herbal species in the Pharmacopoeia of China could not be successfully identified by matK. The high species resolution of DNA barcoding is likely due to the relatively small sample size and wide taxonomic sampling employed (Li et al., 2011; Yan et al., 2014), and the species identification resolution usually decreases with increases in the sample size. Thus, sufficient sampling for a taxon‐based DNA barcoding study is a pivotal issue that should be considered (Yan et al., 2014). Based on the highest species resolution obtained with a single barcode in this study and the successful optimization of PCR amplification and sequencing methods, we considered ITS to be the most appropriate barcode for the discrimination of species belonging to the genus Corydalis. The combination of DNA barcodes usually improves species identification (CBOL Plant Working Group, 2009; Li et al., 2011; Yan et al., 2014; Yuan et al., 2015). In this study, any combination of the barcodes yielded higher discrimination rates. Although matK provided the lowest species identification ability when used alone, combinations including matK exhibited significantly increased discrimination power. Thus, matK can be used as an additional barcode for Corydalis, and ITS + matK was identified as the best barcode combination for Corydalis.

Phylogenetic analysis of Corydalis based on DNA barcoding

Wu (1996) first classified this genus into 40 sections and two subgenera, subg. Corydalis and subg. Pistolochia, based on morphological characteristics and geographical distributions. Linden (1997) claimed that this genus should be divided into 25 sections and three subgenera, namely, subg. Chremnocapnos, subg. Sophorocapnos, and subg. Corydalis, based on morphological characteristics and the rps16 locus. In recent years, with the continuous uncovering of new evidence and the accumulation of new taxa, the section division proposed by Linden has tended to be consistent with the structure proposed by Wu. Based on molecular systematics (rps16 and matK) and palynology, Wang (2006) advocated that the most appropriate division of Corydalis is into five subgenera, namely, subg. Chremnocapnos, subg. Sophorocapnos, subg. Corydalis, subg. Rapiferae, and subg. Fasciculatae. In all the trees constructed in this study, the genus Corydalis tended to be divided into two major clades. The first major clade included sect. Thalictrifoliae, sect. Sophorocapnos, sect. Racemosae, sect. Aulacostigma, and sect. Corydalis, and the first four of these sections were classified into Wu's subg. Corydalis (Wu, 1996), Linden's subg. Sophorocapnos (Linden et al., 1995), and Wang's subg. Sophorocapnos (Wang, 2006), respectively. However, subg. Sophorocapnos (in Linden's and Wang's classifications) can be included in subg. Corydalis (in Wu's classification). Thus, all the evidence supports the classification of sect. Thalictrifoliae, sect. Sophorocapnos, sect. Racemosae, and sect. Aulacostigma into one subgenus. In addition, this study constitutes the first evaluation of sect. Corydalis, and all the phylogenetic trees constructed in this study support the inclusion of sect. Corydalis in this subgenus. In the second major clade, the further divisions created by the ML tree using different loci exhibited a number of discrepancies. The matK‐ and ITS + matK‐based trees were strongly supported, whereas the ITS‐based tree was weakly supported; therefore, we will mainly refer to the ITS + matK‐based tree. The second major clade tended to be divided into two larger subclades and one smaller subclade. Sections Pes‐gallinaceus Irmisch and Chinenses formed the first larger subclade; sections Asterostigma, Duplotuber, and Ramoso‐sibiricae formed the second larger subclade; and sect. Capnogorium alone formed the smaller subclade. In Wang's division of subgenera (Wang, 2006), subg. Sophorocapnos was recognized in the first major clade, but the remaining subgenera in the second major clade could not be recognized, and our results, therefore, do not support Wang's division. In the second larger subclade, sect. Ramoso‐sibiricae and sect. Asterostigma are divided into subg. Corydalis, as in Wu's classification (Wu, 1996). Although our results support the division of this genus into two major subgenera, we do not agree with the species coverage of subgenera in Wu's division. Overall, Wu (1996) classified Corydalis into two subgenera based on morphological characteristics and geographical distributions. Linden (1997) claimed that this genus was divided into three subgenera based on morphological characteristics and the rps16 locus. Using molecular systematics (rps16 and matK) and palynology, Wang (2006) advocated that the most appropriate division of Corydalis was five subgenera. Our molecular evidence is consistent with Linden's proposal, and we tended to divide this genus into three subgenera. In the subgenus, four sections, namely, sect. Thalictrifoliae, sect. Sophorocapnos, sect. Racemosae, and sect. Aulacostigma, which are recognized in the division proposed by Linden (1997) and Wang (2006), have been classified within subg. Sophorocapnos, and our molecular results support this view. Furthermore, this study provides the first molecular analysis of sect. Corydalis, and its inclusion in subg. Sophorocapnos is strongly supported by our phylogenetic analysis. Therefore, we suggest that sect. Thalictrifoliae, sect. Sophorocapnos, sect. Racemosae, sect. Aulacostigma, and sect. Corydalis should be classified as one subgenus. Of course, more molecular and morphological evidence will be required to test this finding. Our current results based on molecular data are expected to provide some reference for future research.

Potential utility of DNA barcoding in Corydalis

The genus Corydalis is an important group of Papaveraceae. Corydalis plants contain a variety of alkaloid components that possess potent pharmacological effects, including antinociceptive (Wang et al., 2016), anticardiovascular (Hung & Wu, 2016), antitumor (Gao, He, Li, & Wang, 2009), antidepressive (Du et al., 2018), hepatoprotective (Wu et al., 2017), acetylcholinesterase inhibitory, and butyrylcholinesterase inhibitory effects (Orhan, Şener, Choudhary, & Khalid, 2004). More than 30 species are used in folk medicine or are recorded in the Pharmacopoeia (Pharmacopoeia Commission of People's Republic of China, 2015; Sang, 2002), and these species are often closely related. To ensure the safety, efficacy, and legality of these medicines, accurate identification is essential. The authentication of materia medica is time‐consuming and knowledge‐intensive because they are always presented as crude drugs or decoctions, which are air‐dried or processed via various methods and thus exhibit modified morphological and anatomical features (Yuan et al., 2015). DNA technology exhibits an outstanding advantage in that it is not affected by morphological characteristics, developmental stage, environmental factors, or harvesting period (Heubl, 2010). Since Chen et al. (2010) proposed ITS2 as a standard DNA barcode for the identification of medicinal plants, DNA barcoding has been increasingly used in the authentication of medicinal plants and materia medica (Xin et al., 2012; Yuan et al., 2015). In this study, three Pharmacopoeia‐recorded medicinal plants and their crude drugs were identified successfully, and another Pharmacopoeia‐recorded species was discriminated from closely related plants. Based on the good performance of DNA barcoding in Corydalis, it can be used as a potential tool for the authentication of the medicinal plants and materia medica belonging to this genus. The short‐region DNA barcodes showed a relatively high species discrimination power for Corydalis species in this study; however, 30.4% of the species, most of which are closely related, could not be identified successfully. Recent barcoding studies have placed high emphasis on the use of whole‐chloroplast genome sequences, which are now more readily available due to improvements in sequencing technologies (Li et al., 2015). The whole‐chloroplast genome is termed a “super‐barcode” and provides more abundant informative sites for species identification. These super‐barcodes exhibit higher discrimination power for closely related species and have been applied for the identification of species in various taxa, such as the genera Fritillaria (Li, Zhang, Yang, & Lv, 2018), Epipremnum (Tian, Han, Chen, & Wang, 2018), and Papaver (Zhou et al., 2017). The use of a super‐barcodes is a good option for the identification of closely related Corydalis species and obtaining improved species discrimination power. The use of data inferred from DNA barcodes and whole‐chloroplast genomes, along with data on morphological characteristics and geographical distributions, might result in more precise species discrimination and resolve phylogenetic disputes, which will aid the reconstruction of a more integrated taxonomic system of this taxonomically complicated genus.

CONFLICT OF INTEREST

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

AUTHOR CONTRIBUTIONS

The study was conceived by FR, JS, sample collection was performed by FR, YW, YQ, data generation was carried out by YW, data were analyzed by ZX, YL, JZ, and TX, the manuscript was written by FR, JS. Click here for additional data file. Click here for additional data file.
  39 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

Review 2.  Utility of low-copy nuclear gene sequences in plant phylogenetics.

Authors:  Tao Sang
Journal:  Crit Rev Biochem Mol Biol       Date:  2002       Impact factor: 8.250

3.  DNA barcoding the floras of biodiversity hotspots.

Authors:  Renaud Lahaye; Michelle van der Bank; Diego Bogarin; Jorge Warner; Franco Pupulin; Guillaume Gigot; Olivier Maurin; Sylvie Duthoit; Timothy G Barraclough; Vincent Savolainen
Journal:  Proc Natl Acad Sci U S A       Date:  2008-02-07       Impact factor: 11.205

4.  MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.

Authors:  Koichiro Tamura; Joel Dudley; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2007-05-07       Impact factor: 16.240

Review 5.  Four years of DNA barcoding: current advances and prospects.

Authors:  Lise Frézal; Raphael Leblois
Journal:  Infect Genet Evol       Date:  2008-06-03       Impact factor: 3.342

6.  A DNA barcode for land plants.

Authors: 
Journal:  Proc Natl Acad Sci U S A       Date:  2009-07-30       Impact factor: 11.205

Review 7.  Genome-based approaches to the authentication of medicinal plants.

Authors:  Nikolaus J Sucher; Maria C Carles
Journal:  Planta Med       Date:  2008-04-30       Impact factor: 3.352

8.  Use of DNA barcodes to identify flowering plants.

Authors:  W John Kress; Kenneth J Wurdack; Elizabeth A Zimmer; Lee A Weigt; Daniel H Janzen
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-31       Impact factor: 11.205

9.  Acetylcholinesterase and butyrylcholinesterase inhibitory activity of some Turkish medicinal plants.

Authors:  I Orhan; B Sener; M I Choudhary; A Khalid
Journal:  J Ethnopharmacol       Date:  2004-03       Impact factor: 4.360

10.  DNA barcoding: error rates based on comprehensive sampling.

Authors:  Christopher P Meyer; Gustav Paulay
Journal:  PLoS Biol       Date:  2005-11-29       Impact factor: 8.029

View more
  8 in total

1.  Characterization of the complete chloroplast genome of Corydalis bungeana Turcz.

Authors:  Qi Wang; Zhixian Lei; Lirong Zhou; Biwei Mai; Naiyun Zhu; Xiaoli Zhao; Wenting Xu
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-14       Impact factor: 0.658

2.  The low-copy nuclear gene Agt1 as a novel DNA barcoding marker for Bromeliaceae.

Authors:  Fabian Bratzel; Sascha Heller; Nadine Cyrannek; Juraj Paule; Elton M C Leme; Anna Loreth; Annika Nowotny; Markus Kiefer; Walter Till; Michael H J Barfuss; Christian Lexer; Marcus A Koch; Georg Zizka
Journal:  BMC Plant Biol       Date:  2020-03-12       Impact factor: 4.215

3.  Comparative Chloroplast Genomics of Corydalis Species (Papaveraceae): Evolutionary Perspectives on Their Unusual Large Scale Rearrangements.

Authors:  Xiaodong Xu; Dong Wang
Journal:  Front Plant Sci       Date:  2021-01-27       Impact factor: 5.753

4.  Highly variable chloroplast genome from two endangered Papaveraceae lithophytes Corydalis tomentella and Corydalis saxicola.

Authors:  Fengming Ren; Liqiang Wang; Ying Li; Wei Zhuo; Zhichao Xu; Haojie Guo; Yan Liu; Ranran Gao; Jingyuan Song
Journal:  Ecol Evol       Date:  2021-03-19       Impact factor: 2.912

5.  Diversity investigation by application of DNA barcoding: A case study of lepidopteran insects in Xinjiang wild fruit forests, China.

Authors:  Jinyu Zhan; Yufeng Zheng; Qing Xia; Jin Wang; Sibo Liu; Zhaofu Yang
Journal:  Ecol Evol       Date:  2022-03-07       Impact factor: 2.912

6.  New Insights Into the Backbone Phylogeny and Character Evolution of Corydalis (Papaveraceae) Based on Plastome Data.

Authors:  Xiaodong Xu; Xuexiu Li; Dong Wang
Journal:  Front Plant Sci       Date:  2022-08-05       Impact factor: 6.627

7.  Phylogenetic analysis based on single-copy orthologous proteins in highly variable chloroplast genomes of Corydalis.

Authors:  Xianmei Yin; Feng Huang; Xiaofen Liu; Jiachen Guo; Ning Cui; Conglian Liang; Yan Lian; Jingjing Deng; Hao Wu; Hongxiang Yin; Guihua Jiang
Journal:  Sci Rep       Date:  2022-08-20       Impact factor: 4.996

8.  Genetic relationships and low diversity among the tea-oil Camellia species in Sect. Oleifera, a bulk woody oil crop in China.

Authors:  Huasha Qi; Xiuxiu Sun; Wuping Yan; Hang Ye; Jiali Chen; Jing Yu; Dai Jun; Chunmei Wang; Tengfei Xia; Xuan Chen; Dongliang Li; Daojun Zheng
Journal:  Front Plant Sci       Date:  2020-09-30       Impact factor: 6.627

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.