Literature DB >> 35631799

DNA Barcoding Medicinal Plant Species from Indonesia.

Ria Cahyaningsih1,2, Lindsey Jane Compton1, Sri Rahayu2, Joana Magos Brehm1, Nigel Maxted1.   

Abstract

Over the past decade, plant DNA barcoding has emerged as a scientific breakthrough and is often used to help with species identification or as a taxonomical tool. DNA barcoding is very important in medicinal plant use, not only for identification purposes but also for the authentication of medicinal products. Here, a total of 61 Indonesian medicinal plant species from 30 families and a pair of ITS2, matK, rbcL, and trnL primers were used for a DNA barcoding study consisting of molecular and sequence analyses. This study aimed to analyze how the four identified DNA barcoding regions (ITS2, matK, rbcL, and trnL) aid identification and conservation and to investigate their effectiveness for DNA barcoding for the studied species. This study resulted in 212 DNA barcoding sequences and identified new ones for the studied medicinal plant species. Though there is no ideal or perfect region for DNA barcoding of the target species, we recommend matK as the main region for Indonesian medicinal plant identification, with ITS2 and rbcL as alternative or complementary regions. These findings will be useful for forensic studies that support the conservation of medicinal plants and their national and global use.

Entities:  

Keywords:  DNA barcoding; Indonesia; conservation; forensic; medicinal plants

Year:  2022        PMID: 35631799      PMCID: PMC9147630          DOI: 10.3390/plants11101375

Source DB:  PubMed          Journal:  Plants (Basel)        ISSN: 2223-7747


1. Introduction

Plant identification has formerly been formed using morphological characteristics that could be observed visually. Currently, DNA is also used to help species identification and to build bioinventories [1]. DNA barcoding was introduced by Hebert and colleagues in 2003 and involves the identification of species through universal, short, and standardized DNA regions [2]. DNA material for the barcoding can be obtained from living plants, herbarium specimens [3], and market products [4,5]. In plants, plastid DNA (rbcL, matK, trnL, and trnH-psbA regions) and nuclear DNA (ITS and ITS2 regions) are often used in DNA barcoding [6,7,8]. The rbcL and matK regions are recommended by the Consortium for the Barcode of Life (CBOL) as a standard two-locus barcode for global plant databases because of their species discrimination ability [8]. The process entails registering the DNA of identified species into a barcoding library and matching the DNA of unidentified species against the genetic data available in the library [6,9]. The library or the database can be accessed online for species identification and taxonomic clarification [10], namely through the NCBI GenBank (https://www.ncbi.nlm.nih.gov; accessed on 1 February 2022) [10] and the Barcode of Life Data (BOLD) (http://www.boldsystems.org; accessed on 1 February 2022) [11]. DNA barcoding has become an important taxonomic tool because of its accuracy, repeatability, and rapidity. It can also be used to identify species under legislative protection, or under threat of extinction, and to check the authenticity of biological products [6,9]. It is particularly powerful as identification is not influenced by the morphological diversity of species, growth phases, and environmental factors [12,13,14,15]. In the forensic field, even an inexperienced user is able to assign a taxonomic name to an unidentified plant specimen with relative ease [16,17]. It is an effective conservation tool as it is able to prevent substitution of important commercial species, protect species from theft [6,18], and help to define species richness in underexplored areas [6]. DNA barcoding is valuable in terms of medicinal plant (MP) species identification compared to traditional morphological identification for conservation and use, as it is able to identify species and ensure a genuine product rather than a substitute [6,18]. Identifying the plant correctly protects consumer rights [19], even with respect to small and damaged plant parts used in botanical forensics [10,20,21,22]. Several studies conducted on DNA barcoding of medicinal plants have indicated the effectiveness of ITS2 and matK. For example, these regions are able to distinguish Rauvolfia serpentina (L.) Benth. Ex Kurz, of which root extracts act as an antihypertensive drug from other species in the genus [5,23] and are able to authenticate Eurycoma longifolia Jack, of which all plant extracts (particularly roots) are a useful drug for cough, anticancer, and aphrodisiac activities [24]. MatK is also known to give the best identification for Philippine ethnomedicinal Apocynaceae [25]. However, DNA barcoding from only one specific sequence region has been applied for most medicinal plants. For example, the ITS2 region has been used as a DNA barcode for authenticating many medicinal plants, their relatives, and broader species [14,26], although it was found that this region could not authenticate all Chinese medicinal Bupleurum L. (Apiaceae) species [27]. For Indian medicinal plants (Ayurveda), the rbcL region has been used for DNA barcoding [19], while for medicinal plants of the Philippines, rbcL, matK, and trnL-F regions have been used based on their efficiency [28]. Indonesia is famous for its plant diversity and richness, particularly in medicinal plants and their uses [29,30,31]. Different forms of medicinal plants are used, regardless of being fresh or dried, for curing illness and disease [32]. Thus, the primary purpose of undergoing the barcoding process, apart from enriching the DNA barcoding database, is determining the identity of medicinal plants. DNA barcoding is an advanced technology for plant diversity inventories, and its high cost makes it both an issue and challenge for biodiversity conservation in Indonesia [33]. Nevertheless, DNA barcodes are useful for conservation and even for commercial purposes, and they will be widely used in the future as DNA sequencing technologies become simpler and cheaper [6]. This study aimed to assess how four different DNA barcoding regions (ITS2, matK, rbcL, and trnL) can aid 61 species identifications and conservation efforts, and investigate their effectiveness for DNA barcoding of Indonesian medicinal plants. The finding will allow for broader and more comprehensive use in the future with respect to medicinal plant conservation both nationally and globally.

2. Results and Discussions

2.1. Understanding the Use of DNA Barcoding for Indonesian Medicinal Plants

Of the 61 sampled Indonesian medicinal plants, 55 species are native to Indonesia (of which 29 are endemics), and six are introduced [34]. Some of the medicinal plants may need to be prioritized in terms of conservation, namely those assessed as threatened (VU, EN, CR) or near threatened (NT) according to the IUCN Red List [35], the 19 species listed in the CITES Appendices I, II, or III (UNEP-WCMC database) [36], and the 12 rare medicinal plants [37]. Two species were assessed as VU, namely Aquilaria hirta Ridl. [38] and Etlingera solaris (Blume) R.M.Sm. [39] and are considered to be facing a high extinction risk in the wild in the near future [40]. The 19 species listed in CITES II may become extinct if their trade is not controlled because they are collected from the wild and there is no sufficient data with respect to artificial propagation for commercial purposes [36]. Of the 61 sequence target species, 13 sequences were not found in BOLD, although their DNA sequence data was available in NCBI; a further 10 species did not have DNA sequences stored in either NCBI or BOLD. Detailed information for each of the 61 species is presented in Table 1.
Table 1

The Indonesian medicinal plants (n = 61) used in this study with related information from literature study.

No.SpeciesAuthorFamily.N/IImportant Sp.Sp. No. per GenusBOLD (NCBI)Database
1 Justicia gendarussa Burm.f.AcanthaceaeNNo921yes
2 Staurogyne elongata (Nees) KuntzeAcanthaceaeNNo148yes
3 Pangium edule Reinw.AchariaceaeNYes (P)1yes
4 Spondias malayana Kosterm.AnacardiaceaeNNo19no (yes)
5 Toxicodendron succedaneum (L.) KuntzeAnacardiaceaeINo27yes
6 Ancistrocladus tectorius (Lour.) Merr.AncistrocladaceaeNNo21yes
7 Anaxagorea javanica BlumeAnnonaceaeNYes (P)25no (yes)
8 Dasymaschalon dasymaschalum (Blume) I.M.TurnerAnnonaceaeNNo27yes
9 Alstonia macrophylla Wall. Ex. G.DonApocynaceaeNNo44yes
10 Alstonia scholaris (L.) R. Br.ApocynaceaeNYes (P) yes
11 Alyxia reinwardtii BlumeApocynaceaeNYes (P)106yes
12 Hoya diversifolia BlumeApocynaceaeNNo521no (yes)
13 Rauvolfia serpentina (L.) Benth. ex KurzApocynaceaeNYes (II)74yes
14 Aglaonema commutatum SchottAraceaeNNo22no (yes)
15 Trevesia burckii R.Br.AraliaceaeNNo8yes (yes)
16 Cibotium barometz (L.) J.Sm.CibotiaceaeNYes (II)10yes
17 Decalobanthus mammosus (Lour.) A.R.Simoes & StaplesConvolvulaceaeINo13no (yes)
18 Erycibe malaccensis C.B. ClarkeConvolvulaceaeNNo70no (no)
19 Rhododendron macgregoriae F. Muell.EricaceaeNYes (E)1057no (no)
20 Acalypha grandis Benth.EuphorbiaceaeNNo428no (no)
21 Euphorbia tirucalli L.EuphorbiaceaeIYes (II)1976yes
22 Millettia sericea (Vent.) Benth.FabaceaeNNo187yes
23 Parkia timoriana (DC.) Merr.FabaceaeNNo40yes
24 Phanera fulva (Korth.) Benth.FabaceaeNYes (E)90no (no)
25 Orthosiphon aristatus (Blume) Miq.LamiaceaeNNo44yes
26 Premna serratifolia L.LamiaceaeNNo131yes
27 Vitex glabrata Gaertn.LamiaceaeNNo203yes
28 Cinnamomum rhynchophyllum Miq.LauraceaeNNo241no (yes)
29 Ficus deltoidea JackMoraceaeNYes (P)874yes
30 Myristica succedanea BlumeMyristicaceaeNYes (E)175no (no)
31 Nepenthes ampullaria JackNepenthaceaeNYes (P, II)165yes
32 Nepenthes gracilis Korth.NepenthaceaeNYes (P, II) yes
33 Nepenthes mirabilis (Lour.) DruceNepenthaceaeNYes (P, II) yes
34 Nepenthes reinwardtiana Miq.NepenthaceaeNYes (P, E, II) yes
35Acriopsis liliifolia var. liliifolia(J.Koenig) OrmerodOrchidaceaeNYes (P, II)10no (yes)
36 Cymbidium aloifolium (L.) Sw.OrchidaceaeNYes (P, II)74yes
37 Cymbidium ensifolium (L.) Sw.OrchidaceaeIYes (II) yes
38 Dendrobium crumenatum Sw.OrchidaceaeNYes (P, II)1547yes
39 Dendrobium purpureum Roxb.OrchidaceaeNYes (P, E, II) no (no)
40 Dendrobium salaccense (Blume) Lindl.OrchidaceaeNYes (P, II) yes
41 Grammatophyllum speciosum BlumeOrchidaceaeNYes (P, II)13yes
42 Nervilia concolor (Blume) Schltr.OrchidaceaeNYes (P, II)77yes
43 Nervilia plicata (Andrews) Schltr.OrchidaceaeNYes (P, II) yes
44 Oberonia lycopodioides (J.Koenig) OrmerodOrchidaceaeNYes (P, II)305no (no)
45 Strongyleria pannea (Lindl.) Schuit., Y.P.Ng & H.A.PedersenOrchidaceaeNYes (P, II)4no (yes)
46 Galearia filiformis (Blume) Boerl.PandaceaeNYes (E)5yes
47 Benstonea affinis (Kurz) Callm. & BuerkiPandanaceaeNNo61yes
48 Phyllanthus oxyphyllus Miq.PhyllanthaceaeNNo1016yes
49 Ardisia complanata Wall.PrimulaceaeNNo719no (no)
50 Ardisia crenata SimsPrimulaceaeINo yes
51 Ventilago madraspatana Boerl.RhamnaceaeNNo41no (yes)
52 Psychotria montana BlumeRubiaceaeNNo1531no (yes)
53 Lunasia amara BlancoRutaceaeNYes (P)1yes
54 Melicope lunu-ankenda (Gaertn.) T.G. HartleyRutaceaeNNo241no (yes)
55 Kadsura scandens (Blume) BlumeSchisandraceaeNYes (P)17yes
56 Smilax calophylla Wall. ex A.DC.SmilacaceaeNNo262yes
57 Smilax zeylanica L.SmilacaceaeNYes (P) yes
58 Aquilaria hirta Ridl.ThymelaeaceaeNYes (P, VU)21no (yes)
59 Amomum hochreutineri ValetonZingiberaceaeNYes (E)102no (no)
60 Etlingera solaris (Blume) R.M.Sm.ZingiberaceaeNYes (E, VU)143no (no)
61 Meistera aculeata (Roxb.) Skornick. & M.F. NewmanZingiberaceaeNNo41no (yes)

Note: Scientific names (1st and 2nd columns were collected from POWO (2022); Species: R for rare medicinal plant (MP), E for endemic to Indonesia, VU for Vulnerable (IUCN Red List), P for Priority, and II for CITES Appendix II; N = Native, I = Introduced.

The contribution of the DNA barcoding information from each species to DNA banks and to the correct identification of medicinal plants with high conservation status was classified using categories A–M, where category A denotes the contribution of new data to DNA banks and DNA barcoding information that can strongly assist MP conservation; at the opposite end of the spectrum, letter M denotes the least substantial contribution, where DNA barcoding needs to be clarified further before using it directly for identification. Figure 1 indicates how the four DNA barcodes are useful for the conservation and use of Indonesian medicinal plants with respect to the availability of their data in the DNA bank. The number of medicinal plant species per criteria are provided in Table A1. Sequences grouped in categories A-D can be of direct use to conservation efforts due to the correct identification of related medicinal plants. The A-B categories can be used in botanic forensics (in cases of medicinal plant adulteration and illegal trading) for medicinal plant identification [10,21,22,23,24], as the plants are listed as species that need to be prioritized in terms of conservation.There are 19 families of Indonesian medicinal plants consisting of 31 species, that could be identified accurately to the family level by DNA barcoding. Two major families of Indonesian medicinal plants that were successfully sequenced and correctly identified were Orchidaceae (13 sequences) and Apocynaceae (10 sequences). It is highlighted that correct identification was defined after the validation step, which is cross-checked to morphological identification result by taxonomists (indicated in the species identity card).
Figure 1

Summary of DNA barcoding use for medicinal plant (MP) conservation in Indonesia. Letters represent the DNA barcoding contribution of a species to the DNA bank data and its importance in conservation in the following order; A = new DNA barcoding and can strongly assist MP conservation; B = can strongly assist MP conservation; C = new DNA barcoding and can assist MP conservation; D = can assist MP conservation; E = new DNA bank data and new DNA barcoding and may strongly assist MP conservation; F = new DNA barcoding and may strongly assist MP conservation; G = may strongly assist MP conservation; H = new DNA bank data and new DNA barcoding and may assist MP conservation; I = new DNA barcoding and may assist MP conservation; J = may assist MP conservation; K = new DNA bank data and new DNA barcoding but sequences need to be clarified further; L = new DNA barcoding, but sequences need to be clarified further; M = sequences need to be clarified further.

Table A1

DNA barcoding regions used for medicinal plant (MP) conservation in Indonesia.

DNA Barcoding Use for MP Conservation in IndonesiaITS2 matK rbcL trnL
A. new DNA barcoding and can strongly assist MP conservation 1 1 2 1
Anaxagorea javanica 1
Aquilaria hirta 1
Strongyleria pannea 111
B. can strongly assist MP conservation 11 12 8 6
Alstonia scholaris 1111
Alyxia reinwardtii 111
Cymbidium aloifolium 111
Dendrobium crumenatum 11
Dendrobium salaccense 111
Euphorbia tirucalli 1
Ficus deltoidea 1
Galearia filiformis 111
Kadsura scandens 1
Lunasia amara 11 1
Nepenthes gracilis 1
Nepenthes reinwardtiana 11
Nervilia plicata 111
Pangium edule 11
Parkia timoriana 1
Rauvolfia serpentina 1111
C. new DNA barcoding and can assist MP conservation 1 1
Aglaonema commutatum 1
Meistera aculeata 1
D. can assist MP conservation 5 6 7 3
Alstonia macrophylla 11
Ancistrocladus tectorius 1 11
Ardisia crenata 11
Dasymaschalon dasymaschalum 1
Justicia gendarussa 1111
Orthosiphon aristatus 1
Phyllanthus oxyphyllus 11
Premna serratifolia 1
Toxicodendron succedaneum 1111
Vitex glabrata 1
E. new to DNA bank data and new DNA barcoding and may strongly assist MP conservation 6 4 6 7
Amomum hochreutineri 1 11
Dendrobium purpureum 1111
Etlingera solaris 1 11
Myristica succedanea 111
Oberonia lycopodioides 1111
Phanera fulva 1 1
Rhododendron macgregoriae 1111
F. new DNA barcoding and may strongly assist MP conservation 2 3 2 2
Acriopsis liliifolia var.  liliifolia1111
Anaxagorea javanica 11
Aquilaria hirta 11 1
G. may strongly assist MP conservation 3 8 12 12
Alyxia reinwardtii 1
Cibotium barometz 1
Cymbidium aloifolium 1
Cymbidium ensifolium 11
Dendrobium crumenatum 1
Dendrobium salaccense 1
Euphorbia tirucalli 1
Ficus deltoidea 111
Grammatophyllum speciosum 111
Kadsura scandens 111
Lunasia amara 1
Nepenthes ampullaria 111
Nepenthes gracilis 11
Nepenthes mirabilis 1111
Nepenthes reinwardtiana 11
Nervilia concolor 1
Pangium edule 1
Parkia timoriana 1 1
Smilax zeylanica 11
H. new to DNA bank data and new DNA barcoding and may assist MP conservation 2 2 3 3
Acalypha grandis 11
Ardisia complanata 1111
Erycibe malaccensis 1111
I. new DNA barcoding and may assist MP conservation 4 6 7 6
Aglaonema commutatum 1 1
Cinnamomum rhynchophyllum 111
Decalobanthus mammosus 1
Hoya diversifolia 1111
Meistera aculeata 1
Melicope lunu-ankenda 1111
Psychotria montana 1111
Spondias malayana 1
Ventilago madraspatana 111
J. may assist MP conservation 7 6 8 9
Alstonia macrophylla 1 1
Ancistrocladus tectorius 1
Ardisia crenata 1 1
Benstonea affinis 111
Dasymaschalon dasymaschalum 1 1
Millettia sericea 1111
Orthosiphon aristatus 1
Phyllanthus oxyphyllus 11
Premna serratifolia 1
Smilax calophylla 1
Staurogyne elongata 1111
Trevesia burckii 1111
Vitex glabrata 1 11
K. new to DNA bank data and new DNA barcoding, but sequences need to clarify further (K) 2 1
Acalypha grandis 1
Myristica succedanea 1
Phanera fulva 1
L. new DNA barcoding, but sequences need to clarify further 2
Aglaonema commutatum 1
Ventilago madraspatana 1
M. new DNA barcoding and may strongly assist MP conservation 10 2
Benstonea affinis 1
Cibotium barometz 1
Dasymaschalon dasymaschalum 1
Galearia filiformis 1
Grammatophyllum speciosum 1
Nervilia concolor 1 1
Nervilia plicata 1
Pangium edule 1
Parkia timoriana 1
Smilax calophylla 1
Smilax zeylanica 1

2.2. Understanding the Effectiveness of Each DNA Barcoding Region Used for Indonesian Medicinal Plants Identification

A total of 61 studied species were analyzed for DNA barcoding of four regions (ITS2, matK, rbcL, and trnL). There were some failures in DNA amplification and sequencing assembly, with the results of each step presented in Table 2.
Table 2

Success or failure in each DNA barcoding step.

Observed ParameterITS2 (%)matK * (%)rbcL (%)trnL (%)
No PCR amplicon obtained1.6427.871.6416.39
Mixed sequences—no use8.2001.643.28
Sequence provided90.1672.1396.7280.33
Assembled consensus sequence88.5265.5796.7273.77
Unidirectional sequence1.646.5606.56

* 4 matK regions with the second primer excluded.

The sequence quality is based on the easily done assembly of both the forward and reverse regions into a single consensus sequence (Table 2). When both forward and reverse sequences were available and were of good quality, obtaining the assembled consensus sequence was straightforward. If one direction of the sequence was mixed, then no assembly could occur and only the unidirectional sequence could be used. The matK region, which is known to have the lowest amplification success among the regions used for DNA barcoding [3,41], could only be amplified in 72% samples, compared with successful amplification in 83–98% samples for the other regions (Table 2). This is consistent with previous work indicating matK has a lower PCR success rate than rbcL for DNA amplification of Indonesian plants [42]. The PCR amplification failure likely occurred due to a high level of sequence variation within the matK regions complementary to the primers [43]. There were only 212 sequences of ITS2, matK, rbcL, and trnL obtained from 61 Indonesian medicinal plants instead of the expected 244 sequences resulting from the sequencing (Table A2). Each species was annotated with its key information, such as whether it is native, how the species became important to be conserved, and all generated sequences from ITS2, matK, rbcL, and trnL regions with identification results from BLAST, whether correct, ambiguous, correct at genus or family level, or incorrect.
Table A2

Summary of DNA barcoding result per species.

No.Species [38]Author Fam.RegionMax ScoreTotal ScoreQuery CoverE ValuePer. IdentBest Matched SpeciesSum.Notes
1 Justicia gendarussa Burm.f.Acanth.ITS25625620.735.00E-1560.9968 Justicia gendarussa c
matK 133013300.9600.9986 Justicia gendarussa c
rbcL 105510550.9701 Justicia gendarussa c
trnL 148714870.9200.9975 Justicia gendarussa c
2 Staurogyne elongata (Nees) KuntzeAcanth.ITS25975970.891.00E-1660.9526 Ophiorrhiziphyllon macrobotryum a **
matK 127312730.9700.9821 Staurogyne concinnula a *
rbcL 9399390.9100.9923 Staurogyne concinnula a *
trnL 101314270.9900.9732 Staurogyne trinitensis a *
3 Pangium edule Reinw.Achari.ITS21631630.151.00E-350.9286Celastraceae sp. i
matK 13871387100.9974 Pangium edule c
rbcL 9729720.9101 Pangium edule c
trnL 115817410.9800.982 Ryparosa kurrangii a *
4 Spondias malayana Kosterm.Anacardi.ITS263663613.00E-1780.9332 Spondias tuberosa a *
5 Toxicodendron succedaneum (L.) KuntzeAnacardi.ITS26606600.7501 Toxicodendron succedaneum c
matK 145214520.9901 Toxicodendron succedaneum c
rbcL 103810380.9701 Toxicodendron succedaneum c
trnL 15981598101 Toxicodendron succedaneum c1/7 is a *
6 Ancistrocladus tectorius (Lour.) Merr.Ancistroclad.ITS2774774100.9953 Ancistrocladus benomensis c1/3 is a *
matK 13871387100.9987 Ancistrocladus heyneanus a *
rbcL 10531053101 Ancistrocladus tectorius c
trnL 16631663100.9903 Ancistrocladus tectorius c
7 Anaxagorea javanica BlumeAnnon. matK 150215020.9700.9928 Anaxagorea luzonensis a *
rbcL 101310130.9401 Anaxagorea luzonensis a *
trnL 14231423101 Anaxagorea javanica c
8 Dasymaschalon dasymaschalum (Blume) I.M.TurnerAnnon.ITS22372370.383.00E-580.9474 Acer palmatum i
matK 13821382100.9947 Dasymaschalon clusiflorum a *
rbcL 102010200.9701 Desmos dasymaschalus c
trnL 156515650.9500.9965 Dasymaschalon megalanthum a *
9 Alstonia macrophylla Wall. Ex. G.DonApocyn.ITS27637630.9800.9976 Alstonia scholaris a *
matK 13861386100.9987 Alstonia macrophylla c
rbcL 857857100.9876 Alstonia scholaris c13/14 is a * with the same coverage
trnL 15571557100.9908 Alstonia scholaris a *
10 Alstonia scholaris (L.) R. Br.Apocyn.ITS24574570.623.00E-1240.9772 Alstonia scholaris c
matK 13801380100.9987 Alstonia yunnanensis c1/9 a is a * with same coverage
rbcL 10511051100.9983 Alstonia scholaris c
trnL 15891589100.9977 Alstonia scholaris c1/2 is a *
11 Alyxia reinwardtii BlumeApocyn.ITS26146140.81.00E-1710.9912 Alyxia reinwardtii c
matK 131713170.9500.9972 Alyxia reinwardtii c
rbcL 102010200.9601 Alyxia reinwardtii c1/2 is a * with higher coverage
trnL 152415240.9800.9929 Alyxia grandis a *
12 Hoya diversifolia BlumeApocyn.ITS25075070.633.00E-1391 Hoya glabra a *
matK 13471347101 Hoya vitellinoides a *
rbcL 105110510.9901 Hoya pottsii a *
trnL 153915390.9800.9988Hoya sp. a *
13 Rauvolfia serpentina (L.) Benth. ex KurzApocyn.ITS26176170.731.00E-1721 Rauvolfia serpentina c
matK 138013800.9901 Rauvolfia serpentina c
rbcL 105710570.9901 Rauvolfia serpentina c
trnL 139513950.8900.9873 Rauvolfia serpentina c
14 Aglaonema commutatum SchottAr.ITS25018050.592.00E-1370.9964 Thunbergia coccinea i
matK 13841384100.9974 Aglaonema crispum a *
rbcL 102210220.9701 Aglaonema commutatum c
trnL 16501650100.9989 Aglaonema crispum a *
15 Trevesia burckii R.Br.Arali.ITS27457450.9500.988 Trevesia palmata a *
matK 13931393101 Trevesia palmata a *
rbcL 104810480.9800.9982 Brassaiopsis gracilis a *
trnL 166816680.9900.9989 Brassaiopsis ciliata a *
16 Cibotium barometz (L.) J.Sm.Ciboti.ITS23488580.753.00E-910.9896 Cucumis sativus i
rbcL 9659650.9400.9872 Cyathea chinensis a **
17 Decalobanthus mammosus (Lour.) A.R.Simoes & StaplesConvolvul. rbcL 103110310.9700.9982 Merremia peltata a *
18 Erycibe malaccensis C.B.ClarkeConvolvul.ITS24664660.955.00E-1270.8631 Erycibe obtusifolia a *
matK 13891389101 Erycibe cochinchinensis a *
rbcL 103310330.9601Erycibe sp. a *
trnL 134713470.9300.9881 Erycibe coccinea a *
19 Rhododendron macgregoriae F.Muell.Eric.ITS2723723100.9658 Rhododendron groenlandicum a *
matK 13691369100.9908 Rhododendron javanicum a *
rbcL 102710270.9800.9912 Rhododendron simsii a *
trnL 162916290.9600.9955 Rhododendron javanicum a *
20 Acalypha grandis Benth.Euphorbi.ITS22722720.351.00E-680.9808Acer tataricum subsp. theiferumi
rbcL 106210620.9901 Acalypha grisebachiana a *
trnL 17291729100.9886 Acalypha hispida a *
21 Euphorbia tirucalli L.Euphorbi.ITS26176170.711.00E-1721 Euphorbia tirucalli c1/12 I with higher coverage
rbcL 104610460.9801 Euphorbia rauhii a *
22 Millettia sericea (Vent.) Benth.Fab.ITS27127120.9400.9571 Millettia pulchra a *
matK 133213320.9700.988 Millettia pulchra a *
rbcL 104210420.9700.9982 Dahlstedtia pinnata a *
trnL 15431543100.9819 Millettia pinnata a *
23 Parkia timoriana (DC.) Merr.Fab.ITS25935930.712.00E-1650.9909 Parkia timoriana c
matK 137613760.9800.996 Parkia biglandulosa a *
rbcL 100010000.9500.9927Magnoliophyta sp. i
trnL 181418140.9900.999 Parkia biglandulosa a *
24 Phanera fulva (Korth.) Benth.Fab.ITS24754750.687.00E-1300.9477Bauhinia sp. a *
rbcL 101610160.9600.9982 Embryophyte environmental i
trnL 140414040.7800.9974 Phanera vahlii a **
25 Orthosiphon aristatus (Blume) Miq.Lami.ITS25625620.695.00E-1561 Orthosiphon aristatus c
rbcL 104210420.9801 Clerodendranthus spicatus a **
26 Premna serratifolia L.Lami.ITS24224220.999.00E-1140.8495 Premna microphylla a *
rbcL 104010400.9701 Premna serratifolia c2/3 is a * with higher and lower coverage
27 Vitex glabrata Gaertn.Lami.ITS26516510.9100.9558 Vitex carvalhoi a *
matK 15871587100.9988 Vitex glabrata c
rbcL 10501050100.9982 Vitex doniana a *
trnL 141114110.9400.9923 Vitex triflora a *
28 Cinnamomum rhynchophyllum Miq.Laur. matK 137513750.9900.9987 Cinnamomum camphora a *
rbcL 10551055101Cinnamomum dubiuma *
trnL 15871587101Cinnamomum pittosporoidesa *
29 Ficus deltoidea JackMor.ITS26166160.784.00E-1721 Ficus deltoidea c
matK 13801380100.996Ficus cf. a *
rbcL 105110510.9800.9983 Ficus benjamina a *
trnL 166416640.9900.9967 Ficus carica a *
30 Myristica succedanea BlumeMyristic.ITS21851850.172.00E-420.9231 Rhodohypoxis milloides i
matK 147614760.9200.9988 Myristica fragrans a *
rbcL 10571057101 Horsfieldia amygdalina a *4/11 is a **
trnL 137113710.8300.9987 Myristica iners a *
31 Nepenthes ampullaria JackNepenth. matK 137513750.9900.9973 Nepenthes mapuluensis a *
rbcL 10421042101 Nepenthes mirabilis a *
trnL 16481648100.9956 Nepenthes mirabilis a *
32 Nepenthes gracilis Korth.Nepenth. matK 13711371100.9973 Nepenthes gracilis c
rbcL 10461046101 Nepenthes mirabilis a *
trnL 9619610.5700.9962 Nepenthes ampullaria a *
33 Nepenthes mirabilis (Lour.) DruceNepenth.ITS2857857100.9979 Nepenthes reinwardtiana a *
matK 13711371100.9973 Nepenthes mapuluensis a *
rbcL 10381038100.9965 Nepenthes graciliflora a *
trnL 9599590.5700.9943 Nepenthes sanguinea a *
34 Nepenthes reinwardtiana Miq.Nepenth.ITS2861861100.9979 Nepenthes reinwardtiana c
matK 13761376100.996 Nepenthes reinwardtiana c
rbcL 104210420.9800.9965 Nepenthes mirabilis a *
trnL 9489480.5700.9924 Nepenthes alba a *
35Acriopsis liliifolia var. liliifolia(J.Koenig) OrmerodOrchid.ITS23943940.942.00E-1050.8428 Cymbidium ensifolium a **
matK 14081408100.9987Acriopsis sp. a *
rbcL 911911100.9824Acriopsis sp. a *
trnL 82415910.9100.9265 Cymbidium erythraeum a **
36 Cymbidium aloifolium (L.) Sw.Orchid.ITS24684680.611.00E-1270.9884 Cymbidium aloifolium c
matK 13861386100.9987 Cymbidium aloifolium c1/5 is a *
rbcL 104810480.9800.9982 Cymbidium aloifolium c1/4 is a *
trnL 9899890.7900.953 Cymbidium wadae a *
37 Cymbidium ensifolium (L.) Sw.Orchid.ITS23873870.664.00E-1030.9072 Cymbidium goeringii a *
matK 129312930.9900.9889 Cymbidium longibracteatum a *
38 Dendrobium crumenatum Sw.Orchid.ITS25775770.72.00E-1600.9968 Dendrobium crumenatum c
matK 140014000.9900.9961 Dendrobium crumenatum c
rbcL 103810380.9700.9982 Dendrobium pseudotenellum a *
39 Dendrobium purpureum Roxb.Orchid.ITS24815370.862.00E-1310.9005 Dendrobium calcaratum a *
matK 13601360100.9947 Dendrobium faciferum a *
rbcL 104210420.9800.9965 Dendrobium aggregatum a *
trnL 5629980.988.00E-1560.9814 Dendrobium chrysanthum a *
40 Dendrobium salaccense (Blume) Lindl.Orchid.ITS26276270.792.00E-1750.9914Dendrobium haemoglossuma *
matK 138213820.9900.9987 Dendrobium salaccense c
rbcL 10311031101 Dendrobium salaccense c2/3 is a *
trnL 132813280.8100.9959 Dendrobium salaccense c
41 Grammatophyllum speciosum BlumeOrchid.ITS280938152101Raphanus raphanistrum subsp. landrai
matK 137813780.9900.996 Grammatophyllum papuanum a *
rbcL 103710370.9700.9947 Cymbidium faberi a **
trnL 56811030.932.00E-1570.9905 Cymbidium serratum a **
42 Nervilia concolor (Blume) Schltr.Orchid.ITS2828828101 Cucumis sativus i
rbcL 106210620.9901 Nepenthes mirabilis i
trnL 15851585100.9834 Nervilia mekongensis a *
43 Nervilia plicata (Andrews) Schltr.Orchid.ITS27217210.8800.9741 Syzygium megacarpum i
matK 141314130.9700.9987 Nervilia plicata c
rbcL 100510050.9401 Nervilia plicata c1/4 is a * with higher coverage
trnL 166316630.9900.9967 Nervilia plicata c
44 Oberonia lycopodioides (J.Koenig) OrmerodOrchid.ITS23983980.881.00E-1060.8765 Oberonia caulescens a *
matK 120512050.9300.9732 Oberonia mucronata a *
rbcL 922922100.9921Ancistrochilus sp. a **
trnL 59210780.912.00E-1640.8734 Liparis loeselii a **
45 Strongyleria pannea (Lindl.) Schuit., Y.P.Ng & H.A.PedersenOrchid.ITS24314310.592.00E-1160.959 Mycaranthes pannea c
matK 13751375100.996 Mycaranthes pannea c
rbcL 10551055100.9965 Mycaranthes pannea c
46 Galearia filiformis (Blume) Boerl.Pand.ITS24334330.994.00E-1170.8552 Populus nigra i
matK 13931393101 Galearia filiformis c
rbcL 104210420.9801 Galearia filiformis c
trnL 17441744100.9969 Galearia filiformis c
47 Benstonea affinis (Kurz) Callm. & BuerkiPandan.ITS21241240.246.00E-240.8611 Magnolia henryi i
matK 139713970.9100.9935 Pandanus oblatus a *
rbcL 10571057101 Pandanus adinobotrys a *
trnL 17051705100.9989 Pandanus baptistii a *
48 Phyllanthus oxyphyllus Miq.Phyllanth.ITS26216210.749.00E-1740.9971 Phyllanthus oxyphyllus c1/2 is a * with higher coverage
matK 13751375100.9973 Phyllanthus oxyphyllus c
rbcL 10591059101 Phyllanthus emblica a *
trnL 9899890.5800.9945 Phyllanthus emblica a *
49 Ardisia complanata Wall.Primul.ITS26676670.7800.9973 Ardisia dasyrhizomatica a *
matK 15741574100.9931 Ardisia mamillata a *
rbcL 103110310.9900.9965 Ardisia crenata a *
trnL 14831483100.9951 Ardisia dasyrhizomatica a *
50 Ardisia crenata SimsPrimul.ITS26176170.741.00E-1720.997 Ardisia villosa a *
matK 140414040.8800.9987 Ardisia crenata c
rbcL 10481048101Ardisia cornudentata subsp. morrisonensisc1/2 is a *
trnL 147614760.9900.9988 Ardisia affinis a *
51 Ventilago madraspatana Boerl.Rhamn.ITS22063160.451.00E-480.9444 Hibiscus panduriformis i
matK 134713470.9600.9973 Ventilago leiocarpa a *
rbcL 102210220.9600.9947 Ventilago leiocarpa a *
trnL 15741574100.9722 Ventilago kurzii a *
52 Psychotria montana BlumeRubi.ITS239839818.00E-1070.9744 Psychotria camerunensis a *
matK 137613760.9900.996 Psychotria asiatica a *
rbcL 102910290.9601 Psychotria adenophylla a *
trnL 150415040.9600.9826 Psychotria asiatica a *
53 Lunasia amara BlancoRut.ITS25795790.746.00E-1610.9654 Lunasia amara c
matK 124312430.8800.9971 Lunasia amara c
rbcL 102610260.9700.9947 Flindersia brayleyana a **
trnL 166816680.9500.9946 Lunasia amara c
54 Melicope lunu-ankenda (Gaertn.) T.G. HartleyRut.ITS2787787100.9823 Melicope pteleifolia a *
matK 14081408100.9987 Melicope pteleifolia a *
rbcL 103110310.9800.9965 Melicope pteleifolia a *
trnL 11681168100.9953 Melicope grisea a *
55 Kadsura scandens (Blume) BlumeSchisandr.ITS25585580.697.00E-1550.9967 Kadsura scandens c
matK 13761376100.9947 Kadsura philippinensis a *
rbcL 105010500.9901Kadsura cf. a *
trnL 163516350.9900.986 Kadsura matsudae a *
56 Smilax calophylla Wall. ex A.DC.Smilac.ITS2821821100.9933 Phaseolus vulgaris I
rbcL 104810480.9800.9982 Smilax cocculoides a *
57 Smilax zeylanica L.Smilac.ITS22742740.353.00E-690.9809Acer tataricum subsp. theiferumi
matK 13711371101 Smilax ovalifolia a *
rbcL 104410440.9801 Smilax ocreata a *
58 Aquilaria hirta Ridl.Thymelae.ITS27027020.8200.9948 Aquilaria microcarpa a *
matK 14021402100.9974 Aquilaria malaccensis a *
rbcL 105710570.9901 Rauvolfia serpentina c
trnL 9879870.6700.9945 Aquilaria microcarpa a *
59 Amomum hochreutineri ValetonZingiber.ITS26166160.794.00E-1720.9884 Sundamomum hastilabium a **
rbcL 104410440.9801Amomum villosum var. xanthioidesa *
trnL 156815680.9800.9931 Amomum fulviceps a *
60 Etlingera solaris (Blume) R.M.Sm.Zingiber.ITS26566560.8900.9764 Hornstedtia conica a **
rbcL 105310530.9901 Alpinia arundelliana a **
trnL 162216220.9900.9955 Etlingera yunnanensis a **
61 Meistera aculeata (Roxb.) Skornick. & M.F. NewmanZingiber.ITS25925920.727.00E-1651 Amomum aculeatum c
rbcL 102010200.9601 Amomum dallachyi a *

Note: Result summary: c = correct, a *: ambiguous or correct in genus level, a **: ambiguous or correct in family level, i = incorrect.

2.3. Description of ITS2, matK, rbcL, and trnL Regions of Indonesian Medicinal Plants

The descriptive statistics of the sequence regions ITS2, matK, rbcL, and trnL are portrayed in Figure 2. The minimum and maximum lengths (bp) of ITS2, matK, rbcL, and trnL regions varied between 233–984, 384–1142, 382–1122, and 416–962, respectively, for all studied species; the average lengths of each region were 591.2, 676.9, 636.1, and 735.8, respectively. The range of GC Content (%) for ITS2, matK, rbcL, and trnL regions varied between 30.94–66.83, 27.86–65.43, 27.72–63.64, and 29.26–67.74, respectively, for all Indonesian medicinal plant species, whilst the average GC contents were 48.34, 41.64, 43.52, and 39.10, respectively.
Figure 2

Box plots of the sequence length (upper) and GC content (lower) of ITS2, matK, rbcL, and trnL of Indonesian medicinal plants.

The relationships between MP species identification accuracy and sequence length (bp), GC content (%), species number per genus, and percentage of identity are presented in Figure 3. With respect to sequence length, the longer the ITS2 and rbcL sequence regions, the lower the identification accuracy, while the other regions indicated no such relationship. With respect to GC content (%), all regions except ITS2 tended to be less accurate for identification when the GC content increased. In terms of species number per genus, matK, rbcL, and trnL regions all tended to have no correlation with the species number per genus, but the ITS2 sequence region was more accurate in identification when the species number per genus was higher. However, this result will depend on the available DNA information in the data bank. All regions indicated a positive relationship of percentage identity (through a BLASTN search) with identification accuracy.
Figure 3

Scatterplot of identification accuracy vs. sequence length (bp), GC Content (%), species number per genus, and percentage of identity. Scale 0–3 represents the identification accuracy (0 = incorrect, 1 = correct at the family level, 2 = correct at the genus level, 3 = correct at the species level).

Among the sequence regions produced for Indonesian medicinal plants, ITS2 generally had the lowest minimum length, smallest average sequence, and highest GC content (Figure 1 and Figure 2) and hence gives the highest efficiency of identification, with only a short DNA sequence needed for correct identification. After ITS2, matK follows second with respect to having the smallest average sequence length. A short DNA sequence may make the process of DNA barcoding technically easier and more economical from extraction to sequencing, as Kress and colleagues suggested [44]. Meanwhile, in terms of GC content (%), only ITS2 had higher identification accuracy when the GC content increased. In some plant DNA sequences, GC content has a positive correlation with exon sites, i.e., the coding regions [45]. This might mean that the longer the exons, the higher the GC content; thus, DNA regions with high GC content are expected to have more accurate identification.

2.4. Identification of Indonesian Medicinal Plants Using Sequences of Their ITS2, matK, rbcL, and trnL Regions

Identification of the sequence regions resulting from the BLAST method that have been matched with samples morphologically identified are presented in Table 3. The highest correct identification in the set of medicinal plant species was reached by the matK region, followed by ITS2 and rbcL, although the percentage values among them were not significantly different (i.e., 31.15% compared to 29.51%). In contrast, trnL had the lowest correct identification, approximately 15% lower than that of matK. The highest incorrect identification was reached by the ITS2 region, followed by the rbcL region. Overall, the most accurate of the four regions was matK because it has the highest identification rate at the species level, lowest at the family level, and resulted in no incorrect identifications [3,41,42].
Table 3

Identification success rates of each region through the BLAST method after validating with the species name from morphologicy identification.

Identification MeasureRegion
ITS2 (%)matK * (%)rbcL (%)trnL (%)
Correct identification at species level29.5131.1529.5116.39
Correct identification at genus level32.7947.5452.4655.74
Correct identification at family level6.5609.848.20
Incorrect identification22.9504.920

* 4 matK regions with the second primer excluded.

Some ambiguous (correct at the genus and family level) and incorrect identification of Indonesian medicinal plants occurred. This might have happened because the world plant data has more than 1.2 million species names [34], while the DNA barcoding data for plants contains only 234,692 barcodes and only 5942 plants are recorded from Indonesia (http://www.boldsystems.org; accessed on 6 February 2020). As such, the available DNA bank to be cross-checked with the samples is far from complete. The correct identification of unique species by singular regions and by combinations of regions can be visualized in the Venn diagrams (Figure 4). ITS2 was the most accurate region with unique correct identification, followed by rbcL, matK, and trnL. A combination of three regions gave the same number of unique correct identifications, and a combination of all gave the highest correct identification. With respect to unique correct identification at the genus level, rbcL gave the most accurate identification, followed by ITS2, trnL, and finally matK. A combination of matK, rbcL, and trnL gave the best unique accurate identification compared to the other three combinations, and the combination of all gave the largest number of unique species among all possibilities. The highest unique correct species at the family level were obtained by using rbcL, then ITS2, and finally trnL.
Figure 4

Venn diagrams for correct identification of species at different taxonomic levels. From left to right: at the species level, at the genus level, and at the family level.

As presented in Table 4, the overall averages of the barcoding regions describing the genetic distance between the two compared species were very similar to one another, i.e., above 1.1% and below 1.2%, except for ITS2, which indicated an average of 1.29%. The lower the taxon unit relation, the lower the percentage, while the higher the taxon unit relation, the higher the percentage. Only the minimum distance of the matK region could describe species in the same genera. Nevertheless, the maximum distance of each region describes the highest level of the different species in a family. In principle, the genetic distance of interspecific related species (within the genus level and above) should be greater than that of the intraspecific species (within species level). It can be stated that more genetic distance lies between two different species with a different family than two different species with the same family. Species within the same genus have the least genetic distance.
Table 4

K2P pairwise genetic distances (%) of each region at different species levels.

RegionObservationValue (%)Related Species
ITS2Overall average1.29503
Minimum distance0.00440Nepenthes reinwardtiana and Nervilia concolor ***
Maximum distance2.70903Erycibe malaccensis and Acalypha grandis ***
matK Overall average1.12567
Minimum distance0.00615Nepenthes mirabilis and N. ampullaria *
Maximum distance2.62368Nepenthes reinwardtiana and Parkia timoriana ***
rbcL Overall average1.19148
Minimum distance0.00350Amomum hochreutineri and Etlingera solaris **
Maximum distance2.62587Phyllanthus oxyphyllus and Galearia filiformis ***
trnL Overall average1.11310
Minimum distance0.02887 Alstonia scholaris and Rauvolfia serpentina **
Maximum distance2.59858 Millettia sericea and Cymbidium aloifolium ***

Notes: *: MP species in the same genera; **: MP species in the same family; ***: MP species in the different family.

The percentage of the sequences identified for each of the regions (ITS2, matK, rbcL, and trnL) was directly proportional to the accuracy of the identification. The higher the percentage, the more accurate the identification. MatK could correctly lead to identification of species with the highest percentages, followed by rbcL and ITS2 (Table 2). Only the matK region could differentiate species at the same genus level and species in different families compared to other regions. In contrast, ITS2 could not differentiate all species distances appropriately (Table 4). In addition, it should be considered that using BLAST in a DNA barcoding study with any regions/primers is a basic step in identifying species [25,26,27,28,42]. BLAST analysis is the approach to the most similar species, and it depends on the species information stored in DNA bank. Therefore, the validation step to confirm the most accurate or most possible species is still required. When the used samples were clear species [42] like in this study, morphological identification by the experts was used, but when the samples were unable to be identified morphologically due to damage or derivate form or/and lack of taxonomic expert, generating a phylogenetic tree amongst medicinal plant groups such as a neighbor-joining (NJ) tree [23,25,26,42], maximum parsimony (MP), and maximum likelihood (ML) [42], and even analyzing chemical compound products [24] might be needed. Considering Hollingsworth and colleagues’ findings with respect to DNA barcoding, it could serve two purposes. The first would be to provide information into the species-level taxon unit, and the second would be to help identify an unknown specimen to a known species. Thus, all the regions tested are valuable, depending on the purpose [43]. We emphasize that having a phylogenetic tree in the barcoding study would be beneficial, particularly when experts assume the medicinal plants are unidentified or a cryptic species. Thus, identification, authentication, and even conservation plan and action can be properly defined and applied.

3. Materials and Methods

3.1. Plant Samples and Literature Survey

This study used 61 different species of medicinal plants belonging to 30 families and 50 genera (Table 1). Plant samples were collected from a living collection with written permission from botanic gardens, including Bogor Botanic Gardens and Cibodas Botanic Gardens in Indonesia, and Hortus Botanicus Leiden in the Netherlands. All species had been taxonomically identified using morphological features as viewed on their identity card. Their scientific names were cross-checked online using POWO (2022) [34]. A leaf sample was collected from each species, except for Alstonia scholaris (L.) R. Br. and Spondias malayana Kosterm, from which bark samples were taken. This was due to A. scholaris and S. malayana Kosterm being high trees with unreachable leaves. Each sample (approximately 25 g) was collected and stored in a teabag with silica gel [46,47,48]. A literature study was conducted to collect all scientific information with respect to each of the sampled plant species, which can help the conservation status of every species. Information about available DNA data—i.e., whether the species already had DNA barcoding or genetic information that could be accessed from DNA banks—was identified using BOLD [11] and NCBI [10]. Data on species origin, including whether the species are native or introduced to Indonesia, and, if native, whether they are endemic, were collected from POWO (http://www.plantsoftheworldonline.org; accessed on 1 February 2022) [34]. Threatened species status was collected from the IUCN Red List of Threatened Species (https://www.iucnredlist.org; accessed on 6 February 2022), with species classified as Vulnerable (VU), Endangered (EN), Critically Endangered (CR), Extinct in The Wild (EW), or Extinct (EX) [35]. Global legislation regulating trade, i.e., based on whether the species is included in CITES Appendices I, II, or III, was collected from the UNEP-WCMC Checklist of CITES species (https://checklist.cites.org; accessed on 1 February 2022) [36]. The information on rare medicinal plants, was compiled from the Indonesian Biodiversity Strategy and Action Plan (IBSAP) National Document [37]. Endemic plants or plants mentioned in the IUCN Red List, CITES Appendices I, II, or III, endemic, and priority lists were considered to be important species that need to be prioritized for conservation [49].

3.2. Molecular Analysis

Molecular analysis was performed at the University of Guelph laboratory, Canada. The barcoding method involves genomic DNA extraction, DNA amplification, and DNA sequencing, and taxonomic identification against available DNA banks. For DNA extraction, genomic DNA was extracted from plant samples using the Maxwell® RSC Purefood GMO and Authentication Kit and the Maxwell® RSC Instrument (Promega). For DNA amplification, primers targeting the ITS2, matK, rbcL, and trnL genes of plants were used to amplify the DNA (Table 5). Each PCR reaction mix (25 μL) contained 1x HotStarTaq master mix (Qiagen), 0.4 μM of each (forward and reverse) primers, 0.15 μg of BSA and 2 μL of template DNA. PCR thermal cycling was conducted by using a GeneAmpTM PCR System 9700 (Applied Biosystems, Waltham, MA, USA). The PCR cycling conditions were as follows: 95 °C for 10 min for DNA denaturation, 45 cycles of 95 °C for 15 sec for DNA annealing with the primer, followed by 55 °C for 30 sec and 72 °C for 1 min for DNA extension, and finally 72 °C for 7 min.
Table 5

Primers used for amplification of DNA regions of ITS2, matK, rbcL, and trnL.

GeneRegionNameSequenceReference
rbcL rbcLa-FATGTCACCACAAACAGAGACTAAAGC[50]
rbcLa-RGTAAAATCAAGTCCACCRCG
matK matK472FCCCRTYCATCTGGAAATCTTGGTTC[41]
matK1248RGCTRTRATAATGAGAAAGATTTCTGC
matK a matKxFTAATTTACGATCAATTCATTC[23]
matK5RGTTCTAGCACAAGAAAGTCG
ITS2ITS2FATGCGATACTTGGTGTGAAT[51]
ITS3RGACGCTTCTCCAGACTACAAT
trnL trnL-F ATTTGAACTGGTGACACGAG[7]
trnL-cCGAAATCGGTAGACGCTACG

Note: matK a is an alternative to matK that is used when the PCR reaction fails to have an amplificon. F denotes the forward primer sequence and R is the reverse primer sequence.

PCR products were visualized on 2% agarose gels to check whether DNA amplification was successful. PCR products were then purified using a NucleoFast® 96 PCR clean-up kit (Macherey-Nagel). The purified PCR fragments were sequenced bidirectionally, using the same primers as for the PCR, with the help of an ABI 3730 Genetic Analyzer (Applied Biosystems). The retrieved sequences were analyzed using ABI PrismTM Sequencing Analysis software (Applied Biosystems) to obtain a consensus sequence (Q > 20) for each sample.

3.3. Sequence Analyses and Data Interpretation

For each sample, the consensus sequence was compared with the nucleotide sequences in the BOLD species ID engine and the NCBI GenBank using BLASTN (https://blast.ncbi.nlm.nih.gov; accessed on 7 January 2022) [52] with the program selection as “Highly Similar Sequences (Megablast)” [53] for taxonomic identification. When no result was obtained from Megablast due to the sequence being too short, the sequence was queried with the program selection as, “Somewhat similar sequences (nBlast) for an alternative”. PCR amplification, sequencing, and identification success rates were calculated as percentages. Only one best-matched species was selected from the BLASTN identification that is approached from the most similar sequence species recorded in DNA bank. Where there was more than a single match, the best-matched species was selected as the one with the lowest E value and the highest coverage; otherwise, any species was the closest-related species to the query (species). The results were then validated with studied medicinal species’ ID from botanical gardens where they have been morphologically identified by taxonomic expert. The BLAST identification results were the initial step to identify species with DNA barcoding [25,26,27,28,42]. It was considered to be the correct species if the highest percentage of identification referred to the right species, i.e., when the species name from sequence identification matched the morphologically identified species. Otherwise, when the sequence was identified as a different species within a genus or a different species within a family, the result was considered to be an ambiguous species or genus. Ambiguous identifications were counted as correct identification, as per the study by Amandita et al. [42]. Sequences with an identification percentage of 99% or more were included in the novel sequence data for specific DNA barcoding for a species. Novel sequence data will be deposited in the GenBank database to assist in future identification. Descriptive, statistical, and scatter plot analyses were used to gain understanding of the ITS2, matK, rbcL, and trnL regions and the relationship between factors in the BLAST analysis, with the identification being completed using the MINITAB Statistical Software. In addition, Venn diagrams generated by Bioinformatics and Evolutionary Genomics (http://bioinformatics.psb.ugent.be/cgi-bin/liste/Venn/calculate_venn.htpl; accessed on 2 January 2022) were used to depict how many species were correctly identified by singular regions and by multiple combinations of regions, whether or now there was a correct identification within species, genus, or family level. Information about the species number per genus was obtained from POWO [34]. Sequence alignments were performed using the Muscle program. The nucleotide composition of all sequences obtained from the ITS2, matK, rbcL, and trnL regions were computed, and their genetic distances were calculated with Kimura 2 parameters (K2P) [54]. The K2P pairwise genetic distance is the percentage of nucleotide sequence divergence that was used by Hebert and colleagues [2]. All analyses were performed with the Molecular Evolutionary Genetics Analysis (MEGA X) software [55]. All the medicinal plant species information collected was analyzed and interpreted according to the use of the data in DNA barcoding with respect to conservation. Any correct identification can be used for DNA barcoding for related species and can be subsequently helpful for medicinal plant conservation, although the DNA barcoding can only be used for identification at species level and cannot estimate variation within species [56]. Any ambiguous identification can be used as an approach to species identification and thus may also be valuable for medicinal plant conservation. Any new sequence or new DNA barcoding that is not available in NCBI or BOLD constitutes novel data. Species included in at least one of the following categories: IUCN Red List [40], CITES Appendixes I, II, or III [36], rare medicinal plants species [37], or Native and Endemic species [34] would require DNA barcoding more urgently than the non-listed species. Therefore the species were categorized in priority order A-M as follows: new DNA barcoding and can strongly assist medicinal plant (MP) conservation (A), can strongly assist MP conservation (B), new DNA barcoding and can assist MP conservation (C), can assist MP conservation (D), new to DNA bank data and new DNA barcoding and may strongly assist MP conservation (E), new DNA barcoding and may strongly assist MP conservation (F), may strongly assist MP conservation (G), new to DNA bank data and new DNA barcoding and may assist MP conservation (H), new DNA barcoding and may assist MP conservation (I), may assist MP conservation (J), new to DNA bank data and new DNA barcoding but sequences need to be clarified further (K), new DNA barcoding but sequences need to be clarified further (L) and sequences need to be clarified further (M).

4. Conclusions

Based on the results of this study, we conclude that no single region is perfectly ideal for DNA barcoding. Nonetheless, according to the observed criteria, we recommend matK as the core DNA barcoding region for Indonesian medicinal plant identification. In addition, due to its unique correct species identification, we recommended the ITS2 and rbcL regions as alternative or complementary regions to the core barcoding DNA using matK. DNA barcoding for 33 Indonesian medicinal plant species was provided; of these 33 species, 21 species were newly DNA barcoded; of these 21 species, three contributed novel DNA barcoding data to DNA bank. In the future, this guide and associated data will facilitate a means to identify Indonesian medicinal plants, particularly those that need to be conserved strongly, to assure a valid species rather than a substitute in herbal medicines and to prevent illegal trade.
  36 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

2.  DNA barcoding a useful tool for taxonomists.

Authors:  David E Schindel; Scott E Miller
Journal:  Nature       Date:  2005-05-05       Impact factor: 49.962

3.  DNA barcoding: a six-question tour to improve users' awareness about the method.

Authors:  Maurizio Casiraghi; Massimo Labra; Emanuele Ferri; Andrea Galimberti; Fabrizio De Mattia
Journal:  Brief Bioinform       Date:  2010-02-15       Impact factor: 11.622

4.  Forensic botany II, DNA barcode for land plants: Which markers after the international agreement?

Authors:  G Ferri; B Corradini; F Ferrari; A L Santunione; F Palazzoli; M Alu'
Journal:  Forensic Sci Int Genet       Date:  2014-10-16       Impact factor: 4.882

5.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

6.  Forensic identification of Indian snakeroot (Rauvolfia serpentina Benth. ex Kurz) using DNA barcoding.

Authors:  Marcel C M Eurlings; Frederic Lens; Csilla Pakusza; Tamara Peelen; Jan J Wieringa; Barbara Gravendeel
Journal:  J Forensic Sci       Date:  2013-03-04       Impact factor: 1.832

7.  Plant DNA barcodes can accurately estimate species richness in poorly known floras.

Authors:  Craig Costion; Andrew Ford; Hugh Cross; Darren Crayn; Mark Harrington; Andrew Lowe
Journal:  PLoS One       Date:  2011-11-11       Impact factor: 3.240

8.  DNA barcoding detects contamination and substitution in North American herbal products.

Authors:  Steven G Newmaster; Meghan Grguric; Dhivya Shanmughanandhan; Sathishkumar Ramalingam; Subramanyam Ragupathy
Journal:  BMC Med       Date:  2013-10-11       Impact factor: 8.775

9.  Assessing product adulteration of Eurycoma longifolia (Tongkat Ali) herbal medicinal product using DNA barcoding and HPLC analysis.

Authors:  Bashir Mohammed Abubakar; Faezah Mohd Salleh; Mohd Shahir Shamsir Omar; Alina Wagiran
Journal:  Pharm Biol       Date:  2018-12       Impact factor: 3.503

10.  Identification of ethnomedicinal plants (Rauvolfioideae: Apocynaceae) through DNA barcoding from northeast India.

Authors:  Pradosh Mahadani; Gouri Dutta Sharma; Sankar Kumar Ghosh
Journal:  Pharmacogn Mag       Date:  2013-07       Impact factor: 1.085

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.