Muhammad Jefte C Arshed1, Marcos B Valdez2, Grecebio Jonathan D Alejandro3. 1. The Graduate School, University of Santo Tomas, España Blvd., 1015 Manila, Philippines. 2. Department of Biological Sciences, Institute of Arts and Sciences, Far Eastern University, Nicanor Reyes Street, Sampaloc, 1008 Manila, Philippines. 3. The Graduate School and Research Center for the Natural and Applied Sciences, University of Santo Tomas, España Blvd, 1015 Manila, Philippines.
Abstract
INTRODUCTION: The pantropical genus Lasianthus Jack is identified for high phenotypic plasticity making traditional taxonomic identification difficult. Having some members with important medicinal properties, a precise complimentary identification through DNA barcoding is needed for species delineation. MATERIALS AND METHODS: In this study, 12 samples representing six Philippine Lasianthus species were used to determine the most efficient barcoding loci among the cpDNA markers (matK, rbcL, rps16, and trnT-F) and nrDNA (ITS) based on the criteria of universality, discriminatory power, and resolution of species. RESULTS: The results revealed that ITS has the recommended primer universality, greatest interspecific divergences, and average resolution of species. Among the cpDNA markers, matK and rbcL are recommended but with minimal resolution of species. While trnT-F showed moderate interspecific variations and resolution of Lasianthus species, rps16 has the lowest interspecific divergence and resolution of species. CONCLUSION: Consequently, ITS is the potential ideal DNA barcode for Lasianthus species. SUMMARY: ITS, matK, and rps16 markers have the excellent amplification and sequence qualityITS marker has the highest interspecific divergence with the maximum values, followed by matK, rbcL, trnT-F, and rps16, respectivelyAll markers except rps16 yielded average resolution to Lasianthus speciesITS marker is the most ideal locus in terms of excellent universality, high interspecific discriminatory ability, and average species resolution. Abbreviations used: ITS: Internal Transcribe Spacer, matK: maturase K, rbcL: ribulose-1,5-biphospahte-carboxylase, rps16: ribosomal protein 16 small subunit gene.
INTRODUCTION: The pantropical genus Lasianthus Jack is identified for high phenotypic plasticity making traditional taxonomic identification difficult. Having some members with important medicinal properties, a precise complimentary identification through DNA barcoding is needed for species delineation. MATERIALS AND METHODS: In this study, 12 samples representing six Philippine Lasianthus species were used to determine the most efficient barcoding loci among the cpDNA markers (matK, rbcL, rps16, and trnT-F) and nrDNA (ITS) based on the criteria of universality, discriminatory power, and resolution of species. RESULTS: The results revealed that ITS has the recommended primer universality, greatest interspecific divergences, and average resolution of species. Among the cpDNA markers, matK and rbcL are recommended but with minimal resolution of species. While trnT-F showed moderate interspecific variations and resolution of Lasianthus species, rps16 has the lowest interspecific divergence and resolution of species. CONCLUSION: Consequently, ITS is the potential ideal DNA barcode for Lasianthus species. SUMMARY: ITS, matK, and rps16 markers have the excellent amplification and sequence qualityITS marker has the highest interspecific divergence with the maximum values, followed by matK, rbcL, trnT-F, and rps16, respectivelyAll markers except rps16 yielded average resolution to Lasianthus speciesITS marker is the most ideal locus in terms of excellent universality, high interspecific discriminatory ability, and average species resolution. Abbreviations used: ITS: Internal Transcribe Spacer, matK: maturase K, rbcL: ribulose-1,5-biphospahte-carboxylase, rps16: ribosomal protein 16 small subunit gene.
Entities:
Keywords:
DNA barcoding; Lasianthus; Philippines; cpDNA; nrDNA
Lasianthus Jack is the largest genus of the four genera comprising the tribe Lasiantheae of family Rubiaceae. The genus consists of about 225 species with the highest diversity in tropical and subtropical Asia.[1] Lasianthus is characterized as drupes with thick wall that develop from the ovaries with 3–9 locules.[2] It represents an ecologically important element specifically in its distribution pattern, which is significant in the field of biogeography and speciation patterns in the assemblage of tropical rainforest.[3] Moreover, Lasianthus exhibits medicinal uses such as Lasianthus lucidus Blume that is used to ease fever, blood loss and has hepatoprotective potential;[4] Lasianthus verticillatus (Lour.) Merr. is traditionally used by the Onges tribe as antidote;[5] Lasianthus oblongus King and Gamble is applied orally to hasten constriction of the organs for postpartum mothers;[6] and other several species of the genus are with known active chemical constituents such as alkaloids, terpenoids, and glycosides (e.g., L. attenuatus Jack, L. fordii Hance, and L. lucidus Blume).[78] Close analysis of literature, protologs, and herbarium specimens reveals uncertainties and difficulties in discriminating Lasianthus species based on morphology. The genus is identified for high phenotypic plasticity making traditional taxonomic identification difficult. Knowing some Lasianthus species exhibits medicinal and pharmaceutical importance; accurate species identification is necessary.Modern molecular biology tools offer excellent approaches for rapid characterization and precise identification of species. Using short sequences as molecular markers for species-level identification is known as DNA barcoding.[9101112] Applications of DNA barcoding are enormous especially in scenario where morphological approaches cannot resolve identification in species having sexual dimorphism and phenotypic plasticity within species of the same genus.[13] Several genomic regions were proposed for the plant DNA barcoding and the plant working group of the Consortium for the Barcode of Life (CBOL)[14] recommended using matK and rbcL as the standard barcodes. Aside from using these two markers, additional three markers were utilized in this study, namely (1) rps16, an intron in the single large copy region of the plastid genome[15] that can provide good resolution[16] and has higher divergence than other cpDNA markers;[17] (2) trnT-F, a noncoding chloroplast gene that has high variability and useful for species and genus level resolutions for phylogenetic studies (e.g., family Arecaceae and Rhamnaceae);[18] and (3) ITS, a nuclear locus that has ability to infer closely related genera[19] due to its high repeating units that promote good amplification and sequencing.[20] Moreover, these markers have been utilized in molecular analyses of Lasianthus species.[117] In this paper, five barcoding loci (matK, rbcL, rps16, trnT-F, and ITS) were evaluated for Philippine Lasianthus species to identify the ideal DNA barcode of the genus based on universality, discriminatory ability, and resolution of species.
MATERIALS AND METHODS
Sampling of plant materials
Collections of Philippine Lasianthus species [Table 1] from the provinces of Antique, Camiguin, Cebu, Davao, Mindoro, and Quezon, Philippines, by the Thomasian Angiosperm Phylogeny and Barcoding Group (TAPBG) of the University of Santo Tomas (UST), Manila, were used in this study. Field images of Lasianthus [Figure 1] and voucher specimens were deposited at the UST Herbarium (USTH) provided with accession numbers [Table 1]. Leaf samples from two different populations were collected and stored in a zip-lock with silica gel.[21] A total of 12 samples representing six Philippine Lasianthus species were used in this study. Seven additional sequences of three Lasianthus species retrieved in the GenBank were used in the analysis [Table 2].
Table 1
Thomasian Angiosperm Phylogeny and Barcoding Group Lasianthus collection used in the study
Figure 1
Field images of some Lasianthus species. Lasianthus fordii Hance: (a) leaves; (b) infructescence; (C) flowers; Lasianthus clementis Merr.: (d) habit; (e) infructescence; (f) fruits
Table 2
Accession numbers of Lasianthus species obtained from National Center for Biotechnology Information-GenBank
Thomasian Angiosperm Phylogeny and Barcoding Group Lasianthus collection used in the studyField images of some Lasianthus species. Lasianthus fordii Hance: (a) leaves; (b) infructescence; (C) flowers; Lasianthus clementis Merr.: (d) habit; (e) infructescence; (f) fruitsAccession numbers of Lasianthus species obtained from National Center for Biotechnology Information-GenBank
DNA extraction, polymerase chain reaction amplification, and sequencing
Silica gel-dried leaf samples were used for the extraction of genomic DNA using the DNeasy Plant Mini Kit (Qiagen®, Germany) following the manufacturer's protocol. The Biometra T-gradient (Germany) was used for the polymerase chain reaction (PCR) amplification. DNA was amplified using KAPA Taq PCR kit (USA). The universal primers and amplification protocol used are listed in [Table 3]. The PCR cocktail of 25 μL reaction for the chloroplast markers (rps16, trnT-F, matK, and rbcL) was as follows: 17.35 μL nuclease free water, 2.5 μL × 10 PCR buffer, 1.0 μL 25 MgCl2, 2.0 μL deoxynucleotide triphosphates (dNTP), 1.0 μL of 10 μM forward and reverse primers, 0.15 μL Taq DNA polymerase, and 0.5 μL DNA template. For ITS marker, the PCR cocktail of 25 μL reaction was mixed as follows: 15.3 μL nuclease free water, 2.5 μL × 10 PCR buffer, 2.0 μL MgCl2, 1.5 μL dNTP, 1.0 μL of 10 μM forward and reverse primers, 0.2 μL Taq DNA polymerase, and 1.5 μL DNA. The presence of amplified DNA bands was confirmed using 1% concentration of agarose gel with ×1 tris-borate-ethylenediaminetetraacetic acid buffer [Figure 2]. Amplified DNA was purified using the QIA-quick Purification Kit (Qiagen®, Germany) and were sent to Macrogen, South Korea, for bidirectional sequencing. DNA sequences were assembled and edited using the Codon Code Aligner v. 4.1.1. (CodonCode Co., USA).
Table 3
Primers used for the amplification of DNA barcodes
Figure 2
Sample of gel autoradiograph showing polymerase chain reaction products (ITS marker)
Primers used for the amplification of DNA barcodesSample of gel autoradiograph showing polymerase chain reaction products (ITS marker)
Sequence analyses
For determining the most effective barcode marker for the discrimination of Lasianthus species, the following conventional barcoding parameters such as mean length of base pair (bp), PCR success rate (%), intra- and inter-specific divergences (%), and the mean sequence divergence in each marker and between the different markers were analyzed using MEGA v. 7.0.14 (Pennsylvania State University), (K2P, Kimura-2-Parameter with pairwise deletion). This was followed by the Wilcoxon Mann–Whitney test to establish if the mean sequence divergence is statistically significant using SPSS 15.0 software (SPSS Inc., Chicago, IL, USA). To assess the resolution of species, percentage was calculated base from the neighbor-joining (NJ) tree that was constructed for further evaluation of markers.
RESULTS
From the five markers, a total of sixty newly sequences of Lasianthus were produced [Appendix 1]. Sequence characteristics for the five barcode loci are presented in Table 4 with their overall results. The longest mean length was from trnT-F with 2101 bp followed by rps16, matK, ITS, and rbcL. As for the most parsimonious informative sites, the trnT-F marker was the highest with 164 informative bp from 270 variable sites, followed by ITS, rbcL, and matK. Interestingly, rps16 with the second highest mean bp still fall short for having the least informative characters of 11 from 54 variable sites. Primer universality was determined using the PCR amplification efficiency and sequence quality. PCR amplification was generally 100% successful to all the candidate barcodes. For the sequence quality, ITS, matK, and rps16 exhibited 100% success rates, followed by rbcL with 92% sequencing success and trnT-F with 75% which was the least efficient.
Table 4
Characteristics of the different barcoding loci
Characteristics of the different barcoding lociPairwise divergence analyses for each candidate barcodes using the two parameters to characterize the inter- and intra-specific divergences are presented in Table 5. The ITS has the highest interspecific divergence (0.1623 ± 0.0810), followed by matK (0.0951 ± 0.0982), trnT-F (0.0621 ± 0.0356), rbcL (0.0563 ± 0.0232), and rps16 (0.0238 ± 0.0376). Results for the intraspecific variations revealed that trnT-F (0.0121 ± 0.0122) has the lowest average in all the parameters, followed by rbcL (0.0155 ± 0.0161), matK (0.0207 ± 0.0172), rps16 (0.0243 ± 0.0469), and ITS (0.0999 ± 0.0613).
Table 5
Inter- and intra-specific divergences among loci
Inter- and intra-specific divergences among lociNJ tree was used to generate the topology of Lasianthus species in each candidate barcodes to determine the species resolution. Using BLAST, all of the candidate barcodes were able to classify each species as to genus Lasianthus, but the generated tree for each barcodes was unable to categorize some species to its specific resolutions [Figure 3].
Figure 3
Neighbor-joining bootstrap trees (based on Kimura-2-Parameter) illustrating the resolution of the species for the five barcoding loci: (a) ITS sequences showing 50% species resolution; (b) matK sequences showing 50% species resolution; (c) rbcL sequences showing 67% species resolution; (d) rps16 showing 33% species resolution; (e) trnT-F showing 50% species resolution
Neighbor-joining bootstrap trees (based on Kimura-2-Parameter) illustrating the resolution of the species for the five barcoding loci: (a) ITS sequences showing 50% species resolution; (b) matK sequences showing 50% species resolution; (c) rbcL sequences showing 67% species resolution; (d) rps16 showing 33% species resolution; (e) trnT-F showing 50% species resolutionNone of the markers can completely resolve taxa with closely related species (e.g., L. lucidus, L. fordii, L. verticillatus, L. trichophlebus). Nevertheless, some markers can give better resolution with higher bootstrap (BS) support than other markers used in the study. The rbcL marker followed by matK and trnT-F can resolve some of the difficult species with greater support value. For ITS, it cannot group same species fully just like rps16, but it can generate higher confidence level compared to rps16.
DISCUSSION
A suitable barcode should exhibit the following criteria: (1) high universality (PCR and sequencing success rates), (2) high discriminatory power based on the inter- and intra-specific divergences, and (3) high species resolution.[14] The results of the study were assessed and vis-a-vis against the criteria.
Universality
PCR amplification efficiency and sequence quality: Amplified and generated sequences of the five barcoding loci were evaluated based on the sequence quality that each barcodes produced. ITS, matK, and rps16 markers have the excellent amplification and sequence quality. The rbcL and trnT-F markers yielded successful amplification but less sequencing success rates. Results show that ITS, matK, and rps16 markers are the most universal in terms of quality and coverage of sequences among the barcodes utilized. This corresponds to previous studies[92728] that ITS has high amplification [Figure 2] and sequence capabilities. Likewise, the results confirmed matK exhibiting amplification and sequencing efficiency[293031] and this was one of the markers recommended by CBOL as a standard barcode in plants. Furthermore, rps16 marker also provides high amplification and sequencing success, indicating its universality as it has been used in discriminating taxonomic uncertainties in Rubiaceae.[32]
Discriminatory: Inter- versus intra-specific genetic divergence
An ideal barcode should exhibit high interspecific divergences but low intraspecific variation.[93334] Using the Wilcoxon two-sample tests, significant differences between the inter- and intra-specific divergences of the five candidate barcodes were analyzed [Table 6]. Interspecific differences were significantly higher (P < 0.05) than their related intraspecific divergences. Thus, settled differences exhibited by both specific divergences give a good lead for the discriminatory efficiency of the markers used.
Table 6
Wilcoxon two-sample test for inter- versus intra-specific divergences
Wilcoxon two-sample test for inter- versus intra-specific divergencesIn comparison of the five barcodes, ITS maker has the highest interspecific divergence with the maximum values, followed by matK, rbcL, trnT-F, and rps16, respectively [Table 5]. The ITS has the second highest number of variable and informative sites. It also yields the highest interspecific mean which corroborates in other studies.[3435] However, results for intraspecific variations revealed that ITS has the highest value, followed by rps16, matK, rbcL, and trnT-F markers. An ideal barcode should have low intraspecific variations which ITS failed to have. Thus, ITS has high discriminatory power on interspecific level as this marker is useful for identification efficiency of closely related species among numerous genera. Furthermore, ITS region is regarded as more varied than any of the chloroplast genes.[363738] Results obtained from Wilcoxon signed-rank test [Table 7] support ITS to possess the highest interspecific divergence with almost high significant differences. However, ITS is not a good marker for intraspecific identification of Lasianthus species for having the least intraspecific variations among other barcodes. Consequently, trnT-F should be the ideal barcode for discriminating species for intraspecific level in genus Lasianthus. Furthermore, this marker has the highest number of variable and informative sites. Results obtained using Wilcoxon signed-rank test of intraspecific divergence among loci [Table 8] suggest rps16 as the lowest, followed by trnT-F and matK with equal rank and then rbcL and ITS as the highest. The significant differences were exhibited by rps16 and trnT-F when compared to ITS alone, making the results inconclusive for the ideal barcode for intraspecific level. There should be a significant difference between all the markers to establish the efficiency of the particular marker to discriminate up to intraspecific level.
Table 7
Wilcoxon signed-rank tests of interspecific divergence among loci
Table 8
Wilcoxon signed-rank tests of intraspecific divergence among loci
Wilcoxon signed-rank tests of interspecific divergence among lociWilcoxon signed-rank tests of intraspecific divergence among loci
Resolution of species
Alignments for each barcodes were used to generate phylogenetic analysis using NJ tree to evaluate the species resolution if each barcode can generate taxonomic groupings per species and a monophyletic tree. In addition, the BS values were included to give partial tree reliability for each barcodes. All of the markers have insufficient conspecific groupings [Figure 3] where rbcL has the highest species resolution of only 67%. The ITS, matK, and trnT-F were able to have 50% species resolution and least was from rps16 with 33%. Thus, candidate barcodes used in the study were inadequate for species resolution; nevertheless, inference from this study suggests that most of the barcodes, except for rps16, can give average resolution to Lasianthus species.
CONCLUSION
This study provides baseline information on the potential barcodes for Philippine Lasianthus species. The ITS marker has the most feasible ideal locus for this genus, having excellent universality, high interspecific discriminatory ability, and average species resolution, which can be supplemented by rbcL and matK. It would be suitable to increase the sample size of Lasianthus species to facilitate more definite results for rapid authentication of Philippine Lasianthus.
Financial support and sponsorship
The present work is part of the graduate thesis of MJCA in which financial grant has been obtained from DOST-Science Education Institute. GJDA thanks DOST-Philippine Council for Health, Research and Development and CHED-Philippine Higher Education Research Network, for the funding.