Literature DB >> 30276026

Development of 10 single-copy nuclear DNA markers for Euchresta horsfieldii (Fabaceae), a rare medicinal plant.

Arief Priyadi1,2,3, Chao Feng2, Ming Kang2, Hongwen Huang2.   

Abstract

PREMISE OF THE STUDY: Euchresta horsfieldii (Fabaceae) is a rare and endangered medicinal plant in Indonesia with restricted distribution. Single-copy nuclear DNA (scnDNA) markers were developed for this species to facilitate further investigation of genetic diversity and population structure. METHODS AND
RESULTS: We performed RNA-Seq and de novo assembly of the transcriptome. Ten primer sets were developed for E. horsfieldii, all of which also amplified in E. japonica and E. tubulosa.
CONCLUSIONS: These scnDNA markers will be an important resource for the study of genetic diversity and population structure of E. horsfieldii and other species in the genus Euchresta.

Entities:  

Keywords:  Euchresta horsfieldii; Fabaceae; RNA‐Seq; genetic diversity; single‐copy nuclear DNA (scnDNA) markers

Year:  2018        PMID: 30276026      PMCID: PMC6159642          DOI: 10.1002/aps3.1178

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Euchresta horsfieldii (Lesch.) Benn. is a perennial shrub distributed in Bhutan, China, India, Indonesia, Laos, Nepal, the Philippines, Thailand, and Vietnam (Sun and Larsen, 2010). In Indonesia, the natural habitats of E. horsfieldii are restricted to the middle range of rainforests at 1300–2400 m above sea level in Sumatra, Java, and Bali (van Steenis, 2006). Euchresta horsfieldii is used in traditional Indonesian herbal medicine, and its pharmacological properties have been evaluated for antitumor, anti‐oxidant, and lipid‐reducing agents (Li et al., 2014). Despite its potency as an Asian medicinal plant, its rarity, and its endangered status, studies of the conservation genetics of the species are still scarce. Hama et al. (2009) developed a set of microsatellite primers for E. japonica Hook. f. ex Regel and performed cross‐amplification in E. formosana (Hayata) Ohwi. We tested these markers in E. horsfieldii, and only two out of nine primers amplified. Thus, we developed a new set of single‐copy nuclear DNA (scnDNA) markers for E. horsfieldii to allow further study of its population genetic diversity and structure. We used RNA‐Seq, transcriptome de novo assembly, and subsequent bioinformatics workflows to develop scnDNA markers. Ten primer pairs of scnDNA markers were obtained for E. horsfieldii and E. japonica. We successfully used these primers to assess polymorphism of three wild populations of E. horsfieldii from Indonesia and one population of E. tubulosa Dunn from China. Therefore, this set of scnDNA markers should be helpful in population genetic studies of E. horsfieldii and related taxa.

Methods and results

A stem cutting of E. horsfieldii was collected from Tapak Hill, Bali, Indonesia, and cultivated in the greenhouse of South China Botanical Garden, Guangdong, China. Fresh leaves were harvested from the 3‐month‐old plants grown from this cutting for RNA extraction. Library construction of cDNA and the transcriptome sequencing followed protocols by Ai et al. (2015), with some modifications. Total RNA was isolated from approximately 100 mg of fresh leaf tissue of E. horsfieldii with the TransZol Up Plus RNA Kit (TransGen Biotech Co., Beijing, China), following the manufacturer's protocols. Total mRNA was isolated using Oligo(dT) cellulose, and subsequently first‐ and second‐strand cDNA were synthesized. Sequencing adapters were ligated to short fragments. Agarose gel electrophoresis was used to separate fragments with lengths of approximately 350 bp, which were then selected as sequencing templates for PCR amplification. Transcriptome sequencing was performed on an Illumina HiSeq X Ten System (Illumina, San Diego, California, USA) that generated 150‐bp paired‐end raw reads. The library construction and sequencing was carried out by a commercial company (Novogene Bioinformatics Institute, Beijing, China). Raw reads were filtered by removing the low‐quality sequences using QC_pe pipeline (Feng et al., 2017), and clean data were used for de novo assembly using Trinity (Grabherr et al., 2011). Transcript quantification was performed using the software package RNA‐Seq by Expectation Maximization (RSEM; Li and Dewey, 2011), and only assembled genes with fragments per kilobase of transcript per million mapped reads (FPKM) values greater than 1 were selected for subsequent analysis. Coding regions within these unigenes were predicted by TransDecoder version 5.0.1 (https://github.com/TransDecoder). We performed Pfam and BLASTP searches of these protein‐coding genes against UniProtKB/Swiss‐Prot to predict their putative functions. Their ortholog groups were compared against Ricinus communis L., Arabidopsis thaliana (L.) Heynh., Oryza sativa L., and Physcomitrella patens (Hedw.) Bruch & Schimp. and identified using an online version of OrthoMCL‐DB (Chen et al., 2006; http://orthomcl.org/orthomcl/). These ortholog groups were treated as putative single‐copy genes (scnDNA). Approximately 21 million Illumina paired‐end clean reads were generated (National Center for Biotechnology Information [NCBI] Sequence Read Archive [SRA] accession no. SRP149026). Clean reads were assembled into 61,796 unigenes with an N50 length of 2160 bp. Among these, 49,804 unigenes with FPKM greater than 1 were obtained, 27,405 protein‐coding genes were predicted, and 1017 putative scnDNA were identified. We randomly selected 24 of these putative scnDNA for initial design of 72 PCR primers using Primer‐BLAST (Ye et al., 2012). To validate the scnDNA markers, genomic DNA was extracted from two individuals each of E. horsfieldii (population BLBG) and E. japonica (populations SCB1, SCB2) (Appendix 1). Validation was done separately in these two species. DNA was extracted from approximately 15–20 mg of silica gel–dried leaf samples using the Plant Genomic DNA extraction kit (BioTeke, Beijing, China), following the manufacturer's instructions. DNA amplification was performed in a 20‐μL reaction mixture containing 10 μL of 2× EasyTaq PCR SuperMix (TransGen Biotech Co.), 0.5 μL each of forward and reverse primer, 8.5 μL of ddH2O, and approximately 50 ng of template DNA. The PCR program was set as one cycle of 5 min at 95°C; 35 cycles of 30 s at 94°C, 90 s at 55°C, 60 s at 72°C; and a final extension of 10 min at 72°C. For amplicon quality and quantity check, each PCR product was run for 15 min of electrophoresis in 1% agarose gel at 120 V. Amplicons with only one clear band were sequenced using an ABI 3730xl DNA Sequencer (Tsingke Biological Technology, Guangzhou, China). Ten primer pairs showed single clear bands and good electropherogram quality from Sanger sequencing. Sequence data were then read, trimmed, and exported to FASTA in Chromas version 2.6.2 (Technelysium, South Brisbane, Queensland, Australia; http://technelysium.com.au). FASTA sequences were aligned using the MUSCLE algorithm available in MEGA 7.0 (Kumar et al., 2016) and then formatted manually to PHYLIP file format as input for further analysis. These 10 primers were assessed for polymorphism in three populations of E. horsfieldii from Indonesia and one population of E. tubulosa from China, following the same protocol described above for marker validation. In total, we collected 38 wild individuals of E. horsfieldii from Indonesia and six individuals of E. tubulosa from China. Voucher specimens of E. horsfieldii were deposited in the Herbarium Hortus Botanicus Baliense (THBB), Bali Botanic Garden, Indonesian Institute of Sciences (LIPI), Bali, Indonesia (Appendix 1). PCR primer pairs and characteristics of the 10 newly developed scnDNA markers, GenBank accessions, and BLASTN hits are presented in Table 1. Genetic diversity measures of all samples derived from pairwise number of site differences, including nucleotide diversity (π), Watterson estimator (θ w), and related measures, were calculated using DnaSP version 5.10 (Librado and Rozas, 2009) (Table 2). The average number of alleles was 5.9 (5 to 7), π was 5.03 × 10−3 (1.75 × 10−3 to 7.5 × 10−3), and θ w was 4.01 × 10−3 (1.65 × 10−3 to 7.23 × 10−3). There was no significant Tajima's D; however, EhoScn04a and EhoScn16a showed significant negative Fay and Wu's H, indicating non‐neutrality of these two loci. Most loci showed no linkage disequilibrium after 10,000 permutations in Arlequin version 3.5.2.2 (Excoffier et al., 2005), except EhoScg15b and EhoScg24b in population SCHU. For the population genetic analysis, the PHYLIP file format was first converted to STRUCTURE input using SPADS 1.0 (Dellicour and Mardulyn, 2014), followed by conversion to GENEPOP format using PGDSpider2 (Lischer and Excoffier, 2012). Allelic richness of each locus and population and number of private alleles (Appendix 2) were generated by PopGenReport version 3.0.0 (Adamack and Gruber, 2014). The average allelic richness was 1.25 (1 to 2.56) and average number of private alleles was 10.2 (3 to 21). These figures show that the genetic diversity of Euchresta spp. vary across species. Within each population, E. horsfieldii are homozygotes and show no genetic variation, but variations are detected among populations. The KTCN and MRPI populations were found to share seven common alleles and to each have three private alleles, whereas all 10 alleles from the BALI population are private.
Table 1

PCR primers and characteristics of 10 scnDNA markers developed for Euchresta horsfieldii

LocusPrimer sequences (5′–3′) T a (°C) Fragment size (bp)BLASTN top hit description  E‐value Closest speciesGenBank accession no.
EhoScn01aF: AAGTTCCGCTTCCTCGAATC52.5646AdoMet‐dependent rRNA methyltransferase SPB1 2e‐99 Cajanus cajan MH248777MH248783
R: GTAATTACCTTCGCCTGGGG
EhoScn02bF: GCGAAAAACCTCGTACTTGC51.5671Diphthamide biosynthesis protein 10 Lupinus angustifolius MH269247MH269251
R: CTCTGCAGCTACACTCACAG
EhoScn04aF: ATGGACACCCACTCTTCTCA51.5529Histone acetyltransferase GCN51e‐116 Glycine max MH269252MH269256
R: ACTCTCTTCTCTGGCAGTGT
EhoScn15bF: TCTACTTCCGACCCCTTTGT51.8591Protein ABCI7, chloroplastic 9e‐103 Lupinus angustifolius cv. Tanjil MH269257MH269263
R: CTCCAATGGCAACCTCCAAT
EhoScn16aF: AAGCAATGCCGAGACGAGAA53.5445DNA/RNA‐binding protein KIN170 Glycine max MH269264MH269268
R: CCTCGGACACTCAACCTGTG
EhoScn17aF: ATCGGCACTCTTCTGAGACTGA53.0538F‐box/LRR‐repeat protein 100 Lupinus angustifolius cv. Tanjil MH269269MH269274
R: ACCTCATCATAATCACGCGCA
EhoScn20aF: CCAGGTTAAGGGTCGCAAGT54.0553Pentatricopeptide repeat‐containing protein At3g160100 Lupinus angustifolius cv. Tanjil MH269275MH269280
R: GCCTCAGAAGGTGGAGCTTT
EhoScn21cF: ATCACAACCAGCCCAACCAA53.6709Mediator of RNA polymerase II transcription subunit 40 Lupinus angustifolius cv. Tanjil MH269281MH269285
R: TGGATTGGCATCCAAAGGCT
EhoScn23cF: CCACTTCGCCACCAAGTACA54.0439Pentatricopeptide repeat‐containing protein At1g51965, mitochondrial 2e‐108 Lupinus angustifolius cv. Tanjil MH269286MH269291
R: TCTAAATCCTGGCCGTTCCC
EhoScn24bF: AAGAACCTTCGTTATCCACAGCA52.6631Pentatricopeptide repeat‐containing protein At2g018600 Lupinus angustifolius cv. Tanjil MH269292MH269298
R: TAATAAGGGCACCACATGCCT

T a = annealing temperature.

Table 2

Genetic properties of the 10 newly developed scnDNA markers of Euchresta horsfieldii, cross‐amplified in E. japonica and E. tubulosa

Locus n Length S p n A k h π (× 10−3) θ W (× 10−3)Tajima's DFay & Wu's H
EhoScn01a96646240.037274.8480.7527.507.230.11160.3725
EhoScn02b96671100.014953.4030.6295.072.91.9226−1.4166
EhoScn04a9652990.017052.5470.7514. 873.720.7930−6.6328a
EhoScn15b96591160.027173.4940.6355.915.270.34250.2169
EhoScn16a9644570.015751.4790.6293.323.5−0.1232−3.7282a
EhoScn17a9653880.014962. 3770. 6274.422.901.2839−1.6326
EhoScn20a96553130.023563.7290.7526.744.581.28380.9755
EhoScn21c9670960.008551.2380. 6291.751.650.13470.4301
EhoScn23c9643980.018261.8570. 6294.233.550.47010.2507
EhoScn24b96631150.0237873.8540.6316.324.790.8896−0.9234

A = number of alleles; h = haplotype diversity; k = average number of nucleotide differences; n = number of sequences used (consists of four sequences of E. japonica, 12 sequences of E. tubulosa, and 80 sequences of E. horsfieldii); p n = proportion of polymorphic sites; π = nucleotide diversity; θ w = Watterson estimator per site from S; S = variable sites.

Significant (α = 0.01).

PCR primers and characteristics of 10 scnDNA markers developed for Euchresta horsfieldii T a = annealing temperature. Genetic properties of the 10 newly developed scnDNA markers of Euchresta horsfieldii, cross‐amplified in E. japonica and E. tubulosa A = number of alleles; h = haplotype diversity; k = average number of nucleotide differences; n = number of sequences used (consists of four sequences of E. japonica, 12 sequences of E. tubulosa, and 80 sequences of E. horsfieldii); p n = proportion of polymorphic sites; π = nucleotide diversity; θ w = Watterson estimator per site from S; S = variable sites. Significant (α = 0.01).

Conclusions

In this study, we demonstrated the application of RNA‐Seq to develop scnDNA markers for E. horsfieldii, and these markers were cross‐amplified to E. japonica and E. tubulosa. These scnDNA markers will provide useful resources to study the population genetic diversity and population structure of these rare medicinal plants.

Author contributions

M.K. conceived and designed the project. A.P. carried out the field and laboratory works supervised by H.H. C.F. conducted bioinformatics analyses. A.F. and C.F. wrote the manuscript. All authors read and approved the final version of the manuscript.

Data accessibility

Illumina paired‐end clean reads were deposited to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA; accession no. SRP149026). Sequence information for the developed primers has been deposited to NCBI; GenBank accession numbers are provided in Table 1.
SpeciesVoucher specimen accession no.Collection localityGeographic coordinatesPopulation codeNo. of individuals genotyped
E. horsfieldii (Lesch.) Benn.E19910118c Bali, Indonesia (living collection of Bali Botanic Garden) 8.28370S, 115.13702E (8.27681S, 115.15338E)BLBG2
AP231a Bedugul, Tabanan, Bali, Indonesia8.28370S, 115.13702EBALI14
AP233a Mt. Marapi, Kotobaru, West Sumatra, Indonesia0.39433S, 100.42735EMRPI14
AP234a Mt. Kerinci, Kutacane, Aceh, Indonesia3.35859N, 97.70061EKTCN10
E. japonica Regel20060969/N27‐0002c Lechang, Guangdong, China (living collection of SCBG)— (23.18033N, 113.35351E)SCB11
20100407/11‐0004c Sangzhi, Hunan, China (living collection of SCBG)29.11456N, 110.46506E (23.18033N, 113.35351E)SCB21
E. tubulosa DunnPE 00305297b Mt. Emei, Sichuan, China29.55253N, 103.34494ESCHU6

AP = Arief Priyadi, collector.

Vouchers deposited at the Herbarium Hortus Botanicus Baliense (THBB), Bali Botanic Garden, Indonesian Institute of Sciences (LIPI), Bali, Indonesia.

No vouchers collected during this study. Voucher information is from the Herbarium of the Institute of Botany (PE), Chinese Academy of Sciences; accessed from the Chinese Virtual Herbarium (http://www.cvh.ac.cn).

Living collections of the South China Botanical Garden (SCBG), Guangzhou, Guangdong, China, or Bali Botanic Garden, LIPI, Tabanan, Bali, Indonesia.

Population (Species) N Allelic richness per locus per populationNo. of private alleles
01a02b04a15b16a17a20a21c23c24b
SCB1, SCB2 (E. japonica)22.5571.97111.9711.9712.5571.9711.7862.5571.97121
SCHU (E. tubulosa)61111.85411.3121112.17114
KTCN (E. horsfieldii)1011111111113
MRPI (E. horsfieldii)1411111111113
BALI, BLBG (E. horsfieldii)16111111111110

N = number of individuals sampled.

  12 in total

1.  PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs.

Authors:  H E L Lischer; L Excoffier
Journal:  Bioinformatics       Date:  2011-11-21       Impact factor: 6.937

2.  spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.

Authors:  Simon Dellicour; Patrick Mardulyn
Journal:  Mol Ecol Resour       Date:  2013-12-08       Impact factor: 7.090

3.  Development of microsatellite markers for Euchresta japonica and E. formosana (Leguminosae).

Authors:  I Hama; Y Saito; C Umehara; C L Lian; Y Ide
Journal:  Mol Ecol Resour       Date:  2009-03-17       Impact factor: 7.090

4.  Comparative transcriptome resources of eleven Primulina species, a group of 'stone plants' from a biodiversity hot spot.

Authors:  Bin Ai; Yong Gao; Xiaolong Zhang; Junjie Tao; Ming Kang; Hongwen Huang
Journal:  Mol Ecol Resour       Date:  2014-10-08       Impact factor: 7.090

5.  MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.

Authors:  Sudhir Kumar; Glen Stecher; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2016-03-22       Impact factor: 16.240

6.  Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction.

Authors:  Jian Ye; George Coulouris; Irena Zaretskaya; Ioana Cutcutache; Steve Rozen; Thomas L Madden
Journal:  BMC Bioinformatics       Date:  2012-06-18       Impact factor: 3.169

7.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

8.  Arlequin (version 3.0): an integrated software package for population genetics data analysis.

Authors:  Laurent Excoffier; Guillaume Laval; Stefan Schneider
Journal:  Evol Bioinform Online       Date:  2007-02-23       Impact factor: 1.625

9.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups.

Authors:  Feng Chen; Aaron J Mackey; Christian J Stoeckert; David S Roos
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

View more
  1 in total

1.  Corrigendum.

Authors: 
Journal:  Appl Plant Sci       Date:  2019-06-07       Impact factor: 1.936

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.