Literature DB >> 30693158

Development of novel EST-SSR markers for Ephedra sinica (Ephedraceae) by transcriptome database mining.

Si-Qian Jiao1, Yan-Qiang Sun1, Dong-Xu Zhang2, Qiong Gao1, Yuqing Jin1, Hui Liu1, Yongpeng Ma3, Yong Yang4, Ilga Porth5,6, Jian-Feng Mao1.   

Abstract

PREMISE OF THE STUDY: Ephedra sinica (Ephedraceae) is a gymnosperm shrub with a wide distribution across Central and Eastern Asia. It is widely cultivated as a medicinal plant, but its wild populations are monitored to determine whether protection is needed. METHODS AND
RESULTS: Thirty-six microsatellite markers, including 11 polymorphic markers, were developed from E. distachya RNA-Seq data deposited in the National Center for Biotechology Information dbEST database. Among 100 genotyped E. sinica individuals originating from five different population groups, the allele number ranged from three to 22 per locus. Levels of observed and expected heterozygosity ranged from 0 to 0.866 (average 0.176) and 0 to 0.876 (average 0.491), respectively. Allelic polymorphism information content ranged from 0.000 to 0.847 (average 0.333). Cross-species amplifications were successfully conducted with two related Ephedra species for all 11 di- or trinucleotide simple sequence repeats.
CONCLUSIONS: This study provides the first set of microsatellite markers for genetic monitoring and surveying of this medicinal plant.

Entities:  

Keywords:  Ephedra sinica; Ephedraceae; expressed sequence tag–simple sequence repeat (EST‐SSR) marker; genetic diversity; gymnosperm; medicinal plant

Year:  2019        PMID: 30693158      PMCID: PMC6342172          DOI: 10.1002/aps3.1212

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Ephedra sinica Stapf (also known as Chinese ephedra or ma huang; Ephedraceae), a gymnosperm shrub, is distributed across southern Siberia, Mongolia, and China, and is found in arid areas and highlands, occurring on slopes, dry river beds, sandy places, or fields in mountainous areas (Lin et al., 2002). The species is reported as dominant in some areas, but little is known about its entire population size. Ephedra sinica has been used in Chinese herbal medicine for thousands of years (Fabricant and Farnsworth, 2001). The stems of most members in the genus Ephedra L. contain the alkaloid ephedrine, which is used for treatment of asthma and other respiratory ailments (Liu, 1989; Nam et al., 2003). Recently, E. sinica has become extensively exploited in a large market developed for nutritional supplements and stimulants involving this plant. Ephedra sinica is recorded on the International Union for Conservation of Nature (IUCN) Red List of Threatened Species (Bell and Bachman, 2011). The IUCN lists the species as Least Concern; however, wild populations still need to be monitored to determine whether protection is required, as a species of Least Concern may still be critically endangered within a particular region where numbers are very small or declining. Recently, 29 polymorphic microsatellite loci were developed for a distantly related species, E. gerardiana Wall. ex C. A. Mey., by mining the whole‐genome‐skimming data from Illumina MiSeq sequencing (De et al., 2017). However, no DNA markers have been developed for E. sinica, limiting our ability to monitor its population dynamics and employ conservation genetic measures. The present study developed a crucial set of di‐ or trinucleotide microsatellite markers by mining an E. distachya expressed sequence tag (EST)–derived database. The EST–simple sequence repeat (SSR) markers developed here will enrich the genetic marker set for Ephedra species.

METHODS AND RESULTS

A total of 4981 ESTs generated from mRNA sequencing of E. distachya were retrieved from the National Center for Biotechnology Information (NCBI) Expressed Sequence Tags database (dbEST) (accessed by searching with “(Ephedra) AND “Ephedra distachya”[porgn:__txid3389]”). Microsatellites with a minimum repeat number of five were detected for 324 ESTs with a minimum length of 200 bp. We obtained 203 unique EST‐SSR loci by an all‐against‐all BLAST analysis and successfully designed primers for 171 unique EST‐SSR loci. All bioinformatic operations were performed using the microsatellite detection and development pipeline QDD version 3.1 (Meglécz et al., 2014). Finally, we selected 88 di‐ or trinucleotide loci with at least five repeats for further evaluation. We sampled five populations (100 individuals total) of E. sinica in Datong, Shanxi Province, China (Appendix 1). Voucher specimens were deposited in the Herbarium of Beijing Forestry University (BJFC). In order to test for successful amplification of the 88 EST‐SSR loci selected, we conducted PCR analysis using eight individual plants of E. sinica. These eight individuals were collected in the Beijing Botanical Garden, Chinese Academy of Sciences. The genomic DNA was extracted from dried leaves using the cetyltrimethylammonium bromide (CTAB) protocol (Doyle and Doyle, 1987). An M13 tail (FAM, HEX, TAMRA, ROX) was attached to the forward primer (Meglécz et al., 2014) for visualization. The final PCR volume was 20 μL, containing 10 μL of 2× Taq PCR Mix (Tiangen, Beijing, China), 4 μL of fluorescent dye–labeled M13 primer (4 pM), 4 μL of mixed forward and reverse primers, and 2 μL (20 ng) of DNA. The following PCR conditions were used: 94°C incubation for 5 min; 25 cycles at 94°C for 40 s, 55°C for 40 s, and 72°C for 45 s; 10 cycles at 94°C for 40 s, 53°C for 40 s, and 72°C for 45 s; and a final extension at 72°C for 10 min. Among the 88 identified di‐ or trinucleotide loci, 38 displayed the expected size bands. After final capillary electrophoresis analysis on an ABI 3730 sequencer (Applied Biosystems, Waltham, Massachusetts, USA), SSR alleles were called with GeneMarker version 2.20 (SoftGenetics, State College, Pennsylvania, USA). Of these 38 loci, 36 showed clear, single peaks for each allele as essential for confident scoring, and 11 of these loci were polymorphic among the initially screened eight individuals. Characteristics of the 25 pairs of monomorphic microsatellite loci developed for E. sinica are shown in Appendix 2. The 11 polymorphic primer pairs were subsequently used to screen five E. sinica populations (with sample sizes n = 20 per population) and two additional populations originating from E. likiangensis Florin (n = 20) and E. equisetina Bunge (n = 6) (Appendix 1). Table 1 shows the primer sequences, repeat motifs, amplification sizes, GenBank accession number of the target sequences, and functional annotations determined with the protein family database, Pfam (Finn et al., 2014). We employed GenAlEx version 6.5 (Peakall and Smouse, 2012) to calculate genetic diversity parameters. The allelic polymorphism information content (PIC) was calculated using CERVUS 3.0 (Kalinowski et al., 2007). Allele numbers ranged from three to 22, with an average of 11.55 alleles per locus. Levels of observed and expected heterozygosity ranged from 0 to 0.842 (average 0.176) and 0 to 0.883 (average 0.491), respectively. In addition, PIC values ranged from 0 to 0.847 (average 0.333). The genetic parameters calculated for the 11 polymorphic EST‐SSR loci are detailed in Table 2. The target sequences for all microsatellite loci are provided in Appendices S1 and S2.
Table 1

Characteristics of 11 polymorphic microsatellite loci developed for Ephedra sinica

LocusPrimer sequences (5′–3′) Repeat motifAllele size range (bp)Fluorescent dyeFunction annotationa GenBank accession no.
E‐2F: GAGAGAAGGCAAGTGTCATGG (AGG)6 192–231FAMPeroxidase JG722437
R: CCATCCTCGTCTCTTTCTGC
E‐18F: AGTCGAAGCAGAAGGCTGAC (AAT)6 153–228TAMRADev_Cell_Death JG719586
R: TCCTGGGAAGAGACTCCGTA
E‐20F: GATTAGGTGGAAAGCAAGCG (AAG)5 164–170HEXDUF260, Oxidored_q1 JG721857
R: ATCCAACCCGATCATGTACC
E‐33F: TTGATGATGTCTGTAGCGGC (ATC)6 186–246ROXMGS, AICARFT_IMPCHas JG720119
R: AGTGGCAGAAGTGTTGGCTT
E‐35F: GGTGTTTCAGATGCGATTCA (AAG)6 182–188FAMCK_II_beta JG720356
R: ATCGTTGATCCTCTTGCGAT
E‐49F: CCTTGAGGCGCTTTATTCAG (AGG)5 175–253TAMRAMIT JG721444
R: CGCAAGATCGAAATACCCAT
E‐58F: GCTCTGTCGAGAAGAACCGA (ATC)5 149–200HEXU‐box, zf‐RING_LisH,DOPA_dioxygen JG722187
R: GGGTGGAACTTGAGGTCCTT
E‐59F: GGATCCAAGATCTGGAAGGAG (AGG)9 174–246ROXYycI JG722338
R: AAGCCCATGTCATCATCCAT
E‐62F: TGAATAGAAGCTGGCTGGGT (AAG)5 173–248FAMNo hit JG722724
R: TTGGCTGGTTCTGTCTGATG
E‐71F: AAAGCGTGCAAGACGAGTTT (CAA)3CGAC(AAC)5A171–261ROXAAA_assoc JG723111
R: TCCTCTTCCTCTCCACCTCA
E‐83F: GTCATGTCATGCTCACCGAC (ATC)5(TTC)3 255–264HEXPkinase, Pkinase_Tyr, Kdo, APH, RIO1, YrbL‐PhoP_reg JG719186
R: GCGACTTCTCATTGCTCTCC

Pfam annotation refers to the protein functional annotation.

Table 2

Values for genetic diversity of Ephedra sinica across 11 polymorphic microsatellite loci.a

LocusMH‐1 (n = 20)MH‐2 (n = 20)MH‐3 (n = 20)MH‐4 (n = 20)MH‐5 (n = 20)Total
A H o H e PIC A H o H e PIC A H o H e PIC A H o H e PIC A H o H e PIC A
E‐220.050 0.050 0.048 30.000 0.272 0.247 20.000 0.185 0.164 20.000 0.097 0.090 30.100 0.188 0.174 5
E‐1860.000 0.813 0.757 60.083 0.750 0.686 40.000 0.598 0.531 50.176 0.727 0.657 110.250 0.883 0.847 18
E‐2030.167 0.379 0.337 20.111 0.489 0.362 20.000 0.097 0.090 20.000 0.097 0.090 30.100 0.272 0.247 4
E‐33100.278 0.521 0.495 90.278 0.608 0.564 70.150 0.500 0.465 70.250 0.786 0.739 80.471 0.832 0.783 20
E‐3530.105 0.104 0.099 20.200 0.185 0.164 20.050 0.050 0.048 20.200 0.185 0.164 20.100 0.097 0.090 3
E‐4950.263 0.290 0.271 40.842 0.597 0.502 40.800 0.581 0.512 40.600 0.483 0.433 50.750 0.564 0.503 8
E‐5840.105 0.711 0.636 40.125 0.762 0.689 30.067 0.398 0.351 70.000 0.823 0.770 50.000 0.694 0.627 10
E‐5960.471 0.770 0.707 40.050 0.594 0.497 80.235 0.820 0.772 40.105 0.545 0.454 50.200 0.645 0.558 16
E‐6280.000 0.876 0.834 80.211 0.751 0.695 100.235 0.845 0.804 100.350 0.836 0.794 100.300 0.831 0.787 21
E‐7190.278 0.708 0.669 90.263 0.875 0.835 80.250 0.866 0.810 70.125 0.730 0.672 60.300 0.590 0.547 22
E‐8320.0000.0970.09010.0000.0000.00030.0000.1900.17710.0000.0000.00030.0530.1520.1424

A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; n = number of individuals; PIC = polymorphism information content.

Voucher and locality information are provided in Appendix 1.

Characteristics of 11 polymorphic microsatellite loci developed for Ephedra sinica Pfam annotation refers to the protein functional annotation. Values for genetic diversity of Ephedra sinica across 11 polymorphic microsatellite loci.a A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; n = number of individuals; PIC = polymorphism information content. Voucher and locality information are provided in Appendix 1. Furthermore, we conducted cross‐species amplification of the 11 polymorphic primer pairs on two related species: E. likiangensis from Yulong, Yunnan Province, and E. equisetina from Datong, Shanxi Province, China (Appendix 1). All 11 primer pairs successfully amplified E. likiangensis, except for locus E‐20, which produced monomorphic bands in the species (Table 3). For E. equisetina, nine out of the 11 primers tested were polymorphic, and two loci failed to amplify. The interspecific amplification profile may be partially related to the phylogenetic relationships between species, as the relationship between E. equisetina and E. sinica is more distant (Ickert‐Bond and Wojciechowski, 2004). In terms of polymorphisms, except for primers at the E‐49 locus, the remaining primer pairs showed moderate polymorphism in E. equisetina, possibly due to the small sample size.
Table 3

Cross‐amplification of 11 polymorphic microsatellite loci developed for Ephedra sinica in E. likiangensis and E. equisetina.a

Locus Ephedra likiangensis (n = 20) Ephedra equisetina (n = 6)
A N Allele size (bp) H o H e PIC A N Allele size (bp) H o H e PIC
E‐2419189–2370.526 0.528 0.444 111820.000 0.000 0.000
E‐18420195–2460.500 0.581 0.511 161950.000 0.000 0.000
E‐201201850.000 0.000 0.000 26170–1850.833 0.530 0.368
E‐33920162–2460.150 0.826 0.781 151470.000 0.000 0.000
E‐35320182–2770.400 0.337 0.289 161880.000 0.000 0.000
E‐49620169–2291.000 0.686 0.626 36175–1840.667 0.530 0.424
E‐58519152–1970.526 0.627 0.546 161950.000 0.000 0.000
E‐59419185–2120.316 0.587 0.479 22202–2080.000 0.667 0.375
E‐62520150–2240.200 0.486 0.438 162220.000 0.000 0.000
E‐71520125–1510.450 0.619 0.559 142260.000 0.000 0.000
E‐83319247–2650.368 0.317 0.275 26189–2580.333 0.485 0.346

A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; n = number of individuals sampled; N = number of successfully amplified individuals; PIC = polymorphism information content.

Voucher and locality information are provided in Appendix 1.

Cross‐amplification of 11 polymorphic microsatellite loci developed for Ephedra sinica in E. likiangensis and E. equisetina.a A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; n = number of individuals sampled; N = number of successfully amplified individuals; PIC = polymorphism information content. Voucher and locality information are provided in Appendix 1.

CONCLUSIONS

The EST‐SSR polymorphic markers developed in this study will be potentially useful for studies of population structure and genetic diversity in E. sinica conservation genetics. These new markers will also be applicable for E. likiangensis and E. equisetina and can enrich the number of DNA markers available for Ephedra.

DATA ACCESSIBILITY

Expressed sequence tags used for primer development were downloaded from the National Center for Biotechnology Information (NCBI) Expressed Sequence Tags database (dbEST). GenBank accession numbers for target sequences of both polymorphic and monomorphic SSR loci are provided in Table 1 and Appendix 2. APPENDIX S1. Monomorphic microsatellite target sequences from microsatellite marker development in Ephedra sinica. Click here for additional data file. APPENDIX S2. Polymorphic microsatellite target sequences from microsatellite marker development in Ephedra sinica. Click here for additional data file.
SpeciesPopulation codeVoucher specimen accession no.Collection localityGeographic coordinates n
Ephedra likiangensis FlorinELF201807281b Baishui River, Jade Dragon Snow Mountain, Lijiang County, Yunnan Province27.13205°N, 100.248755°E 20
Ephedra equisetina BungeEEB201807301c Datong, Shanxi Province39.95878°N, 113.776324°E 6
Ephedra sinica Stapf MH‐1ESS201806101c Fan Yao village, Yanggao County, Datong, Shanxi Province40.28975°N, 113.648139°E 20
Ephedra sinica MH‐2ESS201806232c Nan Tuo village, Duzhuang township, Yunzhou District, Datong, Shanxi Province39.95890°N, 113.776347°E 20
Ephedra sinica MH‐3ESS201806233c Yang Lao Wa village, Xubao township, Yunzhou District, Datong, Shanxi Province40.85174°N, 113.852189°E 20
Ephedra sinica MH‐4ESS201806234c Longhun Mountain, Kang Yao village, Yanggao County, Datong, Shanxi Province40.26208°N, 113.622244°E 20
Ephedra sinica MH‐5ESS201806255c Bai Deng Mountain, Pingcheng District, Datong, Shanxi Province40.12804°N, 113.372931°E 20

n = number of individuals sampled.

All voucher specimens are deposited in the Herbarium of Beijing Forestry University (BJFC), Beijing, China.

Collector Yong‐peng Ma.

Collector Dong‐xu Zhang.

LocusPrimer sequences (5′–3′) Repeat motifAllele size (bp)Fluorescent dyeGenBank accession no.
E‐1F: CCGAATCAATCGCTCTCTTT(CT)5 151FAM JG721273
R: GCCTGGTTCTCTCCCATTT
E‐6F: CAGTCAGGTCTCTTCGCCTC(CA)9 200TAMRA JG723006
R: TGCAACCGTGATATGAGAGC
E‐12F: TAGCTTGTGGCTATTGCCCT(TAG)5 144HEX JG719000
R: ACCCTCCTCCTCCATTGTG
E‐13F: AATCAACTTGGCCCAGACAA(CAT)5 151ROX JG719115
R: CCTCTTGCTTAGCAGCGTCT
E‐19F: GAAGCAGGAGCAGAAGATGC(GCA)5 194FAM JG720107
R: TTTGGAGGTCGCTGATGG
E‐21F: TTTGTGGTGTTGCTGACAGG(AG)24 244TAMRA JG719754
R: ACTCCTCTGCCTCCACTTCC
E‐22F: AGGCTGTGCAGGAACATCTC(GGC)6 230HEX JG723316
R: GTGAGCGGGAATGAGTAGGA
E‐23F: TAAACGACGGGTTCTCTCCA(TG)5 173ROX JG719685
R: TCAAAGTCGTCGAGGAGGAG
E‐25F: GAAACAGGCACAGACACGAC(GGCACA)5 186FAM JG719706
R: GATTTCCAGATCCATTATGCG
E‐26F: TGTTCCTCTCTCTGCGGATT(TTC)5 115TAMRA JG719755
R: TCCTTTGGAAGCTGACCAGT
E‐30F: ACACCACAGGCGAAGAAACT(CTC)5 186HEX JG720051
R: GGAACGGACAGTTGGAGAAG
E‐36F: ATTGAGCACGCAGACACAGT(TTG)5 243ROX JG720438
R: GTTCTCGGACAAACTCAATGG
E‐38F: TGGTCTTGGTCTCATCCCTC(AG)3(CAC)5 228FAM JG720528
R: TCTCACCAAATTCCCACACTC
E‐39F: AAGCGAATGGCGTATAATCG(AGG)7GCA(AGG)3 249TAMRA JG720562
R: AGAGGAAGCAACCAACCCTT
E‐41F: TAGAAGGAGGCGAGAAGCAG(AGA)5 214HEX JG720763
R: TAGCTGAGTCGATCCCACG
E‐46F: GGCAAACAGAAGGAACGAGA(ATG)5 144ROX JG721163
R: TTGCTTGGGTAATAGGCATTG
E‐47F: AACTGGACATGGAGGAGGTG(TCA)5 222FAM JG721187
R: AGAGCGTCAGCCTCAGAAAC
E‐54F: TTCCTGCTTCTTCTAATGCTTTG(TGC)5 165TAMRA JG721879
R: TCGGATCAACACCAAACTCA
E‐55F: AGGCCTTTCTCCGTGTGC(GCA)6 253HEX JG721940
R: GAGCAATGGCCTTGACGTAG
E‐60F: CTTGCAAGTTGCCGAAGC(GA)3T(TTG)3(TTA)6 167ROX JG722642
R: GGTGAATCCATCAAACGCAT
E‐61F: GGATAGGACCCGGGTTAAGA(TA)10 249FAM JG722646
R: GCTGCCCATTAACAAACCAG
E‐65F: TGCATAGAACAGTTGCAGAGG(AG)5 223TAMRA JG723017
R: CAAGCATCTTTCCAACCCAT
E‐74F: CAAATCCCTTTCTTCTCAGATTG(TAT)5 193HEX JG723206
R: GGGTTTCTTACCAGTTGCAGA
E‐84F: TCACTCTCTACAATTCATTCACAGC(TC)5(TA)4 183ROX JG719254
R: GAAGCCGACGTGGATAAGAG
E‐88F: TGACCAAGCTCAAGCAAGAA(ACA)6 166TAMRA JG719585
R: GAAGCGATGATCAGTGGTGA
  7 in total

1.  Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment.

Authors:  Steven T Kalinowski; Mark L Taper; Tristan C Marshall
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

2.  QDD version 3.1: a user-friendly computer program for microsatellite selection and primer design revisited: experimental validation of variables determining genotyping success rate.

Authors:  Emese Meglécz; Nicolas Pech; André Gilles; Vincent Dubut; Pascal Hingamp; Aurélie Trilles; Rémi Grenier; Jean-François Martin
Journal:  Mol Ecol Resour       Date:  2014-05-26       Impact factor: 7.090

3.  Antiinvasive, antiangiogenic and antitumour activity of Ephedra sinica extract.

Authors:  Nguyen-Hai Nam; Chang-Woo Lee; Dong-Ho Hong; Hwan-Mook Kim; Ki-Hwan Bae; Byung-Zun Ahn
Journal:  Phytother Res       Date:  2003-01       Impact factor: 5.878

4.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

Review 5.  The value of plants used in traditional medicine for drug discovery.

Authors:  D S Fabricant; N R Farnsworth
Journal:  Environ Health Perspect       Date:  2001-03       Impact factor: 9.031

6.  Development of microsatellite markers using Illumina MiSeq sequencing to characterize Ephedra gerardiana (Ephedraceae).

Authors:  Ji De; Weidong Zhu; Tianmeng Liu; Zhe Wang; Yang Zhong
Journal:  Appl Plant Sci       Date:  2017-03-10       Impact factor: 1.936

7.  Pfam: the protein families database.

Authors:  Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-11-27       Impact factor: 16.971

  7 in total
  1 in total

Review 1.  Interpol review of controlled substances 2016-2019.

Authors:  Nicole S Jones; Jeffrey H Comparin
Journal:  Forensic Sci Int Synerg       Date:  2020-05-24
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.