Literature DB >> 26421250

Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides.

Yuqing Jin1, Quanxin Bi2, Wenbin Guan2, Jian-Feng Mao1.   

Abstract

PREMISE OF THE STUDY: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. METHODS AND
RESULTS: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively.
CONCLUSIONS: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.

Entities:  

Keywords:  454 pyrosequencing; Cupressaceae; EST-SSR; Metasequoia glyptostroboides; microsatellite; relict plant

Year:  2015        PMID: 26421250      PMCID: PMC4578375          DOI: 10.3732/apps.1500038

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Metasequoia glyptostroboides H. H. Hu & W. C. Cheng (Cupressaceae), the dawn redwood, is a relict conifer of the genus Metasequoia H. H. Hu & W. C. Cheng. Its natural population has been found only in highly restricted regions bordering Hubei, Hunan, and Chongqing provinces in China (Farjon, 2001). This tree species is valued for its essential oils and crude extracts, which have great potential for antifungal activity. Moreover, this conifer is widely used as an ornamental tree in eastern China. Given its limited natural population size, M. glyptostroboides has been listed as an endangered species by both the Chinese government and the International Union for Conservation of Nature (Li et al., 2005). Low genetic diversity was previously found in both wild and artificial populations of M. glyptostroboides using random amplified polymorphic DNAs (RAPDs) and amplified fragment length polymorphisms (AFLPs) (Chen et al., 2003). Nevertheless, dominant markers, including RAPDs and AFLPs, cannot provide reliable estimates of genetic diversity because they are unable to distinguish heterozygotes from homozygotes (Nybom, 2004). This drawback can be overcome using simple sequence repeats (SSRs). In contrast to RAPDs and AFLPs, SSRs are typically studied separately at each identified locus and can be regarded as codominant markers (Nybom, 2004). Cui et al. (2010) developed 11 polymorphic microsatellite markers with a high polymorphism information content (PIC) by sequencing a microsatellite-enriched library. In recent years, with increasing exploration of expressed sequence tags (ESTs), EST-derived SSRs (EST-SSRs) have emerged as useful tools for estimating functional variation (Andersen and Lübberstedt, 2003; Varshney et al., 2007; Zalapa et al., 2012). Additional genetic markers, especially codominant EST-SSRs, are valuable and will be critical for resolving finer genetic variation patterns and developing functional studies and conservation strategies for this endangered species. Zhao et al. (2013) generated 1,571,764 high-quality reads (assembled into 97,565 unigenes) from vegetative and female buds by transcriptome sequencing using 454 pyrosequencing technology, which provides a large amount of sequence information for microsatellite mining. In this study, we referenced the transcriptome sequences and developed the first set of EST-SSR markers for M. glyptostroboides.

METHODS AND RESULTS

The 97,565 unigene sequences were downloaded from http://www.genome.clemson.edu/node/273 (presented by the transcriptome sequencing project) (Zhao et al., 2013) and used for subsequent microsatellite mining. We first detected the microsatellite sequences from the unigene data set and identified unique microsatellites through an all-against-all BLAST search. Finally, we successfully designed primer pairs for 2087 unique EST-SSR loci. A bioinformatics analysis was implemented for microsatellite development and primer design using QDD version 3.1 (Meglécz et al., 2014). We selected 96 di- or trinucleotide loci with large numbers of repeats for primer synthesis and tested them in eight mature individuals collected from the Beijing Botanical Garden with permission. The sampled trees were introduced separately from natural populations in three counties (Lichuan [108.56°N, 30.18°E], Shizhu [28.28°N, 108.58°E], and Longshan [29.59°N, 106.50°E]) in the 1950s. Voucher specimens were deposited in the herbarium at Beijing Forestry University (accession no.: BFU-shuishan201403). Genomic DNA was extracted from the leaves of eight individuals using the cetyltrimethylammonium bromide (CTAB) method (Doyle and Doyle, 1987). Functional annotations were prepared for each EST-SSR locus with polymorphisms detected by querying the Pfam protein families database (Finn et al., 2014). The 96 selected primer pairs were validated by PCR using the M13-tail technique (Schuelke, 2000) to test for polymorphisms. Three primers were synthesized for each genotyping experiment: a 5′ M13-tailed forward primer, a reverse primer, and a fluorescently labeled M13 primer. M13 primers carrying FAM, HEX, TAMRA, and ROX (BGI, Beijing, China) labels were used separately. Each PCR mixture contained 5 μL of 2× MIX (New Industry, Beijing, China), 10–20 ng of template DNA, 1.6 pmol of reverse primer, 1.6 pmol of single fluorescently labeled M13 primer, and 0.4 pmol of forward primer in a final volume of 10 μL. PCR was performed as follows in a thermal cycler: 94°C for 5 min; 28 cycles of 94°C for 40 s, annealing at 55°C for 40 s, and elongation at 72°C for 45 s; 10 cycles of 94°C for 40 s, annealing at 53°C for 40 s, and elongation at 72°C for 45 s; and a final extension at 72°C for 10 min. The products were analyzed using an ABI 3730 Prism Genetic Analyzer (Applied Biosystems, Foster City, California, USA). The raw data were analyzed using GeneMarker version 2.62 (SoftGenetics, State College, Pennsylvania, USA). The number of alleles, observed heterozygosity (Ho), and expected heterozygosity (He) were calculated using GenAlEx version 6.5 (Peakall and Smouse, 2012). The allelic PIC was also calculated for each locus using CERVUS version 3.0 (Kalinowski et al., 2007). Of the 96 loci, 53 demonstrated the expected fragment sizes; the remaining loci failed to amplify any product. After capillary electrophoresis scoring, 42 of the 53 loci showed a clear, single peak for each allele, of which 23 (53.49%) were polymorphic and 19 were monomorphic. The EST-SSR loci evaluated did not overlap with putative polymorphic SSRs described in a previous study that detected SSR loci from the same transcriptome data set (Zhao et al., 2013). The primer sequences, repeat motifs, sizes, and functional annotations from Pfam are shown in Table 1. The number of alleles from these 23 polymorphic EST-SSRs varied from two to eight (average: 3). The Ho, He, and PIC ranged from 0 to 1.0, 0.117 to 0.813, and 0.110 to 0.789, respectively (Table 2). The mean He was 0.551, which is higher than that estimated using RAPDs (Chen et al., 2003). Indicators including the percent of polymorphic loci and He suggest considerable genetic variability for this endangered conifer. Target sequences for the 23 microsatellite loci are attached as Appendix S1.
Table 1.

Characteristics of 23 polymorphic EST-SSR loci developed for Metasequoia glyptostroboides.

LocusPrimer sequences (5′–3′)Repeat motifAllele size (bp)Functional annotationb
BQSS1F: GTTTCCTAGTTCTGCGCGAC(AT)8301CLP_protease, Ribosomal_S18, MGC-24
R: TTGCTGCTCATTTCATTCCA
BQSS2F: CATTGGCAAATTCACTCCAAT(AC)8335No hit
R: GACGAAGAGGCATTCCCAT
BQSS3F: GCCATTCGTATCTGTTTCCAA(AG)9229KH_1, KH_3, KH_2, KH_4, KH_5, SLS
R: GCGGTATTTACGAGACGAGC
BQSS7F: CTGGCACTCCTGATATGCCT(AAG)8154DEAD, Helicase_C, Helicase_C_2, AAA_22, TATR
R: GATGACTCGATGCAAGGGTT
BQSS9F: AAATCTCTATCGCCCTCCAA(AT)12187TauD, DUF1667, Glyoxalase_2
R: ACTTGGCGAACCTGTAATCC
BQSS10F: GGAGGCGTTAGTGGAGGAA(AG)3CG(AGG)8AG140No hit
R: ATCATTTGTCCCTCTGCTGC
BQSS17F: TCCTCGGTTTCTAGTGCAGG(ACC)8117No hit
R: CAAGGGAATGTACTCATCTCCA
BQSS18F: TCCTCCTCCAAATTATTGCG(AAC)9291RRM_1, RRM_6, RRM_5, PAM2
R: ATTTCAGAGGCATTTCGGTG
BQSS21F: CCAAATACTGGTCGGATGCT(AGG)8271AP2
R: TGCTCCTCCTCTTCTACCGA
BQSS26F: CTTTGGATTGCATACCTGGG(AAG)9300zf-C2H2_6, zf-C2H2_jaz, ThuA, zf-met, Tnp_zf-ribbon_2
R: GCTTTACAAGTCAGCGGAGG
BQSS27F: TTCTGTGACAATGGATGGGA(AT)8161eIF-5a, KOW, EFP_N
R: GCAAGGCTCTAAAGCTGGAA
BQSS29F: CTGGTTCAGGTTTGTGGGTC(AT)8290QH-AmDH_gamma
R: TTCATTTGAGCTTGGCGG
BQSS32F: CACTTCTGCTCTCTTCTCATCG(AG)15333FBA_2, Pox_E10
R: CACAGGAACAGGTAGGAAACG
BQSS34F: GGTCAAGAGCATCAGCTTCC(AT)8193VPS28
R: ATGGTTCGGATTATGGCAAA
BQSS35F: AAGATGAATGCAATCCAGGC(AG)9154HEAT_2
R: ATCAGGCAACTGTTTGGCAT
BQSS36F: AATCCTCAACACAAGGATGC(AG)9182GUCT
R: TTGTACCTTCGTTTGCCAAG
BQSS38F: CAAACAAACCAACCAACCAA(AT)9176No hit
R: CCTGAATTTGATCGAAGATGG
BQSS45F: TTGCTGTCATACTGTTGTCGTG(AG)9(AT)8A344No hit
R: CTTTGTTGAGTCTGTGCCGA
BQSS46F: GTGATGCAAGGTTAGTGCCC(AC)8139AHH
R: GCATGTGTGTGTTTGAGGGA
BQSS48F: TGGGTGAGTTCAAGAAGTTGG(AT)12277No hit
R: GTGTCCATACGCAAGGGTTT
BQSS49F: CAAGCCATAGCATAGGCACA(AAG)8343SOBP, Plexin_cytopl, 7tm_7, Herpes_US9
R: GATGGGTTGTCGAGTGGTTT
BQSS50F: CTTCAATCCACTTGTCTTGCC(ATGC)12128RP-C_C
R: GACAAGTAACCGGAACCGAA
BQSS70F: CTTTCATTTAGTGCATGGATGG(AC)9174No hit
R: TTTGCATTGTAATCTTGCGG

Annealing temperature (Ta) for all loci was 55°C.

Protein family names from function annotation prepared by querying the protein family database, Pfam (Finn et al., 2014).

Table 2.

Genetic properties of 23 polymorphic EST-SSR loci developed for Metasequoia glyptostroboides.

LocusAHoHeHWEbPIC
BQSS130.0000.594**0.511
BQSS230.6250.508ns0.428
BQSS320.3750.305ns0.258
BQSS720.5000.469ns0.359
BQSS940.7500.648ns0.592
BQSS1030.6250.539ns0.447
BQSS1720.2500.219ns0.195
BQSS1840.7500.609ns0.559
BQSS2120.0000.219**0.195
BQSS2620.3750.430ns0.337
BQSS2740.5000.719ns0.667
BQSS2920.0000.219**0.195
BQSS3230.0000.531**0.468
BQSS3420.1250.117ns0.110
BQSS3520.2500.219ns0.195
BQSS3631.0000.555*0.456
BQSS3850.7500.656ns0.595
BQSS4540.1430.704*0.646
BQSS4620.1250.117ns0.110
BQSS4880.5000.813ns0.789
BQSS4920.3750.305ns0.258
BQSS5020.0000.469**0.359
BQSS7020.6250.430ns0.337

Note: A = number of alleles; He = expected heterozygosity; Ho = observed heterozygosity; HWE = Hardy–Weinberg equilibrium; PIC = polymorphism information content.

Sequences were downloaded from the Clemson University Genomics Institute Metasequioa RNA-Seq transcriptome sequencing project (available at http://www.genome.clemson.edu/node/273).

ns = not significant; *P < 0.05; **P < 0.01; ***P < 0.001.

Characteristics of 23 polymorphic EST-SSR loci developed for Metasequoia glyptostroboides. Annealing temperature (Ta) for all loci was 55°C. Protein family names from function annotation prepared by querying the protein family database, Pfam (Finn et al., 2014). Genetic properties of 23 polymorphic EST-SSR loci developed for Metasequoia glyptostroboides. Note: A = number of alleles; He = expected heterozygosity; Ho = observed heterozygosity; HWE = Hardy–Weinberg equilibrium; PIC = polymorphism information content. Sequences were downloaded from the Clemson University Genomics Institute Metasequioa RNA-Seq transcriptome sequencing project (available at http://www.genome.clemson.edu/node/273). ns = not significant; *P < 0.05; **P < 0.01; ***P < 0.001.

CONCLUSIONS

We developed the first set of EST-SSR markers for M. glyptostroboides. The 23 polymorphic loci reported here will facilitate finer estimation of the genetic diversity and population structure of this species, as well as the development of functional studies and conservation strategies. Click here for additional data file.
  9 in total

1.  An economic method for the fluorescent labeling of PCR fragments.

Authors:  M Schuelke
Journal:  Nat Biotechnol       Date:  2000-02       Impact factor: 54.908

Review 2.  Functional markers in plants.

Authors:  Jeppe R Andersen; Thomas Lübberstedt
Journal:  Trends Plant Sci       Date:  2003-11       Impact factor: 18.313

3.  Comparison of different nuclear DNA markers for estimating intraspecific genetic diversity in plants.

Authors:  Hilde Nybom
Journal:  Mol Ecol       Date:  2004-05       Impact factor: 6.185

Review 4.  Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the plant sciences.

Authors:  Juan E Zalapa; Hugo Cuevas; Huayu Zhu; Shawn Steffan; Douglas Senalik; Eric Zeldin; Brent McCown; Rebecca Harbut; Philipp Simon
Journal:  Am J Bot       Date:  2011-12-20       Impact factor: 3.844

5.  Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment.

Authors:  Steven T Kalinowski; Mark L Taper; Tristan C Marshall
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

6.  QDD version 3.1: a user-friendly computer program for microsatellite selection and primer design revisited: experimental validation of variables determining genotyping success rate.

Authors:  Emese Meglécz; Nicolas Pech; André Gilles; Vincent Dubut; Pascal Hingamp; Aurélie Trilles; Rémi Grenier; Jean-François Martin
Journal:  Mol Ecol Resour       Date:  2014-05-26       Impact factor: 7.090

7.  An EST dataset for Metasequoia glyptostroboides buds: the first EST resource for molecular genomics studies in Metasequoia.

Authors:  Ying Zhao; Shivegowda Thammannagowda; Margaret Staton; Sha Tang; Xinli Xia; Weilun Yin; Haiying Liang
Journal:  Planta       Date:  2012-11-03       Impact factor: 4.116

8.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

9.  Pfam: the protein families database.

Authors:  Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-11-27       Impact factor: 16.971

  9 in total
  2 in total

1.  Novel 28 microsatellite loci using high-throughput sequencing for an endangered species on Metasequoia glyptostroboides (Cupressaceae).

Authors:  Jing-Wen Wang; Tong-Lei Xu; Ghenet Weldegebriel Sereke; Rong Wang; Yuan-Yuan Li
Journal:  Mol Biol Rep       Date:  2020-02-08       Impact factor: 2.316

2.  Microsatellite markers: what they mean and why they are so useful.

Authors:  Maria Lucia Carneiro Vieira; Luciane Santini; Augusto Lima Diniz; Carla de Freitas Munhoz
Journal:  Genet Mol Biol       Date:  2016-08-04       Impact factor: 1.771

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.