Literature DB >> 31059560

Mining and characterization of novel EST-SSR markers of Parrotia subaequalis (Hamamelidaceae) from the first Illumina-based transcriptome datasets.

Yunyan Zhang1, Mengyuan Zhang1, Yimin Hu2, Xin Zhuang1, Wuqin Xu3, Pengfu Li1, Zhongsheng Wang1.   

Abstract

Parrotia subaequalis is an endangered Tertiary relict tree from eastern China. Despite its important ecological and horticultural value, no transcriptomic data and limited molecular markers are currently available in this species. In this study, we first performed high-throughput transcriptome sequencing of two individuals representing the northernmost (TX) and southernmost (SJD) population of P. subaequalis on the Illumina HiSeq 2500 platform. We gathered a total of 69,135 unigenes for P. subaequalis (TX) and 84,009 unigenes for P. subaequalis (SJD). From two unigenes datasets, 497 candidate polymorphic novel expressed sequence tag-simple sequence repeats (EST-SSRs) were identified using CandiSSR. Among these repeats, di-nucleotide repeats were the most abundant repeat type (62.78%) followed by tri-, tetra- and hexa-nucleotide repeats. We then randomly selected 54 primer pairs for polymorphism validation, of which 27 (50%) were successfully amplified and showed polymorphisms in 96 individuals from six natural populations of P. subaequalis. The average number of alleles per locus and the polymorphism information content values were 3.70 and 0.343; the average observed and expected heterozygosity were 0.378 and 0.394. A relatively high level of genetic diversity (HT = 0.393) and genetic differentiation level (FST = 0.171) were surveyed, indicating P. subaequalis maintained high levels of species diversity in the long-term evolutionary history. Additionally, a high level of cross-transferability (92.59%) was displayed in five congeneric Hamamelidaceae species. Therefore, these new transcriptomic data and novel polymorphic EST-SSR markers will pinpoint genetic resources and facilitate future studies on population genetics and molecular breeding of P. subaequalis and other Hamamelidaceae species.

Entities:  

Mesh:

Year:  2019        PMID: 31059560      PMCID: PMC6502335          DOI: 10.1371/journal.pone.0215874

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Parrotia subaequalis (H.T. Chang) R.M. Hao & H.T. Wei, the focal species of our study, is a member of the genus Parrotia C. A. Mey. in the Hamamelidaceae family. This species is a diploid (2n = 2x = 24) deciduous tree characterized by unique exfoliating bark, obovate leaves in green, yellow, red or purple, and distinct apetalous bisexual flowers [1, 2]. Therefore, P. subaequalis is widely cultivated as a horticultural and ornamental tree in North America, Europe and East Asia [3, 4]. However, the natural population size of the wild P. subaequalis has sharply declined due to its narrow geographic distributions in disjunct montane ecosystems of eastern China, serious habitat destruction and the species’ alternate-year fruit production [5, 6]. Additionally, as an endangered Tertiary relict tree, P. subaequalis is categorized as ‘extremely endangered’ by the International Union for Conservation of Nature (IUCN) [7] and the Chinese Plant Red Book (Grade I Key Protected Wild Plants) [8]. Thus, collection of the wild germplasm resources, plant breeding, and improvement of genetic variability of P. subaequalis has been attracting increasing amounts of attention from cultivators and researchers because of its high value in gardening applications and extant endangered survival. Currently, molecular markers are recognized as a reliable and indispensable approach in studies of plant genetics and breeding. Specifically, molecular markers such as randomly amplified polymorphic DNAs (RAPDs), amplified fragment length polymorphisms (AFLPs), inter-simple sequence repeats (ISSRs), and simple sequence repeats or microsatellites (SSRs), are widely used for genetic diversity assessment, gene mapping, marker assisted selection and molecular breeding [9-11]. Compared with other types of molecular markers, SSRs have many advantages in abundance, hypervariability, codominant inheritance and extensive genomic and transcriptomic coverage [12]. Based on the location of the original sequences used to identify simple repeats, SSRs can be divided into genomic SSRs (gSSRs) and expressed sequence tag-simple sequence repeats (EST-SSRs). EST-SSR markers are widely used to investigate the population genetic diversity, marker-assisted selection and molecular breeding because of their higher possibility of being functionally associated with important traits or pathways and higher levels of transferability to related species as compared to gSSRs [13-15]. In addition, a series of bioinformatics tools have been developed for automated SSR discovery and marker development, such as CandiSSR, which help users to efficiently identify candidate polymorphic SSRs (PolySSRs) from transcriptome datasets or multiple assembled genome sequences rather than in a traditional time-consuming and labor-intensive way [16]. Moreover, in the last decade, with the advent of high-throughput next-generation sequencing (NGS) technologies, including 454 Life Science GSFLX Titanium and the Illumina platform, we can have access to the abundant genetic resources of the species of interest in a rapid and cost-effective way [17-19]. The transcriptome refers to the complete set and quantity of transcripts in a cell at a specific developmental stage or under a physiological condition. NGS-derived transcriptome sequencing produces large EST datasets that are exploited for molecular marker development, novel gene identification and population genetic research related to adaptive traits and pathways [20, 21]. To date, there remains a lack of available transcriptomic databases of P. subaequalis and the previously studied types of molecular markers developed for P. subaequalis merely includes ISSRs and gSSRs [6, 22]. Thus, it is imperative to enlarge the transcriptomic resources for conservation and marker-assisted breeding of P. subaequalis. In this study, we first sequenced the transcriptomes of two individuals of P. subaequalis from the northern- and southernmost populations on the Illumina HiSeq 2500 platform. The objectives of our study were to (1) provide transcriptomic information for these two P. subaequalis transcriptomes, (2) undertake the mining and characterization of novel polymorphic EST-SSR markers for P. subaequalis based on the two transcriptome datasets, and (3) perform the assessment of genetic variation in six natural populations by EST-SSR markers and their cross-species transferability.

Materials and methods

Plant samples, RNA, and DNA isolation

For RNA sequencing, the young and fresh leaves of two individuals of P. subaequalis were collected from two natural populations: TX in Anhui Province and SJD in Jiangsu Province (China) (Fig 1 and S1 Table), respectively. Our field work in TX population was under the authority of Tianxia Mountain Landscape Management Administration; And Shanjuan Cave Scenic Spot Management Administration gave the permission of our collection in SJD population. The leaves were chosen to represent the northern- and southernmost distribution of P. subaequalis. Before RNA extraction, all samples were immediately frozen in liquid nitrogen and stored at –80°C. Total RNA for each individual was extracted using TRIzol Reagent (Invitrogen Life Technologies, Carlsbad, California, USA) and treated with DNase (TakaRa Bio, Shuzo, Kyoto, Japan) following the manufacturer’s instructions. The integrity of the RNA was evaluated by agarose gel electrophoresis and validated using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). The concentration of RNA was measured using a NanoDrop LITE spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA). To evaluate the polymorphisms of the EST-SSR markers developed from our transcriptome datasets and analyze the population genetic diversity of P. subaequalis, we collected samples from a total of 96 individuals of P. subaequalis from six natural populations (16 individuals per population) in China, including Shanjuan Cave (SJD), Mt. Huangbo (HBS), Mt. Tianxia (TX), Zhuxian Village (ZXC), Mt. Wangfo (WFS) and Mt. Longwang (LWS). The field studies in these locations were under the authority of the Shanjuan Cave Scenic Spot Management Administration, Huangbo Mountain Landscape Management Administration, Tianxia Mountain Landscape Management Administration, Zhuxian Village Committee, Wangfo Mountain Landscape Management Administration, and Longwang Mountain Landscape Management Administration, respectively. In addition, five related species in Hamamelidaceae (Parrotia persica, Parrotiopsis jacquemontana, Sycopsis sinensis, Distylium racemosum, and Hamamelis virginiana; S1 Table) were further selected for tests of cross-amplification of the polymorphic EST-SSR markers; for each species, five accessions were used. No specific permissions were required for these species’ collection, for they didn’t belong to the endangered or protected species. Representative voucher specimens of all plant materials were deposited in the Herbarium of Zhejiang University (HZU). Total genomic DNA was extracted from silica gel-dried leaves with Plant DNAzol Reagent (Invitrogen) following the manufacturer’s protocol. The quality of DNA was examined on 0.8% agarose gels stained with 1×GelRed (Biotium) and the concentration was checked using a NanoDrop LITE spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA).
Fig 1

The distribution range of six natural populations of Parrotia subaequalis in China.

Transcriptome sequencing, de novo assembly and annotation

Two next-generation sequencing (NGS) cDNA libraries were normalized using a NEBNext UltraTM RNA Library Prep Kit for Illumina (New England Biolabs, MA, USA) [23]. The mRNAs of each sample were purified and enriched using poly-T oligo-attached magnetic beads. The two cDNA libraries were then pooled together and sequenced in one lane of the HiSeq 2500 platform (Illumina Inc., San Diego, California, USA) at Beijing Genomics Institute (BGI, Shenzhen, China). The base calling and quality value calculations were performed using Illumina GA Pipeline version 1.6. After filtering the adaptor contamination and low-quality reads by Trimmomatic [24], the clean reads were assembled into transcripts using Trinity version 2.5 with the default parameters [25]. TGICL software [26] was then used to cluster similar transcripts, which generated non-redundant transcripts defined as unigenes for two individuals of P. subaequalis (Table 1). To annotate and identify the putative function of the unigenes, these sequences were subjected to a BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) search with a cut off E value of 10−5 against the following databases: National Center for Biotechnology Information (NCBI) non-redundant protein sequences (Nr), Swiss-Prot protein (http://www.expasy.ch/sprot/), NCBI non-redundant nucleotide sequences (Nt), Eukaryotic Orthologous Groups of proteins (KOG) database (http://www.ncbi.nlm.nih.gov/KOG), protein sequence analysis and classification (InterPro) database (http://www.ebi.ac.uk/interpro/) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database (http://www.genome.jp/kegg). In addition, gene ontology (GO) terms describing molecular functions, cellular components, and biological processes were assigned using the BLAST2GO program (B2G; http://www.blast2go.com) for further annotation of the unigenes in our study.
Table 1

Summary of the de novo assembly of two individuals of Parrotia subaequalis.

CategoryItemsNumber
P. subaequalis (TX)P. subaequalis (SJD)
Raw-readsTotal raw reads26,037,11926,666,948
Clean readsTotal clean reads25,448,38326,066,749
Q20 percentage97.94%98.21%
GC percentage46.48%46.54%
TranscriptsTotal number117,794145,619
Average length (bp)674672
N50 (bp)12681245
UnigenesTotal number69,13584,009
Average length (bp)890887
N50 (bp)15911602

Mining of EST-SSR markers and primer design

The potential polymorphic EST-SSR loci of P. subaequalis were identified from our two non-redundant unigenes datasets using CandiSSR [16]. The parameters used in CandiSSR were as follows: the flanking sequence length of 100, blast e-value cutoff of 1e-10, blast identity cutoff of 95, and blast coverage cutoff of 95. For each target EST-SSR, primers were automatically designed in the pipeline based on the Primer3 package [27] with default settings: PCR product size of 100 to 300 base pairs (bp), primer length of 18–25 bp, annealing temperature between 48 and 60°C, and CG content from 40 to 60%. The OLIGO version 6.67 (Molecular Biology Insights, Inc., Cascade, Co, USA) was then used to check for hairpin structures, potential primer dimers and the occurrence of mismatch of the above designed primer pairs.

EST-SSR polymorphism validation and transferability test

Based on the proportion of different EST-SSR repeats, we randomly chose 54 candidate polymorphic primer pairs for initial tests of amplification availability and optimal annealing temperature of each primer pair using six samples (one individual per population) of P. subaequalis. The gradient PCR amplifications were performed on a GeneAmp9700 DNA Thermal Cycler (Perkin-Elmer, Waltham, Massachusetts, USA) following the standard protocol of the AmpliTaq Gold 360 Master PCR kit (Thermofisher Biotech Company, Applied Biosystems, Foster City, California, USA) in a final volume of 15 μL, which contained 1 μL (50 ng) of genomic DNA, 7.5 μL AmpliTaq Gold 360 Master Mix, 5.5 μL of deionized water, and 0.5 μL of forward and reverse primers (10 μM). The procedure of PCR was 5 min initial denaturation at 95°C, 35 cycles of 45 s at 95°C, a temperature gradient for annealing from 48°C to 60°C for 30 s and 30 s synthesis at 72°C followed by a final 10-min extension step at 72°C and a 4°C holding temperature. The polymorphisms of the above successfully amplified loci were screened by means of fluorescence-based genotyping using 96 individuals from six natural populations of P. subaequalis. For all loci, the 5’ end of each forward primer was tagged with one of four fluorescent dyes (FAM, ROX, HEX or TAMRA), and PCR amplifications were performed on a GeneAmp9700 DNA Thermal Cycler (Perkin-Elmer, Waltham, Massachusetts, USA) using reaction volumes of 15 μL including 1μL (50 ng) of template DNA, 7.5 μL AmpliTaq Gold 360 Master Mix, 5.5 μL of deionized water, and 0.5 μL of reverse and fluorophore-labeled forward primers (10 μM). PCRs were run following an endpoint PCR procedure with initial denaturation for 5 min at 95°C followed by 35 cycles of 95°C for 45 s, 30 s annealing at the optimal primer temperature (Table 2) and 30 s synthesis at 72°C; ending with a final 10-min extension step at 72°C and a 4°C holding temperature. PCR products were analyzed on an ABI 3730XL DNA Analyzer (Applied Biosystems, Foster City, California, USA) with GeneScan LIZ 500 as an internal reference (Applied Biosystems). Electrophoresis peaks scoring and polymorphism identification were assayed by using GeneMarker v2.2.0 (SoftGenetics, State College, Pennsylvania, USA). All primer sequences obtained from this study were submitted to GenBank (Table 2).
Table 2

Characteristics of the 27 polymorphic EST-SSR markers developed for Parrotia subaequalis.

LocusPrimer sequences (5’- 3’)Repeat motifAllele size range (bp)aTa (°C)Fluorescent dyebGenBank accession no.BLAST top hit description [organism]cE-value
PasE6F: GCCAAACAACACCAACAAACC R: GTCGCCGATGGAGaGTAAGAC(AAG)5147–15353FAMMK238352
PasE20F: TGTGGTGACAAAAGACACAGT R: TGCTTGTCATACGATGATTC(AC)6184–20048FAMMK238353
PasE27F: TCTCTTCACCCATCTCCCCAT R: GTTGGGTGGGTTTCAGAGCT(AC)7172–17851HEXMK238354Zinc finger CCCH domain-containing protein 20 [Morus notabilis]2E-06
PasE83F: TGGCAGACAACGAAGGATGG R: CCATCTCGGTTGCCACTTCT(AGC)5155–16752HEXMK238355Zinc finger AN1 and C2H2 domain-containing stress-associated protein 11 [Quercus suber]1E-50
PasE108F: CTCCGTTGACCAAAACTGGAC R: CCAAAGAATCCTGCAAAGAAAGC(AT)6202–20859TAMRAMK238356
PasE156F: GCCGATCAAGATGCGGTTTC R: CGGGGCTCTTCTTCTCCATG(ATA)8193–20259TAMRAMK238357Titin like [Actinidia chinensis var. chinensis]1E-41
PasE159F: TACTGCAGAAGGCCATCAGC R: TGGTGAGATGGAGCTGCTTG(ATC)7163–17553TAMRAMK238358NAC transcription factor 25 [Populus trichocarpa]1E-11
PasE178F: CTAGTCCCAGCCAAACAGCA R: CATCGAGTGGCTCCAGAGTG(CAC)6123–12953ROXMK238359Eukaryotic translation initiation factor 5-like [Quercus suber]1E-18
PasE180F: GAAAGCCCACAGTGGTTCCT R: CGACTCACAACCTGCTCCTC(CAC)6152–16449TAMRAMK238360hypothetical protein B456_008G056900 [Gossypium raimondii]1E-21
PasE188F: GACCCTGCCCATCTTCTGTC R: GTGCAGTGTTCTGTCTCACG(CAT)6134–15556TAMRAMK238361
PasE198F: CGCCAAGGACAGTGATGAGT R: AAGTCGGGCCCGGAATATTC(CCG)6105–11153ROXMK238362Hypothetical protein T459_10871 [Capsicum annuum]1E-04
PasE205F: CTCCCGTACCTTCGATCACG R: TCTTCGGATGGAGGGTCACT(CGC)5132–13552ROXMK238363E3 ubiquitin-protein ligase CIP8 [Prunus avium]1E-26
PasE208F: CAGTGTGAGCTCAACGAGGT R: TCCTCGGCACTCCCTTAGAT(CGG)6164–17356FAMMK238364Hypothetical protein CDL12_15788 [Handroanthus impetiginosus]1E-18
PasE218F: TCGCTCTCCTCTGATCTGCT R: CAACCGCCATGCTTTCTCAC(CT)6116–12056ROXMK238365Hypothetical protein CICLE_v10030533mg [Citrus clementina]1E-08
PasE268F: TTGATTTCACTCCCGGCGAA R: ACTTTCTTGCCAGAGCGTGT(GA)7157–16356FAMMK238366
PasE290F: GCGAAAGATGAAGCGAAGAGG R: TCCACCATGAAACTGAGGCT(GAA)5151–16053TAMRAMK238367F-box protein SKIP22 [Populus trichocarpa]1E-17
PasE300F: GCTGGTGCTGAAGATGAGGA R: ACTCCTCTGCAACCTCCATTG(GAT)5187–19059HEXMK238368Polyprenyl synthetase [Parasponia andersonii]1E-58
PasE304F: TCCATGTAACAAGTAAGCGGCTA R: TCGTGTCTTCTCATTACTCCACA(GAT)7111–11456ROXMK238369
PasE348F: GCCGCCGATTCAAGAGATTC R: ACGATTCACCTCCGAACCTC(TA)6184–19049ROXMK238370SAC3 family protein A isoform X1 [Prunus yedoensis var. nudiflora]2E-06
PasE368F: AGCACAACGTACTCAACTCCT R: ACTACATACGCACCGCAGTT(TA)7155–17357FAMMK238371Hypothetical protein CCACVL1_08931 [Corchorus capsularis]1E-12
PasE380F: ACATCAATAGAGGATCGGTT R: TGTGAGCACACCAAACTATG(TA)8200–20453ROXMK238372Hypothetical protein DAPPUDRAFT_104540 [Daphnia pulex]2E-56
PasE425F: AACCCACCATCACCACCATC R: GCTCGTCTTGAAACCGCATC(TC)7151–15753ROXMK238373Protein KINESIN CHAIN-RELATED 1-like [Olea europaea var. sylvestris]1E-04
PasE447F: GGGTGAGGTGGAGTTAAGGC R: CTTCCGGTATTGCACCCACA(TCG)7150–15652FAMMK238374
PasE452F: GTGGTTGTGGAAAGAGAGGGT R: GTCTGCTGCTGATGCTGTTG(TCT)5175–17856HEXMK238375Auxin-responsive protein IAA26-like isoform X2 [Ziziphus jujuba]1E-09
PasE480F: TGTTGTTGTGCTGATGACTGT R: TCCCCTTAGGCTACCATGCT(TGA)5101–10452HEXMK238376Hypothetical protein AQUCO_01400195v1 [Aquilegia coerulea]1E-38
PasE486F: TGTCATGCATCACCCCAAGG R: GCCGCCATGTCAACAAAACA(TGT)5198–20149TAMRAMK238377Hypothetical protein GOBAR_DD26384 [Gossypium barbadense]1E-19
PasE487F: TGAATGGACAAAACCAGGCT R: AGGCCCCTTCAGTAAATCACT(TTA)5178–18157HEXMK238378Leucoanthocyanidin reductase [Vaccinium ashei]3E-32

Note: a Size range values based on 96 individuals.

b Forward 5’-label.

c The corresponding sequences of the 27 EST-SSRs were blasted against the GenBank nonredundant database using BLASTX.

─ = not found.

Note: a Size range values based on 96 individuals. b Forward 5’-label. c The corresponding sequences of the 27 EST-SSRs were blasted against the GenBank nonredundant database using BLASTX. ─ = not found. In addition, transferability tests among the other five Hamamelidaceae species, i.e., five accessions each for Parrotia persica, Parrotiopsis jacquemontana, Sycopsis sinensis, Distylium racemosum, and Hamamelis virginiana (S1 Table) were assessed using the same PCR conditions described above. PCR products were detected using 2% agarose gels and amplification was considered successful when one clear distinct band was visible in the expected size range. GeneMarker v2.2.0 (SoftGenetics, State College, Pennsylvania, USA) was used to score the electrophoresis peaks.

Evaluation of population genetic diversity and variation and test of bottleneck effect

The number of alleles (A), observed heterozygosity (Ho), expected heterozygosity (He) and polymorphism information content (PIC) were calculated for each locus and population using CERVUS v3.0 [28]. FSTAT v2.9.3 [29] were employed to estimate the following genetic diversity parameters of each locus and six natural populations of P. subaequalis: total genetic diversity for the species (HT), genetic diversity within populations (HS) and population genetic differentiation coefficients (FST and GST). The frequency of null alleles and their bias on genetic diversity were evaluated based on the expectation maximization method implemented in FreeNA [30]. Deviation from Hardy-Weinberg equilibrium (HWE) for each population and linkage disequilibrium for each primer pair were tested using a Markov chain (dememorization 1,000, 100 batches, 1,000 iterations per batch) through GENEPOP v4.2 [31]. Analysis of molecular variance (AMOVA) was performed to partition the total genetic variance among and within populations using ARLEQUIN v3.11 [32]. The program BOTTLENECK v1.2.02 [33] was used to detect the population bottleneck effect (i.e. reductions in effective population size) over past or more recent time scales under three different models of microsatellite evolution (Infinite allele model, IAM; Stepwise mutation model, SMM; Two-phased model of mutation, TPM).

Results

De novo assembly of Parrotia subaequalis transcriptome datasets and functional annotation of unigenes

Using Illumina high-throughput RNA sequencing technology, a total of 26,037,119 and 26,666,948 raw reads (of 125 bp length) were generated for P. subaequalis (TX) and P. subaequalis (SJD), respectively. After stringent quality inspection and data filtering, 25,448,383 and 26,066,749 high-quality clean reads were obtained for P. subaequalis (TX) with 97.94% Q20 bases (base quality greater than 20) and P. subaequalis (SJD) with 98.21% Q20 bases. The total length of the clean reads was 3.62 Gb for P. subaequalis (TX) and 3.91 Gb for P. subaequalis (SJD). The GC percentage of P. subaequalis (TX) and P. subaequalis (SJD) were 46.48% and 46.54% (Table 1). The two raw sequencing datasets were uploaded to the NCBI Sequence Read Archive (SRA, https://www.ncbi.nlm.nih.gov/Traces/sra; Biosample accession SAMN10502180 for P. subaequalis (TX) and SAMN10509852 for P. subaequalis (SJD)). With the help of Trinity version 2.5, these above clean reads were de novo assembled into 117,794 transcripts with an average length of 674 bp and an N50 length of 1268 bp for P. subaequalis (TX) and 145,619 transcripts with an average length of 672 bp and an N50 length of 1245 bp for P. subaequalis (SJD) (Table 1). Subsequently, using TGICL software, we gathered 69,135 unigenes with an average length of 890 bp and an N50 length of 1591 bp for P. subaequalis (TX) and 84,009 unigenes with an average length of 887 bp and an N50 length of 1602 bp for P. subaequalis (SJD) (Table 1). Among the unigenes in P. subaequalis (TX), the length of 48,285 unigenes (69.84%) ranged from 300 to 1000 bp, while the other 20,850 unigenes (30.16%) were more than 1000 bp in length. Of the unigenes in P. subaequalis (SJD), 59,643 unigenes (70.00%) had a length range of 300 to1000 bp, and 24,366 unigenes (30.00%) had a length of more than 1000 bp. The length distributions of these two unigene datasets are shown in Fig 2.
Fig 2

Length distribution of assembled unigenes of two Parrotia subaequalis transcriptomes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD).

Length distribution of assembled unigenes of two Parrotia subaequalis transcriptomes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD). Sequence similarity searching was conducted using the BLAST algorithm specifying E values of less than 10−5 for annotation of unigenes. For P. subaequalis (TX), of the 69,135 total unigenes, 42,978 (62.17%) were successfully annotated in at least one database and 11,958 (17.30%) were annotated in all databases. Specifically, among the annotated unigenes, 38,187 (55.24%) had hits in the Nr database, 34,760 (50.28%) in Nt, 32,331 (46.77%) in InterPro, 28,880 (41.77%) in KOG, 28,306 (40.94%) in KEGG, 25,661 (37.12%) in Swiss-Prot and 21,004 (30.38%) in GO. For P. subaequalis (SJD), we found that 51.78% (43,499) consensus sequences showed homology with sequences in the Nr database, 47.49% (39,895) in Nt, 43.51% (36,556) in InterPro, 39.29% (33,004) in KOG, 38.47% (32,322) in KEGG, 34.33% (28,840) in Swiss-Prot and 28.12% (23,624) in GO. Taken together, of the 84,009 total unigenes, 60.03% (50,429) were successfully annotated in at least one database and 15.49% (13,015) were annotated in all databases. We used the BLAST2GO program to annotate and analyze the function of unigenes in two individuals of P. subaequalis against the GO database. It comprehensively classifies the properties of genes into three categories: biological process, cellular components and molecular function. Based on sequence similarity, 21,004 unigenes (30.38%) in P. subaequalis (TX) and 23,624 unigenes (28.12%) in P. subaequalis (SJD) were classified into three main GO categories and 55 sub-categories (Fig 3 and S2 and S3 Tables). For the two individuals of P. subaequalis, the three largest sub-categories of biological process were “metabolic process”, “cellular process” and “single-organism process”; Of the cellular components, “cell”, “cell part” and “membrane” were the most highly represented terms. Among fourteen different molecular function categories, “catalytic activity” and “binding” were the two most matched classes (Fig 3 and S2 and S3 Tables).
Fig 3

Gene ontology (GO) classification of assembled unigenes of two Parrotia subaequalis transcriptomes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD).

Gene ontology (GO) classification of assembled unigenes of two Parrotia subaequalis transcriptomes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD). Furthermore, KEGG analysis was used to help us focus on the biological pathways and functions of the gene products of P. subaequalis. The results showed that 28,306 unigenes (40.94%) in P. subaequalis (TX) and 32,322 unigenes (38.47%) in P. subaequalis (SJD) were grouped into 21 biological pathways that fell under six larger groups (cellular processes, environmental information processing, genetic information processing, human disease, metabolism and organismal systems) (Fig 4 and S4 and S5 Tables). Among these 21 pathways, “global and overview maps”, “carbohydrate metabolism”, “translation”, “folding, sorting and degradation”, “amino acid metabolism” and “signal transduction” were the major biological pathways in the two individuals of P. subaequalis (Fig 4 and S4 and S5 Tables).
Fig 4

Classification map of KEGG metabolic pathways of two Parrotia subaequalis assemble unigenes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD).

Classification map of KEGG metabolic pathways of two Parrotia subaequalis assemble unigenes.

(A) Parrotia subaequalis (TX); (B) Parrotia subaequalis (SJD).

Frequency and distribution of candidate polymorphic EST-SSRs

Based on our two non-redundant unigenes datasets, a total of 497 candidate polymorphic EST-SSRs with an average length of 17 bp (S6 Table) were identified using CandiSSR. Of these EST-SSRs, di-nucleotide repeats (DNRs) were the most abundant repeat type (312; 62.78%), followed by tri- (TNRs; 177; 35.61%), tetra- (TTRs; 6; 1.21%) and hexa-nucleotide repeats (HNRs; 2; 0.40%) (Fig 5). Among the DNRs, AT/TA (37.18%) was quite dominant followed by AG/TC (27.24%) and CT/GA (20.83%). CTG/AAG (10.17%) was the most abundant motif for TNRs followed by AGC/GCG (9.04%) (Fig 5 and S6 Table). There were no obvious dominant motifs among the TTRs and HNRs.
Fig 5

Frequency and distribution of candidate polymorphic EST-SSRs in the two Parrotia subaequalis transcriptomes.

Polymorphisms and transferability assessment of EST-SSR markers

Of 497 candidate polymorphic EST-SSRs, primer pairs were designed for 488 EST-SSR loci (98.19%; S7 Table). The remaining loci were inappropriate for primer modeling or the DNA flanking sequences of these loci were too short to design primer pairs. From the 488 primer pairs, based on the proportion of different EST-SSR repeats, we randomly chose 54 primer pairs (S7 Table) for initial testing using six individuals (one sample per population) of P. subaequalis to ensure the availability and optimal annealing temperature of these primer pairs. After excluding those that gave poor amplification or produced a complex pattern with multiple bands in an initial screening, 44 primer pairs were selected for further tests of polymorphism and transferability. To validate the polymorphisms of these 44 EST-SSR loci, fluorescence-based genotyping was performed using 96 individuals from six natural populations of P. subaequalis. Finally, 27 polymorphic primer pairs were selected for transferability and further population genetic studies, and all of these EST-SSR sequences have been deposited in GenBank with the accession numbers from MK238352 to MK238378 (Table 2). All EST-SSR markers were successfully cross-amplified and exhibited polymorphisms in five congeneric Hamamelidaceae species except for one loci (PasE380) for Sycopsis sinensis and two loci (PasE188 and PasE380) for Hamamelis virginiana, showing a transferability rate of 92.59% (Table 3).
Table 3

Fragment sizes detected in cross-amplification tests of the 27 EST-SSR markers in the related five species of the Hamamelidaceae group.

LocusSycopsis sinensis(N = 5)Distylium racemosum(N = 5)Hamamelis virginiana(N = 5)Parrotiopsis jacquemontiana(N = 5)Parrotia persica(N = 5)
PasE6180–192153–159183–192180–195153–159
PasE20178–184186–192180–190182–192190–200
PasE27168–176172–180174–182170–178172–178
PasE83152–164155–161155–164149–164155–167
PasE108200–204198–206200–208200–206200–210
PasE156187–196190–199184–193190–202193–205
PasE159169–175166–175169–178163–175163–175
PasE178120–129123–129126–132117–129120–129
PasE180155–161164–170152–161155–164152–161
PasE188149–158158–164140–152134–152
PasE198102–105105–111102–108108–114105–111
PasE205129–135132–138132–135132–135129–135
PasE208161–170167–176164–173167–173164–173
PasE218118–124130–136118–126122–130116–120
PasE268167–173165–173163–173173–177165–173
PasE290151–163151–157151–160145–157151–160
PasE300187–190184–193187–193190–199187–190
PasE304114–120111–123111–120111–126114–120
PasE348176–184174–180180–186178–186184–190
PasE368159–165145–159151–157155–161155–161
PasE380198–204198–202200–204
PasE425159–166163–166155–161155–163151–157
PasE447150–159147–156153–159147–153150–156
PasE452172–178169–178169–181172–178178–181
PasE480101–107104–110101–110104–107101–104
PasE486201–210198–204195–201198–204201–204
PasE487178–184181–187178–181178–184178–184

Note: ─ = amplification failed.

a Voucher and locality information are provided in S1 Table.

Note: ─ = amplification failed. a Voucher and locality information are provided in S1 Table.

Characterization of EST-SSR markers and population genetic diversity and variation

As a result, these above 27 polymorphic EST-SSRs in total yielded 100 alleles with an average of 3.70 alleles and a range of 1 to 8 alleles per locus. The polymorphism information content per locus over all populations varied from 0.060 to 0.597, and the observed and expected heterozygosity ranged from 0.063 to 0.906 and from 0.061 to 0.666 (Table 4). At the population level, average estimates of genetic diversity were medium (HO = 0.378, HE = 0.394), being highest in population WFS (HO = 0.459, HE = 0.366) and lowest in population SJD (HO = 0.358, HE = 0.297) (Table 4). And a high frequency of null alleles was detected in PasE188 and PasE425 (v>5%) for the 27 EST-SSR loci. No significant linkage disequilibrium was observed for any pair of loci. Three loci deviated significantly from HWE expectations (P < 0.001) in some populations (PasE20 in HBS; PasE180 in HBS and ZXC; PasE368 in HBS, TX and LWS), which might be due to the Wahlund effect of specific populations (Table 4).
Table 4

Genetic diversity of the 27 polymorphic EST-SSR markers in six natural populations of Parrotia subaequalis.

SJD (N = 16)HBS (N = 16)TX (N = 16)ZXC (N = 16)WFS (N = 16)LWS (N = 16)Total (N = 96)
LocusAHoHePICAHoHePICAHoHePICAHoHePICAHoHePICAHoHePICAHoHePIC
PasE620.1250.1210.11020.1250.1210.11030.3750.4010.33420.7500.5080.37120.3130.4170.32320.1880.4660.34930.3130.5300.422
PasE2030.5630.4460.3783*1.0000.6270.530*60.7500.5930.54641.0000.6750.60230.5630.4580.40140.5000.4250.38380.7290.5760.553
PasE2730.8130.5340.41220.4380.3530.28330.4380.4330.35420.6880.4660.34920.2500.3150.25820.6880.5140.37430.5520.4700.368
PasE8330.3130.2800.24820.6880.4660.34930.3130.2840.25720.5630.4170.32330.6880.5870.48240.6250.7240.65350.5310.5860.515
PasE10830.4380.3730.32720.2500.2260.19520.0630.0630.05910.0000.0000.00050.2500.4330.40030.1880.1790.16650.1980.2200.208
PasE15630.8130.6590.56730.6880.4860.38620.1250.3150.25830.5630.5380.45120.7500.4840.35920.3130.4660.34940.5420.6130.554
PasE15930.6880.5420.41631.0000.6690.57530.8130.5460.41920.0630.0630.05930.5630.5220.45031.0000.6210.51640.6880.5910.500
PasE17840.7500.6390.55920.6880.4660.34920.5630.4980.36620.5000.4440.33730.3130.4010.33440.6880.7000.61640.5830.6120.531
PasE18030.8750.5890.4965*1.0000.7140.638*30.9380.6470.5513*1.0000.5460.419*30.9380.6630.56830.6880.5790.49650.9060.6660.597
PasE18820.3130.4170.32340.8130.6630.57730.3750.3250.28120.3130.2720.22940.3750.6210.54720.1250.4840.35960.3850.5040.458
PasE19810.0000.0000.00020.0630.0630.05910.0000.0000.00010.0000.0000.00020.1250.1210.11020.1880.1750.15530.0630.0610.060
PasE20510.0000.0000.00010.0000.0000.00010.0000.0000.00020.5000.5080.37110.0000.0000.00020.0630.1750.15520.0940.1620.148
PasE20830.2500.2320.21030.6880.5560.44740.8750.6790.60720.5000.3870.30520.3750.4840.35930.3750.4760.39840.5100.5780.515
PasE21820.2500.2260.19520.9380.5140.37420.6250.4440.33720.7500.4840.35930.7500.5240.42820.3750.3150.25830.6150.4340.347
PasE26830.3130.2800.24820.6250.4440.33720.3750.3870.30520.5000.5080.37120.4380.4980.36620.7500.4840.35930.5000.4860.385
PasE29020.0630.0630.05930.1250.1230.11620.0630.0630.05920.5000.3870.30530.1880.2320.21030.3130.5220.45040.2080.2550.241
PasE30010.0000.0000.00010.0000.0000.00010.0000.0000.00020.3130.2720.22910.0000.0000.00020.1880.1750.15520.0830.0800.077
PasE30420.5630.4980.36620.1250.1210.11020.1880.1750.15520.1880.1750.15520.2500.2260.19520.5630.5140.37420.3130.3440.283
PasE34820.5630.4980.36630.8750.5340.41210.0000.0000.00030.6250.4940.43130.6250.4860.41620.1880.1750.15540.4790.4260.392
PasE36820.3130.2720.2293*0.8750.5730.456*2*0.0000.4840.359*10.0000.0000.00030.2500.4540.3935*0.3130.5180.477*80.2920.4600.430
PasE38030.7500.5990.51120.4380.3530.28330.3750.4110.35430.3130.5060.39730.6250.6150.52220.5000.5080.37130.5000.5820.488
PasE42520.6880.4980.36620.3750.3150.25820.1880.3530.28320.5000.5080.37140.3130.3810.34430.3750.5240.42840.4060.5070.401
PasE44710.0000.0000.00020.1880.1750.15520.6250.4840.35920.5630.4170.32320.0630.0630.05920.1880.2720.22930.2710.2720.245
PasE45210.0000.0000.00010.0000.0000.00020.1250.3150.25810.0000.0000.00020.1250.1210.11020.1250.1210.11020.0630.0990.094
PasE48010.0000.0000.00020.0630.0630.05920.0630.0630.05920.1250.1210.11020.1880.1750.15520.0630.0630.05920.0830.0800.077
PasE48610.0000.0000.00010.0000.0000.00020.3750.3150.25810.0000.0000.00020.1250.3870.30510.0000.0000.00020.0830.1360.126
PasE48720.2250.2650.20020.3750.3150.25820.3130.4980.36620.0630.0630.05920.2500.2260.19520.3750.3150.25820.2290.3060.258
Average2.20.3580.2970.2442.30.4610.3310.2372.30.3310.3250.2632.00.4030.3240.2502.60.3590.3660.3072.50.3680.3890.3143.70.3780.3940.343

Note: A = number of alleles sampled; Ho = observed heterozygosity; He = expected heterozygosity; N = number of individuals sampled; PIC = polymorphism information content.

a Voucher and locality information are provided in S1 Table.

* Significant deviation from Hardy-Weinberg equilibrium (P < 0.001).

Note: A = number of alleles sampled; Ho = observed heterozygosity; He = expected heterozygosity; N = number of individuals sampled; PIC = polymorphism information content. a Voucher and locality information are provided in S1 Table. * Significant deviation from Hardy-Weinberg equilibrium (P < 0.001). At the level of the species, our results showed that total genetic diversity of P. subaequalis (HT) was 0.393 and genetic diversity within populations (HS) was 0.336 (S8 Table). Overall, FST and GST across the six natural populations of P. subaequalis were 0.171 and 0.147, representing a much higher genetic differentiation between populations. The AMOVA revealed that 16.74% of the total variation was attributed to differences among six populations and that 83.26% was contributed by differences within populations (P < 0.001; S9 Table), indicating the genetic variation of P. subaequalis mainly existed in individuals within populations. Besides, bottleneck analysis found only one population of P. subaequalis in Zhuxian Village of Anhui Province (ZXC) could have experienced the significant recent bottleneck under the three different models of microsatellite evolution (S10 Table).

Discussion

Characterization of the Parrotia subaequalis transcriptome using next-generation sequencing technologies

In recent years, the use of next-generation sequencing (NGS) technologies have become increasingly prevalent because of its high-throughput genomic and transcriptomic data output for model or non-model organisms at reasonable prices and schedules [34-36]. In the present study, we characterized the transcriptomes of two individuals of P. subaequalis using RNA-sequencing technology on the Illumina HiSeq 2500 platform for the first time. Raw data of these two transcriptomes are currently available to the public. Approximately five Gb of data length for each individual of P. subaequalis were generated and assembled into unigenes. As a result, the mean length of the unigenes of P. subaequalis (TX) was 890 bp and 887 bp for P. subaequalis (SJD), suggesting that the large number of reads with paired-end information and high sequencing depth produced much longer unigenes than reported in previous transcriptome studies of Neolitsea sericea (mean length 733 bp) [37], Sesamum indicum (mean length 629 bp) [38] and Pennisetum purpureum (mean length 586 bp) [39]. In terms of the annotation of unigenes, the results showed a large part of unigenes (62.17% in P. subaequalis (TX) and 55.24% in P. subaequalis (SJD)) had homologs in public databases like Nr, Nt, InterPro, KOG, KEGG, Swiss-Port and GO. These annotated unigenes could provide valuable information for future studies on P. subaequalis. A minority of the unigenes (37.83% in P. subaequalis (TX) and 44.76% in P. subaequalis (SJD)) failed to match any proteins in the above public databases, which may be attributable to the large amount of short-length (< 500 nt) unigenes (Fig 2) or the limited publicly available genomic and transcriptomic information for P. subaequalis. Further explanations for the low hit possibility of short sequences were the lack of a characterized protein domain or the short query sequences [38], resulting in false-negative results. GO is a worldwide classification database for gene function; in our study, “metabolic process”, “cellular process”, “catalytic activity” and “binding” were the four most matched categories in two individuals of P. subaequalis (Fig 3 and S2 and S3 Tables). Additionally, KEGG analysis of the annotated unigenes showed that “global and overview maps”, “carbohydrate metabolism”, “translation”, “folding, sorting and degradation”, “amino acid metabolism” and “signal transduction” were the primary biological pathways in the two individuals of P. subaequalis (Fig 4 and S4 and S5 Tables). Overall, these findings here will greatly enrich the transcriptomic resources for further research on gene discovery, molecular mechanisms and biological pathways of P. subaequalis.

Mining and utilization of polymorphic EST-SSR markers in conservation genetics

Prior to our study, molecular marker studies of P. subaequalis were conducted with ISSR, chloroplast SSR and nuclear SSR [6, 22], while no EST-SSR markers had been reported. EST-SSR markers are powerful molecular markers for analyzing population genetic diversity, cross transferability rate, molecular breeding and functions [40, 41]. With the wide application of the NGS technologies, the increasing number of transcriptome sequences have provided abundant resources for EST-SSR applications for research and genetic improvements. In addition, a number of bioinformatics software have been developed for SSR mining, such as MISA [42] and SSR Primer [43]. However, to date, these tools have not integrated a computational solution for systematic assessment of SSR polymorphic status, resulting in poor efficiency of polymorphic SSR identification and time-consuming experiments. The newly developed pipeline, CandiSSR, could help users detect candidate polymorphic SSRs with high efficiency [16]. Therefore, in the present study, using CandiSSR, we successfully and efficiently mined 497 candidate polymorphic EST-SSR markers from the two comparative transcriptomic datasets. Then, 54 randomly chosen primer pairs were used for validation of the polymorphism, and 27 primer pairs (50%) were proven to be polymorphic among 96 individuals from the six natural populations. Such high success ratios indicated that this kind of molecular development method with the aid of CandiSSR was highly efficient and considerably successful. Among 497 candidate polymorphic EST-SSR markers, in agreement with previous reports from many other dicotyledonous plant taxa such as Arabidopsis, peanut, cabbage, pea, grape, soybean, sunflower [44], dinucleotide motifs (DNRs) were found to be the most frequent motif type (62.78%) in P. subaequalis, followed by TNRs (35.61%), TTRs (1.21%) and HNRs (0.40%) (Fig 5). Among the DNRs, AT/TA (37.18%) was quite dominant, followed by AG/TC (27.24%) and CT/GA (20.83%). CTG/AAG (10.17%) was the most abundant motif type for TNRs, followed by AGC/GCG (9.04%). Our results were consistent with previous reports on tree peony [45], radish [46] and sweet potato [47]. In our study, the GC/CG repeat motif was found in only 0.01% (Fig 5) of the dinucleotide repeats. As is well-known, a common feature in most dicotyledonous plants is the rarity of GC/CG in dinucleotide motifs [37, 44, 48], which was has been explained by the low GC content of dicotyledons [49]. Furthermore, using the 27 polymorphic EST-SSR markers, 100 alleles were found across the 96 individuals of P. subaequalis from six natural populations. The range of the number of alleles per locus was from 1 to 8 with a mean of 3.70 alleles, which was lower than the range from 2 to 14 and mean of 5.33 alleles in the gSSRs of P. subaequalis [6]. The average gene diversity (He) and PIC value of the 27 polymorphic EST-SSR markers were 0.394 and 0.343, representing a moderate level of gene polymorphism compared to the gSSRs (mean: He = 0.558; PIC = 0.515) reported in Zhang et al. [6]. We observed a considerably higher level of transferability (92.59%) in five congeneric Hamamelidaceae species than the gSSR (66.67%) reported by Zhang et al. [6]. The much higher level of cross-transferability and the slightly lower degree of gene polymorphism of EST-SSRs than of gSSRs reflected the highly conserved character of the flanking sequences of EST-SSRs and the low mutation frequency of EST sequences. Additionally, our EST-SSR survey of six natural population of P. subaequalis revealed a relatively high level of genetic diversity (HT = 0.393; HS = 0.336; S8 Table) and a little higher genetic differentiation level (FST = 0.171; S8 Table) at the level of species, suggesting P. subaequalis maintained high levels of species diversity in the long-term evolutionary history despite its restricted and highly disjunct distribution range. The observation of genetic diversity and bottleneck test among six wild P. subaequalis populations indicated that WFS was the most variable population, while SJD and ZXC was the two more endangered populations that we should pay more attention to their protection and preservation. The Wangfo Mountain (WFS) was considered as one of the biodiversity refugia since the Tertiary in China [50, 51] and few human activities were found there, thus contributing the highest genetic diversity in WFS population in some degree. While based on our field observations, SJD population was located in the scenic area of Shanjuan Cave and the population ZXC lied in a village, the human interference including farming and foresting may result in the lower level of genetic diversity and recent bottleneck. In summary, the polymorphic EST-SSR markers developed here will provide a powerful tool for further studies on conservation genetics and molecular breeding of P. subaequalis and other Hamamelidaceae species.

Conclusions

This study is the first to assemble and characterize the transcriptomes of two individuals of P. subaequalis using RNA-sequencing technologies on the Illumina HiSeq 2500 platform. This large set of annotated unigenes and pathways will remarkably enlarge the transcriptomic resources and putative gene functions of P. subaequalis. In addition, we successfully and efficiently developed the first set of 27 novel polymorphic EST-SSR markers for P. subaequalis from the two transcriptomic datasets. These polymorphic EST-SSR markers displayed a relatively high genetic diversity in P. subaequalis and high transferability in five related Hamamelidaceae species, suggesting that they are useful and powerful molecular tools to facilitate future studies on population genetics, molecular breeding and germplasm identification of P. subaequalis and other Hamamelidaceae species. Taken together, these results produced by our study indicated that high-throughput next-generation sequencing technology is a cost-effective and convenient approach to mining abundant novel molecular resources for non-model organisms.

Locality and voucher information for populations of Parrotia subaequalis and the Hamamelidaceae species used in this study.

(DOCX) Click here for additional data file.

Go classification of Parrotia subaequalis (TX) unigenes.

(XLS) Click here for additional data file.

Go classification of Parrotia subaequalis (SJD) unigenes.

(XLS) Click here for additional data file.

KEGG classification for unigenes of Parrotia subaequalis (TX).

(XLS) Click here for additional data file.

KEGG classification for unigenes of Parrotia subaequalis (SJD).

(XLS) Click here for additional data file.

The candidate polymorphic EST-SSRs of two individuals of Parrotia subaequalis.

(XLS) Click here for additional data file.

The primer pairs of candidate polymorphic EST-SSRs of two individuals of Parrotia subaequalis and 54 primer pairs (highlighted in the table) selected for the polymorphism validation and transferability tests.

(XLS) Click here for additional data file.

Genetic diversity of the 27 polymorphic EST-SSR loci for Parrotia subaequalis.

(DOCX) Click here for additional data file.

Analysis of molecular variance (AMOVA) within/among six P. subaequalis populations using EST-SSR markers.

(DOCX) Click here for additional data file.

Bottleneck detection for six natural populations of P. subaequalis.

(DOCX) Click here for additional data file.
  35 in total

1.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes.

Authors:  Michele Morgante; Michael Hanafey; Wayne Powell
Journal:  Nat Genet       Date:  2002-01-22       Impact factor: 38.330

Review 2.  Microsatellites within genes: structure, function, and evolution.

Authors:  You-Chun Li; Abraham B Korol; Tzion Fahima; Eviatar Nevo
Journal:  Mol Biol Evol       Date:  2004-02-12       Impact factor: 16.240

3.  Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species.

Authors:  Siva P Kumpatla; Snehasis Mukhopadhyay
Journal:  Genome       Date:  2005-12       Impact factor: 2.166

4.  Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment.

Authors:  Steven T Kalinowski; Mark L Taper; Tristan C Marshall
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

5.  An approach to transcriptome analysis of non-model organisms using short-read sequences.

Authors:  Lesley J Collins; Patrick J Biggs; Claudia Voelckel; Simon Joly
Journal:  Genome Inform       Date:  2008

Review 6.  Next-generation DNA sequencing methods.

Authors:  Elaine R Mardis
Journal:  Annu Rev Genomics Hum Genet       Date:  2008       Impact factor: 8.929

7.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Authors:  T Thiel; W Michalek; R K Varshney; A Graner
Journal:  Theor Appl Genet       Date:  2002-09-14       Impact factor: 5.699

8.  Transcriptome sequencing of field pea and faba bean for discovery and validation of SSR genetic markers.

Authors:  Sukhjiwan Kaur; Luke W Pembleton; Noel O I Cogan; Keith W Savin; Tony Leonforte; Jeffrey Paull; Michael Materne; John W Forster
Journal:  BMC Genomics       Date:  2012-03-20       Impact factor: 3.969

9.  Comparative Transcriptomics of Strawberries (Fragaria spp.) Provides Insights into Evolutionary Patterns.

Authors:  Qin Qiao; Li Xue; Qia Wang; Hang Sun; Yang Zhong; Jinling Huang; Jiajun Lei; Ticao Zhang
Journal:  Front Plant Sci       Date:  2016-12-15       Impact factor: 5.753

10.  CandiSSR: An Efficient Pipeline used for Identifying Candidate Polymorphic SSRs Based on Multiple Assembled Sequences.

Authors:  En-Hua Xia; Qiu-Yang Yao; Hai-Bin Zhang; Jian-Jun Jiang; Li-Ping Zhang; Li-Zhi Gao
Journal:  Front Plant Sci       Date:  2016-01-07       Impact factor: 5.753

View more
  4 in total

1.  Development of novel EST microsatellite markers for genetic diversity analysis and correlation analysis of velvet antler growth characteristics in Sika deer.

Authors:  Boyin Jia; Guiwu Wang; Junjun Zheng; Wanyun Yang; Shuzhuo Chang; Jiali Zhang; Yuan Liu; Qining Li; Chenxia Ge; Guang Chen; Dongdong Liu; Fuhe Yang
Journal:  Hereditas       Date:  2020-06-26       Impact factor: 3.271

Review 2.  An overview of remote monitoring methods in biodiversity conservation.

Authors:  Rout George Kerry; Francis Jesmar Perez Montalbo; Rajeswari Das; Sushmita Patra; Gyana Prakash Mahapatra; Ganesh Kumar Maurya; Vinayak Nayak; Atala Bihari Jena; Kingsley Eghonghon Ukhurebor; Ram Chandra Jena; Sushanto Gouda; Sanatan Majhi; Jyoti Ranjan Rout
Journal:  Environ Sci Pollut Res Int       Date:  2022-10-05       Impact factor: 5.190

3.  Identification of Genic SSRs Provide a Perspective for Studying Environmental Adaptation in the Endemic Shrub Tetraena mongolica.

Authors:  Zhenhua Dang; Lei Huang; Yuanyuan Jia; Peter J Lockhart; Yang Fong; Yunyun Tian
Journal:  Genes (Basel)       Date:  2020-03-18       Impact factor: 4.096

4.  Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora.

Authors:  Yingling Wan; Min Zhang; Aiying Hong; Yixuan Zhang; Yan Liu
Journal:  Genes (Basel)       Date:  2020-02-19       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.