Literature DB >> 29188146

Characterization and development of EST-derived SSR markers in Sinowilsonia henryi (Hamamelidaceae).

Zuo-Zhou Li1, Hua Tian1, Jin-Ju Zhang1,2.   

Abstract

PREMISE OF THE STUDY: Polymorphic microsatellite markers were developed to reveal the genetic diversity of extant populations and the mating system of Sinowilsonia henryi (Hamamelidaceae). METHODS AND
RESULTS: In this study, nuclear simple sequence repeat (SSR) markers were developed using the Illumina high-throughput sequencing technique (RNA-Seq). The de novo-assembled transcriptome generated a total of 64,694 unique sequences with an average length of 601 bp. A total of 2941 microsatellite loci were detected. Of the 121 tested loci, 13 loci were polymorphic and eight were monomorphic among 72 individuals representing three natural populations of the species. The number of alleles per locus ranged from one to four, and the observed and expected heterozygosity at population level were 0.00-1.00 and 0.10-0.66, respectively.
CONCLUSIONS: The developed expressed sequence tag (EST)-SSRs will be useful for studying genetic diversity of S. henryi as well as assessing the mating system among Sinowilsonia species.

Entities:  

Keywords:  Hamamelidaceae; RNA-Seq; Sinowilsonia henryi; microsatellite

Year:  2017        PMID: 29188146      PMCID: PMC5703181          DOI: 10.3732/apps.1700080

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


The tree genus Sinowilsonia Hemsl. is a member of the Hamamelidaceae family and comprises only one species, S. henryi Hemsl. This species is narrowly distributed in the mountains of central China at an elevation of 600–1400 m (Zhang et al., 2003). Currently, the natural habitats of this species are severely deteriorated and fragmented, with population sizes ranging from as few as five individuals to approximately 50 flowering plants (Zhou et al., 2014). Thus, S. henryi has been listed as an endangered plant species in the China Plant Red Data Book (Fu and Jin, 1992). Knowledge of genetic diversity and genetic structure of extant populations is essential to the formulation of effective conservation and management strategies for threatened species (Frankham et al., 2002). Due to their codominance, hypervariability, and reliable scorability, microsatellite markers have been widely used in population genetic studies (Selkoe and Toonen, 2006). However, microsatellite markers for S. henryi are currently not available. High-throughput RNA sequencing (RNA-Seq) is one of the most useful next-generation sequencing techniques for identifying microsatellites. In the current study, we developed and characterized 21 expressed sequence tag–simple sequence repeat (EST-SSR) markers for S. henryi using RNA-Seq.

METHODS AND RESULTS

Total RNAs were isolated from young leaves using a cetyltrimethylammonium bromide (CTAB) procedure (Chang et al., 1993). The poly(A)+ RNA (mRNA) was purified with the RNA Clean-up Kit (Invitrogen, Carlsbad, California, USA) according to the manufacturer’s instructions. The purified RNA was subsequently fragmented into small pieces (200 bp) by the fragmentation buffer. Then, the cleaved RNA fragments were used for first-strand cDNA synthesis using reverse transcriptase (Invitrogen) with random hexamer primers. Subsequently, second-strand cDNA was synthesized using RNase H and DNA polymerase I (Tiangen, Beijing, China). Illumina paired-end sequencing adapters were then ligated to the ends of the 3′-adenylated cDNA fragments. The cDNA library was sequenced by Shanghai Haiyu Biotechnology Co. Ltd. on the Illumina HiSeq 2000 instrument (Illumina, San Diego, California, USA). Before assembly, raw reads were filtered to remove those containing adapter or low-quality reads (more than 20% of nucleotides with Q-value ≤ 10) and reads containing poly N (>10% ambiguous base calls). Transcriptome assembly was performed using the Trinity package (version 2013-02-25) with the default parameters (Grabherr et al., 2011). A total of 28.7 million 300-bp, clean, paired-end reads were obtained. All clean reads are available from the National Center for Biotechnology Information (NCBI) Short Read Archive (SRA) database (Bioproject accession no. PRJNA394173). De novo assembly of clean reads resulted in 64,694 unique sequences with an average length of 601 bp and an N50 length of 999 bp. The MIcroSAtellite identification tool (MISA; Thiel et al., 2003) was used to screen for the presence of microsatellites. The parameters used to identify microsatellites were seven repeats for di-, five for tri- and tetra-, four for penta-, and three for hexanucleotide repeats. Subsequently, SSR primers were designed with minimum GC content of 40% and an expected product size ranging from 100 to 280 bp using Primer3 (Rozen and Skaletsky, 1999). A total of 8892 SSRs containing repeats from di- to pentanucleotides were identified from 64,694 unique sequences. Dinucleotides were the most abundant repeat type (5232), followed by trinucleotides (2198), hexanucleotides (1035), pentanucleotides (259), and tetranucleotides (168). The dinucleotide repeat (AG/CT)n (3646) was followed by (AT/AT)n (1192), (AC/GT)n (384), and (CG/CG)n (11). Among the trinucleotide repeat motifs, the most frequent SSR motif was AAG/CTT (667), followed by AAT/ATT (314), AGC/CTG (301), and ATC/ATG (252) (Table 1). Of the 8892 identified SSRs, 2941 (33%) were suitable for designing locus-specific primers (Appendix S1).
Table 1.

Frequency of repeat motifs in nonredundant Sinowilsonia henryi ESTs.

No. of repeats
SSR motifs3456789101112>12Total
AC/GT1068671645430384
AG/CT7178021268742111603646
AT/TA28327532524859111192
CG/GC541100011
AAC/GTT59171360000095
AAG/CTT300200164300000667
AAT/ATT14710361300000314
ACC/GGT1114733300000194
ACG/CGT35141040010063
ACT/AGT199210000031
AGC/CTG1399464400000301
AGG/CCT814432700000164
ATC/ATG1365754500000252
CCG/CGG73319400000117
Tetra-146220000000168
Penta-252700000000259
Hexa-8441910000000001035

Note: — = number of repeats not calculated.

Frequency of repeat motifs in nonredundant Sinowilsonia henryi ESTs. Note: — = number of repeats not calculated. SSR loci with a minimum of 10 repeats for dinucleotides and seven for trinucleotides were selected for amplification. A total of 121 primer pairs were selected and used for further characterization. Eight individuals of S. henryi from Wuhan Botanical Garden, China, were collected to initially assess microsatellite polymorphism. Genomic DNA was isolated using the CTAB method (Doyle and Doyle, 1987). PCR reactions were performed in a 10-μL reaction mixture (final volume) containing approximately 50 ng of genomic DNA, 0.2 μM each of forward and reverse primer, 10 mM Tris-HCl (pH 8.4), 50 mM (NH4)2SO4, 1.5 mM MgCl2, 0.2 mM dNTPs, and 1 unit Taq polymerase (Fermentas, Vilnius, Lithuania). The PCR cycling program included 5 min of initial denaturation at 94°C; followed by 35 cycles of 50 s at 94°C, 50 s at 56–60°C depending on the primer pair (Table 2), and 1 min at 72°C; followed by a final 10-min extension step at 72°C. The PCR products were separated on a 6% polyacrylamide denaturing gel of high resolution with silver stain. A 25-bp marker ladder (Promega Corporation, Madison, Wisconsin, USA) was used to identify the alleles.
Table 2.

Characterization of 21 EST-SSR primers developed in Sinowilsonia henryi.

LocusPrimer sequences (5′–3′)Repeat motifAllele size range (bp)Ta (°C)BLASTX top hit descriptionE-valueGenBank accession no.
SH01F: TTTGACCCCGAAACAACAGC(CAT)7 209–213 58 MF503975
R: TGATACCGCTCAAGTCTCCC
SH02F: CATCACCTTCTGCTGGAACG(TC)10 223–231 58Hypothetical protein EUGRSUZ_G03166 3E-40 MF503976
R: ACCCCGGAGCATATATCAGC
SH03F: CCACCTCGTTCTTCTCGTCT(ATC)7 210–213 58Phospho-N-acetylmuramoyl-pentapeptide-transferase 2E-176 MF503977
R: CCTGACGGTAAAGAGAAACGC
SH04F: GAGTCGGAGTCCCATTTTGC(TTC)7 258–275 60NUDIX domain-containing protein 3E-165 MF503978
R: GTCTTCGAACATGAGGCGTC
SH05F: TATGCTAGTGGTGGTGCTGT(GCA)7 195–202 58 MF503979
R: TAGTCGTCGGGCTCATCATC
SH06F: ATTGAAGGCGTTTGGATCCG(GCC)7 148–158 58 MF503980
R: TGGCTTCCCTCTCGTCTTTT
SH07F: TGACATGGAGGGTAGTGTGG(ATG)7 183–186 58 MF503981
R: TCACCTCTTCCATTGCCTTCT
SH08F: GAAGCTGGAGTTTGTGACGG(GTT)8 214–225 58 MF503982
R: CTTCGGGGCCTATAGTTGGT
SH09F: GGGGTGTTGTCCATTGATACAG(ACC)7 232–240 58CBL-interacting protein kinase 07 0 MF503983
R: CCAGCAGTTGAAGTTCAGGAG
SH10F: AACCAAATCAGGCTCGCTTT(AGC)7 225–239 59Pre-mRNA-splicing factor SYF1 0 MF503984
R: CCGCTGCCAGATGAAATTGA
SH11F: GGATTGCCATCATGCTGTTG(TC)10 209–215 58Transmembrane protein 230 1E-61 MF503985
R: AGCAAATTTGGCCACTGGAG
SH12F: GGCATCCACAGTGTTGCTAG(TC)10 154–156 58 MF503986
R: ACTTCTGGGGCCATTTCCTT
SH13F: AAGGACGAGGATGAATGGGG(GCG)7 265–268 56 MF503987
R: CCCAATTCCCCTCGAGAAGT
SH14F: TCACCATCATCACCACCTTCA(TTG)7 175 56ABC transporter G family member 5-like 0 MF510515
R: AGGCTCATGGGTTTACAGCT
SH15F: AGCAAGAGGACCAACACTCT(AAG)7 200 58 MF510516
R: TGCTGCTTTTACTTCCCCTC
SH16F: CCAAGAGACCCCACCAACTA(GCT)7 256 56 MF510517
R: AGACGTTGCCTCAGTCTTGT
SH17F: TGGCTTCCAACCTCCTCAAA(ACA)8 250 56 MF510518
R: GGTGGGGTGGAAGAAGAGAG
SH18F: ACCCGCGATCATACTGACAA(CTG)7 165 56DExH-box ATP-dependent RNA helicase 3E-35 MF510519
R: GGTCCGTCATCACTTCTCCT
SH19F: GAGCAAACCCACAATCCAGA(GAG)7 200 58 MF510520
R: GCTGCCATGGTGAAGAAACA
SH20F: GGGTGGGGAGAATAGGGAAG(CT)10 200 56NADP-dependent malic enzyme 0 MF510521
R: AGAGGGAGAGAGGGTCACAA
SH21F: CCATATCCGCCGCCAATAAG(GGT)7 275 58Receptor-like protein 1, putative isoform 2 2E-35 MF510522
R: GCTCAATTTGCTACCTTTGAAG

Note: Ta = annealing temperature.

Characterization of 21 EST-SSR primers developed in Sinowilsonia henryi. Note: Ta = annealing temperature. Of the 121 primer pairs tested, 21 successfully amplified the target fragments (Table 1); of these, 13 loci were polymorphic (SH01–SH13), while eight were detected as monomorphic (SH14–SH21; Table 2). The level of genetic variability was estimated by genotyping 72 individuals of S. henryi from three wild populations (Appendix 1). For each locus, the number of alleles (A), observed heterozygosity (Ho), and expected heterozygosity (He) were estimated using the program GENEPOP version 3.4 (Raymond and Rousset, 1995). Null alleles were detected at three loci (SH03, SH04, and SH07) using the program CERVUS 2.0 (Marshall et al., 1998). In the SNJ population, A ranged from one to three, He ranged from 0 to 0.60, and Ho ranged from 0 to 1.00. In the FS population, A ranged from one to three, He ranged from 0 to 0.66, and Ho ranged from 0 to 0.80. In the WD population, A ranged from one to four, He ranged from 0 to 0.63, and Ho ranged from 0 to 0.63. Three loci deviated from Hardy–Weinberg equilibrium after correction for multiple tests (Table 3). The observed departures from Hardy–Weinberg equilibrium may be due to null alleles. Significant linkage disequilibrium was observed in 10 pairs of loci before correction for multiple tests (P < 0.05). However, no loci were observed to be in linkage disequilibrium after correction for multiple tests (P < 0.0006). The sequences containing microsatellites were BLASTed against the NCBI nonredundant protein database using BLASTX with a threshold of E-value < 2.00E-5. Ten loci showed significant similarities to known proteins in the NCBI nonredundant protein database (Table 2).
Table 3.

Genetic diversity of 13 SSR loci in three populations of Sinowilsonia henryi.

LocusSNJ (N = 15)FS (N = 25)WD (N = 32)
AHeHoAHeHoAHeHo
SH0120.180.1320.300.2830.470.41
SH0230.560.8830.520.2430.590.63
SH0320.500.5010.000.00***10.000.00***
SH0420.310.3830.630.3230.480.38
SH0530.601.0030.250.2820.420.41
SH0630.510.7530.370.3230.410.38
SH0710.000.0020.420.2010.000.00***
SH0820.220.2530.620.7220.480.38
SH0910.000.00***20.080.0810.000.00***
SH1020.380.2530.630.5630.630.59
SH1130.530.7530.660.8040.360.38
SH1220.500.7520.340.3620.490.47
SH1320.220.2520.390.3620.440.34
Average2.150.350.452.460.400.352.310.370.44

Note: A = number of alleles; He = expected heterozygosity; Ho = observed heterozygosity; N = number of individuals sampled.

Locality and voucher information are provided in Appendix 1.

Denotes significant departure from Hardy–Weinberg equilibrium after Bonferroni correction (P < 0.0006).

Genetic diversity of 13 SSR loci in three populations of Sinowilsonia henryi. Note: A = number of alleles; He = expected heterozygosity; Ho = observed heterozygosity; N = number of individuals sampled. Locality and voucher information are provided in Appendix 1. Denotes significant departure from Hardy–Weinberg equilibrium after Bonferroni correction (P < 0.0006).

CONCLUSIONS

In the current study, a total of 2941 primer pairs were successfully designed based on transcriptome sequences. In total, 121 PCR primers of SSR loci were used for validation of amplification and polymorphism; of these, 13 revealed microsatellite polymorphism. To the best of our knowledge, this is the first study to develop microsatellites for S. henryi. These EST-derived SSRs could provide valuable tools for studying genetic diversity and assessing the mating system among Sinowilsonia species. In addition, because EST-derived SSRs may be associated with functional genes, the remaining untested 2820 SSRs and 21 loci developed in the current study may be useful for examining adaptive variation using genome scan methods. Click here for additional data file.
Appendix 1.

List of vouchers of Sinowilsonia henryi used in this study.

Population codeNLocationVoucher no.aGeographic coordinatesAltitude (m)
SNJ15Shennongjia Mountain, Hubei ProvinceQ. G. Ye 110231°30′09″N, 110°24′03″E1405
FS25Yangchashan, Fang County, Hubei ProvinceQ. G. Ye 110831°53′01″N, 110°27′53″E1201
WD32Wudang Mountain, Hubei ProvinceQ. G. Ye 110932°40′59″N, 111°01′01″E1035

Note: N = number of individuals sampled.

All vouchers are deposited at the Wuhan Botanical Garden Herbarium (HIB), Wuhan, Hubei Province, China.

  5 in total

1.  Primer3 on the WWW for general users and for biologist programmers.

Authors:  S Rozen; H Skaletsky
Journal:  Methods Mol Biol       Date:  2000

2.  Microsatellites for ecologists: a practical guide to using and evaluating microsatellite markers.

Authors:  Kimberly A Selkoe; Robert J Toonen
Journal:  Ecol Lett       Date:  2006-05       Impact factor: 9.492

3.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Authors:  T Thiel; W Michalek; R K Varshney; A Graner
Journal:  Theor Appl Genet       Date:  2002-09-14       Impact factor: 5.699

4.  Statistical confidence for likelihood-based paternity inference in natural populations.

Authors:  T C Marshall; J Slate; L E Kruuk; J M Pemberton
Journal:  Mol Ecol       Date:  1998-05       Impact factor: 6.185

5.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

  5 in total
  2 in total

1.  Identification and development of microsatellite (SSRs) makers of Exbucklandia (HAMAMELIDACEAE) by high-throughput sequencing.

Authors:  Cuiying Huang; Qianyi Yin; Dipak Khadka; Kaikai Meng; Qiang Fan; Sufang Chen; Wenbo Liao
Journal:  Mol Biol Rep       Date:  2019-04-15       Impact factor: 2.316

2.  Identification and development of microsatellite markers in Hamamelis mollis (Hamamelidaceae).

Authors:  Qianyi Yin; Cuiying Huang; Yanshuang Huang; Sufang Chen; Huagu Ye; Qiang Fan; Wenbo Liao
Journal:  Appl Plant Sci       Date:  2018-10-23       Impact factor: 1.936

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.