Literature DB >> 30828505

Development and characterization of genomic SSR markers for Tamarix chinensis (Tamaricaceae).

Ruhua Zhang1, Qiang Wen2, Li-An Xu3.   

Abstract

PREMISE OF THE STUDY: We developed a set of microsatellite markers to study the population genetic structure, mating system, and interspecific hybridization of Tamarix chinensis (Tamaricaceae), an alkali- and salt-tolerant shrub endemic to China, Korea, and Japan. METHODS AND
RESULTS: Using Illumina sequencing, we developed 10 polymorphic and 11 monomorphic microsatellite primers. High levels of polymorphism were detected in four T. chinensis populations. Allele numbers ranged from two to 11, and the levels of observed and expected heterozygosity ranged from 0.182 to 0.846 and from 0.165 to 0.794, respectively. The polymorphism information content values ranged from 0.201 to 0.803. Cross-species amplification showed two to 15 alleles per locus in 24 individuals from one natural population of the congener T. ramosissima, and the levels of observed and expected heterozygosity ranged from 0.042 to 0.864 and from 0.041 to 0.892, respectively.
CONCLUSIONS: These markers should be useful for exploring the population genetic structure, mating system, and gene flow of T. chinensis.

Entities:  

Keywords:  Illumina sequencing; Tamaricaceae; Tamarix chinensis; cross‐species amplification; simple sequence repeat (SSR) markers

Year:  2019        PMID: 30828505      PMCID: PMC6384293          DOI: 10.1002/aps3.1219

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Belonging to the family Tamaricaceae, Tamarix chinensis Lour. is an alkali‐ and salt‐tolerant, loosely branched deciduous shrub or 3–6‐m‐tall tree. It is naturally distributed from the temperate to subtropical zones in China, Korea, and Japan, inhabiting riverbeds, sandy floodplains, deserts, and coastal tidal flats (Baum, 1978) and has been naturalized in much of the western United States since its introduction in the mid‐nineteenth century (Whitcraft et al., 2007). The level of genetic diversity and mating system of the species are not well known. A few studies have focused on the genetic diversity and population genetic structure of T. chinensis using random amplified polymorphic DNA (RAPD) (Zhao et al., 2008) and inter‐simple sequence repeat (ISSR) markers (Jiang et al., 2011). Codominant simple sequence repeat (SSR) markers are powerful tools for population variation analysis and for the estimation of gene flow through genotypic exclusion in a number of tree species (Vahdati et al., 2015). To date, expressed sequence tag (EST)–SSR markers have been developed through mining of EST databases of Tamarix L. spp., and the transferability across species including T. chinensis, T. gallica L., T. aphylla (L.) H. Karst, T. jordanis Boiss., T. nilotica (Ehrenb.) Bunge, and T. tetragyna Ehrenb. was tested (Terzoli et al., 2013). A set of 10 genomic microsatellites of T. ramosissima Ledeb. was isolated using the biotinylated‐oligonucleotide capture method (Gaskin et al., 2006), two of which showed polymorphism in Chinese T. chinensis material (Zhang, 2011). Although microsatellite primers have been developed for some Tamarix spp., none have yet been identified specifically for T. chinensis. We obtained a large number of DNA sequences based on high‐throughput sequencing and characterized SSRs distributed in the genome of T. chinensis. A set of 10 polymorphic SSR molecular markers of T. chinensis were developed and the transferability was tested in its congener T. ramosissima.

METHODS AND RESULTS

Fresh leaves of one T. chinensis individual from a natural population of Binzhou City, Shandong Province, China (Appendix 1), were collected for DNA extraction and Illumina sequencing. Genomic DNA was extracted using a DNeasy kit (QIAGEN, Venlo, The Netherlands). Genomic libraries were constructed using the method for RAD sequencing as described by Baird et al. (2008). A Qubit 2.0 kit (Thermo Fisher Scientific, Waltham, Massachusetts, USA) was used to evaluate the quality of the libraries, and the Agilent 2100 bioanalyzer (Agilent Technologies, Santa Clara, California, USA) was used to check the sizes of the libraries after they were diluted to 1 ng/μL. Sequencing was conducted on the HiSeq 2500 high‐throughput sequencing system (Illumina, San Diego, California, USA) to generate 125‐bp paired‐end reads by a commercial company (Novogene Co. Ltd., Beijing, China). Raw data were cleaned up by trimming the adapters and low‐quality reads with a custom script by the company who did the sequencing, and by filtering reads with read depth of <10 and >400 using CD‐HIT‐EST (Li and Godzik, 2006) to avoid false positives. Assembly of paired reads was performed using Velvet version 1.1.06 (Zerbino and Birney, 2008). Contigs with a length of less than 125 bp were deleted. Raw sequences were deposited to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA accession PRJNA492209). A total of 818,142 contigs were generated with an average length of 323 bp and an N50 length of 474 bp. The MISA perl script (Thiel et al., 2003) was used to search for SSRs with mono‐, di‐, tri‐, tetra‐, penta‐, and hexanucleotide motifs with a minimum repeat number of 14, eight, six, five, four, and four, respectively. Compound microsatellites were defined as having two or more motifs separated by an interval of ≤100 bp. A total of 31,140 SSRs were identified using the MISA perl script for 28,454 contigs. There were 2567 contigs that contained more than one SSR, and there were 2027 compound SSRs. Di‐ and trinucleotide motifs were the most abundant, comprising 41.53% and 41.24%, respectively. SSR primers were designed with Primer3 version 1.1.4 (Rozen and Skaletsky, 1999) with the following qualifications: primer length range from 18 to 22 bp and annealing temperature 55–60°C. Twenty‐four primers were synthesized by a commercial company (GenScript, Nanjing, China). In total, 58 individuals of T. chinensis from four natural populations and 24 individuals of T. ramosissima from one population were sampled (Appendix 1). We collected no more than 20 individuals for each T. chinensis population because the sampled T. chinensis individuals were scattered along riverbanks, often 50 m apart, and some individuals were damaged. Genomic DNA was extracted with the DNeasy kit (QIAGEN) and tested on agarose gel. As an initial primer test, two individuals were randomly chosen from each of the four studied T. chinensis populations (Appendix 1) to run PCR using the 24 primer pairs. The PCR had a 10‐μL volume and contained: 1 μL of 1× buffer (Tiangen, Beijing, China), 1 μL of MgCl2 (20 mM), 1 μL of dNTPs (2.5 mM each), 0.2 μL of forward primer (10 μM), 0.2 μL of reverse primer (10 μM), 0.2 μL of Taq polymerase (5 U/μL) (TaKaRa Bio Inc., Kyoto, Japan), 0.5 μL of DNA (50 ng/μL), and 5.9 μL of ddH2O. The PCR profile was as follows: 4 min denaturation at 94°C; followed by 30 cycles of 30 s denaturation at 94°C, 30 s annealing step at 55°C, and 30 s elongation at 72°C; and a final extension step at 72°C for 3 min. PCR products were tested by polyacrylamide gel electrophoresis (PAGE) as an initial polymorphism test. The verified polymorphic primers (altogether 10 polymorphic SSRs, Table 1) were synthesized by GenScript (Nanjing, China) with forward primers labeled with 6‐FAM. The fluorescent‐labeled primers were used to run PCR in 58 T. chinensis individuals and 24 T. ramosissima individuals using the reaction volume and PCR profile as described above. PCR products combined with deionized formamide, using GeneScan 500 LIZ (Thermo Fisher Scientific) as a size standard, were run on a 3730 Genetic Analyzer (Thermo Fisher Scientific), and alleles were assigned to bins in GeneMarker version 2.6.7 (SoftGenetics, State College, Pennsylvania, USA). The levels of observed (H o) and expected heterozygosity (H e) and number of alleles were calculated in GenAlEx 6.502 (Peakall and Smouse, 2005). The GENEPOP allele .txt file was exported from GenAlEx for Hardy–Weinberg equilibrium (HWE) and linkage disequilibrium analysis in GENEPOP (version 1.2) (Raymond and Rousset, 1995). The polymorphism information content (PIC) value of each locus was calculated in CERVUS version 3.0.3 (Kalinowski et al., 2007), according to the following formula: , in which f is the frequency of the ith allele and n is the allele number. Significance levels were tested with sequential Bonferroni corrections (Rice, 1989).
Table 1

Characteristics of 21 novel genomic microsatellite markers developed for Tamarix chinensis

Locusa Primer sequences (5′–3′) Repeat motifAllele size range (bp)GenBank accession no.
T. chinensis T. ramosissima
TC1F: ATGTGGGGAGGTGGAGTG(CTT)10 115–127115–121 MG856343
R: AATGTATGCAGACAAAAGT
TC3F: AAAGCAGGTGAGATTGAA(TTA)11 150–204153–201 MG856344
R: ACACCCTAATCCACATAAC
TC4F: TATCCCGAGGTTGTAAAT(AAT)11 162–201162–186 MG856345
R: GCTGCTGGTCACCACTAA
TC5F: GTCTGCCTAAGAAGTCGC(TCTT)8 189–221186–218 MG856346
R: CGGAAATAAGGGAGAAAT
TC6F: GATAAGGTTTTGACGATT(ATA)11 158–236158–230 MG856347
R: TCTAGTCACCACCATCCC
TC7F: GGTCCTTTAGGTTCTTCC(TATT)8 194–214194–206 MG856348
R: TATGGCCTCAACTATCTT
TC8F: TTTGAGTTTGACGATGTA(AAT)11 208–244208–238 MG856349
R: GATTGACCGTGTTTTAGT
TC12F: TAAGAAGGGTAGAGGAGA(AAG)11 281–341281–338 MG856350
R: TAATCAATAGTCACAAGG
TC17F: AGTAGAGGCAAAGGTTAT(TGA)10 335–347332–344 MG856351
R: CTCAAAAGTCCCTCATAG
TC19F: GAGGGTGGGCAAGAAATG(TTTA)8 369–409369–409 MG856352
R: TGACGCAGCAGTAGTGTA
TC2F: CATTGTCATCATCCCACT(AAT)10 151151 MG856353
R: TCTTGTTGCCGACTTTGT
TC9F: CGAAACTAATAACCCTAA(TAA)10 205205 MG856354
R: CTATCCCTGCCGACTCAA
TC10F: CAACTTTCCACCCTTCTT(ATT)11 239239 MG856355
R: ATTCCGAGGCTACACTTA
TC11F: CAGTGTTATTGAAGGGTT(TTA)10 256256 MG856356
R: GATTGTTGATGCGGATGG
TC13F: TTCTAACCCAAAACACTC(TATT)8 273273 MG856357
R: ATGAAAATCCTTCCTTGT
TC14F: AAATGATGTGCTTGTCGT(AAG)10 290290 MG856358
R: TTTAATAGCTTCTTGGAG
TC15F: CTTAGCCTTAGCACTTGG(TAT)11 317317 MG856359
R: TAACTTCCCTCTTACTCC
TC16F: TCTTCGGGTTGAGATTAC(CATA)10 320320 MG856360
R: TAAGGGCTTGTTTGGGAG
TC21F: ATAATCTCCACCCTGCCAACA(ATT)11 382382 MG856361
R: AACCACCATCCACTACCACATC
TC22F: TCCTCTACCCTTTCTTGC(AAT)10 383383 MG856362
R: ATTCCCAGTTCCACCACA
TC24F: TTAATGCAGTCACGAGTT(ATT)10 409409 MG856363
R: GTGGTAATGTGGACGAAT

Annealing temperature for all loci was 55°C.

Characteristics of 21 novel genomic microsatellite markers developed for Tamarix chinensis Annealing temperature for all loci was 55°C. In total, 21 (GenBank accession no.: MG856343–MG856363) out of 24 primers amplified the target sequences. Of these, 11 were monomorphic and 10 were polymorphic in the PAGE test (Table 1). The 10 polymorphic primers showed high polymorphism in both species (Table 2). For T. chinensis, the allele number per locus per population varied from two to 11, with the highest average allele number (6.2) for the Kenli (KL) population, followed by Fangshan (FS) (5.6), Lijin (LJ) (5.2), and Helou (HK) (4.3). Levels of H o and H e per locus for each population ranged from 0.182 to 0.846 and 0.165 to 0.794, respectively, with the highest average H e for the FS population (0.577), followed by KL (0.543), HK (0.542), and LJ (0.534). The PIC values for each locus ranged from 0.201 to 0.803.
Table 2

Level of polymorphism of 10 microsatellite loci developed for Tamarix chinensis in four T. chinensis populations and one T. ramosissima population.a

Locus T. chinensis T. ramosissima
KL (N = 18)HK (N = 13)LJ (N = 13)FS (N = 14)CY (N = 24)
A H o H e PIC A H o H e PIC A H o H e PIC A H o H e PIC A H o H e PIC
TC130.5580.5170.46930.3640.3100.47830.5560.5260.59430.3000.4550.56630.3870.4570.399
TC3100.812o.7940.80350.6770.5930.61270.5240.6160.67380.6260.7230.717130.8640.8920.882
TC440.4140.5070.51630.3370.4150.46840.384* 0.4970.40450.414* 0.5350.48760.5910.6850.610
TC560.6670.5830.63240.6600.6780.66970.7500.7260.72340.5560.5620.60580.7390.8120.788
TC6110.6070.7330.64570.5410.5010.62790.6530.5820.605110.5490.6450.653150.609* 0.7720.745
TC740.6320.5680.60330.5480.5370.60640.6210.6780.71730.5230.5850.56840.6670.6210.546
TC880.7170.6340.62260.5620.6670.63670.6130.5460.60880.7010.7570.628100.7350.8190.791
TC1280.5960.5150.50260.6590.5530.59760.5490.5120.50470.6240.6570.663100.7120.7320.713
TC1730.2220.2040.26430.283* 0.4340.31020.1820.1650.20130.2310.2100.38520.0420.0410.040
TC1950.7060.6300.64330.846* 0.5410.51230.5380.4940.58440.6360.6450.69380.7080.6670.612
Mean6.20.5930.5430.5704.30.5480.5420.5725.20.5370.5340.5615.60.5160.5770.5977.90.6050.6500.601

A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; N = sample size for each population; PIC = polymorphism information content.

Voucher and locality information are provided in Appendix 1.

*Indicates that H o departs significantly from H e under Hardy–Weinberg equilibrium (P < 0.01).

Level of polymorphism of 10 microsatellite loci developed for Tamarix chinensis in four T. chinensis populations and one T. ramosissima population.a A = number of alleles; H e = expected heterozygosity; H o = observed heterozygosity; N = sample size for each population; PIC = polymorphism information content. Voucher and locality information are provided in Appendix 1. *Indicates that H o departs significantly from H e under Hardy–Weinberg equilibrium (P < 0.01). For T. ramosissima, the allele number per locus varied from two to 15, with an average allele number per locus of 7.9. Levels of H o and H e ranged from 0.042 to 0.864 and 0.041 to 0.892, with an average H o and H e of 0.605 and 0.650, respectively. The PIC values ranged from 0.040 to 0.882. HWE testing revealed no loci (for KL), one locus (for LJ and FS), and two loci (for HK) for T. chinensis; for the single population of T. ramosissima one locus deviated from equilibrium significantly (P < 0.01; Table 2). No significant linkage disequilibrium was detected between markers after Bonferroni correction for both species.

CONCLUSIONS

This is the first report of genomic microsatellites for T. chinensis. The 10 polymorphic markers showed comparatively high genetic variation, transferability to congeneric species, little or no deviation from HWE, and were in linkage equilibrium. These properties make them especially useful for genetic analysis of population genetic structure, mating system, and gene flow in T. chinensis and its congeners.
SpeciesLocalityGeographic coordinatesPopulation code N Voucher specimen accession no.c
Tamarix chinensis Lour.Kenli, Shandong, China37°48′06″N, 119°02′17″EKL18Tch‐KL01‐ZR
T. chinensis Hekou, Shandong, China38°13′19″N, 118°50′30″EHK13Tch‐HK02‐ZR
T. chinensis Lijing, Shandong, China38°02′13″N, 118°44′30″ELJ13Tch‐LJ03‐ZR
T. chinensis Fangshan, Beijing, China39°63′19″N, 115°78′30″EFS14Tch‐FS04‐ZR
T. chinensis d Binzhou, Shandong, China37°22′16″N, 118°03′22″EBZ1Tch‐BZ05‐ZR
T. ramosissima Ledeb.Changyi, Shandong, China37°05′16″N, 119°21′22″ECY24Tra‐CY01‐ZR

N = number of individuals sampled.

Vouchers are deposited in Linyi University, Shandong Province, China.

All individuals were sampled from natural stands.

ZR = Ruhua Zhang.

Individual used for DNA extraction and Illumina sequencing.

  8 in total

1.  Primer3 on the WWW for general users and for biologist programmers.

Authors:  S Rozen; H Skaletsky
Journal:  Methods Mol Biol       Date:  2000

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

3.  Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment.

Authors:  Steven T Kalinowski; Mark L Taper; Tristan C Marshall
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  ANALYZING TABLES OF STATISTICAL TESTS.

Authors:  William R Rice
Journal:  Evolution       Date:  1989-01       Impact factor: 3.694

6.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Authors:  T Thiel; W Michalek; R K Varshney; A Graner
Journal:  Theor Appl Genet       Date:  2002-09-14       Impact factor: 5.699

7.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

8.  Rapid SNP discovery and genetic mapping using sequenced RAD markers.

Authors:  Nathan A Baird; Paul D Etter; Tressa S Atwood; Mark C Currey; Anthony L Shiver; Zachary A Lewis; Eric U Selker; William A Cresko; Eric A Johnson
Journal:  PLoS One       Date:  2008-10-13       Impact factor: 3.240

  8 in total
  1 in total

1.  Seventeen new microsatellites for Tamarix gallica and cross-amplification in Tamarix species.

Authors:  Alejandro Terrones; Ana Juan
Journal:  Appl Plant Sci       Date:  2020-01-22       Impact factor: 1.936

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.