Literature DB >> 25202574

Development and testing of new gene-homologous EST-SSRs for Eucalyptus gomphocephala (Myrtaceae).

Donna Bradbury1, Ann Smithson1, Siegfried L Krauss1.   

Abstract

PREMISE OF THE STUDY: New microsatellite (simple sequence repeat [SSR]) primers were developed from Eucalyptus expressed sequence tags (ESTs) and optimized for genetic studies of the southwestern Australian tree E. gomphocephala, which is severely impacted by tree health decline and habitat fragmentation. • METHODS AND
RESULTS: A total of 133 gene-homologous EST-SSR primer pairs were designed for Eucalyptus, and 44 were screened in E. gomphocephala. Of these, 17 produced reliable amplification products and 11 were polymorphic. Between two and 13 alleles were observed per locus, and observed heterozygosities ranged from 0.172 to 0.867. All 17 EST-SSRs that amplified E. gomphocephala cross-amplified to at least one of E. marginata, E. camaldulensis, and E. victrix. •
CONCLUSIONS: This set of EST-SSR primer pairs will be valuable tools for future population genetic studies of E. gomphocephala and other eucalypts, particularly for studying gene-linked variation and informing seed-sourcing strategies for ecological restoration.

Entities:  

Keywords:  EST-microsatellite; Eucalyptus gomphocephala; Myrtaceae; SSR mining; ecologically important genetic variation; tuart

Year:  2013        PMID: 25202574      PMCID: PMC4103447          DOI: 10.3732/apps.1300004

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


The Australasian tree genus Eucalyptus L’Hér. (Myrtaceae) comprises more than 700 species (Brooker, 2000) and is economically significant to the forestry industry. Vast genomic resources for model species have allowed molecular markers to be developed in related nonmodel taxa of conservation significance. Eucalyptus gomphocephala DC. (subg. Symphyomyrtus, common name tuart) is endemic to the Swan Coastal Plain of southwestern Australia, and is a culturally iconic species in the region. Severe pathogen-mediated tree decline (Cai et al., 2010), together with extensive habitat fragmentation in urbanized areas, has required rapid conservation actions for the species. To inform seed sourcing strategies for ecological restoration and forest management in the context of climate change, measures of adaptive population genetic diversity and structure are critical, but no microsatellite markers (simple sequence repeats [SSRs]) have yet been tested or optimized for E. gomphocephala. Specifically, SSRs within gene coding regions such as expressed sequence tags (ESTs) will be valuable for investigating variation in functional and potentially adaptive regions of the genome. Here, we design novel PCR primer pairs for SSRs within Eucalyptus ESTs, targeting those that are homologous to annotated genes and therefore those with putative ecological relevance. For the first time, we optimize a subset for utility in population genetic analysis of E. gomphocephala to inform future conservation programs. To demonstrate the markers’ broader utility in the genus, we report cross-species amplification in three additional ecologically important species (E. camaldulensis Dehnh., E. marginata Sm., E. victrix L. A. S. Johnson & K. D. Hill). This is the first set of gene-homologous EST-SSRs to our knowledge that have been reported and tested in natural populations of nonmodel eucalypt species of ecological and conservation significance. These EST-SSR primers add value to a recently expanding set of EST-SSRs being made available for Eucalyptus, increasing the variety of genes that can be investigated in novel population and conservation genetic studies of nonmodel species.

METHODS AND RESULTS

A total of 36,001 Eucalyptus EST sequences were downloaded from GenBank in 2007 and screened for microsatellite repeats with Tandem Repeats Finder (Benson, 1999) using default parameters, and organized using Tandem Repeats Database version 2.30 (Gelfand et al., 2007). A total of 1098 repeats were detected in 1073 (3%) of 36,001 ESTs. Sequences were clustered using the sequence assembly program CAP3 (Huang and Madan, 1999) with default parameters, into a set of 128 nonredundant contigs and 401 singletons (Appendix S1 and S2). A total of 154 of 529 (29%) ESTs were homologous to various annotated genes, following BLASTN searches of the National Center for Biotechnology Information’s nucleotide (nr/nt) collection, using an E-value threshold of 10−10. PCR primers were designed for all 154 sequences using Primer3 (Rozen and Skaletsky, 2000). Annotated gene homologs were specifically targeted so that loci could inform hypotheses relating to adaptive genetic variation in future studies of E. gomphocephala. All primer pairs and putative gene functions are provided in Appendix S3. We prioritized a random subset of 44 EST-SSRs for further PCR optimization in E. gomphocephala. Primer pairs were synthesized by GeneWorks Pty Ltd (Hindmarsh, Australia) and initially tested for amplification products on a screening panel of seven E. gomphocephala individuals. DNA was extracted from freeze-dried leaf material using a NucleoSpin 8/96 Plant II Core Kit, with buffer set PL2/3 (Macherey-Nagel GmbH & Co., Düren, Germany), as per the manufacturer’s instructions. PCRs were carried out in a total volume of 10 μL, containing 10 ng genomic DNA template, 1× PCR Polymerization Buffer containing dNTPs, 0.2 μM of each primer, 0.44 units Taq DNA polymerase, and 2 mM of MgCl2 (reagents from Fisher Biotech, Perth, Western Australia, Australia). Thermocycling was carried out as follows: denaturation at 94°C for 5 min; followed by 30 cycles of 94°C for 1 min, annealing at 55°C for 30 s, and extension at 72°C for 30 s; followed by a final extension at 72°C for 15 min. Annealing temperatures (Ta) were optimized for each locus and products were visualized on 2% agarose gels stained with SYBR Safe (Invitrogen, Carlsbad, California, USA). A set of 17 (39%) primer pairs reliably amplified a product of expected size in E. gomphocephala. These loci were screened for allele size polymorphism and genotyped in multiplex PCR of between two and four loci per reaction, using a commercial kit (PCR Multiplex Kit using Q Solution; QIAGEN, Hilden, Germany). Forward primers were 5′ end-labeled with WellRED fluorescent dyes (D2, D3, D4; Sigma-Aldrich, St. Louis, Missouri, USA). Reactions were carried out in a total volume of 12.5 μL, containing 5–30 ng DNA template, 1× QIAGEN Multiplex PCR Master Mix, 0.5× Q-solution, and 0.2 μM of each primer (with some exceptions, Table 1). PCR was conducted with an initial activation step at 95°C for 15 min; followed by 30 cycles of 94°C for 30 s, annealing at Ta (Table 1) for 90 s, and extension at 72°C for 60 s; followed by final extension at 72°C for 10 min. Products were genotyped using a CEQ 8800 Genetic Analysis System and analyzed using CEQ Fragment Analysis Software (Beckman Coulter, Brea, California, USA). Two loci (EGM09 and EGM24) exhibited amplification failures or dubious peaks in multiplex and were analyzed in singleplex for further reactions. Following genotyping tests, 11 EST-SSRs were polymorphic and produced reliable electrophoretic profiles in E. gomphocephala (Tables 1 and 2). For these loci, we further investigated the most likely gene annotations by conducting a BLASTX search against proteins in the UniRef100 database (European Molecular Biology Laboratory–European Bioinformatics Institute [EMBL-EBI], http://www.ebi.ac.uk/Tools/blastall/) (Table 1).
Table 1.

Characteristics of 11 new EST-SSR loci developed and optimized for Eucalyptus gomphocephala.

LocusPrimer sequences (5′–3′)Repeat motifGenBank accession no.aSource speciesbPutative function and E-valuecDyeATAllele size range (bp)Md[Final]Ta (°C)
EGM09F: ATTTGCTGAAGTGGGTCTCG(AG)17ES594818E. globulusRicinus communis glucan endo-1,3-beta-glucosidase, putative [2.0e-68]D44160–1660.256
R: ACAGGTCCAGAAGCATGAGC
EGM12F: GCGCCGAGAATCAATACG(CAG)10CD668471E. tereticonisPopulus deltoides CONSTANS-like protein CO1 [3.0e-22]D34188–197A0.256
R: GTAGCTGTTGGCAGCTTTGG
EGM14F: CACTGCCACTTACCAGAGTCG(CT)18CB968019E. grandisGlycine max heat shock transcription factor 21 [1.0e-17]D48350–372B0.154
R: CCTCCACCATCTCGAACG
EGM24F: CCTGCAACGCTTCTCGTC(CT)17ES589764E. globulusPrunus dulcis putative S-adenosylmethionine decarboxylase (SAMDC) [1.0e-26]D23204–2120.256
R: TCTGTATTGAGGCTCGCGTA
EGM25F: CCAGAAGCAACCTCAATTTCC(TC)15ES589925E. globulusArabidopsis thaliana metal transporter Nramp3 [1.0e-109]D22350–356C0.156
R: AGCCACAGCAGGGAGTAGC
EGM30F: AGTGCAGCACCTTTCAGACC(AG)18CU398186E. gunniiRicinus communis chlorophyll A/B binding protein, putative [5.0e-45]D313225–255C0.156
R: AAGATTGATTGCTAGATCAGTCACC
EGM35F: ATACGCGTCCCAGTGATTTC(AG)18Contig 92*E. globulusRicinus communis fructose-1,6-bisphosphatase, cytosolic [1.0e-143]D29196–212C0.156
R: AGGAGCAGACGAACTTGCAT
EGM37F: TGAGGTCACTTCAAGCACCAAGA(GCTTA)5Contig85*E. globulusGossypium hirsutum quinone oxidoreductase [1.0e-106]D42256–261B0.02554
R: GGAAGCGGCAACAACCTTAACA
EGM46F: ATATTCGGCCTCTTCGCATT(CAG)4(AG)12Contig18*E. gunniiGlycine max desiccation protectant protein Lea14 homolog [4.0e-79]D213233–257A0.256
R: ACCTTGGCGTTGTACTCGAC
EGM47F: TCGTTCGGTTTCTGTTCTGA(AATCG)6Contig79*E. grandis + E. gunniiNicotiana tabacum small GTPase Rab2 [3.0e-65]D2394–114C0.0556
R: ACATCCTTCGATCCAACCAG
EGM48F: TCACACTCCAATCTCCAACG(CT)12CU396026E. gunniiQuercus macrocarpa aquaporin PIP2;1 [1.0e-125]D26141–155A0.256

Note: AT = total number of alleles observed, based on 60 individuals from two natural populations of E. gomphocephala; Dye = WellRED dye label; [Final] = final concentration of primer pairs in PCR reaction (μM); M = PCR multiplex group; Ta = annealing temperature.

GenBank accession number of source EST, or nonredundant contig number. Contigs are marked with an asterisk (*) and were derived from multiple redundant singleton ESTs. Contig sequence assembly is reported in Appendix S1; EST contig FASTA sequences are reported in Appendix S2.

Species from which EST(s) were derived.

Putative EST function based on a BLASTX search of the UniRef100 database; E-value of the match is given in brackets.

Loci allocated the same letter (A, B, or C) were amplified together in the same multiplex PCR reaction; a dash (—) indicates the locus was amplified in singleplex.

Table 2.

Population genetic properties of the 11 newly developed EST-SSRs in two natural populations of Eucalyptus gomphocephala.

Yalgorup National Park (n = 30)Ludlow Tuart Forest (n = 30)
LocusATHoHeATHoHe
EGM0940.4000.63940.2330.558
EGM1240.6000.62940.6210.640
EGM1470.7330.77980.7330.776
EGM2430.3450.58030.1720.506
EGM2520.2330.25520.2070.238
EGM3090.6670.728120.8330.839
EGM3590.6670.83260.7670.748
EGM3720.4330.49520.3000.406
EGM46120.8670.81690.8620.861
EGM4730.5330.51530.5000.452
EGM4860.8330.77650.8620.794

Note: AT = total number of alleles observed; He = expected heterozygosity; Ho = observed heterozygosity; n = number of individuals sampled.

Geographic coordinates for each population are: Yalgorup National Park = 32°51′17″S, 115°39′54″E; Ludlow Tuart Forest = 33°34′42″S, 115°29′42″E.

Characteristics of 11 new EST-SSR loci developed and optimized for Eucalyptus gomphocephala. Note: AT = total number of alleles observed, based on 60 individuals from two natural populations of E. gomphocephala; Dye = WellRED dye label; [Final] = final concentration of primer pairs in PCR reaction (μM); M = PCR multiplex group; Ta = annealing temperature. GenBank accession number of source EST, or nonredundant contig number. Contigs are marked with an asterisk (*) and were derived from multiple redundant singleton ESTs. Contig sequence assembly is reported in Appendix S1; EST contig FASTA sequences are reported in Appendix S2. Species from which EST(s) were derived. Putative EST function based on a BLASTX search of the UniRef100 database; E-value of the match is given in brackets. Loci allocated the same letter (A, B, or C) were amplified together in the same multiplex PCR reaction; a dash (—) indicates the locus was amplified in singleplex. Population genetic properties of the 11 newly developed EST-SSRs in two natural populations of Eucalyptus gomphocephala. Note: AT = total number of alleles observed; He = expected heterozygosity; Ho = observed heterozygosity; n = number of individuals sampled. Geographic coordinates for each population are: Yalgorup National Park = 32°51′17″S, 115°39′54″E; Ludlow Tuart Forest = 33°34′42″S, 115°29′42″E. To screen genetic diversity of the 11 EST-SSRs, we genotyped 60 E. gomphocephala individuals from two natural populations: ‘Yalgorup’ (voucher: R. Davis 1390, PERTH 04473167) and ‘Ludlow’ (voucher: C. A. Gardner, PERTH 01350501) (Table 2). Genetic diversity parameters were calculated with GenAlEx version 6.4.1 (Peakall and Smouse, 2006). Deviation from Hardy–Weinberg equilibrium (HWE) and linkage disequilibrium among loci were calculated with GENEPOP version 4.0.10 (Raymond and Rousset, 1995; Rousset, 2008), using the Bonferroni correction for multiple testing. The total number of alleles per locus ranged from two to 13 (mean = 6) (Table 1). Observed and expected heterozygosities ranged from 0.172 to 0.867 and 0.238 to 0.861, respectively (Table 2). Significant deviation from HWE was detected for EGM09 at Ludlow. We did not detect evidence of linkage disequilibrium between any pair of loci in more than one population. The potential presence of null alleles and their frequency (r) in each population were estimated using MICRO-CHECKER version 2.2.3 (van Oosterhout et al., 2004), and were predicted for EGM09 and EGM24 in both populations (r range = 0.17–0.29) and for EGM35 at Yalgorup (r = 0.09). The potential for stuttering was predicted for EGM09 at Ludlow. All 17 primer pairs that initially amplified E. gomphocephala DNA were screened for cross-amplification in three additional species, belonging to diverse sections within two subgenera: E. camaldulensis (sect. Exsertaria) and E. victrix (sect. Adnataria) (both subg. Symphyomyrtus), and E. marginata (sect. Longistylus) (subg. Eucalyptus) according to the methods described above. Samples were collected from natural populations (Appendix 1). Sixteen out of 17 (94%) loci cross-amplified within subg. Symphyomyrtus, and 14 out of 17 (82%) cross-amplified across the subgenus boundary to subg. Eucalyptus, demonstrating high transferability rates within the genus.
Appendix 1.

Cross-species amplification of 17 EST-SSR loci from Eucalyptus gomphocephala to three additional eucalypt species from two subgenera.

LocusAnnotationdSourceefE. gomphocephalaE. camaldulensisE. victrixE. marginata
EGM09Glucan endo-1,3-beta-glucosidaseE. globulus++++
EGM12CONSTANS-like protein CO1E. tereticornis++++
EGM14Heat shock transcription factor 21E. grandis++++
EGM24Putative S-adenosylmethionine decarboxylase (SAMDC)E. globulus++++
EGM25Metal transporter Nramp3E. globulus+NA+NA
EGM30Chlorophyll A/B binding protein, putativeE. gunnii++++
EGM35Fructose-1,6-bisphosphatase, cytosolicE. globulus++++
EGM37Quinone oxidoreductaseE. globulus++++
EGM46Desiccation protectant protein Lea14 homologE. gunnii++++
EGM47Small GTPase Rab2E. grandis + E. gunnii++++
EGM48Aquaporin PIP2;1E. gunnii++++
EGM28Chloroplastic phosphoglycerate kinaseE. globulus + E. grandis+++NA
EGM19Aquaporin (PIP1)E. globulus + E. gunnii++++
EGM33Synaptobrevin-related protein 1 (SAR1)E. tereticornis++++
EGM34Rapid alkalinization factor (RALF)E. globulus++++
EGM26Magnesium transporter CorA-like family protein MRS2-2E. globulus++NANA
EGM42bZIP (basic leucine zipper) transcription factorE. globulus++++

Note: + = positive amplification; NA = no amplification or inconsistent amplification.

Eucalyptus gomphocephala is taxonomically classified within the monotypic sect. Bolites (Brooker, 2000) but has been shown to have affinities with sect. Bisectae I according to ITS (Steane et al., 2007) and Diversity Arrays Technology (DArT) analyses (Steane et al., 2011).

Taxonomic information for tested species: E. gomphocephala DC. = sect. Bolites/Bisectae I, subg. Symphyomyrtus; E. camaldulensis Dehnh. = sect. Exsertaria, subg. Symphyomyrtus; E. victrix L. A. S. Johnson & K. D. Hill = sect. Adnataria, subg. Symphyomyrtus; E. marginata Sm. = sect. Longistylus, subg. Eucalyptus.

Source locations: E. camaldulensis = 23°06′S, 119°34′E (n = 2) and 23°10′S, 119°45′E (n = 1); E. victrix = 22°55′S, 119°10′E (n = 3); E. marginata = 31°56′S, 115°46′E (n = 1), 32°46′S, 116°26′E (n = 1), and 32°42′S, 116°03′E (n = 1).

Predicted gene annotation based on BLAST searches.

Species from which the original EST was derived.

Taxonomic information for source species: E. globulus Labill. = sect. Maidenaria, subg. Symphyomyrtus; E. gunnii Hook. f. = sect. Maidenaria, subg. Symphyomyrtus; E. tereticornis Sm. = sect. Exsertaria, subg. Symphyomyrtus; E. grandis W. Hill = sect. Latoangulatae, subg. Symphyomyrtus.

CONCLUSIONS

This new set of 11 polymorphic EST-SSR loci will enable the characterization of population genetic diversity and structure throughout the species’ range of E. gomphocephala in conjunction with putatively neutral, nuclear genomic SSRs (gSSRs). Given their gene-linked nature, the EST-SSRs can be used to test for signatures of selection in natural populations. The high transferability of the loci demonstrates their suitability for application to other species of ecological, conservation, or economic importance in the genus. The broader set of 133 EST-SSRs represents a resource that could be exploited to optimize additional gene-linked EST-SSRs for Eucalyptus and further expand the types of genes available for investigation. Data obtained using these markers will be particularly valuable to inform seed sourcing and conservation strategies in natural E. gomphocephala populations. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  7 in total

1.  CAP3: A DNA sequence assembly program.

Authors:  X Huang; A Madan
Journal:  Genome Res       Date:  1999-09       Impact factor: 9.043

2.  Primer3 on the WWW for general users and for biologist programmers.

Authors:  S Rozen; H Skaletsky
Journal:  Methods Mol Biol       Date:  2000

3.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

4.  genepop'007: a complete re-implementation of the genepop software for Windows and Linux.

Authors:  François Rousset
Journal:  Mol Ecol Resour       Date:  2008-01       Impact factor: 7.090

5.  Population genetic analysis and phylogeny reconstruction in Eucalyptus (Myrtaceae) using high-throughput, genome-wide genotyping.

Authors:  Dorothy A Steane; Dean Nicolle; Carolina P Sansaloni; César D Petroli; Jason Carling; Andrzej Kilian; Alexander A Myburg; Dario Grattapaglia; René E Vaillancourt
Journal:  Mol Phylogenet Evol       Date:  2011-02-16       Impact factor: 4.286

6.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

7.  TRDB--the Tandem Repeats Database.

Authors:  Yevgeniy Gelfand; Alfredo Rodriguez; Gary Benson
Journal:  Nucleic Acids Res       Date:  2006-12-14       Impact factor: 16.971

  7 in total
  2 in total

1.  Paternity analysis reveals wide pollen dispersal and high multiple paternity in a small isolated population of the bird-pollinated Eucalyptus caesia (Myrtaceae).

Authors:  N Bezemer; S L Krauss; R D Phillips; D G Roberts; S D Hopper
Journal:  Heredity (Edinb)       Date:  2016-08-17       Impact factor: 3.821

Review 2.  Microsatellite resources of Eucalyptus: current status and future perspectives.

Authors:  Murugan Sumathi; Ramasamy Yasodha
Journal:  Bot Stud       Date:  2014-10-25       Impact factor: 2.787

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.