Literature DB >> 28924513

Identification and characterization of microsatellite loci in the tuliptree, Liriodendron tulipifera (Magnoliaceae).

Ricardo Gutiérrez-Ozuna1, Matthew B Hamilton1.   

Abstract

PREMISE OF THE STUDY: Twenty-three polymorphic microsatellite loci (simple sequence repeats) were identified and characterized for Liriodendron tulipifera (Magnoliaceae), a species native to eastern North America, to investigate its genetic diversity, population structure, and mating system. METHODS AND
RESULTS: Using Illumina HiSeq paired-end reads from genomic DNA, searches for repeat motifs identified approximately 280,000 potentially amplifiable loci. Of 77 loci tested, 51 amplified consistently. When genotyped using 30 to 52 total adult trees from three old-growth populations in Maryland, Virginia, and New Jersey, USA, 23 loci were polymorphic. These loci exhibited four to 13 alleles, and observed and expected heterozygosities ranged from 0.233 to 0.865 and 0.272 to 0.876, respectively.
CONCLUSIONS: The microsatellite marker loci presented here will be valuable in population genetic studies of L. tulipifera because they do not suffer from ascertainment bias and show high polymorphism.

Entities:  

Keywords:  Liriodendron tulipifera; Magnoliaceae; microsatellites; next-generation sequencing; population genetic studies; simple sequence repeats (SSRs); tuliptree

Year:  2017        PMID: 28924513      PMCID: PMC5584817          DOI: 10.3732/apps.1700032

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Liriodendron tulipifera L., commonly known as tuliptree, tulip poplar, or yellow poplar, is a pioneer tree in the family Magnoliaceae native to eastern North America. It has a wide geographic distribution in the southeastern and mid-Atlantic United States and occurs in diverse habitats. To facilitate population genetic analyses of effective population size and population structure, we developed genomic microsatellite (simple sequence repeat [SSR]) markers without the potential limitations of previously reported SSRs. Liriodendron tulipifera SSRs have been developed from expressed sequence tags (ESTs; Xu et al., 2006, 2010; Yang et al., 2012; Zhang et al., 2015) located in or near functional genes, and consequently, they are more likely to be affected by natural selection (Ellis and Burke, 2007). Liriodendron chinense (Hemsl.) Sarg. genomic (noncoding, nontranscribed) microsatellite loci have been cross-amplified in L. tulipifera (Yao et al., 2008). Cross-species amplification of microsatellite loci might result in ascertainment bias, where polymorphism is reduced when loci are transferred to related species (Ellegren et al., 1995). Preliminary tests of loci from Yao et al. (2008) carried out with 10 L. tulipifera individuals showed low polymorphism (results not shown). Nonneutral evolution or ascertainment bias can potentially impact the estimation of population genetic parameters. Therefore, we identified and characterized polymorphic genomic microsatellite loci in L. tulipifera using Illumina next-generation sequencing and a bioinformatics pipeline.

METHODS AND RESULTS

Microsatellite development

Total DNA from leaves of one L. tulipifera individual collected on the main campus of Georgetown University in Washington, D.C., USA, was extracted using the DNeasy Plant Mini Kit (QIAGEN, Valencia, California, USA). A genomic DNA library for Illumina paired-end sequencing was prepared from 4 μg of DNA following the PCR-free library prep kit from Illumina (Illumina, San Diego, California, USA). DNA was sheared to 550 bp and sequenced as 150 bp paired-end reads on an Illumina HiSeq 2500 at the Biocomplexity Institute of Virginia Tech (Blacksburg, Virginia, USA). We used PAL_FINDER_v0.02.04 (Castoe et al., 2012) to extract reads containing perfect microsatellites (uninterrupted and identical repeats). The reads were imported to PAL_FINDER and analyzed in two different ways: (1) as Illumina paired-end reads filtered to include ≥12 tri-, ≥10 tetra-, ≥8 penta-, and ≥6 hexanucleotide repeats, and (2) as FASTQ sequence files converted to FASTA format, treated as 454 single-end reads, and filtered to include ≥15 di-, ≥10 tri-, ≥8 tetra-, ≥6 penta-, and ≥4 hexanucleotide repeats. One potential advantage of using both methods is the development of loci with a broader range of amplification fragment sizes. In both cases, we identified microsatellite loci with flanking sequences suitable for PCR primer design or potentially amplifiable loci (PALs). Raw reads were deposited in the National Center for Biotechnology Information (NCBI) Short Read Archive (SRA; BioProject no. PRJNA331147, BioSample no. SAMN05417503). Summaries of reads containing microsatellite repeats and PALs (with primer sequences) detected using both methods are available in Appendices S1 and S2. We selected a set of 77 PALs to empirically assess amplification using three individuals. We amplified each locus in 25-μL PCR reactions (1× OneTaq Standard reaction buffer, 160 μM dNTPs, 0.2 μM forward primer, 0.2 μM reverse primer, 0.625 units OneTaq DNA polymerase [New England BioLabs, Ipswich, Massachusetts, USA], 1 μL template DNA [concentration was not determined], and ddH20 to 25 μL). Thermocycling conditions were 94°C (30 s); followed by 30 cycles of denaturation at 94°C (30 s), annealing at 50–61°C (30 s, Table 1), and extension at 68°C (30 s); and a final extension of 68°C (5 min). Fifty-one primer pairs yielded products of the expected size without nonspecific amplification and were then tested for polymorphism in seven individuals, by visualizing PCR products on 3% agarose gels. Of these 51 loci, 23 were polymorphic and used to genotype 30 to 52 total individuals collected from three old-growth locations in the native range of L. tulipifera (Appendix 1). Because L. chinense, the other single species in Liriodendron L., has a restricted geographic distribution in China and Vietnam, we were not able to test for cross-species amplification. As cross-amplification of genomic SSRs has limited success in plants (Merritt et al., 2015) and success declines as genetic divergence increases (Barbará et al., 2007), we did not test for cross-amplification in other Magnoliaceae.
Table 1.

Characteristics of 23 polymorphic genomic microsatellite loci isolated from Liriodendron tulipifera.

LocusPrimer sequences (5′–3′)Repeat motifObserved amplicon size (bp)Ta (°C)Fluorescent labeling methodaIdentifierb
Lt006F: GTGGTAACGCATGAGATGGC(AAG)12416–43758M13-labeledKX869968
R: TGAGCTTTCCATCAGGTGAGC
Lt011F: GCATGGACATGGTGTAACCC(AAT)12223–250586-FAM1KX869967
R: TTCCATGTGTGCCCTACTGC
Lt014F: CTCACATGAACAACAAGGAAACC(AAT)16108–12658M13-labeled HWI-1KL163:137:H7DJKADXX:1:2116:14211:16050 1:N:0:CTTGTA
R: CTGGGATCTGCACTGATTGG
Lt023F: CGTGGCCAGCATCTTGTAGC(AAC)16113–14058M13-labeled HWI-1KL163:137:H7DJKADXX:1:2214:4579:5389 1:N:0:CTTGTC
R: CAAGAACAACGAGGCAAACAGC
Lt025F: AGTTGGGAATTGGGACAAGG(ATC)13105–12057M13-labeled HWI-1KL163:137:H7DJKADXX:1:2114:7509:71461 1:N:0:CTTGTA
R: TTCAGTGCTCCAGTTTTCACG
Lt032F: GCTCCTACAAACATCAAAAGC(AAG)1395–12250M13-labeled HWI-1KL163:137:H7DJKADXX:1:2109:2516:54908 1:N:0:CTTGTA
R: CAAAACCCATTTCGTGTTCC
Lt035F: CACAGAGCTTGGTGCTTTACG(AAT)1689–11956NED1 HWI-1KL163:137:H7DJKADXX:1:1115:14496:36717 1:N:0:CTTGTA
R: AAGTCCATGTTCCACTCATTCG
Lt036F: TTGAAGTTTGAATCCCCATCC(ATC)13107–14350VIC3 HWI-1KL163:137:H7DJKADXX:1:1109:19109:85000 1:N:0:CTTGTA
R: TGATTGGGCCATGTTAATCG
Lt043F: TCATCTTCCTTTGGTTTGCC(AAG)12119–149506-FAM3 HWI-1KL163:137:H7DJKADXX:1:2103:5825:78158 1:N:0:CTTGTA
R: TGGGGATTTGACAGAGAACG
Lt052F: TGGTCCCGAGATTGTTCACC(AAC)1494–11557M13-labeled HWI-1KL163:137:H7DJKADXX:1:2106:8132:35499 1:N:0:CTTGTA
R: TCTTACCCACCACAACCATCG
Lt054F: CCTCGTAGTGTTGATCGTTGC(AAT)1688–12157NED2 HWI-1KL163:137:H7DJKADXX:1:1214:10945:60086 1:N:0:CTTGTA
R: TCAACCTTCCACCAGTGTCC
Lt059F: CTGCCCCTTCAAATTCTTGG(AGG)1293–11155NED3 HWI-1KL163:137:H7DJKADXX:1:2102:16887:46302 1:N:0:CTTGTA
R: AATTGCGTGAAGCTCAGACC
Lt060F: CCTACTCTTCCGGAGTTTCG(AAT)16126–15057M13-labeled HWI-1KL163:137:H7DJKADXX:1:1215:8150:67169 1:N:0:CTTGTA
R: GGGATGGGGTGGAATATAAGC
Lt061F: TTGGCGGATGATTGAGAGC(AAG)14122–14655PET3 HWI-1KL163:137:H7DJKADXX:1:2205:15636:87476 1:N:0:CTTGTA
R: TTAATGCCGTGGGTTCTGC
Lt064F: AAGGATGACTTTCACTGAGG(AAT)13120–156536-FAM2 HWI-1KL163:137:H7DJKADXX:1:1203:17908:71099 1:N:0:CTTGTA
R: CATTGGGACTTTATTTCTCTCC
Lt066F: TCTGGCCCTTGATACTGTGG(ATC)12134–15257M13-labeled HWI-1KL163:137:H7DJKADXX:1:1109:1769:97275 1:N:0:CTTGTA
R: CCCACTTGGGTGTTTCAGG
Lt068F: AAACTCCCTAACAAGGGTCTCC(AAT)1689–13155VIC2 HWI-1KL163:137:H7DJKADXX:1:2211:4001:84575 1:N:0:CTTGTA
R: ACCACAACACAGAAACAATGGG
Lt070F: TTCTCCGCCATCGTCTTACC(ATC)12112–133576-FAM1 HWI-1KL163:137:H7DJKADXX:1:2211:16362:43312 1:N:0:CTTGTA
R: CTAATGAACGGTCGGGATGG
Lt075F: CATCGCCATTGTTTTCTCTGC(AAT)13119–13455M13-labeled HWI-1KL163:137:H7DJKADXX:1:1201:4554:96377 1:N:0:CTTGTA
R: TAACTGCCTGCGTATCATCC
Lt077F: TATCCAAACGGCCCTTAACC(AAT)1484–10555HEX1 HWI-1KL163:137:H7DJKADXX:1:1214:2279:95306 1:N:0:CTTGTA
R: GATCACAAAACTCCCACATGC
Lt079F: GGGAGACTCGGCTTTAATCGCC(ATC)1276–9461PET2 HWI-1KL163:137:H7DJKADXX:1:2215:19866:60074 1:N:0:CTTGTA
R: GAGTGGAGTAGCGGGACAGG
Lt080F: GGCCTGAATTCCTTTGTTCCC(AAT)12130–15157M13-labeled HWI-1KL163:137:H7DJKADXX:1:1107:4232:34637 1:N:0:CTTGTA
R: CCCTCAATGTACACGCTTGC
Lt081F: ATGGATTCCGGCAAGTCTCC(AAT)1796–12357M13-labeled HWI-1KL163:137:H7DJKADXX:1:2115:20835:65097 1:N:0:CTTGTA
R: TGAGGAAGAGAACAAACAGGGG

Note: Ta = annealing temperature.

PCR multiplex sets are indicated as 1, 2, or 3.

Numbers are either GenBank accession numbers or Illumina sequence identifiers associated with NCBI’s Short Read Archive (BioProject no. PRJNA331147, BioSample no. SAMN05417503).

Characteristics of 23 polymorphic genomic microsatellite loci isolated from Liriodendron tulipifera. Note: Ta = annealing temperature. PCR multiplex sets are indicated as 1, 2, or 3. Numbers are either GenBank accession numbers or Illumina sequence identifiers associated with NCBI’s Short Read Archive (BioProject no. PRJNA331147, BioSample no. SAMN05417503). For fragment analyses, PCR products were fluorescently labeled either using primers tailed with a 5′ M13(–21) sequence following Schuelke (2000) or using primers with a 5′ fluorophore and amplified in multiplex (Table 1). In the tailed primer labeling method, two PCR reactions were carried out using the same reverse primer. The first PCR used an M13(–21)-tailed locus-specific forward primer, while the second used a universal fluorescently labeled M13(–21) as a forward primer. The products of the first PCR were purified using StrataPrep PCR Purification Kit (Agilent Technologies, Santa Clara, California, USA) and then used as the template for the second PCR. Fluorescent products were electrophoresed on an ABI PRISM 3100 Genetic Analyzer (Applied Biosystems, Foster City, California, USA), and amplicon sizes were estimated with either orange or red DNA size standard (MCLAB, San Francisco, California, USA) and GeneMapper software 3.7 (Applied Biosystems) using the Local Southern sizing algorithm.

Microsatellite data analysis

Genotypes appeared diploid, displaying at most two alleles per locus per individual. Data were analyzed by sampled location and as a pooled population (Table 2). For each locus, number of alleles, observed heterozygosity (Ho), expected heterozygosity under random mating (He), and polymorphism information content (; Botstein et al., 1980) for the pooled population were estimated using CERVUS 3.0.3 (Kalinowski et al., 2007). We used GENEPOP 4.2 (Rousset, 2008) to test deviation from Hardy–Weinberg expected heterozygote frequency (HWE) using default values for Markov chain parameters, and to estimate the fixation index (F = [He − Ho]/He; Hamilton, 2009) for the pooled population.
Table 2.

Genetic properties by individual and pooled sampled locations of 23 polymorphic microsatellite markers developed in Liriodendron tulipifera.

LocusMontpelier (n = 20)Frog Canyon (n = 20)Saddler’s Woods (n = 12)Pooled locations (n = 52)
NAHeHobNAHeHobNAHeHobNAHeHobPICFNullIIM
Lt0061060.7890.8001020.1000.1001030.6890.7003060.6370.5330.5910.1660.074
Lt0112060.7260.6001970.8340.7891250.7790.8335190.7890.7250.7490.0810.039
Lt0141060.7680.6001050.8000.8001060.8160.7003070.8210.7000.7820.1490.078
Lt0231050.5050.5001050.6160.6001060.7790.8003060.7670.6330.7150.1770.075
Lt0251040.6580.9001020.2680.3001030.4260.3003040.4620.5000.420−0.0850.056
Lt0321050.7000.7001050.7110.9001050.7790.7003080.8090.7670.7730.0530.036
Lt0351970.7500.263***1260.8150.167***1140.5710.273*42100.8260.238***0.7930.7140.402†
Lt0362080.8560.8502080.8560.7501260.8120.9175290.8500.8270.8220.0270.017
Lt0432060.8050.7502050.7030.300***1270.8620.667**5270.8070.558***0.7710.3110.098†
Lt052940.6990.6671060.6840.7001050.8050.9002970.7820.7590.7440.0300.036
Lt0541970.7640.5792070.7820.7001060.8260.100***49110.8060.531***0.7730.3440.171†
Lt0592050.5830.6502040.6740.500*1250.7170.9175260.6940.6540.6460.0590.030
Lt0601050.7740.6001050.6950.5001070.7890.7003090.7750.6000.7330.2280.101
Lt0612040.4580.4502030.5550.050***1230.3010.3335260.4920.269***0.4320.4550.130†
Lt0642050.7500.6001770.6930.176***1250.7250.41749100.8250.408***0.7910.5080.238†
Lt0661040.6950.6001040.5740.6001020.5210.5003040.5980.5670.5190.0530.064
Lt06820120.8370.90020100.8730.85012100.7830.83352130.8760.8650.8550.0130.014
Lt0702030.5780.4501950.6290.6321230.6630.6675150.6320.5690.5570.1020.059
Lt0751020.3950.5001020.2680.1001020.1000.1003040.2720.2330.2540.1450.168
Lt0772040.7420.7001960.7250.8421260.6410.7505170.7650.7650.7170.0000.033
Lt0792040.3910.4002030.1880.2001230.4200.4175240.3200.3270.297−0.0210.025
Lt0801030.5420.200*1050.7740.200**1040.6160.300*3060.6540.233***0.5870.6470.438†
Lt0811060.8110.6001050.7740.7001060.7630.8003070.8090.7000.7660.1370.065

Note: A = number of alleles; F = fixation index; He = expected heterozygosity under random mating; Ho = observed heterozygosity; n = number of individuals sampled; N = number of individuals genotyped; NullIIM = estimate of null allele frequency given the individual inbreeding model (IIM); PIC = polymorphism information content.

See Appendix 1 for locality and voucher information.

Significant deviation from Hardy–Weinberg expected genotype frequencies: *P < 0.05, **P < 0.01, ***P < 0.001.

†95% highest posterior density interval does not include zero.

Genetic properties by individual and pooled sampled locations of 23 polymorphic microsatellite markers developed in Liriodendron tulipifera. Note: A = number of alleles; F = fixation index; He = expected heterozygosity under random mating; Ho = observed heterozygosity; n = number of individuals sampled; N = number of individuals genotyped; NullIIM = estimate of null allele frequency given the individual inbreeding model (IIM); PIC = polymorphism information content. See Appendix 1 for locality and voucher information. Significant deviation from Hardy–Weinberg expected genotype frequencies: *P < 0.05, **P < 0.01, ***P < 0.001. †95% highest posterior density interval does not include zero. Population genetic parameters are listed for each sampled location showing loci exhibited two to 12 alleles, with almost all alleles common to each location (Table 2). Lack of population differentiation (FST = 0.077 estimated using GENEPOP) justified pooling genotypes from the three locations. In the pooled population, observed and expected heterozygosities ranged from 0.233 to 0.865 and 0.272 to 0.876, respectively. Six loci showed significant deviations from HWE (Table 2) with deficits of heterozygotes that could be attributed to numerous causes. One hypothesis is nonrandom mating, which we tested using INEST (Chybicki and Burczyk, 2009) with the individual inbreeding model (IIM) run for 50,000 burn-in and 500,000 total cycles. The estimated average coancestry coefficient over all loci (f) was 0.041, with a 95% highest posterior density interval of [0.000, 0.085], indicative of an outcrossing species. Another hypothesis for the deficit of heterozygosity is the presence of null alleles. Frequencies of null alleles estimated with INEST using IIM are listed in Table 2. The six loci with significant deficits of heterozygotes also showed evidence of null allele frequencies greater than zero. When comparing models including null alleles (n), mating among relatives (f), and genotyping failures (b) using INEST, the full model (nfb) was found to best fit the data by the lowest deviance information criterion value (DIC = 5905.837), showing that all three parameters contributed to observed genotype frequencies. The nb model, without mating among relatives, exhibited the closest DIC value (5916.991), but the difference was greater than 10, indicating stronger support for the nfb model (Igor J. Chybicki, personal communication).

CONCLUSIONS

The 23 microsatellite markers developed here do not suffer from ascertainment bias and show high levels of polymorphism. Taken together with thousands of PALs, this study provides useful resources for population genetic studies of L. tulipifera. Click here for additional data file. Click here for additional data file.
Appendix 1.

Locality and voucher information for the Liriodendron tulipifera samples used in this study.

LocationCounty, StateLatitudeLongitudeVoucher no.an
Frog Canyon, Smithsonian Environmental Research CenterAnne Arundel, Maryland38.884284−76.552695MARY102199120
James Madison’s MontpelierOrange, Virginia38.226667−78.179444MARY102199020
Saddler’s WoodsCamden, New Jersey39.900722−75.057750MARY102198912

Note: n = number of individuals sampled.

A voucher sample for each location was deposited in the Norton-Brown Herbarium (MARY), University of Maryland, College Park, Maryland, USA.

  11 in total

1.  Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment.

Authors:  Steven T Kalinowski; Mark L Taper; Tristan C Marshall
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

Review 2.  EST-SSRs as a resource for population genetic analyses.

Authors:  J R Ellis; J M Burke
Journal:  Heredity (Edinb)       Date:  2007-05-23       Impact factor: 3.821

Review 3.  Cross-species transfer of nuclear microsatellite markers: potential and limitations.

Authors:  Thelma Barbará; Clarisse Palma-Silva; Gecele M Paggi; Fernanda Bered; Michael F Fay; Christian Lexer
Journal:  Mol Ecol       Date:  2007-09       Impact factor: 6.185

4.  genepop'007: a complete re-implementation of the genepop software for Windows and Linux.

Authors:  François Rousset
Journal:  Mol Ecol Resour       Date:  2008-01       Impact factor: 7.090

5.  Microsatellite 'evolution': directionality or bias?

Authors:  H Ellegren; C R Primmer; B C Sheldon
Journal:  Nat Genet       Date:  1995-12       Impact factor: 38.330

Review 6.  Construction of a genetic linkage map in man using restriction fragment length polymorphisms.

Authors:  D Botstein; R L White; M Skolnick; R W Davis
Journal:  Am J Hum Genet       Date:  1980-05       Impact factor: 11.025

7.  Characterization of 39 novel EST-SSR markers for Liriodendron tulipifera and cross-species amplification in L. chinense (Magnoliaceae).

Authors:  Ai-Hong Yang; Jin-Ju Zhang; Hua Tian; Xiao-Hong Yao
Journal:  Am J Bot       Date:  2012-10-29       Impact factor: 3.844

8.  Simultaneous estimation of null alleles and inbreeding coefficients.

Authors:  Igor J Chybicki; Jaroslaw Burczyk
Journal:  J Hered       Date:  2008-10-20       Impact factor: 2.645

9.  Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake.

Authors:  Todd A Castoe; Alexander W Poole; A P Jason de Koning; Kenneth L Jones; Diana F Tomback; Sara J Oyler-McCance; Jennifer A Fike; Stacey L Lance; Jeffrey W Streicher; Eric N Smith; David D Pollock
Journal:  PLoS One       Date:  2012-02-14       Impact factor: 3.240

10.  An empirical review: Characteristics of plant microsatellite markers that confer higher levels of genetic variation.

Authors:  Benjamin J Merritt; Theresa M Culley; Alina Avanesyan; Richard Stokes; Jessica Brzyski
Journal:  Appl Plant Sci       Date:  2015-08-17       Impact factor: 1.936

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.