Literature DB >> 25202546

Development and multiplexed amplification of SSR markers for Thuja occidentalis (Cupressaceae) using shotgun pyrosequencing.

Huaitong Xu1, Francine Tremblay2, Yves Bergeron2.   

Abstract

PREMISE OF THE STUDY: Sixteen novel, polymorphic, multiplexed microsatellite loci were developed for eastern white cedar (Thuja occidentalis) using simple sequence repeat (SSR)-enriched shotgun pyrosequencing. • METHODS AND
RESULTS: Sixteen loci were tested on a panel of 24 individuals from different populations. The number of observed alleles ranged from four to 22. Four sets of multiplex PCR for the 16 loci were then carried out on 60 individuals of two populations from islands of FERLD Duparquet Forest, Canada. Mean number of alleles, observed heterozygosity, and expected heterozygosity were respectively 5.75, 0.594, and 0.574 for Island 58, and 5.50, 0.704, and 0.624 for Island 134. •
CONCLUSIONS: Four sets of multiplex microsatellite loci can be used for future genetic studies, which includes investigating genetic diversity and structure, and fragmentation and regeneration studies.

Entities:  

Keywords:  454 GS-FLX Titanium; Thuja occidentalis; microsatellite marker; next-generation sequencing; population genetics; shotgun pyrosequencing

Year:  2013        PMID: 25202546      PMCID: PMC4105039          DOI: 10.3732/apps.1200427

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Eastern white cedar (Thuja occidentalis L.) is a native, wind-pollinated conifer with a broad distribution across North America (Fowells, 1965). The species’ range extends from the Gulf of St. Lawrence in the east to southeastern Manitoba in the west, and from James Bay in the north to Tennessee and North Carolina in the south (Fowells, 1965). A member of the Cupressaceae, it is also commonly called eastern arborvitae, American arborvitae, northern white cedar, Atlantic red cedar, and swamp cedar in English (USDA NRCS, 2013), and thuya occidental, cèdre, balai, cèdre blanc, thuier cèdre, and arborvitae in French (Brouillet et al., 2010). Eastern white cedar (EWC) is listed as endangered in Indiana, Massachusetts, and New Jersey, as a threatened species in Connecticut, Illinois, Kentucky, and Maryland, and of special concern in Tennessee (USDA NRCS, 2013). Genetic analyses previously conducted on EWC have been mainly based on allozyme markers (Hofmeyer et al., 2007), while highly polymorphic markers such as microsatellites have not been developed for EWC. We report on the development and characterization of microsatellite markers for EWC using shotgun pyrosequencing on a simple sequence repeat (SSR)–enriched library (Malausa et al., 2011).

METHODS AND RESULTS

Foliage of EWC individuals from 14 sites across northern Quebec (Appendix 1) was collected and maintained at −20°C before genetic analysis. Genomic DNA was extracted using the DNeasy Plant Mini Kit (QIAGEN, Hilden, Germany). DNA extracts of 14 individuals were combined and sent on dry ice to Genoscreen (Lille, France) for microsatellite-enriched GS-FLX library construction following the methodology developed by Malausa et al. (2011). Briefly, main steps included: (1) digestion of genomic DNA with RsaI (Fermentas International Inc., Burlington, Ontario, Canada); (2) enrichment of microsatellite sequences in fragmented DNA with eight types of probes (TG, TC, AAC, AAG, AGG, ACG, ACAT, ACTC), which was accomplished by using Dynabeads (Invitrogen, Carlsbad, California, USA); and (3) PCR amplification of enriched DNA with primers specific to the adapter sequences (Malausa et al., 2011). In total, 11,393 raw sequences with an average read length of 400 bp were obtained, with 2175 sequences containing microsatellite motifs. One hundred seventeen of the sequences successfully had primers designed for them using QDD software (Meglécz et al., 2010) using default parameters except optimal primer length of 22 bp (range 18–27 bp) and 50% GC content (range 40–60%).
Appendix 1.

Voucher information for Thuja occidentalis samples. All samples were preserved at the Institut de recherche sur les forêts, Université du Québec en Abitibi-Témiscamingue, Canada.

No.SiteLatitudeLongitudeLocationCountryYear of collection
1MZ149°52′31.44″N74°23′34.224″WChibougamauCanada2007
2MZ249°54′32.976″N74°19′21.396″WChibougamauCanada2007
3MZ349°57′12.636″N74°13′44.688″WChibougamauCanada2007
4MZ449°38′30.336″N74°20′2.58″WChibougamauCanada2007
5MZ548°55′39.792″N78°53′8.808″WJames BayCanada2007
6MZ649°25′23.412″N79°12′39.492″WJames BayCanada2007
7MZ749°51′30.708″N78°36′25.956″WJames BayCanada2007
8MZ849°53′0.564″N78°38′45.78″WJames BayCanada2007
9MZ949°51′21.924″N78°38′41.496″WJames BayCanada2007
10DZ148°32′24.72″N78°38′30.696″WAbitibiCanada2007
11DZ248°28′12.54″N79°27′8.46″WAbitibiCanada2007
12DZ348°28′47.244″N79°26′12.624″WAbitibiCanada2007
13DZ448°25′53.796″N79°24′6.588″WAbitibiCanada2007
14DZ548°15′6.656″N78°34′29.208″WAbitibiCanada2007
15DZ648°25′51.636″N79°23′2.976″WAbitibiCanada2007
16DZ748°12′4.752″N79°25′8.796″WAbitibiCanada2007
17CZ147°25′45.192″N78°40′42.528″WTémiscamingueCanada2007
18CZ247°25′0.084″N78°40′55.704″WTémiscamingueCanada2007
19CZ347°23′44.052″N78°43′53.904″WTémiscamingueCanada2007
20CZ447°20′2.18″N79°23′33.396″WTémiscamingueCanada2007
21CZ547°18′39.96″N78°30′55.8″WTémiscamingueCanada2007
22CZ647°27′14.22″N78°35′15.54″WTémiscamingueCanada2007
23CZ747°25′8.184″N78°40′42.384″WTémiscamingueCanada2007
24CZ847°24′56.844″N78°42′41.94″WTémiscamingueCanada2007
25IS5848°26′41.4″N79°15′51.9″WAbitibiCanada2008
26IS13448°27′52.5″N79°16′19.6″WAbitibiCanada2008
To minimize screening costs, we initially selected 48 of 117 pairs following criteria detailed in Lepais and Bacles (2011). In brief, we restricted our selection to loci with hexa-, tetra-, and dinucleotide motif types. Dinucleotide motif was limited to AC, CA, TG, GT, AG, GA, CT, and TC types, because AT and TA types are notoriously hard to amplify (Temnykh et al., 2001). Each of the selected 48 loci was initially tested for amplification with unlabeled primers (Invitrogen) on a screening panel that included seven EWC trees collected across northern Quebec (one tree per site) (Appendix 1) and a negative control. Amplifications were carried out in a total volume of 10 μL using four 96-well Mastercycler pro S PCR systems (Eppendorf Gmbh, Wesselling-Berzdorf, Germany). Each reaction mixture contained 1 μL of DNA extract, 5 μL of 2× QIAGEN Multiplex PCR Master Mix (QIAGEN), and a final concentration of 0.2 μM for each forward and reverse primer. The PCR program consisted of an initial heat-activation step at 95°C for 15 min, 36 cycles of three-step cycling (denaturation at 94°C for 30 s, annealing at 54°C for 90 s, extension at 72°C for 60 s), and a final extension at 60°C for 30 min. A total of 2.5 μL of PCR products were visualized on 3% agarose gel (Promega Corporation, Madison, Wisconsin, USA), with electrophoretic migration performed at 100 V for 20 min on the Bio-Rad Imaging System (Bio-Rad, Montreal, Canada). The initial test on 48 loci using agarose gel showed that 38 had amplified products, and four of 38 had three or more products in one PCR reaction (nonspecific). Among the remaining 34 loci, 16 had one or two amplification products, variable in size. Thus, we used all of them to verify polymorphisms and further design multiplex PCR. Each was tested with fluorescent dye–labeled primers (Applied Biosystems, Carlsbad, California, USA) on a panel that included 24 EWC individuals collected from 24 sites across northern Quebec (one individual per site) (Appendix 1) plus one negative control. PCR cycles were the same as those mentioned previously, except for increased annealing temperatures to achieve specific amplifications (Table 1). A total of 2 μL of 1:100 diluted PCR products labeled with four different dyes (6-FAM, VIC, NED, and PET; Applied Biosystems) were mixed with 8.35 μL of Hi-Di Formamide (Applied Biosystems) and 0.15 μL of GeneScan 500 LIZ Size Standard (Applied Biosystems), and sent to GenoQuebec (Montreal, Canada) for genotype reading on an ABI 3730 genetic analyzer. Results were analyzed with GeneMapper 3.7 software (Applied Biosystems).
Table 1.

Characteristics of 16 microsatellites validated for Thuja occidentalis.

Observed
LocusPrimer sequences (5′–3′)GenBank accession no.DyeRepeat motifSize (bp)aTa (°C)MGAMinMax
TO791F: AAGAGATTTATTTGCCCTCCGJX475983VIC(CA)1214157116133167
R: ATGGTTGATGGACTCCTTGG
TO605F: GAATAACTTCTTCTGGGAAAGATACAJX475984PET(AC)819059110174196
R: GAGGTGGAAAGAAGTGGATAAAA
TO328F: CCCGCAACACCTACTTGTCTJX475985FAM(TACA)72155714203215
R: TGCTCCATGTTTGAAGTTGC
TO53F: AAATGGCCCATAAGCACAAAJX475986NED(CA)51845816174184
R: GGATGTTTCCAGTTGACGGT
TO925F: TGTGTTTGTGGTGGCTGACTJX475987FAM(TG)2015158222129217
R: CATTCATACATTTCCCATCCA
TO727F: GAGATTCCTTTAAAATATTGGCATJX475988VIC(GA)1124157214233325
R: CCCTCCCATTCCTCTTAATG
TO659F: TGATGCACCAATTTTCTTTGGJX475989PET(CT)91915627181195
R: TGATGCACTTTAAGGTGTAGGG
TO29F: TGCAGTGTTAGTGGAGCAACTTJX475990NED(CA)516257214148186
R: TCATTGTTTATTCCCTAAGATGGA
TO737F: GAGCAAGAAGGAGAGTGGGAJX475991PET(AGAT)111246336102130
R: CCTAGGTTGCCTTGTTGTCC
TO587F: GTGCCAAACTTTTCAAGGTAAGAJX475992NED(CT)816762313139211
R: GCAAGAGCACAAATGATCACA
TO512F: TGCATAACAACTCTTCTTAAATCAGCJX475993FAM(CT)819463311146212
R: AGGTCCTATCTAGGTCTTAGACAACTT
TO503F: CTTGTCCGTCTGACATGTGTTTJX475994VIC(GA)819055312138202
R: CACATAGGTTAAGGGTAGTTTCCT
TO715F: CATCTACATGGTCGATGATTTAACJX475995VIC(AG)101066046100110
R: TATCCCAAACCAGCAAAACC
TO521F: CAAATATGGCACCAATGCCTJX475996PET(CT)812154417113239
R: CAATTTCCTCAGGTTTGGGA
TO418F: ATGCTTTTCTAACCCTTTTGGAJX475997NED(AC)72536148163255
R: TGATCAGTTGGATTTCTAGATTGC
TO20F: TTTGGCTTGTAGGTGGTTTTJX475998FAM(TG)519257415168204
R: CTCCATTTTGGAGTGTTGGT

Note: A = number of alleles observed; Max = maximum allele size observed during screening; MG = multiplex group; Min = minimum allele size observed during screening; Ta = annealing temperature.

Product size from shotgun pyrosequencing.

Characteristics of 16 microsatellites validated for Thuja occidentalis. Note: A = number of alleles observed; Max = maximum allele size observed during screening; MG = multiplex group; Min = minimum allele size observed during screening; Ta = annealing temperature. Product size from shotgun pyrosequencing. All 16 loci showed interpretable, repeatable, and polymorphic patterns (Table 1). We used Multiplex Manager (Holleley and Geerts, 2009) to design and optimize multiplex PCRs to find annealing temperatures for each multiplex group to ensure specific amplifications and avoid complementary sequences among primers. Primer pairs were multiplexed to reduce amplification costs (Table 1). Coamplifications of all multiplexed primers were tested on two populations (30 trees per population) sampled from islands in Lake Duparquet, northwestern Quebec (Table 2). PCR cycles were the same as those mentioned previously, except for multiplexed annealing temperatures (M1 at 57°C, M2 at 56°C, M3 at 55°C, M4 at 54°C). PCR products were genotyped as previously detailed.
Table 2.

Results of initial primer screening in Thuja occidentalis samples from Lake Duparquet, Lake Duparquet Research & Teaching Forest, Quebec, Canada.

Island 58 (N = 30)aIsland 134 (N = 30)a
LocusAHoHeFISNull alleles present (frequency)AHoHeFISNull alleles present (frequency)
TO535.000.5670.7050.212no4.000.9000.652−0.366no
TO3284.000.5670.5850.048no3.000.6670.491−0.343no
TO6053.000.2000.5800.665*yes (0.24)2.000.3670.4330.169no
TO79112.000.6670.7940.177no11.000.8000.8390.063no
TO297.000.5000.5430.096no7.000.7000.7120.034no
TO6594.000.4000.5810.326yes (0.11)6.000.5000.6080.194no
TO7272.000.8330.486−0.706*no5.000.9000.686−0.296no
TO92518.000.8000.8470.072no10.000.6670.7700.151no
TO5036.000.9670.644−0.487*no4.000.6330.521−0.199no
TO5125.000.3670.322−0.121no7.000.7330.556−0.305no
TO5875.001.0000.712−0.391*no7.000.8670.710−0.204no
TO7375.000.7330.634−0.139no4.000.8670.624−0.375no
TO202.000.4000.320−0.234no4.000.9330.676−0.366no
TO4183.000.0670.065−0.009no3.000.4330.4430.038no
TO5218.000.9330.777−0.185no8.000.8000.738−0.067no
TO7153.000.5000.5830.159no3.000.5000.5290.072no
Mean5.750.5940.5745.500.7040.624
SE1.0310.0680.0500.6580.0450.030

Note: A = number of alleles; FIS = inbreeding coefficient; He = expected heterozygosity; Ho = observed heterozygosity.

*P ≤ 5%; Bonferroni correction was applied, and indicative adjusted P value for 5% nominal level was 0.0031.

Geographical coordinates: Island 58 (48°26′41.4″N, 79°15′51.9″W), Island 134 (48°27′52.5″N, 79°16′19.6″W).

Results of initial primer screening in Thuja occidentalis samples from Lake Duparquet, Lake Duparquet Research & Teaching Forest, Quebec, Canada. Note: A = number of alleles; FIS = inbreeding coefficient; He = expected heterozygosity; Ho = observed heterozygosity. *P ≤ 5%; Bonferroni correction was applied, and indicative adjusted P value for 5% nominal level was 0.0031. Geographical coordinates: Island 58 (48°26′41.4″N, 79°15′51.9″W), Island 134 (48°27′52.5″N, 79°16′19.6″W). The number of different alleles per locus (A), observed heterozygosity (Ho), and expected heterozygosity (He) were calculated in GenAlEx version 6.2 (Peakall and Smouse, 2006). Inbreeding coefficient (FIS) and Hardy–Weinberg equilibrium (HWE) tests were done in FSTAT version 2.9.3 (Goudet, 2001). Null allele presence was checked in MICRO-CHECKER (Van Oosterhout et al., 2004). Mean values for A, Ho, and He were, respectively, 5.75, 0.594, and 0.574 on Island 58, and 5.50, 0.704, and 0.624 on Island 134 (Table 2). FIS ranged from −0.706 to 0.665 on Island 58, and from −0.357 to 0.194 on Island 134 (Table 2).

CONCLUSIONS

Shotgun pyrosequencing has proved to be effective for isolating microsatellite markers in EWC. The four sets of multiplex microsatellite loci that were developed here for the first time will facilitate future studies of population genetics in EWC, including investigating phylogeographic patterns of postglacial expansion in North America, and studying the impacts of habitat fragmentation on population genetic structure and gene flow. They will also help resolve questions regarding regeneration patterns in this species along postfire successions (Bergeron, 2000).
  6 in total

1.  QDD: a user-friendly program to select microsatellite markers and design primers from large sequencing projects.

Authors:  Emese Meglécz; Caroline Costedoat; Vincent Dubut; André Gilles; Thibaut Malausa; Nicolas Pech; Jean-François Martin
Journal:  Bioinformatics       Date:  2009-12-10       Impact factor: 6.937

2.  High-throughput microsatellite isolation through 454 GS-FLX Titanium pyrosequencing of enriched DNA libraries.

Authors:  Thibaut Malausa; André Gilles; Emese Meglécz; Hélène Blanquart; Stéphanie Duthoy; Caroline Costedoat; Vincent Dubut; Nicolas Pech; Philippe Castagnone-Sereno; Christophe Délye; Nicolas Feau; Pascal Frey; Philippe Gauthier; Thomas Guillemaud; Laurent Hazard; Valérie Le Corre; Brigitte Lung-Escarmant; Pierre-Jean G Malé; Stéphanie Ferreira; Jean-François Martin
Journal:  Mol Ecol Resour       Date:  2011-02-21       Impact factor: 7.090

3.  Multiplex Manager 1.0: a cross-platform computer program that plans and optimizes multiplex PCR.

Authors:  Clare E Holleley; Paul G Geerts
Journal:  Biotechniques       Date:  2009-06       Impact factor: 1.993

4.  Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential.

Authors:  S Temnykh; G DeClerck; A Lukashova; L Lipovich; S Cartinhour; S McCouch
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

5.  Comparison of random and SSR-enriched shotgun pyrosequencing for microsatellite discovery and single multiplex PCR optimization in Acacia harpophylla F. Muell. Ex Benth.

Authors:  Olivier Lepais; Cecile F E Bacles
Journal:  Mol Ecol Resour       Date:  2011-03-16       Impact factor: 7.090

6.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

  6 in total
  2 in total

1.  Chloroplast and Nuclear Genetic Diversity Explain the Limited Distribution of Endangered and Endemic Thuja sutchuenensis in China.

Authors:  Zhi Yao; Xinyu Wang; Kailai Wang; Wenhao Yu; Purong Deng; Jinyi Dong; Yonghua Li; Kaifeng Cui; Yongbo Liu
Journal:  Front Genet       Date:  2021-12-23       Impact factor: 4.599

2.  Microsatellite markers: what they mean and why they are so useful.

Authors:  Maria Lucia Carneiro Vieira; Luciane Santini; Augusto Lima Diniz; Carla de Freitas Munhoz
Journal:  Genet Mol Biol       Date:  2016-08-04       Impact factor: 1.771

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.