Literature DB >> 29876074

Development of high transferability cpSSR markers for individual identification and genetic investigation in Cupressaceae species.

Li-Sha Huang1, Yan-Qiang Sun1, Yuqing Jin1, Qiong Gao1, Xian-Ge Hu1, Fu-Ling Gao2, Xiao-Lei Yang3, Ji-Jun Zhu3, Yousry A El-Kassaby4, Jian-Feng Mao1.   

Abstract

Given the low substitution rate in plastomes, the polymorphic and codominant nature of chloroplast SSRs (cpSSRs) makes them ideal markers, complementing their nuclear counterpart. In Cupressaceae, cpSSRs are mostly paternally inherited, thus, they are useful in mating systems and pollen flow studies. Using e-PCR, 92 SSR loci were identified across six Cupressaceae plastomes, and primers were designed for 26 loci with potential interspecific transferability. The 26 developed cpSSRs were polymorphic in four genera, Platycladus, Sabina, Juniperus, and Cupressus and are suitable for Cupressaceae molecular genetic studies and utilization. We genotyped 192 Platycladus orientalis samples from a core breeding population using 10 of the developed cpSSRs and 10 nuclear SSRs, and these individuals were identified with high confidence. The developed cpSSRs can be used in (1) a marker-assisted breeding scheme, specifically when paternity identification is required, (2) population genetics investigations, and (3) biogeography of Cupressaceae and unraveling the genetic relationships between related species.

Entities:  

Keywords:  Cupressaceae; chloroplast SSRs; genetic diversity; haplotype; mating system; paternity test

Year:  2018        PMID: 29876074      PMCID: PMC5980425          DOI: 10.1002/ece3.4053

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

The cypress family, Cupressaceae, is the largest extant conifer family in terms of genera and the third largest in terms of species. The family includes 27 genera (17 of which are monotypic), with 142 species in total. Cupressaceae is also the most widely distributed gymnosperm family, present on all continents except Antarctica. Many of its species are vital sources of timber, and several genera are also important in ecological conservation and horticulture. Platycladus, which contains only one species, Platycladus orientalis, is native to China, Korea, and the Russian Far East (Cheng & Fu, 1978; Hu, Jin, Wang, Mao, & Li, 2015). As a pioneer species, P. orientalis is often used in ecological restoration projects (Jiang, Shi, Niu, & Yue, 2009; Li, He, & Ren, 2011). P. orientalis has a remarkable capability to absorb and accumulate atmospheric pollutants and heavy metal soil pollutants (Chu, 2012); thus, it is widely used for ecological remediation in densely populated cities in north China. Despite its importance, to date, little is known about this species’ population genetic diversity; however, scant nuclear genetic markers information is available (allozymes: (Xie, Dancik, & Yeh, 1992); AFLP: (Wang, Xing, Tang, & Feng, 2011); SSRs: (Jin et al., 2016)), all revealing low genetic diversity. Understanding P. orientalis population genetics and the extent of diversity in its germplasm collections are critical for the development of sound conservation and breeding strategies. The implementation of marker‐assisted breeding methods such as “Breeding without Breeding,” which is based on paternity assignment and pedigree reconstruction, requires the availability of diagnostic genetic markers (El‐Kassaby & Lstibůrek, 2009; El‐Kassaby, Cappa, Liewlaksaneeyanawin, Klápště, & Lstibůrek, 2011). The uniqueness of organelles genomes has been useful in plants population genetics and evolution due to their nonrecombinant nature and uniparental inheritance (Birky, 1995). In plants, the mitochondrial genome is highly variable and has high levels of intramolecular recombination (Olmstead & Palmer, 1994), rendering them challenging research tool (Provan, Powell, & Hollingsworth, 2001). However, the plastome generally lacks recombination and often has the potential to develop genetic makers (Ebert & Peakall, 2009). Furthermore, plastids are paternally inherited in most Cupressaceae species (Sakaguchi, Tsumura, Crisp, Bowman, & Isagi, 2014), making them ideal markers for monitoring gene flow via pollen and effective breeding and phylogeny tools (Vendramin, Lelli, Rossi, & Morgante, 1996). The high level of substitution conservation in the plastome confirms the feasibility of using cpSSRs to reveal their genome diversity. CpSSRs, as genetic markers, were first developed by Powell (Powell, Morgante, Andre et al., 1995), highlighting their high polymorphism and codominant inheritance, making them attractive genetic markers, coupled with the fact that only few loci are needed to identify unique genotypes (Schlotterer & Tautz, 1992). Plastome SSR loci are often distributed throughout the noncoding regions and show greater sequence variation than the coding regions and are characterized by low evolutionary rate and an almost nonexistent recombination rate (Powell, Morgante, Andre et al., 1995; Powell, Morgante, McDevitt, Vendramin, & Rafalski, 1995; Powell, Machray, & Provan, 1996), making them ideal molecular tools to complementing their nuclear markers counterparts (Huang et al., 2015; Provan et al., 2001). CpSSRs have been found to be transferable among related species because the regions flanking the cpSSR loci are conserved (Pan et al., 2014), thus, have the potential to evaluate both inter‐ and intraspecific variability (Powell, Morgante, Andre et al., 1995; Powell, Morgante, McDevitt et al., 1995). Here, we identified 92 cpSSRs loci in the plastomes of Cupressaceae species and developed 26 transferable SSR markers. The primers were screened for intraspecific polymorphism across different genera, and discrimination rate of the polymorphic loci was evaluated by genetic investigation of a core breeding population of P. orientalis. This study presents a set of polymorphic cpSSR markers which are transferable across diverse genera of Cupressaceae and demonstrates their value for genetic discrimination and diversity studies in this family.

MATERIALS AND METHODS

Study species, sample collection, and DNA extraction

To evaluate the polymorphism and interspecific transferability of the cpSSR markers with the designed primers, we sampled 48 individual trees representing Platycladus orientalis (n = 24), Sabina chinensis (n = 8), Juniperus formosana (n = 8), and Cupressus torulosa (n = 8). These trees were collected from different regions of China and planted in the National Tree Breeding Station for Platycladus orientalis in Jiaxian (Forest Farm of Jiaxian County, Henan, China) and the Institute of Forestry and Pomology (Beijing Academy of Agriculture and Forestry Sciences). Additionally, we characterized the genetic variation in the plastomes of a core breeding population (n = 192 parent trees) of P. orientalis growing in a seed orchard located in the National Tree Breeding Station for Platycladus orientalis in Jiaxian. The seed orchard was established in the mid‐1980s’, representing a wide range of P. orientalis. The sampled 192 parent trees originated from three populations (northern, southwestern, and eastern) covering the northern to southern Henan province (lat.: 33.33° (N) to 35.42° (N), long.: 112.17° (E) to 113.51° (E)); the populations are geographically divided by the Yellow River and the Taihang Mountains (the sampled trees 21–136 from a specific forest stand are regarded as a population) (Table 1).
Table 1

Origins of the 192 sampled Platycladus orientalis individuals

PopulationSample sizeLongitude (E)Latitude (N)
North138113.51°35.42°
East21113.43°33.33°
Southwest33112.17°34.36°
Origins of the 192 sampled Platycladus orientalis individuals

SSR detection and primer design

The GMATA2.1 (Wang & Wang, 2016) program for SSR mining and marker development was employed to identify SSR loci in the plastomes of six species Cupressus gigantean (GenBank ID: KT315754.1) (Li, Guo, & Zheng, 2016), Cupressus sempervirens (GenBank ID: KP099643.1), Juniperus monosperma (GenBank ID: NC_024022.1), Juniperus bermudiana (GenBank ID: KF866297.1), Juniperus scopulorum (GenBank ID: NC_024023.1), and Juniperus virgiana (GenBank ID: NC_024024.1) (Guo et al., 2014) using constraints of more than five repeats and a motif length between 2 and 10 bp. Electronic PCR (e‐PCR) refers to the process of recovering unique sequence‐tagged sites in DNA sequences by searching for subsequence that closely match the PCR primers and have the correct order, orientation, and spacing that they could plausibly prime the amplification of a PCR product of the correct molecular weight (Schuler, 1997). Therefore, e‐PCR presents a good tool to evaluate the designed primer pairs for interspecific amplification. We performed e‐PCR to select potentially interspecific transferable SSRs. The selected primers, which showed interspecific transferability in e‐PCR and potential polymorphism, were screened with M13 attached to the 5′ end of the forward primer. Primer3 (Rozen & Skaletsky, 1999) was employed to design primers for the loci with the following constraints: amplicon size of 120 ~ 400 bp, annealing temperature of about 60°C, and flanking region size ≤2,000 bp. Comparison of DNA sequences from the six species up to one hundred kb long and visualization of the alignments with annotations was generated using VISTA (Frazer, Pachter, Poliakov, Rubin, & Dubchak, 2012; Mayor et al., 2000). The feasibility of primer amplification of these 26 e‐PCR‐selected primer pairs was examined in other chloroplast genomes of seven Cupressaceae species from six genera by e‐PCR.

Primer screening and detection of cpSSR polymorphisms

In the initial screening, we amplified the designed markers from mixed DNA of each species across different genera. In a further screening, DNA from individual trees was used to amplify the cpSSR markers which showed clear amplification in the initial step, and polymorphism levels were determined by fluorescent‐based capillary electrophoresis with an ABI 3730 sequencer. PCRs were carried out in a total volume of 20 μl including 10 μl 2 × Taq PCR Mix, 4 μl (4 pmol) fluorescent‐dye‐labeled M13 primer, 2 μl (10 ng) genomic DNA, and 4 μl (4 pmol) mixed complementary forward and reverse primer. PCR amplifications were performed using Bio‐Rad T100™ Thermal Cycler and Bio‐Rad S1000™ Thermal Cycler with the following profile: 4‐min denaturing at 94°C; 20 cycles of 30‐s denaturing at 94°C, 30‐s annealing at 60°C (−0.5°C per cycle), and 45‐s extension at 72°C; 20 cycles of 30‐s denaturing at 94°C, 30‐s annealing at 50°C, and 45‐s extension at 72°C, with a final extension step of 72°C for 5 min. The amplification products were separated on a 1.0% agarose gel at 150 V for 15 min and visualized by GoldView staining. The number of observed alleles (N a), the effective number of alleles (N e), Shannon index (I), diversity index (h), and the unbiased diversity (uh) were calculated using GenALEx version 6.5 (Peakall & Smouse, 2012).

CpSSR evaluation with a core breeding population

We selected 24 primers for genotyping the 192 plastomes of P. orientalis, after excluding two primer pairs (N19 and N29) which displayed low polymorphism in the initial screening. Nuclear SSR polymorphism data for the same sample set were collected from the previous study (Jin et al., 2016). GeneCap version 1.4 (Wilberg & Dreher, 2004) was used to distinguish the different haplotypes by contrasting all alleles. Principal component analysis (PCA) was performed using the “adegenet” package (Jombart, 2008) to reference the redistribution of populations in R (Ihaka & Gentleman, 1996). The minimum genetic distance matrix was calculated using populations‐1.2.32 (http://bioinformatics.org/~tryphon/populations/), and the matrix was used to generate a molecular phylogenetic tree in Splitstree 4.14.2 (Huson, 1998). The neighbor network graph was generated by Splitstree 4.14.2, and the phylogenetic tree graph was displayed by FigTree v1.4.2 (Rambaut, 2012).

RESULTS

SSR analysis and primer design

Ninety‐two SSRs with more than five repeats and a motif length ranging between 2 and 10 bp were identified in the plastomes of six species of two genera, Cupressus and Juniperus. Dinucleotide repeats were the most abundant, with a count of 73 (79.35%), followed by 13 tri‐, 4 nona‐, 1 tetra‐, and 1 hepta‐nucleotide repeats (14.13%, 4.35%, 1.09%, and 1.09%, respectively) (Table S1). The locations of these 92 loci are presented in Figure [Link], [Link], [Link], [Link], [Link], [Link]. In the six species plastomes, the most common dinucleotide repeat was (AT/TA)n and the most common trinucleotides were (AGA/TCT)n and (TTC/GAA)n. After SSR detection, we performed e‐PCR. With e‐PCR, we selected 26 potentially interspecific transferable primer pairs which targeted 22 di‐, 3 tri‐, and 1 nona‐nucleotide SSRs. Among these 26 SSRs, 23 occurred in intergenic spacer regions, two were in coding sequence regions (N6 in ycf1 and N8 in rpoB), and one (N10) in an intron of tmG‐UCC (Table 2, Figure 1).
Table 2

Primer pairs of 26 cpSSRs designed by mining the plastomes of six species

SpeciesSequencesIDLocation T m Product sizeRepeat motif
Cupressus gigantean F: TTCTAGCTCGCACCCAAACT R: TTGTTTCGCCGATATGTTCA N1rbcL/accD IR60.04209(AC)6
Cupressus gigantean F: TGGTCATACCATTGCTGTTCA R: TGGGCTACTCTACGTGCTTT N2rps19/rp122 IR58.82388(AT)5
Cupressus gigantean F: TCGGAAGAAGAAGATGATATGTAGC R: CCCCAGATATGGAACTTTTGG N4rps12 Exon1/clpP IR60.33138(AAGGACAAA)5
Cupressus gigantean F: GGGAACAACCAGAATTGGAA R: GCCACTTTTATGGCACGACT N6ycf159.95365(TA)5
Cupressus gigantean F: GGGAAGCGGAAAGCTATTTT R: GGTAATCCACAGCAGCCAAT N8rpoB59.83265(AT)6
Cupressus gigantean F: CAAATTTCTCGCCAAGCTGT R: TGATTTCATCGGGTCGAATA N9atpF Exon2/atpA IR59.65210(TA)12
Cupressus gigantean F: TCGGGAACGAAAGAGAAAGA R:ACATAGATGTTATGGAGCAGAGC N10trnG‐UCC intron58.74290(CA)5
Cupressus gigantean F: TCCAATCTAGAACATCTCATCCAG R: CAGTGCTCTACCTAATCTGAAAGC N11trnG‐UCC/psaM IR59.21226(AT)6
Cupressus sempervirens F: TCGCCGCAATACTCCTAATC R: ATTCCGAAAAGATGGCTTCA N13rpl33/rps18 IR59.92277(AT)5
Cupressus sempervirens F: TCCATATCTGGTGGACAGGA R: GGGTTTTGGTCTTCTTCTTCG N14rps12 Exon1/clpP IR59.49141(AT)6
Cupressus sempervirens F: CCAGGTCGAGACAAGTGGAT R: GAAACCAATGCCCTAAGCAA N15rrn16/trnI‐GAU IR60.09350(AT)6
Cupressus sempervirens F: ATTCGATCCCTATCCGGTCT R: ACCAGAGCCATCAACCACTC N16psbZ/trnS‐UGA IR59.93253(AT)5
Juniperus bermudiana F: AAAATCGCCGCAATACTCCT R: ATTCCGAAAAGATGGCTTCA N18rpl33/rps18 IR60.29276(AC)6
Juniperus monosperma F: CGGACCTACCGACAGAACTC R: CCGAAGAAATAAGAAGCTGTATAGG N19rps12 Exon1/clpP IR59.35369(AGA)6
Juniperus monosperma F: CTTGCTCCTAGCCATGAAAA R: TGATTTCATCGGGTCGAATAG N20atpF/atpA IR59.01277(AT)5
Juniperus monosperma F: CTGTGATGCCGTTGATATTGA R: TGCCCATTATCCCTCTGTTC N22psbI/psbK IR59.73334(CA)5
Juniperus monosperma F: TCCTCTGCGATCTTTTATAGGG R: GGGAAGGATTGTTGGATTGA N23chlB/trnK‐UUU IR59.72270(AT)6
Juniperus bermudiana F: CCTCATACGGCTTCTCGTTC R: AAAATGAACCCCGAAGGATT N25rps12/trnV‐GAC IR59.74271(TA)5
Juniperus bermudiana F: TGATTTCATCGGGTCGAATA R: CAAATTTCTCGCCAAGCTGT N27aptA/aptF IR59.65314(AT)7
Juniperus bermudiana F: GAGATGAACGCAGAGCGTAA R: TGTTTGAGCGATTCCTACCC N28psbM/trnD‐GUC IR59.63203(AT)7
Juniperus bermudiana F: GCCATGGTAAGGCGTAAGTC R: TCAGTCAATGGGTTAGGTTCA N29trnT‐GGU/psbD IR58.83365(TA)5
Juniperus bermudiana F: GTAGCCAAGTGGTTCCAAGG R: CAATCTTACCGTCGATTCAGC N31trnH‐GUG/chlL IR59.66254(AT)5
Juniperus scopulorum F: GGACGGGAAAGGAAAAGAA R: TGGTATTCTTCTGGGTTTTGG N32rps12 Exon1/clpP IR59.01219(AT)5
Juniperus scopulorum F: CTGTTCCCCTGTGCATCATA R: AGGAGGAAAATCCGTTGGTT N33trnF‐GAA/trnL‐UAA IR59.66350(TCT)5
Juniperus virgiana F: TCAGTCAATGGGTTAGGTTCA R: GCCATGGTAAGGCGTAAGTC N34PsbD/trnT‐GGU IR58.83365(TCT)5
Juniperus virgiana F: TGTTTGAGCGATTCCTACCC R: GAGATGAACGCAGAGCGTAA N35trnD‐GUC/psbM IR59.63203(AT)6

T m, melting temperature; location refers to the genomic location in the plastomes of the six species (Cupressus gigantean, Cupressus sempervirens, Juniperus monosperma, Juniperus bermudiana, Juniperus scopulorum, and Juniperus virgiana); IR, intergenic region.

Figure 1

Comparison of plastomes from six species and the locations of the designed cpSSRs. The SSR loci of 26 primers were marked in the figure. This figure uses the chloroplast genome of Juniperus scopulorum (GenBank ID: NC_024023.1) as the reference, so it acts as the X axis. Alignment 1–5 (rows in each block of alignment) denotes Juniperus bermudiana, Juniperus monosperma, Juniperus virginiana, Cupressus sempervirens, and Cupressus gigantean, respectively. CNS, conserved noncoding sequences

Primer pairs of 26 cpSSRs designed by mining the plastomes of six species T m, melting temperature; location refers to the genomic location in the plastomes of the six species (Cupressus gigantean, Cupressus sempervirens, Juniperus monosperma, Juniperus bermudiana, Juniperus scopulorum, and Juniperus virgiana); IR, intergenic region. Comparison of plastomes from six species and the locations of the designed cpSSRs. The SSR loci of 26 primers were marked in the figure. This figure uses the chloroplast genome of Juniperus scopulorum (GenBank ID: NC_024023.1) as the reference, so it acts as the X axis. Alignment 1–5 (rows in each block of alignment) denotes Juniperus bermudiana, Juniperus monosperma, Juniperus virginiana, Cupressus sempervirens, and Cupressus gigantean, respectively. CNS, conserved noncoding sequences

Chloroplast SSR amplification and polymorphism detection

Initial screening via electrophoresis showed that amplicons were obtained for 24 of the 26 e‐PCR‐selected primer pairs, and amplification of two primer pairs (N14 and N23) was detected only in P. orientalis, S. chinensis, and J. formosana, and not in C. torulosa. Further analyses of the cpSSRs were carried out in these four species. In the second screening step, the amplicons of most of the primer pairs were monomorphic in the C. torulosa, while most of pairs showed polymorphism in J. formosana samples. These results may be due to the different degree of genetic variation among the samples. Seven primer pairs (N2, N11, N13, N16, N18, N25, and N35) demonstrated polymorphism across all four species. Primer pairs N19, N27, and N35 had the maximum effective number of alleles (N e = 5.33), while primer pair N35 generally showed the highest levels of polymorphism across all samples (Table 3). The two loci (N6 and N8) from coding regions and one locus (N10) from an intron region displayed levels of polymorphism comparable with other loci.
Table 3

Characteristics of 26 cpSSR markers in 48 individuals of Platycladus orientalis (n = 24), Sabina chinensis (n = 8), Juniperus formosana (n = 8), and Cupressus torulosa (n = 8)

Marker ID Platycladus orientalis Sabina chinensis Juniperus formosana Cupressus torulosa
N a N e I h uh N a N e I h uh N a N e I h uh N a N e I h uh
N1*73.881.630.740.7911.000.000.000.0043.561.320.720.8211.000.000.000.00
N2*53.061.300.670.7121.280.380.220.2532.460.970.590.6842.291.070.560.64
N4# 22.000.690.500.5311.000.000.000.0032.460.970.590.6811.000.000.000.00
N6~ 53.611.400.720.7621.280.380.220.2543.561.320.720.8211.000.000.000.00
N8*42.170.980.540.5611.000.000.000.0031.680.740.410.4611.000.000.000.00
N9*42.721.140.630.6821.280.380.220.2511.000.000.000.0011.000.000.000.00
N10# 21.920.670.480.5121.280.380.220.2554.571.560.780.8911.000.000.000.00
N11*21.600.560.380.4021.280.380.220.2543.561.320.720.8232.130.900.530.61
N13*43.431.290.710.7432.130.900.530.6132.130.900.530.6121.600.560.380.43
N14~ 64.481.640.780.8543.561.320.720.8254.001.490.750.8600.000.000.000.00
N15^ 65.001.700.800.8921.280.380.220.2554.001.490.750.8611.000.000.000.00
N16# 32.460.970.590.6321.280.380.220.2554.001.490.750.8621.600.560.380.43
N18# 32.460.970.590.6321.280.380.220.2553.201.390.690.7921.600.560.380.43
N1911.000.000.000.0021.280.380.220.2565.331.730.810.9311.000.000.000.00
N20*32.460.970.590.6321.280.380.220.2554.571.560.780.8911.000.000.000.00
N22# 21.800.640.440.4621.280.380.220.2542.911.210.660.7511.000.000.000.00
N23~ 32.310.940.570.6411.000.000.000.0021.280.380.220.2511.000.000.000.00
N25# 42.721.140.630.6821.280.380.220.2532.130.900.530.6121.280.380.220.25
N27*21.280.380.220.2321.280.380.220.2565.331.730.810.9311.000.000.000.00
N28~ 41.810.890.450.4931.680.740.410.4654.571.560.780.8900.000.000.000.00
N2921.130.230.120.1321.280.380.220.2532.460.970.590.6811.000.000.000.00
N31^ 22.000.690.500.5321.600.560.380.4354.001.490.750.8611.000.000.000.00
N32~ 54.261.510.770.8253.201.390.690.7964.571.670.780.8900.000.000.000.00
N33*54.241.520.760.8321.280.380.220.2542.911.210.660.7511.000.000.000.00
N34*22.000.690.500.5321.280.380.220.2532.911.080.660.7511.000.000.000.00
N35^ 32.671.040.630.8321.280.380.220.2565.331.730.810.9321.380.450.280.33

N a, observed number of alleles; N e, effective number of alleles; I, Shannon's information index; h, diversity; uh, unbiased diversity; *, #, ~, and ^ indicating polymorphic, monomorphic, low amplification rate, low specificity markers, respectively, characterized in a core breeding population of 192 individuals of P. orientalis.

Characteristics of 26 cpSSR markers in 48 individuals of Platycladus orientalis (n = 24), Sabina chinensis (n = 8), Juniperus formosana (n = 8), and Cupressus torulosa (n = 8) N a, observed number of alleles; N e, effective number of alleles; I, Shannon's information index; h, diversity; uh, unbiased diversity; *, #, ~, and ^ indicating polymorphic, monomorphic, low amplification rate, low specificity markers, respectively, characterized in a core breeding population of 192 individuals of P. orientalis. In addition, we also examined the feasibility of primer amplification of the 26 e‐PCR‐selected primer pairs in other chloroplast genomes of seven Cupressaceae species from six genera by e‐PCR. Eighteen primer pairs (N1, N2, N4, N6, N8, N9, N10, N11, N13, N16, N18, N19, N22, N25, N31, N33, N34, and N35) amplified successfully in the genome of Cupressus jiangeensis. Four pairs (N8, N9, N16, and N20) and three pairs (N2, N18, and N29) amplified successfully in the genomes of Calocedrus formosana and Thujopsis dolabrata, respectively. Five primer pairs (N1, N2, N9, N18, and N34), two pairs (N9 and N34), one pair (N18), and two pairs (N27 and N29) amplified successfully in the genomes of Thuja standishii, Cunninghamia lanceolata, Calocedrus macrolepis, and Hesperocyparis glabra, respectively (Table S2).

Application of cpSSR markers in genetic variation analysis

Among the 24 primer pairs characterized in the P. orientalis core breeding population (n = 192), ten pairs (N1, N2, N8, N9, N11, N13, N20, N27, N33, and N34) amplified well and showed polymorphism while the remaining were monomorphic and exhibited low amplification and specificity (Table 3). The performance of these ten primer pairs was uneven, and they displayed different degrees of polymorphism (Table 4). The core breeding population's genetic parameters were as follows: number of observed alleles (N a) ranged from 2 to 9, effective number of alleles (N e) ranged from 1.021 to 2.743, Shannon's information index (I) ranged from 0.058 to 1.294, diversity index (h) ranged from 0.021 to 0.635, and unbiased diversity (uh) ranged from 0.021 to 0.639 (Table 4). Interestingly, locus N8, from the rpoB gene coding region, displayed fairly high degree of diversity. Although only some of the loci produced lower polymorphisms, they nevertheless collectively presented a wide range of polymorphism. Ten polymorphic cpSSR loci revealed 134 unique genotypes in the 192 sampled trees. Statistical confidence for individual identification using these ten loci was moderate (p ID = .0086). Thus, not every tree had a unique multilocus genotype, and 28 of the 134 genotypes were assigned to 2–7 trees, with a total of 86 trees sharing genotypes.
Table 4

Characterization of 10 polymorphic cpSSR loci from 192 individuals of Platycladus orientalis

LociAllele size (bp) N a N e I h uh
N1247, 255, 283, 313, 343, 37162.7431.1810.6350.639
N2354, 356, 384, 416, 44851.1140.2770.1020.103
N8279, 281, 283, 285, 287, 289, 291, 293, 29992.4471.2940.5910.595
N9306, 308, 310, 312, 314, 31661.2290.4550.1870.188
N11250, 252, 284, 29241.4170.5330.2940.296
N13293, 29921.0510.1160.0480.049
N20211, 213, 215, 217, 219, 221, 223, 22581.3950.6780.2830.285
N27308, 310, 312, 314, 316, 218, 33071.2580.4900.2050.206
N33359, 413, 419, 44041.8420.8090.4570.460
N34362, 38621.0210.0580.0210.021

N a, observed number of alleles; N e, effective number of alleles; I, Shannon's information index; h, diversity; uh, unbiased diversity.

Characterization of 10 polymorphic cpSSR loci from 192 individuals of Platycladus orientalis N a, observed number of alleles; N e, effective number of alleles; I, Shannon's information index; h, diversity; uh, unbiased diversity. According to the polymorphism information of the ten loci, PCA partitioned 72.74 4.33, and 2.94% of the variance in the data along the first three axes, respectively, collectively accounting for 80.01% of the total variation. PCA did not show clear separation of the genotypes into discrete clusters (Figure S7). In the generated dendrogram, the 192 samples were grouped into seven main clades (Figure S8: note individuals color correspond to population origin). We combined the ten polymorphic cpSSR loci with ten previously developed nuclear SSR loci (Jin et al., 2016) to further investigate the efficiency of these combined markers in individual discrimination and population genetic analysis. The statistical confidence for individual identification was high (p ID = 0.0000545), and we obtained 192 unique genotypes corresponding to the 192 genotyped trees (i.e., each individual has a unique fingerprint). However, no clear genetic separation among the populations was observed in either the PCA, network, or phylogenetic tree. PCA partitioned 67.15, 3.21, and 3.06% of the variance along the first three components, respectively, and collectively accounting for 73.42% of the total variation (Figure 2). The neighbor network based on genetic distances among individuals produced a mixed network with no clear geographic structuring (Figure 3). All the sampled individuals were extensively mixed with each other and shared large amounts of recent admixtures (Figure 3).
Figure 2

Principal component analysis (PCA) of Platycladus orientalis populations’ plastomes based on the polymorphism information of 10 cpSSRs and 10 nuclear SSRs

Figure 3

Neighbor network of Platycladus orientalis individuals. The trees were marked with different colors indicating the geographic information

Principal component analysis (PCA) of Platycladus orientalis populations’ plastomes based on the polymorphism information of 10 cpSSRs and 10 nuclear SSRs Neighbor network of Platycladus orientalis individuals. The trees were marked with different colors indicating the geographic information

DISCUSSION

To establish a sound cpSSR genotyping platform for Cupressaceae species, we developed 26 polymorphic and interspecific transferable cpSSR markers for Cupressaceae species by whole plastome analyses and experimental examination. We further assessed intraspecific chloroplast diversity of 192 P. orientalis samples and demonstrated high confidence for individual identification using cpSSR accompanying with small set of nuclear SSRs. The cpSSRs found in this study are valuable genetic markers for its high degree of interspecific transferability and polymorphism and encourage the use of cpSSRs in population genetic research and coniferous tree breeding. CpSSRs are ideal molecular tools complementing nuclear genetic markers in phylogenetic and population genetic studies (El‐Kassaby & Lstibůrek, 2009; El‐Kassaby et al., 2011) and can potentially be used to study mating systems and track gene flow for widespread and ecologically/economically important conifer species for which plastomes are paternally inherited (Vendramin et al., 1996).

E‐PCR is a critical step to predicate the transferable SSRs

The high quality, versatility, and applicability of our primers demonstrate the viability of the implemented method for developing SSR loci. We first identified and characterized SSRs along with gene features in each given genome/sequence (Wang & Wang, 2016). In the following step, simulated marker mapping (e‐mapping) was performed across all genomes/sequences using a forward e‐PCR algorithm, allowing evaluating the transferability of the cpSSRs and calculating the potential intergenomic/intersequence polymorphism of each developed SSR locus (Schuler, 1997). Finally, primers were designed for SSR loci were chosen for their transferability and potential polymorphism. The e‐PCR step is unique compared with other SSR markers development studies. The high rate of polymorphism attained in the present study (100%) indicates that e‐PCR may be a key step providing valuable information to increasing the rate of generating polymorphic SSR markers. The amplification of 26 pairs of primers detected by e‐PCR in seven Cupressaceae species (Cupressus jiangeensis, Calocedrus formosana, Thujopsis dolabrata, Thuja standishii, Cunninghamia lanceolata, Calocedrus macrolepis, and Hesperocyparis glabra) showed strong versatility of the primers and also provided important instructions for the use of molecular markers for these species.

Variation across the chloroplast genome

Formed through a process of mutation known as slippage replication, SSRs are believed to be a key source of genetic variation for plastomes (Provan et al., 2001). Codon sequences are highly conserved in chloroplasts, whereas intron and intergenic regions are variable. In the present study, we found that most of the polymorphic cpSSRs were located in intergenic or intron regions (23 loci in intergenic spacer regions and one in an intron), which is consistent with previous studies (Powell, Morgante, Andre et al., 1995; Powell, Morgante, McDevitt et al., 1995; Powell et al., 1996; Provan et al., 2001). The distribution of polymorphisms indicates that the genetic relationship between samples is the significant factor determining the variation in the number of repeats. Successful amplification demonstrated the viability of the primers, which were designed using the plastome of related species. Some primers may generate PCR products among unrelated species; however, because the loci are highly conserved, it is expected that there will be no polymorphism among unrelated species (Cheng et al., 2005). We also found that some of the primers failed to show polymorphism between species, which may result from the proximity of the cpSSR locus to a very conservative coding sequence. Most interestingly, we found that the two loci from coding regions (N6 and N8) displayed polymorphism at a level comparable with other loci. N6 is located in the coding region of ycf1, one of the two largest ORFs in the plastome, which encodes the TIC214 protein involved in protein precursor import into chloroplasts (Drescher, Ruf, Calsa, Carrer, & Bock, 2000). Ycf1 products are essential for cell survival (Kikuchi et al., 2013), and it is among the four genes (together with matK, ndhF, and ccsA) in which single‐nucleotide variations and insertion/deletion frequencies were clearly higher than average, showing a signature of positive selection (Daniell, Lin, Yu, & Chang, 2016). Ycf1 was also identified as the most promising plastid DNA barcode for land plants (Dong et al., 2015). N8 is located in the coding region of rpoB, which encodes the beta subunit of a multisubunit RNA polymerase (Little & Hallick, 1988). RpoB was also identified as a marker suitable for phylogenetic study due to its relatively high substitution rate in this sequence (Olmstead & Palmer, 1994). Our study thus provides more evidence of SSR‐mediated insertion/deletion variation in two highly variable chloroplast genes (ycf1 and rpoB) in Cupressaceae species.

CpSSR and its application in P. orientalis

Pollen dispersal is an important mechanism for long‐distance gene flow in conifers (Adams, 1992) and potentially plays a major role in maintaining genetic diversity (El‐Kassaby & Davidson, 1991; O'Connell, Mosseler, & Rajora, 2007) and, in turn, promoting adaptive evolution and positive response to rapid climate change (Kremer et al., 2012). Paternity analysis and tracking the origin of pollen also have significant implication for studies of mating systems, conservation genetics, and breeding (Chaisurisri & El‐Kassaby, 1994; El‐Kassaby, 1995). SSRs have already been recognized as an efficient means for paternity analysis in conifer species (Cato & Richardson, 1996; El‐Kassaby et al., 2011). To enrich the genetic toolkit, we developed a set of cpSSRs for the Cupressaceae and evaluated their application in population genetic studies by analyzing a core breeding population of P. orientalis. In the present study, 86 trees shared 28 genotypes over 10 polymorphic cpSSR loci, but each individual owns unique fingerprint by adding the polymorphic information of nuclear SSR. Low levels of differentiation among the 192 sampled trees had already been revealed in a previous study in which 10 polymorphic nuclear SSRs were used (Jin et al., 2016); we also obtained similar results with cpSSR, and the results of these two primers were also consistent with the above results. It has been suggested that cpSSR markers represent ideal molecular tools to complement nuclear genetic markers (Huang et al., 2015) in investigations of population genetics and biogeography and to unravel the genetic relationships of closely related species. Further analysis can be carried on combining the cpSSR with nuclear SSR, as these primers have significant discrimination rate on individual identification. Finally, the utility of the developed cpSSRs along with the available nuclear SSRs in paternity assignment and pedigree reconstruction cannot be understated, specifically when applied in marker‐assisted breeding schemes that aimed at simplifying the breeding process and speeding generation turnover for capturing greater gains (El‐Kassaby & Lstibůrek, 2009; El‐Kassaby et al., 2011).

CONCLUSION

We developed 26 polymorphic cpSSR markers for Cupressaceae species. The evaluation with 192 P. orientalis samples of a core breeding population demonstrated moderate confidence of individual identification, but significantly high discrimination rate was attained after combining with 10 nuclear SSRs. The high degree of interspecific transferability and polymorphism of the developed cpSSRs proved that cpSSRs are a powerful tool for population genetics and breeding of conifer species.

CONFLICT OF INTEREST

None declared.

AUTHOR CONTRIBUTIONS

Conceived and designed the experiments: JFM. Performed the experiments: LSH YJ. Analyzed the data: YQS YJ XGH. Contributed reagents/materials/analysis tools: QG FLG XLY JJZ. Wrote the paper: LSH JFM YAEK.

DATA ACCESSIBILITY

Primers designed and evaluated: main body of text (Table 2). Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  33 in total

1.  The two largest chloroplast genome-encoded open reading frames of higher plants are essential genes.

Authors:  A Drescher; S Ruf; T Calsa; H Carrer; R Bock
Journal:  Plant J       Date:  2000-04       Impact factor: 6.417

2.  Slippage synthesis of simple sequence DNA.

Authors:  C Schlötterer; D Tautz
Journal:  Nucleic Acids Res       Date:  1992-01-25       Impact factor: 16.971

3.  adegenet: a R package for the multivariate analysis of genetic markers.

Authors:  Thibaut Jombart
Journal:  Bioinformatics       Date:  2008-04-08       Impact factor: 6.937

4.  A set of primers for the amplification of 20 chloroplast microsatellites in Pinaceae.

Authors:  G G Vendramin; L Lelli; P Rossi; M Morgante
Journal:  Mol Ecol       Date:  1996-08       Impact factor: 6.185

5.  Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species.

Authors:  Daniel Ebert; Rod Peakall
Journal:  Mol Ecol Resour       Date:  2009-01-28       Impact factor: 7.090

6.  Polymorphic simple sequence repeat regions in chloroplast genomes: applications to the population genetics of pines.

Authors:  W Powell; M Morgante; R McDevitt; G G Vendramin; J A Rafalski
Journal:  Proc Natl Acad Sci U S A       Date:  1995-08-15       Impact factor: 11.205

7.  Breeding without breeding.

Authors:  Yousry A El-Kassaby; Milan Lstibůrek
Journal:  Genet Res (Camb)       Date:  2009-04       Impact factor: 1.588

Review 8.  Long-distance gene flow and adaptation of forest trees to rapid climate change.

Authors:  Antoine Kremer; Ophélie Ronce; Juan J Robledo-Arnuncio; Frédéric Guillaume; Gil Bohrer; Ran Nathan; Jon R Bridle; Richard Gomulkiewicz; Etienne K Klein; Kermit Ritland; Anna Kuparinen; Sophie Gerber; Silvio Schueler
Journal:  Ecol Lett       Date:  2012-02-28       Impact factor: 9.492

9.  Breeding without breeding: is a complete pedigree necessary for efficient breeding?

Authors:  Yousry A El-Kassaby; Eduardo P Cappa; Cherdsak Liewlaksaneeyanawin; Jaroslav Klápště; Milan Lstibůrek
Journal:  PLoS One       Date:  2011-10-03       Impact factor: 3.240

Review 10.  Chloroplast genomes: diversity, evolution, and applications in genetic engineering.

Authors:  Henry Daniell; Choun-Sea Lin; Ming Yu; Wan-Jung Chang
Journal:  Genome Biol       Date:  2016-06-23       Impact factor: 13.583

View more
  4 in total

1.  Comparative Analysis of the Complete Chloroplast Genomes in Allium Section Bromatorrhiza Species (Amaryllidaceae): Phylogenetic Relationship and Adaptive Evolution.

Authors:  Junpei Chen; Dengfeng Xie; Xingjin He; Yi Yang; Xufeng Li
Journal:  Genes (Basel)       Date:  2022-07-19       Impact factor: 4.141

2.  Development and Characterization of Microsatellite Markers Based on the Chloroplast Genome of Tree Peony.

Authors:  Qi Guo; Lili Guo; Yuying Li; Haijing Yang; Xiaoliang Hu; Chengwei Song; Xiaogai Hou
Journal:  Genes (Basel)       Date:  2022-08-26       Impact factor: 4.141

3.  Molecular markers from the chloroplast genome of rose provide a complementary tool for variety discrimination and profiling.

Authors:  Changhong Li; Yongqi Zheng; Ping Huang
Journal:  Sci Rep       Date:  2020-07-22       Impact factor: 4.379

4.  Initial Characterization of the Chloroplast Genome of Vicia sepium, an Important Wild Resource Plant, and Related Inferences About Its Evolution.

Authors:  Chaoyang Li; Yunlin Zhao; Zhenggang Xu; Guiyan Yang; Jiao Peng; Xiaoyun Peng
Journal:  Front Genet       Date:  2020-02-20       Impact factor: 4.599

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.