| Literature DB >> 24415843 |
Braulio J Soto-Cerda, Sylvie Cloutier.
Abstract
Genomic microsatellites (gSSRs) and expressed sequence tag-derived SSRs (EST-SSRs) have gained wide application for elucidating genetic diversity and population structure in plants. Both marker systems are assumed to be selectively neutral when making demographic inferences, but this assumption is rarely tested. In this study, three neutrality tests were assessed for identifying outlier loci among 150 SSRs (85 gSSRs and 65 EST-SSRs) that likely influence estimates of population structure in three differentiated flax sub-populations (FST = 0.19). Moreover, the utility of gSSRs, EST-SSRs, and the combined sets of SSRs was also evaluated in assessing genetic diversity and population structure in flax. Six outlier loci were identified by at least two neutrality tests showing footprints of balancing selection. After removing the outlier loci, the STRUCTURE analysis and the dendrogram topology of EST-SSRs improved. Conversely, gSSRs and combined SSRs results did not change significantly, possibly as a consequence of the higher number of neutral loci assessed. Taken together, the genetic structure analyses established the superiority of gSSRs to determine the genetic relationships among flax accessions, although the combined SSRs produced the best results. Genetic diversity parameters did not differ statistically (P > 0.05) between gSSRs and EST-SSRs, an observation partially explained by the similar number of repeat motifs. Our study provides new insights into the ability of gSSRs and EST-SSRs to measure genetic diversity and structure in flax and confirms the importance of testing for the occurrence of outlier loci to properly assess natural and breeding populations, particularly in studies considering only few loci.Entities:
Keywords: Adaptive variation; Linum usitatissimum; Microsatellites; Neutral variation; Outlier loci; Population structure
Year: 2013 PMID: 24415843 PMCID: PMC3881565 DOI: 10.1007/s11105-013-0568-1
Source DB: PubMed Journal: Plant Mol Biol Report ISSN: 0735-9640 Impact factor: 1.595
Genetic diversity analysis of three flax sub-populations identified by STRUCTURE
| Overall | South Asian | South American | North American |
| |
|---|---|---|---|---|---|
| Allele number | 408 | 275 | 345 | 354 | 0.0001* |
| Allele/locus | 2.72 | 1.83 | 2.30 | 2.36 | 0.0001* |
| Major allele frequency | 0.75 | 0.81 | 0.77 | 0.78 | 0.0897 n.s. |
| Exp. heterozygosity | 0.35 | 0.26 | 0.32 | 0.31 | 0.0198* |
| Allelic richnessa | 2.71 | 1.81 | 2.21 | 2.03 | 0.0001* |
| Private allelesa | – | 17 | 41 | 20 | 0.0001* |
| Pop. specific | 0.19 | 0.30 | 0.14 | 0.19 | 0.0001* |
| PIC value | 0.30 | 0.21 | 0.27 | 0.26 | 0.0069* |
n.s. nonsignificant
aCorrected by population size (n = 10)
*P < 0.05, statistical significance was tested by ANOVA
Candidate SSR outlier loci for balancing selection between three STRUCTURE sub-populations of flax
| Locus name | Accession numbera | Outlier test | BLASTn homology against nt | BLASTx homology against nr | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ln RHc | Nucleotide sequence | Identity (%) |
| Reference | Amino acid sequence | Identity (%) |
| Reference | ||
| LGM19 | EU831048 |
| −2.15 |
| 72 | 6 | XM002320441 |
| 70 | 1 | XP002525312 |
| LGM26 | EU830802 |
| −1.87 n.s. | No hits found | – | – | – | No hits found | – | – | – |
| LGM45A | EU829744 |
| −2.01 |
| 84 | 8 | CU228355 |
| 77 | 5 | XP002524642 |
| LM52 | EX720477 |
| −2.01 |
| 83 | 4 | XM002306612 |
| 59 | 2 | XP002306648 |
| LM70 | EH791974 |
| −3.71 | No hits found | – | – | – | No hits found | – | – | – |
| LM73 | EH791736 |
| −1.98 |
| 100 | 1 | AB096099 | No hits found | – | – | – |
aGenBank accession number from which the SSR marker was designed
b P value significance determined using LOSITAN (Beaumont and Nichols 1996)
cSignificant value ln RH calculated according to Kauer et al. (2003). Values between −1.96 and 1.96 are not significant
Fig. 1SSR neutrality tests in the three STRUCTURE sub-populations. a LOSITAN: distribution of empirical F ST values is shown as a function of expected heterozygosity. The upper and lower dashed lines indicate the 99 and 1 % confidence limits, respectively. The intermediate dashed line depicts the median value. b Bayescan: distribution of log-transformed Bayes factors and locus specific F ST. The dashed and solid lines indicate log10 BF of 1.5 and 2 corresponding to posterior probabilities of locus effects of 0.97 and 0.99, respectively. c Standardized ln RH: comparison between STRUCTURE sub-populations II and III. Dashed lines represent the 95 % confidence interval
Bottleneck analysis for the putative neutral and balancing SSR loci in flax
| Neutral (144 loci) | Balancing (6 loci) | |||||||
|---|---|---|---|---|---|---|---|---|
| Sign test | One-tailed Wilcoxon | Sign test | One-tailed Wilcoxon | |||||
| SSM | TPM | SSM | TPM | SSM | TPM | SSM | TPM | |
| Sub-population I (10) | 1 × 10−3** | 3 × 10−4*** | 1 × 10−5*** | 2.6 × 10−4*** | 0.340 | 0.273 | 0.093 | 0.093 |
| Sub-population II (18) | 0.177 | 2 × 10−3** | 0.107 | 1.7 × 10−4*** | 0.671 | 0.643 | 0.562 | 0.562 |
| Sub-population III (32) | 0.299 | 3.2 × 10−4*** | 0.310 | 2 × 10−3** | 0.570 | 0.610 | 0.953 | 0.921 |
Sign test and one-tailed Wilcoxon values are P values for heterozygosity excess. The analysis was performed for two models of mutation, step-wise mutation model (SMM) and two-phase mutation model (TPM). Numbers in brackets represent the number of accessions per STRUCTURE sub-population
**P < 0.01; ***P < 0.001
Fig. 2Estimation of the hypothetical ancestral groups for 60 flax accessions based on gSSRs, EST-SSRs, and combined SSRs using STRUCTURE with (left panel) and without (right panel) the six outlier loci. Each individual is represented by a vertical column partitioned into K colored segments proportional to their ancestry probability to each sub-population. Dashed lines indicate the reference sub-populations identified by combined SSRs