| Literature DB >> 32543001 |
William J Gammerdinger1, Melissa A Toups1, Beatriz Vicoso1.
Abstract
Sewall Wright developed FST for describing population differentiation and it has since been extended to many novel applications, including the detection of homomorphic sex chromosomes. However, there has been confusion regarding the expected estimate of FST for a fixed difference between the X- and Y-chromosome when comparing males and females. Here, we attempt to resolve this confusion by contrasting two common FST estimators and explain why they yield different estimates when applied to the case of sex chromosomes. We show that this difference is true for many allele frequencies, but the situation characterized by fixed differences between the X- and Y-chromosome is among the most extreme. To avoid additional confusion, we recommend that all authors using FST clearly state which estimator of FST their work uses.Entities:
Keywords: zzm321990Fzzm321990STzzm321990; sex chromosomes
Mesh:
Year: 2020 PMID: 32543001 PMCID: PMC7689734 DOI: 10.1111/1755-0998.13210
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 7.090
Description of values for p and q in males and females
| Subpopulation 1 (males) | Subpopulation 2 (females) | Average of the subpopulations |
| |
|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
FIGURE 1Various estimates of F ST for a fixed difference between the X‐ and Y‐chromosome when (a) using equal subpopulation sample sizes for two subpopulations, males and females, and (b) using unequal subpopulation sample sizes for the two subpopulations, males and females, while keeping the total sample size constant
FIGURE 2Comparison of the nonzero components of π B in Hudson, Slatkin, et al. (1992) estimator and π T in Nei’s (1973) estimator for biallelic SNPs in (a) two subpopulations and (b) an XY system. Each bar under the alleles represents a nonzero comparison that occurs in the formulation of π B or π T. The curly bracket beneath females in the sex chromosome comparison illustrates that females are homomorphic for this allele despite being diploid and thus only one nonzero comparison is made
FIGURE 3Visualizations of Nei (1973), Hudson, Slatkin, et al. (1992) and the difference between the two estimators. (a) Estimates of F ST using the Nei (1973) estimator with white being no differentiation and dark blue being complete differentiation. (b) Estimates of F ST using the Hudson, Slatkin, et al. (1992) estimator given infinitely large subpopulation sizes with white being no differentiation and dark blue being complete differentiation. (c) A heatmap of the difference between Hudson, Slatkin, et al. (1992) estimator and Nei’s (1973) estimator for F ST (Hudson, Slatkin, et al. (1992) minus Nei (1973)) given infinitely large subpopulation sample sizes and the allele frequencies of p in subpopulations 1 and 2. Warmer colours show more difference between the estimators, while cooler colours show less difference between the estimators. Because the assignment of p and q along with subpopulation 1 and 2 is arbitrary, we have placed black boxes at all of the locations that could fit the description of a fixed difference between the X‐ and Y‐chromosome and provided an arrow to the scenario we outlined in Table 1. Dotted lines show an F ST estimate equal to 0.1, dashed lines show an F ST estimate equal to 0.5 and solid lines show an F ST estimate equal to 0.9. Black dotted, dashed and solid lines are used to signify Nei’s (1973) estimator in panels (a) and (c), while purple dotted, dashed and solid lines are used to signify Hudson, Slatkin, et al. (1992) estimator in panels (b) and (c)
Software packages for estimating F ST and their estimates using mock input. These input files contained fixed differences between the X‐ and Y‐chromosome for various sample sizes of males and females
| Package (version) | Option | 5 Males and 5 females | 10 Males and 10 females | 20 Males and 20 females | 5 Males and 15 females | 15 Males and 5 females | Referenced estimator |
|---|---|---|---|---|---|---|---|
|
| weir‐fst‐pop | 0.5 | 0.5 | 0.5 | 0.667 | 0.4 | Weir and Cockerham, ( |
|
| F_ST.stats: nucleotide.F_ST | 0.444 | 0.474 | 0.487 | 0.444 | 0.483 | Hudson, Slatkin, et al. ( |
| F_ST.stats: nuc.F_ST.pairwise | 0.444 | 0.474 | 0.487 | 0.444 | 0.483 | Hudson, Slatkin, et al. ( | |
| F_ST.stats: Nei.G_ST | 0.333 | 0.333 | 0.333 | 0.333 | 0.333 | Nei ( | |
| F_ST.stats: Nei.G_ST.pairwise | 0.333 | 0.333 | 0.333 | 0.333 | 0.333 | Nei ( | |
| F_ST.stats: Hudson.H_ST | 0.296 | 0.316 | 0.325 | 0.45 | 0.163 | Hudson, Boos, et al. ( | |
| F_ST.stats: Hudson.G_ST | 0.286 | 0.310 | 0.322 | 0.378 | 0.195 | Hudson, Boos, et al. ( | |
|
| diffCalc(fst = TRUE) | 0.5 | 0.5 | 0.5 | 0.667 | 0.4 | Weir and Cockerham ( |
| diffCalc() | 0.286 | 0.310 | 0.322 | 0.3023 | 0.3023 | Nei and Chesser ( | |
|
| pairwise.fst | 0.333 | 0.333 | 0.333 | 0.429 | 0.2 | Nei ( |
| genet.dist(method = Nei87) | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | Nei ( | |
| pairwise.neifst | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | Nei ( | |
| basic.stats(fst) | 0.333 | 0.333 | 0.333 | 0.333 | 0.333 | Nei ( | |
| genet.dist(method = WC84) | 0.5 | 0.5 | 0.5 | 0.667 | 0.4 | Weir and Cockerham, ( | |
| pairwise.WCfst | 0.5 | 0.5 | 0.5 | 0.667 | 0.4 | Weir and Cockerham, ( | |
|
| Fst | 0.5 | 0.5 | 0.5 | 0.667 | 0.4 | Weir and Cockerham, ( |
|
| Compute pairwise FST | 0.444 | 0.474 | 0.487 | 0.647 | 0.362 | Excoffier, Smouse, and Quattro ( |
|
| Gene Flow and Genetic Differentiation: GST | 0.286 | 0.310 | 0.322 | 0.378 | 0.194 | Nei ( |
| Gene Flow and Genetic Differentiation: GammaSt | 0.333 | 0.333 | 0.333 | 0.429 | 0.2 | Nei ( | |
| Gene Flow and Genetic Differentiation: Fst | 0.444 | 0.474 | 0.487 | 0.444 | 0.483 | Hudson, Slatkin, et al. ( |
This implementation appears to use a weighting factor.
These estimates are most consistent with Nei and Chesser (1983), which is also discussed in Hudson, Boos, et al. (1992).
These metrics appear to use a weighting factor, while Nei (1982) and Nei and Chesser (1983) state that in most practices the subpopulations can be assumed to be weighted equally.
The referenced estimator is consistent with F′ST in Nei (1987).