Literature DB >> 36067235

Polygenic signals of sex differences in selection in humans from the UK Biobank.

Filip Ruzicka¹, Luke Holman^2,3, Tim Connallon¹.

Abstract

Sex differences in the fitness effects of genetic variants can influence the rate of adaptation and the maintenance of genetic variation. For example, "sexually antagonistic" (SA) variants, which are beneficial for one sex and harmful for the other, can both constrain adaptation and increase genetic variability for fitness components such as survival, fertility, and disease susceptibility. However, detecting variants with sex-differential fitness effects is difficult, requiring genome sequences and fitness measurements from large numbers of individuals. Here, we develop new theory for studying sex-differential selection across a complete life cycle and test our models with genotypic and reproductive success data from approximately 250,000 UK Biobank individuals. We uncover polygenic signals of sex-differential selection affecting survival, reproductive success, and overall fitness, with signals of sex-differential reproductive selection reflecting a combination of SA polymorphisms and sexually concordant polymorphisms in which the strength of selection differs between the sexes. Moreover, these signals hold up to rigorous controls that minimise the contributions of potential confounders, including sequence mapping errors, population structure, and ascertainment bias. Functional analyses reveal that sex-differentiated sites are enriched in phenotype-altering genomic regions, including coding regions and loci affecting a range of quantitative traits. Population genetic analyses show that sex-differentiated sites exhibit evolutionary histories dominated by genetic drift and/or transient balancing selection, but not long-term balancing selection, which is consistent with theoretical predictions of effectively weak SA balancing selection in historically small populations. Overall, our results are consistent with polygenic sex-differential-including SA-selection in humans. Evidence for sex-differential selection is particularly strong for variants affecting reproductive success, in which the potential contributions of nonrandom sampling to signals of sex differentiation can be excluded.

Entities: Chemical

Mesh：

Year: 2022 PMID： 36067235 PMCID： PMC9481184 DOI： 10.1371/journal.pbio.3001768

Source DB: PubMed Journal: PLoS Biol ISSN： 1544-9173 Impact factor: 9.593

Introduction

Adaptation of a population to its environment requires heritable genetic variation for fitness [1]. Although many populations show substantial genetic variation for fitness components [2]—including life history traits such as maturation rate, lifespan, mating success, and fertility [2,3]—genetic trade-offs between components or between different types of individuals in a population, limit adaptive potential [4]. For example, a mutation that increases the probability of survival to adulthood might simultaneously decrease adult reproductive success (e.g., [5]), weakening the mutation’s net fitness effect [4]. In addition to slowing adaptation [6-8], genetic trade-offs can increase standing genetic variation [2,9], give rise to balancing selection [10,11], and favour evolutionary transitions between mating systems [12,13], modes of sex determination [14], and genome structures [15-18]. Sexually antagonistic (SA) genetic polymorphisms—in which the alleles that benefit one sex are harmful to the other—are a type of genetic trade-off that may be common in sexually reproducing species [19]. Theory shows that SA polymorphisms are likely to arise when mutations differentially affect trait expression in each sex or when mutations similarly affect traits under divergent directional selection between the sexes [20]. Empirical quantitative genetic studies imply that both conditions are frequently met in nature [21-24] and, accordingly, that SA polymorphisms contribute to phenotypic variation in a range of plant and animal populations (e.g., [25-27]), including humans [28-31]. Although there is now abundant evidence that SA polymorphisms contribute to phenotypic variation, efforts to identify and characterise SA alleles in genomic data face 2 formidable challenges [32]. First, methods using explicit fitness measurements to identify SA polymorphisms (e.g., genome-wide association studies (GWAS) of fitness [33]) are rarely feasible, because it is challenging to obtain fitness measurements for large numbers of genotyped individuals under natural conditions [2]. Second, methods using allele frequency differences between adult females and males as genomic signals of SA viability selection (e.g., between-sex F estimates [32,34-43]) are limited in several ways: They have low power to detect SA loci, they cannot distinguish SA selection from sex differences in the strength of selection, they are susceptible to artefacts generated by population structure and mis-mapping of sequence reads to sex chromosomes [32,40,41,44], and they neglect fitness components other than viability, such as reproductive success [32,45]. Previous studies of human genomic data [32,34-36,43,44,46] have been affected by one or more of these issues, such that we currently lack robust evidence of SA genomic variation in humans. More generally, these impediments help to explain the limited catalogue of SA polymorphisms across species [47-49], which currently comprises a handful of loci with exceptionally large phenotypic effects (e.g., [50-54]). Despite these challenges, new datasets and analytical approaches provide opportunities to identify robust genomic signals of SA selection. First, massive “biobank” datasets, which are widely used in human genomics, sometimes include both genotype and offspring number data [29,55] that can be used to detect loci with SA effects on reproductive components of fitness [32]. Second, estimates of allele frequency differences between sexes—though ill-suited for confidently identifying individual SA loci affecting viability—may nevertheless be amenable to genome-wide tests for polygenic SA viability selection [32,34]. Third, population genomic metrics of sex-differential selection (e.g., between-sex F) may include an appreciable proportion of genuine SA loci in the upper tails of their distributions, providing a set of candidate loci that can collectively yield insights into the general properties of SA polymorphisms (e.g., their functional characteristics and evolutionary dynamics), despite uncertainty about individual candidates. Here, we extend [32,34] and develop new statistical tests based on F metrics of between-sex allele frequency differentiation to detect polygenic signals of sex-differential selection affecting viability, reproduction, and total fitness during a full generational cycle. Applying these tests to the UK Biobank [55]—a dataset comprising quality-filtered genotype and offspring number data for approximately 250,000 men and women—reveals polygenic signals of sex-differential and SA polymorphism. We corroborate these results by using mixed-model statistics that explicitly control for systematic differences in the genetic ancestry of female and male individuals. We minimise potential sequencing artefacts and further show that sex-differentiated polymorphisms are preferentially situated in functional, phenotype-altering genomic sequences. Finally, we use genetic diversity data to examine modes of evolution affecting sex-differentiated sites.

Results

Genomic signals of sex differences in selection: Theoretical predictions

Previous studies have examined sex-differential effects of genetic variation during the zygote-to-adult stage by comparing allele frequencies between adult females and males [32,34,36-40,44]. By contrast, our analytical approach combines allele frequency with offspring number data to estimate sex-differential effects during a full generational life cycle (Fig 1). To illustrate the approach, consider a large, well-mixed population containing many polymorphic, biallelic, autosomal loci. At fertilisation, mendelian inheritance equalises allele frequencies between the sexes (Fig 1, left box). In the zygote-to-adult stage, loci with sex-differential effects on survival accumulate allele frequency differences between the adults of each sex (e.g., the black allele becomes enriched in adult males and deficient in adult females because it improves zygote-to-adult survival in males but reduces it in females; Fig 1, middle box). Among the adults, alleles with sex-differential effects on reproductive success have different transmission rates to the next generation from surviving females versus surviving males (e.g., the black allele is enriched among the male gametes contributing to fertilisation but deficient among female gametes, thus increasing its transmission to offspring of males but decreasing transmission to offspring of females; Fig 1, right box).

Fig 1

Partitioning signals of sex differences in selection among fitness components.

Partitioning signals of sex differences in selection among fitness components.

A pair of autosomal alleles are represented by white and black dots, representing female- and male-beneficial alleles, respectively; , and depict sex-specific frequency estimates for a given allele at different stages of the life cycle (see main text for details). Autosomal allele frequencies are equalised between sexes at fertilisation (left box; females, top; males, bottom), resulting in negligible allele frequency differentiation at this stage of the life cycle. Differentiation between sexes can arise in the sample of adults (middle box) due to sex differences in viability selection among juveniles (orange arrow) and in the projected gametes (right box) due to sex differences in LRS among adults (green arrow). Data on sex-specific allele frequencies and LRS thus allow the estimation of sex-differential effects of genetic variants on each fitness component (including overall fitness; purple arrow), despite the absence of allele frequency data among zygotes (left box) and gametes (right box), which are inferred and not directly observed. LRS, lifetime reproductive success. Adult allele frequencies, coupled with offspring number data per individual, thus provide an opportunity to estimate sex-differential effects of genetic variation during a complete life cycle, even though zygotic and gametic allele frequencies are inferred and not directly observed. Below, we apply our approach to the UK Biobank, a dataset that includes genotypes and reported offspring numbers (hereafter “lifetime reproductive success” or LRS, following standard terminology [29]) among putatively post-reproductive adults (ages 45 to 69 after filtering; see Materials and methods). For a biallelic autosomal locus with alleles A1 and A, we denote and the respective estimated frequencies of the A1 allele in adult males and females of the UK Biobank. The projected frequencies of A1 in paternal and maternal gametes contributing to fertilisation are: where M and F represent the cumulative LRS of males and females, respectively, with genotype ij (e.g., M11, M12, and M22 correspond to genotypes A1A1, A1A2, and A2A2). Using F [56], we partition between-sex allele frequency differentiation over 1 generation into 3 components: (i) differentiation among adults, which includes effects of sex-differential survival (hereafter “adult F;” see [32,34,45]); (ii) sex-differential variation in adult LRS (hereafter “reproductive F”); and (iii) sex-differential variation in overall fitness (hereafter “gametic F”). Single-locus estimates of adult, reproductive, and gametic F are defined, respectively, as: where and .

F distributions in the absence of sex-differential selection

In the absence of sex differences in selection (e.g., under neutrality or under sexually concordant (SC) selection of equal magnitude and direction in each sex), with large sample sizes, negligible Hardy–Weinberg deviations at birth, and excluding single-nucleotide polymorphisms (SNPs) with very low minor allele frequencies, we show that the adult, reproductive, and gametic metrics converge, respectively, to the following distributions: where each X is an independent chi-square random variable with 1 degree of freedom, N and N denote adult sample sizes, μ and μ denote mean LRS, and denote variances in LRS, and and quantify sex-specific departures from Hardy–Weinberg equilibrium in the sample of adults (Section A in S1 Appendix). In datasets such as the UK Biobank, there is also between-site variation in the number of genotyped individuals and the extent of Hardy–Weinberg deviations in the adult sample. The null distributions described by Eqs [3A–3C] are easily adjusted to account for this between-site variation (see Materials and methods). Relative to the null distributions in Eqs [3A–3C], sex differences in selection inflate each metric (Section A in S1 Appendix). These inflations may arise due to polymorphisms under sex-differential selection and neutral polymorphisms that hitchhike with selected polymorphisms. However, linkage disequilibrium (LD) alone cannot inflate genome-wide in the absence of genuine selected polymorphisms (Section B in S1 Appendix). As such, inflations represent reliable signals of sex-differentially selected polymorphism [32], provided: (i) technical artefacts are controlled (as shown below); (ii) sex-specific population structure is controlled; and (iii) males and females are sampled at random (though (iii) is not a requirement for reproductive ; see Discussion). To simplify the presentation, we first present analyses using F metrics, but we return to non-F metrics in the section titled “Controlling for sex-specific population structure.”

Genomic signals of sex differences in selection: Empirical data

UK Biobank SNP data

The sample size in the UK Biobank, after removing individuals that were closely related, had a recorded ancestry other than “White British,” or had missing LRS data, was N = 249,021 (N = 115,531 males and N = 133,490 females). We removed rare polymorphic sites (MAF < 1%), sites with low genotype or imputation quality, and sites with high potential for artefactual between-sex differentiation based on criteria identified by Kasimatis and colleagues [44] (i.e., between-sex differences in missing rates, deficits of minor allele homozygotes, and heterozygosity levels exceeding what can be plausibly be explained by sex differences in selection; see Section C in S1 Appendix). Reassuringly, none of the 8 sites that Kasimatis and colleagues [44] identified as false positives for sex-differential viability selection appear among the quality-filtered, LD-pruned, imputed SNPs (N = 1,051,949) that are the focus of our analyses.

Observed F distributions relative to null distributions

We tested for sex differences in selection by calculating adult, reproductive, and gametic (Eqs [2A–2C]) in the UK Biobank and contrasting these estimates against: (i) their respective theoretical null distributions (Eqs [3A–3C]); and (ii) empirical null distributions (generated by a single random permutation of male and female labels among individuals or, in the case of reproductive , a single permutation of LRS among individuals of each sex; see Materials and methods). All 3 metrics showed greater between-sex differentiation than predicted by their theoretical and empirical null distributions, consistent with sex differences in selection with respect to mortality, LRS, and total fitness. Mean adult in the observed data was larger than predicted by both null distributions (theoretical null: 2.039 × 10−6; permuted null: 2.043 × 10−6; observed: 2.104 × 10−6; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 2A and 2D), with a 14.1% and 13.7% excess of SNPs in the top percentile of the theoretical and empirical nulls, respectively (χ2 tests, p < 0.001). Mean reproductive was also larger than predicted by both nulls (theoretical null: 8.731 × 10−7; permuted null: 8.749 × 10−7; observed: 8.900 × 10−7; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 2B and 2E), with a 7.4% and 5.0% excess of SNPs in the top percentile of the theoretical and empirical nulls ( tests, p < 0.001). Moreover, mean gametic was larger than predicted by both nulls (theoretical null: 2.908 × 10−6; permuted null: 2.907 × 10−6; observed: 2.974 × 10−6; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 2C and 2F), with a 9.0% and 7.8% excess of SNPs in the top percentile of the theoretical and empirical nulls (χ2 tests, p < 0.001).

Fig 2

Polygenic signals of sex-differential selection: Inflation in metrics relative to their nulls. (A–C) Percentage of sites (coloured, observed; grey, permuted) falling into each of 100 quantiles of the theoretical null distributions of adult (A), reproductive (B), and gametic (C). Theoretical null data (x-axes) were generated by simulating values (nSNPs = 1,051,949) from a chi-square distribution with 1 degree of freedom. For each locus, observed and permuted values were scaled by the multiplier of the relevant theoretical null distributions (i.e., the multiplier in Eqs [3A–3C] for adult, reproductive, and gametic , respectively; see Materials and methods). In the absence of sex differences in selection, approximately 1% of observed SNPs should fall into each quantile of the null (dashed line). LOESS curves (±SE) are presented for visual emphasis. (D–F) Difference between the mean of observed and empirical null data for each metric (i.e., adult, reproductive, and gametic , respectively) (top), and the difference between observed and theoretical null data (bottom), across 1,000 bootstrap replicates. Vertical line intersects zero (no difference between observed and null data). As in panels (A–C), values were scaled by the relevant theoretical null distributions. The code and data needed to generate this figure can be found at https://github.com/filipluca/polygenic_SA_selection_in_the_UK_biobank and https://zenodo.org/record/6824671. SNP, single-nucleotide polymorphism. Signals of sex differences in selection in adult, reproductive, and gametic were polygenic. For example, genetic variants situated in genomic regions with high LD tended to explain more SNP heritability of each metric than variants situated in low-LD regions, as predicted if each sex-differential fitness component has a polygenic basis (Section D in S1 Appendix). Moreover, no individual locus had a p-value below the Bonferroni-corrected threshold of 4.753 × 10−8, implying that the significant overall inflations were not driven by a small number of strongly sex-differentiated polymorphisms (adult : minimum p- and q-values = 2.237 × 10−7 and 0.176; reproductive : minimum p- and q-values = 3.925 × 10−7 and 0.413; gametic : minimum p- and q-values = 4.152 × 10−6 and 0.821).

Forms of sex-differential selection: Theoretical predictions

The elevations reported above indicate the presence of polygenic sex-differential selection in the UK Biobank. However, the signals could have arisen because of SA selection, because of sex differences in the strength but not the direction of selection (i.e., sex-differential SC selection), or a combination of both scenarios. To partition signals affecting LRS into SA and SC components, we examined the effects of a given allele on LRS in each sex relative to the other. Specifically, estimates of the product should tend to be negative when alleles have SA effects and positive when alleles have SC effects (Fig 3A). A new metric, termed “unfolded reproductive , ” provides a standardised measure of the product of sex-specific effects on LRS:

Fig 3

Partitioning signals of sex-differential selection into SA and SC components reveals their joint contributions.

(A) As in Fig 1, , and depict sex-specific frequency estimates for a given allele at different stages of the life cycle. Under SA selection (top), the white allele is female-beneficial and the black allele is male-beneficial, which tends to generate negative values of unfolded reproductive . Under SC selection (bottom), the black allele is beneficial in both sexes, which tends to generate positive values of unfolded reproductive . (B) Percentage of sites (turquoise: observed; grey: permuted) falling into each of 100 quantiles of the theoretical null distributions of unfolded reproductive . Theoretical null data (x-axes) were generated by simulating values (nSNPs = 1,051,949) from the null (i.e., the product of 2 standard normal distributions). In the absence of sex-differential selection, approximately 1% of observed SNPs should fall into each quantile of the null (dashed line). LOESS curves (±SE) are presented for visual emphasis. (C) Difference, for unfolded reproductive , between the mean observed and empirical null data (top) and between observed and theoretical null data (bottom), across 1,000 bootstrap replicates. The vertical line intersects zero, indicating no difference between the observed and null data. Differences between observed and null data were obtained separately for negative and positive values of unfolded reproductive . This illustrates that there is enrichment of SNPs in both tails of the null. The code and data needed to generate this figure can be found at https://github.com/filipluca/polygenic_SA_selection_in_the_UK_biobank and https://zenodo.org/record/6824671. SA, sexually antagonistic; SC, sexually concordant; SNP, single-nucleotide polymorphism.

Partitioning signals of sex-differential selection into SA and SC components reveals their joint contributions.

Forms of sex-differential selection: Empirical data

As with previous metrics, we calculated unfolded reproductive (Eq [4]) and contrasted it against its theoretical and empirical null distributions—the latter generated by a single random permutation of LRS among the individuals of each sex. Doing so revealed that both SC and SA sites contribute to the polygenic signal of sex-differential selection affecting LRS. As predicted under SC selection, we observed an enrichment of sites in the upper quantiles of the null distributions of unfolded reproductive (mean among sites with > 0; theoretical null: 0.637; permuted null: 0.640; observed: 0.694; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 3B and 3C). As predicted under SA selection, we observed a smaller but significant enrichment of sites in the lower quantiles of the null (mean among sites with < 0; theoretical null: –0.635; permuted null: –0.638; observed: –0.651; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 3B and 3C).

Controlling for sex-specific population structure

In principle, polygenic elevations can arise entirely in the absence of genuine sex differences in selection if there are systematic differences in ancestry (population structure) between sexes in the sampled population [32,45]. We therefore replicated our analyses using mixed-model association tests that are analogous to but which explicitly correct for sex-specific population structure (see also Section F in S1 Appendix). We first re-evaluated signals of sex differences in viability selection present in adult by performing a GWAS of sex [32,43,44] using standardised estimates of the log-odds ratio (; see Materials and methods). Like adult quantifies between-sex allele frequency differences among adults; moreover, it controls for population structure by including a kinship matrix of genome-wide relatedness between individuals and principal components that capture structure-induced axes of genetic variation (see Materials and methods). As expected, was highly correlated with adult (r ± SE = 1.046 ± 0.020; p < 0.001), and mean was elevated relative to its empirical null distribution (null : 5.236 × 10−7; observed: 5.323 × 10−7; Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 4A and 4D), with 8.9% excess of SNPs in the top percentile of the empirical null (χ2 test, p < 0.001).

Fig 4

Structure-corrected metrics reaffirm -based signals of sex-differential selection. (A–C) Percentage of sites falling into each of 100 quantiles of the empirical null distributions of , |t|, and unfolded t. In the absence of sex differences in selection, approximately 1% of observed SNPs should fall into each quantile of the null (dashed line). LOESS curves (±SE) are presented for visual emphasis. (D–F) Difference between the mean of each metric in observed and empirical null data across 1,000 bootstrap replicates. Vertical line intersects zero (no difference between observed and null data). For unfolded t, differences between observed and null data were obtained separately for negative and positive values. This illustrates that there is enrichment of SNPs in both tails of the null. The code and data needed to generate this figure can be found at https://github.com/filipluca/polygenic_SA_selection_in_the_UK_biobank and https://zenodo.org/record/6824671. SNP, single-nucleotide polymorphism. We then re-evaluated signals of sex-differential selection through reproductive success by performing separate GWAS for LRS in females and males, each corrected for population structure, and quantifying the difference between female and male effect sizes using a t-statistic (|t|; see Materials and methods). As expected, |t| was highly correlated with reproductive (r ± SE = 1.025 ± 0.059, p < 0.001) and mean |t| was elevated relative to its empirical null (null = 0.796, observed = 0.811, Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 4B and 4E), with an 11.9% excess of SNPs in the top percentile of the empirical null (χ2 test, p < 0.001). We also developed an analogue of unfolded reproductive , termed unfolded t (see Materials and methods), to partition signals of sex-differential reproductive selection into SA and SC components. As with unfolded reproductive , SC selection should generate an enrichment of values in the upper quantiles of its null, while SA selection should generate an enrichment of values in its lower quantiles; unlike unfolded reproductive , this metric also controls for population structure. Corroborating previous results, we observed an excess of high values of unfolded t (mean t among sites with t > 0; permuted null = 0.639, observed = 0.692, Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001; Fig 4C and 4F) and an excess of low values of unfolded t (mean t among sites with t < 0; permuted null = –0.639, observed = –0.649, Wilcoxon and Kolmogorov–Smirnov tests, p < 0.001), signalling the presence of SC and SA polymorphisms, respectively. Finally, we examined genetic correlations between metrics. These analyses showed that metrics of sex-differential LRS selection were not significantly correlated with metrics of sex-differential mortality selection across loci (Fig 5A). For example, the genetic correlation (estimated via LD score regression) between adult and reproductive was –0.24 (SE = 0.16, p = 0.13) and the genetic correlation between and |t| was –0.16 (SE = 0.16, p = 0.31).

Fig 5

Indications that sex-differentiated loci are more likely to be functional and contribute to trait variation.

(A) Genetic correlations between metrics of sex-differential selection. Positive correlations (orange) imply that alleles have similar sex-specific effects on given fitness components, while negative correlations (purple) imply that alleles have opposing sex-specific effects on given fitness components; * denotes unadjusted p < 0.05. (B) Enrichments (±SE) of sex-differentiated loci in major functional categories. For each metric, enrichments were calculated as the relative SNP heritability (as a fraction of total SNP heritability) explained by a given functional category, divided by the relative number of SNPs (as a fraction of all SNPs) present in a given functional category. Dashed line = 1 (no enrichment). “Negative” and “Positive” refer to negative and positive values (i.e., SA and SC components, respectively) of unfolded reproductive and unfolded t metrics. (C) Genetic correlations between metrics of sex-differential selection and various UK Biobank phenotypes (as analysed by the Neale laboratory). Metrics of sex-differential selection have been polarised, such that positive correlations (red) suggest that higher trait values are more beneficial to females than males (for the relevant fitness component), while negative correlations (blue) suggest that higher trait values are more beneficial to males than females (see Discussion for caveats surrounding this interpretation); ** denotes FDR-adjusted p < 0.05 and * denotes unadjusted p < 0.05. The code needed to generate this figure can be found at https://github.com/filipluca/polygenic_SA_selection_in_the_UK_biobank and https://github.com/lukeholman/UKBB_LDSC, with data at https://zenodo.org/record/6824671. FDR, false discovery rate; SA, sexually antagonistic; SC, sexually concordant; SNP, single-nucleotide polymorphism.

Indications that sex-differentiated loci are more likely to be functional and contribute to trait variation.

Functional and phenotypic effects of sex-differentiated loci

If sex-differentiated loci reflect genuine sex-differential selection—rather than random chance, genotyping errors, or population structure—such polymorphisms should be preferentially found in functionally important regions in the genome. We therefore conducted enrichment tests, both to support our inference that sex-differential selection is occurring and to explore functional effects of sex-differentiated loci. We first used LD score regression [57] to test whether sites with high sex-differentiation tend to be found in major functional categories in the genome (coding, 3′UTR and 5′UTR regions). If a given category is enriched for genuine selected SNPs, the expected heritability tagged by these SNPs (i.e., what LD score regression measures) should exceed the fraction of SNPs present in that functional category. While functional enrichment estimates were noisy and thus not statistically distinguishable from 1 (no enrichment) after multiple-testing correction (Fig 5B), each estimate consistently exceeded 1 across functional categories and metrics, suggesting that sex-differentiated loci are more likely to have phenotype-altering effects than expected by chance. Further evidence for the phenotype-altering effects of sex-differentiated loci was sought through direct comparisons between metrics of sex-differential selection and the Neale laboratory database of UK Biobank GWAS. Specifically, we used cross-trait LD score regression [58] to estimate genetic correlations between metrics of sex-differential selection and 30 phenotypes, chosen for their medical relevance and/or relationship to phenotypic sex differences. Though many significant associations did not survive multiple testing correction (Fig 5C), several disease-relevant and quantitative traits (age at menarche, body fat percentage, diseases of the eye and adnexa, fluid intelligence, injury, neuroticism score, SHBG [sex hormone binding globulin], standing height) represent candidates for sex-differential viability and LRS selection, while other traits (testosterone, high blood pressure) represent candidates for sex-differential viability selection.

Modes of evolution of sex-differentiated loci: Theoretical predictions

To gain insight into the modes of evolution affecting sex-differentiated sites, we investigated the association between metrics of sex-differential selection and MAF in the UK Biobank. In the absence of any contemporary sex differences in selection, all between-sex metrics should be independent of MAF (Section G in S1 Appendix). In the presence of sex-differential selection, the association between each metric and MAF can potentially be positive or negative, depending on the patterns of contemporary and historical selection affecting loci throughout the genome. A positive covariance between and MAF should arise when alleles subject to sex-differential selection often segregate at intermediate frequencies, as may occur under a history of balancing selection or drift (Section G in S1 Appendix) or non-equilibrium scenarios such as incomplete selective sweeps. In contrast, a negative association between MAF and between-sex is expected for loci that have evolved under sex-differential purifying selection (Section G in S1 Appendix). This negative covariance arises because purifying selection disproportionately lowers the frequency of large-effect alleles (those generating larger values) relative to small-effect alleles [59]. In short, positive associations with MAF indicate that purifying selection is not the dominant mode of evolution affecting loci under sex-differential selection and instead signal a recent history of balancing selection, positive selection, or drift. While associations between metrics of sex-differential selection and MAF provide insights into relatively recent and contemporary patterns of selection affecting sex-differentiated sites, they do not provide insights into their deeper evolutionary histories. To examine this, we tested the specific hypothesis that sex-differentiated sites are subject to long-term balancing selection, as predicted for SA polymorphisms under certain scenarios of selection and dominance [10]. Under long-term balancing selection, we would expect sex-differentiated (and linked) loci to be old, to exhibit low between-population , to exhibit high genetic diversity, and to disproportionately co-localise with previous candidates for long-term balancing selection, compared to less sex-differentiated sites with similar allele frequencies in the UK Biobank.

Modes of evolution of sex-differentiated loci: Empirical data

Examining the relationship between MAF and metrics of sex-differential selection in the UK Biobank data revealed consistently positive correlations (adult = 0.009, p < 0.001; : ρ = 0.006, p = 0.216; reproductive , ρ = 0.006, p < 0.001; |t|: ρ = 0.005, p < 0.001; gametic , ρ = 0.007, p < 0.001; Fig 6A–6D), with all correlations stronger in observed than null data (Section H in S1 Appendix). Given the absence of negative correlations between MAF and each metric, we can reject purifying selection as the dominant mode of evolution affecting sex-differentiated sites. The positive correlations instead suggest that balancing selection, drift, or incomplete selective sweeps characterise the evolution of sex-differentiated loci.

Fig 6

Modes of evolution of sex-differentiated sites.

Modes of evolution of sex-differentiated sites.

(A–D) Mean MAF, in the UK Biobank, across 100 quantiles of the null for each metric of sex-differential selection. For metrics, x-axes correspond to Fig 2A–2C (and Fig 3B for unfolded reproductive ). For mixed-model metrics, x-axes correspond to Fig 4A–4C. LOESS curves (±SE) are presented for visual emphasis. (E-H) Mean age of the alternative (i.e., non-reference) allele across 100 quantiles of the null for each metric of sex-differential selection. Each panel corrects for ascertainment bias of allele frequencies among highly sex-differentiated sites (i.e., Fig 6A–6D). For visualisation purposes, this was done by averaging, in each quantile, allele age across 20 quantiles of alternative allele frequency in the UK Biobank (such that UK Biobank alternative allele frequency is approximately equal across quantiles). LOESS curves (±SE) are presented for visual emphasis. The code and data needed to generate this figure can be found at https://github.com/filipluca/polygenic_SA_selection_in_the_UK_biobank and https://zenodo.org/record/6824671. We then tested the hypothesis that long-term balancing selection has shaped the evolutionary histories of sex-differentiated loci. We focused our analyses on 4 measures of balancing selection: allele age estimates from the Atlas of Variant Age database [60], between-population and Tajima’s D estimates from 2 non-European populations from the 1000 Genomes Project [61], and 3 sets of candidate loci for long-term balancing selection [62-64]. In each case, we looked for associations between metrics of sex-differential selection and balancing selection, while controlling for ascertainment bias of intermediate-frequency alleles (which are, on average, older and thus more likely to be under long-term balancing selection irrespective of the strength of sex-differential selection) among highly sex-differentiated sites (see Materials and methods). Overall, we found little support for the hypothesis of long-term balancing selection affecting sex-differentiated loci. After corrections for multiple testing across metrics of sex-differential selection (see Section I in S1 Appendix, for full statistical results), we found weak or absent associations with allele age (Fig 6E–6H), between-population (Section I in S1 Appendix), genetic diversity (Section I in S1 Appendix), or previous candidates for balancing selection (Section I in S1 Appendix). We found some indications that candidate SA alleles (i.e., loci with negative values of unfolded reproductive and unfolded t) were older than the genome-wide average (Fig 6H), and loci experiencing strong SC selection (i.e., positive values of unfolded reproductive and unfolded t) were younger (Fig 6H).

Discussion

Sex differences in directional selection on phenotypes have been reported in a wide range of animal taxa [19,21-23,65], including post-industrial human populations [28-30], yet population genomic signals of sex-differential selection—let alone SA selection—have been extremely difficult to establish. The reason is simple: Sexual reproduction equalises autosomal allele frequencies between the sexes every generation, restricting genetic divergence and, in effect, preventing the use of common tests to infer sex differences in selection (e.g., McDonald–Kreitman tests for positive selection, F outlier tests for spatially varying selection [66-68]). Published studies using human genomic data illustrate the challenges of studying polymorphisms with sex-differential fitness effects [32,45], including sample sizes that may be insufficient for detecting polygenic signals of sex-differential selection, lack of controls for population structure or technical artefacts, and/or absence of data concerning reproductive fitness components.

Signals of sex-differential selection in the UK Biobank

We developed a theoretical framework for studying genomic variation with sex-differential effects across a complete life cycle. Our approach extends current work based on between-sex allele frequency differentiation among adults—a potential signal of sex-differential viability selection among juveniles [32,34,45]—to further include reproductive success components and total fitness. Applying this approach to data from a quarter-million UK adults, we present evidence for polygenic signals of sex-differential selection in humans. Specifically, UK Biobank individuals showed sex differences in allele frequencies—both among adults and their (projected) offspring—that consistently exceeded expectations defined by our theoretical null models for viability, reproductive, and total fitness and persisted after controlling for potential artefacts arising from mis-mapping of reads to sex chromosomes [44]. Although we focussed on F as our metric of differentiation for a variety of reasons (its simplicity, amenability to theoretical modelling, and rich history in population genetic studies of adaptation [66-68]), an important drawback of F is its inability to control for systematic sex differences in the genetic ancestry of sampled individuals. We therefore used F analogues based on mixed-model association tests to control for sex-specific population structure. These F analogues corroborated F-derived signals of sex-differential selection on each component, with clear enrichments in the upper tails of each null distribution. Additional support for genuine sex-differential selection came from functional enrichment analyses, which, despite noisy individual estimates, consistently indicated that sex-differentiated sites were situated in functional genomic regions and contributed to variation for many phenotypes. An important limitation of metrics of sex-differential selection affecting non-LRS fitness components (i.e., adult F, gametic F, and their mixed-model analogues) applied to the UK Biobank is that UK Biobank individuals are sampled through active participation. Consequently, as noted by Pirastu and colleagues [43], sex differences in the genetic basis of individuals’ predisposition to take part in the UK Biobank may generate sex differences in adult allele frequencies. To support this argument, Pirastu and colleagues [43] reported significantly greater SNP heritability of sex (a polygenic measure of sex differences in allele frequencies) in biobanks relying on active participation than in biobanks using passive participation. However, their analysis is inconclusive because the passive participation studies they analysed were smaller (NBiobank Japan = 178,242, NFinnGen = 150,831, NiPsych = 65,891) than active participation studies (NUK Biobank = 452,302, N23andme = 2,462,132). Thus, differences in statistical power between studies (and/or differences in the extent of sex-differential viability selection between populations) could account for their results. Moreover, the positive point estimates of SNP heritability for passive participation studies suggest that substantial allele frequency differences between the sexes are possible. For example, mortality after fertilisation, but before birth, is very high in humans (on the order of 50% [69]), giving ample opportunity for mortality in early life to generate allele frequency differences between sexes. In sum, neither their study nor ours can conclusively distinguish the relative contributions of sex-differential selection and participation bias to allele frequency differentiation between female and male adults, though both sources likely contribute. Importantly, participation bias should not affect metrics of sex-differential selection relating to LRS. Reproductive and its mixed-model analogue, |t|, control for allele frequency differences between samples of adults of each sex and rule out factors that might otherwise affect estimated adult allele frequencies in the UK Biobank (e.g., mis-mapping of reads to sex chromosomes, participation biases [43]). Elevations in these metrics thus provide the most compelling evidence for sex-differential selection in the UK Biobank (see also [46]). Moreover, they are consistent with previous observations in post-industrial human populations, including variation in female and male LRS [70] (a necessary precondition for sex-differential selection), widespread sex differences in the genetic basis of quantitative traits (e.g., in the UK Biobank [71]), and sex-differential selection on phenotypes (e.g., height [29,30] and multivariate trait combinations [70]), which should collectively lead to genome-wide polymorphisms with sex-differential effects on fitness and fitness components [20].

Distinguishing between SA and SC forms of sex-differential selection

Having established signals of sex-differential selection affecting LRS, we developed a new test for investigating the form of selection—SC or SA—affecting these genomic variants by quantifying the product of a genetic variant’s effect on LRS in each sex. Applying our test to UK Biobank data showed that both types of variant contribute to signals of sex-differential selection on LRS, with SC variants contributing comparatively more enrichment in the upper tail of the null of unfolded reproductive (and its mixed-model analogue, unfolded t) than SA variants contribute in the lower tail of the null. That signals of SC polymorphism were more pronounced than SA polymorphism is perhaps unsurprising, given that most traits are likely to be subject to SC rather than SA selection [29]. Moreover, alleles subject to identical SC selection in each sex will contribute to the upper tail of unfolded reproductive , but will not contribute to the lower tail (or to other metrics of sex-differential selection), which might also account for greater apparent signal of SC than SA selection in these analyses. Nonetheless, some human traits have been shown to be under SA selection—most notably standing height, which positively covaries with male LRS and negatively covaries with female LRS [28-30]. The enrichment of sites in the lower tails of unfolded reproductive and unfolded t is consistent with these previous observations. Our finding that variants that increase height tended to have male-beneficial and female-detrimental effects (i.e., as reflected by a negative correlation between height and t) is particularly reassuring and validates the intuition that SA selection at the phenotypic level (e.g., over height) gives rise to SA variation throughout the genome.

Modes of evolution affecting sex-differentiated loci

We found that sex-differentiated sites had, on average, more intermediate frequencies than less sex-differentiated sites. This finding has several implications. First, we expect no association between metrics of sex-differentiation and MAF in the absence of sex-differential selection. Therefore, these positive associations represent an independent strand of support for the argument that sex-differential selection is shaping patterns of genome-wide variation in the UK Biobank. Second, the positive associations imply that a model of sex-differential purifying selection, in which variants are maintained at mutation-selection-drift balance, is inadequate to explain enrichments of sex-differentiated sites. Sex-differential purifying selection is instead expected to generate negative associations between MAF and the extent of sex-differentiation (a negative association that is indeed observed for many quantitative traits [72]). Finally, the positive associations between sex-differentiation and MAF are consistent with a variety of scenarios, such as recent evolutionary histories of balancing selection, genetic drift, or incomplete selective sweeps. Balancing selection or drift can both generate a broad spectrum of allele frequency states at SA loci, in which intermediate-frequency SA variants dominate signals of sex-differential selection. Alternatively, SC alleles with unequal fitness effects in each sex could have recently swept to intermediate frequencies and these variants now dominate signals of sex-differential selection. Although positive associations between metrics of sex-differential selection and MAF indicate that balancing selection may be present, our analyses did not reveal clear signals of long-term balancing selection among sex-differentiated sites. The absence of such signals may stem from several factors. First, SA polymorphisms are only predicted to experience balancing selection under narrow conditions [10,73], so SA loci may not experience balancing selection at all. Second, balancing selection could affect sex-differentiated polymorphisms but be too recent to generate a clear statistical signal in our analyses [74]. Third, long-term balancing selection at sex-differentiated loci may be present but effectively weak, owing to relatively small N in humans [75] and the high susceptibility of SA alleles to genetic drift [73,76]. Fourth, long-term balancing selection may be present, but statistical tests for it may be too weak to stand out from the background noise of false positives in our metrics and the datasets used to quantify balancing selection [77]. How do we reconcile these results with previous work in Drosophila melanogaster indicating that candidate SA polymorphisms segregate across worldwide populations and even species [33]? A parsimonious explanation for these contrasting findings is that the effectiveness of balancing selection is lower in humans than fruit flies due to much smaller N. Indeed, given the pronounced sensitivity of SA balancing selection to genetic drift [73,76], we should expect the relationship between signals of SA and balancing selection to vary with N. Moreover, previous work in D. melanogaster focussed on SA polymorphisms [33] to the exclusion of SC polymorphisms, whereas our metrics capture both forms of sex-differential variation, thus weakening the power of tests for associations with signals of balancing selection. Interestingly, when we partitioned signals of sex-differentiation into SA and SC components, we found indications that candidate SA sites were indeed older, which implies that SA balancing selection may be present but masked by sex-differential SC polymorphisms. Overall, evidence that sex-differentiated, including SA, polymorphisms contribute to standing genetic variation—as in our study—is at present much stronger than evidence that they are maintained by balancing selection.

Directions for future research

Our analyses suggest a number of fruitful directions for further research. First, given the difficulty of distinguishing participation bias from selection in signals of between-sex allele frequency differentiation among adults, conclusively establishing the presence of sex-differential viability selection in genomic data remains an important research direction. Parent-offspring trio analyses that control for participation effects [78], or replication of our analysis strategy in large datasets sampled through passive rather than active participation, may yield the evidence required. Second, the extent to which variants with positive effects on mortality in a given sex have similar or opposing effects on reproduction bears further examination. Our finding that genetic correlations between metrics of viability and reproductive selection were not significantly different from zero indicates a range of possible scenarios. It may suggest that variants affecting each fitness component are independent (i.e., because alleles affecting each component are genuinely independent), that between-sex allele frequency differentiation among adults is a poor signal of sex-differential viability selection or that a similar fraction of loci have concordant and antagonistic effects, thus also generating no net correlation. Finally, given the increasing availability of genotypic and LRS data, further work could attempt to replicate our analysis strategy in different populations and species. Many taxa exhibit greater variance for reproductive success than humans [79], generating higher potential for detecting polygenic signals of sex-differential selection. In line with this, polygenic inflations of adult have previously been documented in modest samples of pipefish and flycatchers [32,38,39], suggesting that sex differences in selection might be stronger in those species than in humans. Moreover, these samples are less susceptible to ascertainment bias because individuals do not actively participate and because sampling can often be randomised with respect to sex. While we expect that polygenic signals of sex-differential selection will replicate across populations of a species (see, for example, Zhu and colleagues [35]’s replication of the association between testosterone and adult allele frequency differences in Fig 5C), we caution that there may be relatively little overlap in terms of the most sex-differentiated polymorphisms. One reason is that environmental differences between populations (e.g., cultural differences in family planning between human populations) could alter the set of causal loci under sex-differential selection. Another reason is that the noisiness of polygenic signals of sex-differential selection [32,45], along with the near certainty that most polymorphic loci have small effects on fitness [80], generates variation in the set of candidate sex-differential polymorphisms identified across populations [81], even if causal sex-differential polymorphisms do not differ.

Materials and methods

Ethics statement

The UK Biobank has Research Tissue Bank approval from the North-West Multi-centre Research Ethics Committee. Approval for using UK Biobank data for this specific project, from participants consenting to share anonymised data, was granted under project number 52049.

Quality control of UK Biobank data

We used sample-level information provided by the UK Biobank (see [55] for details) to perform individual-level (phenotypic) quality controls. Specifically, we excluded individuals with high relatedness (third degree or closer), non-“white British” ancestry, high heterozygosity, and high missing rates. We also excluded individuals whose reported sex did not match their inferred genetic sex, aneuploids, and individuals with missing or unreliable LRS data (as detailed below). We processed LRS data as follows. LRS data were obtained from UK Biobank field 2405 “Number of children fathered” for males, and field 2734 “Number of live births” for females. Previous observations of positive genetic correlations between offspring and grand-offspring numbers across generations [82] indicate that offspring number represents a good proxy for LRS in post-industrial human populations. Because some individuals were asked to report offspring number at repeated assessment points, we considered the maximum offspring number reported as the definitive value of LRS for that individual. Though misestimation of LRS for each individual cannot be definitively excluded (e.g., individuals may misreport and include non-biological children, individuals may reproduce after data collection), we minimised this possibility by removing individuals: (i) younger than 45 years of age (this cutoff was chosen for consistency with previous research [29] and because Office for National Statistics data indicates that reproduction is very limited for UK individuals aged 45 and over); (ii) reporting fewer offspring at a later assessment point than at an earlier assessment point; (iii) with 20 or more reported offspring numbers (large offspring numbers often ended in zero—e.g., 20, 30, 50, 100—and were thus considered less reliable). Furthermore, uncounted LRS data add imprecision but should not systematically bias our analyses. In addition to site-level quality controls implemented by the UK Biobank [55], we used PLINK and PLINK2 [83] to remove imputed sites that were non-diallelic, had MAF <1%, missing rates >5%, p-values < 10−6 in tests of Hardy–Weinberg equilibrium, and INFO score ≤0.8, denoting poor imputation quality. While these cutoffs restrict our analyses to a nonrandom subset of all genetic variation, they guard against sequencing artefacts in the UK Biobank and help remove sites (e.g., those with MAF <1%) which have little potential to carry statistical signal of sex-differentiation relative to noise induced by sampling error.

Additional artefact filtering in UK Biobank data

Mis-mapping of autosomal reads to sex chromosomes can generate between-sex allele frequency differences among adults in the absence of sex differences in selection [44]. In light of scant direct evidence for SA polymorphisms in humans and still-developing bioinformatic methods for distinguishing artefacts from genuine sex-differential selection [40,44,84-86], our primary concern was to reduce the chance of mapping errors. We did so by excluding: (i) sites with heterozygosity levels that exceeded what could plausibly be expected under SA selection (see below and Section C in S1 Appendix); (ii) sites with a deficit of minor allele homozygotes; and (iii) sites exhibiting large differences in missing rate between sexes. These 3 patterns have previously been shown to correlate with mis-mapping of reads to sex chromosomes [44]. While these filters reduce the chance of false positives, they also potentially increase chance of false negatives and therefore represent a slightly conservative test of sex-differential selection. For example, the removal of sites with high heterozygosity levels is expected to remove sites under strong (but not weak or moderately strong) sex-differential selection; similarly, the removal of sites with large missing rate differences between sexes may remove genuine polymorphisms with sex-differential effects. To remove sites with artificially inflated heterozygosity, we estimated F for each SNP as: where P denotes the frequency of heterozygotes for a given locus and the sex-averaged allele frequency. For a SA locus at polymorphic equilibrium, the distribution of is well approximated by a normal distribution with expectation and variance as follows: where n is total sample size of adults, p the minor allele frequency, and smax = max(s, s) with s and s representing male and female selection coefficients (Section C in S1 Appendix). To identify SNPs with excess heterozygosity, we compared in the observed data to expected under strong SA selection (smax = 0.2) by performing a 1-tailed Z-test for excess heterozygosity. We thus obtained p-values for each locus, corrected p-values for multiple testing using Benjamini–Hochberg false discovery rates (FDR) [87], and removed sites with FDR q-values below 0.05. To identify sites with a deficit of minor allele homozygotes, we compared the observed frequency of minor allele homozygotes to the expected frequency under Hardy–Weinberg equilibrium (p2, where p is the frequency of the minor allele) by performing a 1-tailed binomial test, removing sites with FDR q-values below 0.05. Tests for excess heterozygosity and deficits of minor allele homozygotes were performed across all individuals (regardless of sex) and also for each sex separately. Sites were removed if they exhibited q-values below 0.05 in any of the 3 tests (i.e., both sexes combined, females, and males). Finally, to assess differences in missing rate between the sexes, we performed a χ2 test, removing sites with FDR q-values below 0.05.

Quantifying polygenic signals of sex differences in selection

-based metrics

We used to quantify allele frequency differences between sexes. is a simple metric, well established in evolutionary biology research, amenable to theoretical modelling (as in Eqs [3A–3C]), and independent of MAF in the absence of sex-differential selection (unlike, say, raw allele frequency differences [32]). We obtained allele frequencies in adults of each sex directly from sequence data (after filtering individuals and sites, as described above) and used them to calculate adult for each polymorphic site. We obtained allele frequencies among projected gametes using LRS data (as per Eq [1]) and used them to calculate reproductive , gametic , and unfolded reproductive (as per Eqs [2A–2C] and [4]).

Statistical comparisons of null and observed distributions

Null distributions for metrics were theoretically derived (see Sections A and E in S1 Appendix). The theoretical null distributions apply to genome-wide data in which the sample of female and male sequences, mean and variance in LRS, and Hardy–Weinberg deviations, are constant across loci. In practice, there is variation in sample sizes, mean LRS, variance in LRS, and the extent of Hardy–Weinberg deviations between loci. To take these factors into account, we let the multiplier in Eqs [3A–3C] vary in terms of its sample size ( and per diploid locus i), mean and variance in LRS ( and , and and , per diploid locus i) and the extent of Hardy–Weinberg deviations in the sample ( and per diploid locus i). We then scaled by the multiplier, such that, for each locus: These scaled estimates, which correct for site-specific variation, can then be compared to a chi-square distribution with 1 degree of freedom. For unfolded reproductive , no scaling is required because site-specific adjustments are already taken into consideration in the definition of the metric (Eq [4]). Null distributions were also obtained empirically, through permutation, as follows. For adult and gametic , we performed a single permutation of female and male labels and recalculated (scaled by the multiplier, as above) in permuted data. For reproductive and unfolded reproductive , we performed a single permutation of LRS values within each sex—without permuting sex—and recalculated the statistic (scaled by the multiplier, as above) in permuted data. Permuting LRS without permuting sex is appropriate for reproductive and unfolded reproductive because it allows allele frequencies to differ between adult males and females (as would happen if, for example, sex-differential viability selection is occurring among juveniles) but randomises the effects of genotype on LRS, thus ensuring that only estimation error can contribute to the empirical null. We performed a single permutation for each metric because performing large numbers of permutations was computationally unfeasible and because we were focussed on testing a cumulative signal of selection across loci, rather than establishing significance at the single-locus level. To test for elevations in observed data relative to the (theoretical or empirical) nulls, we LD-pruned the dataset (settings “—indep-pairwise 50 10 0.2” in PLINK) and ran Wilcoxon rank-sum and Kolmogorov–Smirnov tests. These tests assess differences in the median and distribution of the observed and null data, respectively. As a complementary way of comparing observed and null data, we quantified enrichment of observed values in the top 1% of each null using a χ2 test. Finally, we estimated the difference between the mean value of the metric in the observed data and the mean value of the metric in each null, obtaining 95% confidence intervals and empirical p-values through bootstrapping (1,000 replicates; where each replicate consists of the set of relevant SNPs, sampled with replacement).

Case-control GWAS of sex

To complement the test for sex-differential viability selection based on adult , we performed a GWAS of sex [32,43,44]. By analogy to adult , loci with sex-differential effects on viability in a GWAS of sex will tend to have relatively large absolute log-odds ratios (corresponding to relatively large allele frequency differences between sexes). Unlike adult , the GWAS of sex approach additionally permits the inclusion of covariates that account for population structure and other possible confounders [32,43,44]. We used BOLT-LMM to run a mixed-model GWAS [88] using a kinship matrix to account for population structure. The kinship matrix was constructed from an LD-pruned set of quality-filtered imputed SNPs (LD-pruning settings as above). We added individual age (field 54), assessment centre (field 21003), and the top 20 principal components derived from the kinship matrix, as fixed-effect covariates. To facilitate comparisons with adult , we standardised the regression coefficients (log-odds ratios) from the GWAS by allele frequency, such that: where is the log-odds ratio and is the sex-averaged allele frequency among adults. To obtain permuted values, we performed a single permutation of female and male labels and recalculated the statistic in the permuted data.

t-Statistics for sex-differential effects on LRS

To complement the test for sex differences in selection on LRS based on reproductive , we performed a GWAS of LRS in each sex separately, using a mixed-model GWAS, which allowed us to correct for population structure in effect size estimates. Following [89], we quantified differences in male and female effect size estimates by means of a t-statistic, defined as: where β and β are estimated effect sizes obtained from sex-stratified GWAS (implemented in BOLT-LMM as above), SE and SE are sex-specific standard errors, and ρ is the between-sex rank correlation among genome-wide LD-pruned loci. Relative to permuted data (obtained using an identical procedure to that implemented for reproductive ), loci with elevated |t| in observed data denote candidate loci with sex-differential effects on LRS. To examine the relative contributions of SA and SC components to signals of sex-differential reproductive selection, while correcting for sex-specific population structure, we developed an analogue of unfolded reproductive , termed “unfolded t,” as: Relative to permuted data, SA selection (i.e., opposing signs of β and β) will tend to generate an excess of negative values of unfolded t, while SC selection (i.e., same sign of β and β) will tend to generate an excess of positive values of unfolded t.

Functions and phenotypic effects of sex-differentiated loci

We used stratified LD score regression [57] to examine whether sex-differentiated loci were more likely to be situated in putatively functional genomic regions (e.g., coding or UTR regions) than expected by chance. This method partitions the heritability from GWAS summary statistics into different functional categories, while accounting for differences in LD (and thus, increased tagging of a given causal locus) in different regions of the genome (with LD quantified from European-ancestry samples from the 1000 genome project, and restricted to SNPs also present in the HapMap 3 reference panel [57]). Because LD score regression requires signed summary statistics as input, we first transformed our (unsigned) metrics of sex-differential selection to signed metrics (e.g., metrics and were transformed to Z-scores, |t| was transformed to t), where positive and negative values denote female- and male-beneficial effects of the focal allele, respectively. Enrichments for 3 putatively functional categories (coding, 3′UTR, 5′UTR) were then calculated as the fraction of total heritability explained by a given category divided by the fraction of all SNPs in a given category. Note that we calculated enrichment for these categories while implementing the “full baseline model,” which includes 50 further categories. This model has been shown to provide unbiased enrichments for focal categories [57] and for total SNP heritability [90] (estimates of total SNP heritability were used in Section D in S1 Appendix). We used cross-trait LD score regression [58] to examine genetic correlations between metrics of sex-differential selection and a suite of phenotypic traits, as well as between the metrics of sex-differential selection. The method calculates genetic correlations between pairs of traits while taking into account LD-induced differences in the extent of tagging of causal loci across the genome. We computed genetic correlations between each metric of sex-differential selection (transformed to a signed statistic, as above, such that higher values of the signed metric are more likely to benefit females than males) and an initial list of 43 traits (subsequently filtered to 30 after removing traits where an accurate genetic correlation, defined as SE < 0.2, could not be estimated) (http://www.nealelab.is/uk-biobank/), and used FDR correction (across metrics and traits) on resulting p-values.

Associations with MAF in the UK Biobank

To test for associations between metrics of sex differences in selection and MAF, we estimated a Spearman’s rank correlation between each metric and MAF. We also tested whether the relationship between metrics of sex differences in selection and MAF was more pronounced in the observed data than in the (theoretical or empirical) nulls by estimating the difference between correlations in the observed data and correlations in the null data among 1,000 bootstrap replicates (as above), thereby generating 95% confidence intervals and empirical p-values.

Allele ages

If sex-differentiated variants experience sufficiently strong and sustained balancing selection relative to the countervailing effects of genetic drift, we expect them to be older than the genome-wide average [74]. We used the Atlas of Variant Age database to obtain allele age estimates for genome-wide variants [60]. Estimates of allele age in this database apply to the non-reference (i.e., alternative) allele and are derived from coalescent modelling of the time to the most recent common ancestor using the “Genealogical Estimation of Variant Age” method (see [60] for details). Estimates of allele age make use of genomic data from: (i) the 1000 Genomes Project; (ii) the Simons Genome Diversity Project; and (iii) both datasets combined. For each site in the UK Biobank, we obtained the median estimate of allele age from the combined dataset (when available), from the 1000 Genomes Project, or the Simons Genome Diversity Project (when neither alternative estimate was available).

Between-population F and Tajima’s D in non-European populations

If candidate SA variants experience sufficiently strong balancing selection maintaining a fixed polymorphic equilibrium, they should exhibit lower-than-average allele frequency differences between populations [74] and larger-than-average allele frequency diversity within populations. We used bcftools [91] to obtain allele frequency data from 2 non-European populations from the 1000 Genomes Project: Yoruba Nigerians (YRI, N = 108) and Gujarati Indians (GIH, N = 103). We then estimated between-population as: where and are allele frequency estimates in the relevant pair of populations and We also used vcftools [92] to calculate Tajima’s D, a metric of genetic diversity which takes on elevated values under certain evolutionary and demographic scenarios, including balancing selection, in 10 kb windows across the genome.

Previous candidates for balancing selection

If candidate SA variants experience strong balancing selection, they should disproportionately co-occur with previously identified candidates for balancing selection. We used 3 independent sets of candidate sites for balancing selection to investigate this possibility: (i) the dataset of Andrés and colleagues [62], which consists of 64 genes exhibiting elevated polymorphism (as determined using the Hudson–Kreitman–Aguadé test) and/or intermediate-frequency alleles across 19 African-American or 20 European-American individuals; (ii) the dataset of DeGiorgio and colleagues [64], which consists of 400 candidate genes exhibiting elevated T1 or T2 statistics among 9 European (CEU) and 9 African (YRI) individuals. T1 or T2 statistics quantify the likelihood that a genomic region exhibits levels of neutral polymorphism that are consistent with a linked balanced polymorphism; (iii) the dataset of Bitarello and colleagues [63], which consists of 1,859 candidate genes exhibiting elevated values of “non-central deviation” (NCD) statistics. NCD statistics also quantify the likelihood that given genomic regions are situated nearby a balanced polymorphism, using polymorphism data from 50 random individuals from 2 African (YRI; LWK) and European (GBR; TSI) populations and divergence data from a chimpanzee outgroup. We assigned each site in the UK Biobank dataset to a gene using SnpEff [93] and categorised sites as candidates or non-candidates for balancing selection based on whether they were annotated as belonging to a candidate or non-candidate gene in each of the 3 aforementioned datasets.

Statistical associations between metrics of sex-differential selection and balancing selection

To test whether signals of sex differences in selection were associated with signals of balancing selection, we performed Spearman’s rank correlations between alternative allele age (scaled by alternative allele frequency in the UK Biobank, to control for ascertainment bias) and each metric of sex differences in selection. For between-population and Tajima’s D, we performed multiple linear regressions, with the relevant metric of sex differences in selection as the independent variable and MAF in the UK Biobank as a fixed-effect covariate (to control for ascertainment bias). For previous candidates for balancing selection, we performed multiple logistic regressions, where candidate/non-candidate status was the binary response variable, with the relevant metric of sex differences in selection as the independent variable and MAF in the UK Biobank as a fixed-effect covariate. In the case of regressions involving between-population was first log-transformed to meet normality assumptions. In the case of Tajima’s D analyses (which are window-based rather than SNP-based), we averaged independent variables across 10 kb windows before performing regressions.

Supporting information.

Section A. Theoretical null distributions for F estimates. Section B. Hitchhiking estimates in between-sex F. Section C. Defining upper bounds for excess heterozygosity in F estimates arising from SA selection. Section D. Polygenicity of signals of sex differences in selection. Section E. Null model for unfolded reproductive F. Section F. Correcting for sex-specific population structure. Section G. Sex differences in selection and the relation between F and MAF. Section H. Associations between metrics of sex differences in selection and MAF. Section I. Associations between metrics of sex differences in selection and candidates for balancing selection. (PDF) Click here for additional data file. 7 Oct 2021 Dear Dr Ruzicka, Thank you for submitting your manuscript entitled "Polygenic signals of sexually antagonistic selection in contemporary human genomes" for consideration as a Research Article by PLOS Biology. Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to send your submission out for external peer review. However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire. Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed the checks it will be sent out for review. If your manuscript has been previously reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time. If you would like to send your previous reviewer reports to us, please specify this in the cover letter, mentioning the name of the previous journal and the manuscript ID the study was given, and include a point-by-point response to reviewers that details how you have or plan to address the reviewers' concerns. Please contact me at the email that can be found below my signature if you have questions. Please re-submit your manuscript within two working days, i.e. by Oct 11 2021 11:59PM. Login to Editorial Manager here: https://www.editorialmanager.com/pbiology During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit. Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible. Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission. Kind regards, Roli Roberts Roland Roberts Senior Editor PLOS Biology rroberts@plos.org 12 Oct 2021 Submitted filename: Ruzicka et al. Response to Previous Reviewer comments.docx Click here for additional data file. 1 Dec 2021 Dear Dr Ruzicka, Thank you for submitting your manuscript "Polygenic signals of sexually antagonistic selection in contemporary human genomes" for consideration as a Research Article at PLOS Biology. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by four independent reviewers. IMPORTANT: We'd like to apologise for an unfortunate glitch in our process. When we called in the additional metadata in preparation for peer review, our template letter invited you to include any previous reviews from other journals. You did so, correctly, but our system should have then alerted us to this fact, at which point we would have taken those reviews into account, potentially contacting the other journal, and then highlighting this fact to our reviewers. Unfortunately this alert failed, and your paper went out to new reviewers, but including your reviews and rebuttal. You will see that most of the reviewers nevertheless acknowledge this fact, and have commented accordingly. Overall, we hope that this process will prove useful, despite not providing the expedited journey that we intended. I'll apologise similarly to the reviewers for the confusion. When I described the situation to the Academic Editor, s/he said "I hope that you can convey that [the required revision] is at the milder end of the spectrum. It has been reviewed already, and several of the new reviewers noted that they were therefore reluctant to require yet more analyses. The suggested analyses make sense to me, and would be helpful, but I think that the main issue is to present the work more cautiously, which should be straightforward." I hope that this additional advice will be helpful when deciding how to revise your manuscript. In light of the reviews (below), we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers. We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology. **IMPORTANT - SUBMITTING YOUR REVISION** Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript: 1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript. *NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point. You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response. 2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type. *Re-submission Checklist* When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record. Please make sure to read the following important policies and guidelines while preparing your revision: *Published Peer Review* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *PLOS Data Policy* Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5 *Blot and Gel Data Policy* We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Roli Roberts Roland Roberts Senior Editor PLOS Biology rroberts@plos.org ***************************************************** REVIEWERS' COMMENTS: Reviewer #1: [identifies himself as Daniel Berner] Comments on PBIOLOGY-D-21-02588R1, 'Polygenic signals of sexually antagonistic selection in contemporary human genomes' I have digested this study as a new reviewer, and to save time, I have haphazardly but not systematically checked how the authors addressed previous reviewer criticism. This study uses a very large human data set, including whole-genome genotype data and information on offspring number, to look for genome-wide signatures of sexually antagonistic (SA) selection. The key analyses include the evaluation of the observed distribution of intersex differentiation (Fst) against null expectations, the association of Fst with MAF, the location of candidate SA SNPs in (non)functional genomic regions, and looking for evidence of balancing selection at candidate SA SNPs. The study performs quite rigorous data filtering to reduce the risk of detecting spurious SA signals, and the theoretical framework underlying their analyses is very carefully developed in my view. The key findings include a collective (but of course not SNP-level) pattern consistent with SA selection on both survival and reproductive output emerging from several analyses, and that SA candidate loci are enriched for functional sites. The study finds no evidence, however, that SA candidate loci are under balancing selection. The quest for demonstrating SA selection is currently an active field in biology, so the study is certainly topical. Moreover, I feel that methodologically, the study is quite rigorous and in many aspects goes beyond what has previously been presented in the field. Of course there have been (more or less problematic) previous attempts to detect SA selection in humans, so this work is not addressing an entirely novel question or producing earth-shattering results. Nevertheless, the overall quality of the investigation and evidence seems outstanding to me, and I feel it is a strength, not a weakness, to revisit important existing research problems with new data and methods. Also, the writing is very clear and analyses are well justified and explained; I really enjoyed reading this manuscript and feel the investigation is well suited for publication in PLoS Biology. I consider this work in very good shape (it is a revision, after all) and have only relatively minor comments. Perhaps the most serious would be to consider testing for elevated genetic diversity around the SA candidates, as a complementation to the balancing selection analysis. Here the details, not sorted by relevance: 1) Line 89 (and L139-140 etc): I was first puzzled by this association between Fst and MAF being taken as evidence of SA selection, because for the allele frequency difference, a positive correlation to MAF exists under pure neutrality (because low-MAF polymorphisms cannot yield high differentiation; Roesti et al. 2012 BMC Evol Biol). I thus performed a brief simulation experiment on the association between differentiation and MAF in the absence of SA selection. This indicated that indeed the raw intersex allele frequency difference is correlated positively to MAF, but Fst is not. Given that the Fst-MAF correlation is just one of the signals taken as evidence of SA selection by the authors, and that its theoretical basis is well developed in the Methods section, I trust this conclusion, although it goes somewhat beyond my direct intuition. 2) L114: This passage makes me wonder about loci that may have antagonistic fitness effects within a sex across the life cycle. E.g., loci that may increase female survival but reduce female fertility. Is this possibility addressed in the paper (I may have overlooked)? 3) L189 and elsewhere: I think in line with what professional statisticians recommend, the term 'statistically significant' should be discarded (e.g., Wasserstein et al. 2019, American Statistician). This dichotomous label is not needed; can simply give the P-value, or even better, the parameter estimates along with their confidence intervals. 4) L296-302: I think a signature consistent with balancing selection would be elevated genetic diversity in the close (i.e., tightly physically linked) vicinity of the candidate SA polymorphisms. I encourage the authors to explore this. For instance, take your best few hundred or so SA candidate SNPs, and retrieve all polymorphisms occurring within windows of, say, 4 kb around them. Then do the same for a large number of non-SA-candidate SNPs. A prediction would be that around the SA candidate SNPs, genetic variation is elevated. Not sure what would be the best diversity metric, but Haenel et al. (2019, Evolution Letters) suggest that the density of high-MAF SNPs is a particularly sensitive diversity measure. 5) L351-355: You here argue that sample size could be an issue, but how large are the Biobank Japan and Finngen data bases? 6) L424: These (or analogous) predictions for mapping artifacts have also been described in previous studies, which could perhaps be acknowledged: https://doi.org/10.1186/1745-6150-7-17 https ://doi.org/10.1111/1755-0998.12613 https ://doi.org/10.3390/genes 10040320 https ://doi.org/10.1111/mec.15255 7) L439, ‚eliminate mapping errors'. Perhaps tone this down slightly; using such a threshold-based approach, it is unlikely that errors are completely eliminated, as some errors may just pass below the detection criterion. 8) L473: Put 'proxy' in singular. 9) Figure 2A-C: How about adding a LOESS smoother to the observed data? Also, in B, is there an explanation as to why the permuted data also seem to show an excess of SNPs in the top Fst quantiles? 10) L902 and 915: I generally think that a lot of significance testing we report is unnecessary. Here, for instance, the histograms convey the full information, no need for P values in my view. Reviewer #2: The work has been reviewed before by three reviewers and so I will comment on the responses to these first. 1. Generally, the authors have responded well to the comments of Reviewer #1 and Reviewer #2. 2. I agree with the previous reviewers regarding the fact that the mixed-model association FST values should be the focus of the manuscript. It would streamline the presentation, especially of the results, which remain hard to follow, and as the authors state in their response to Reviewer #3, it is these estimates that have the best chance to control for population artefacts (though see below). 3. I do not feel they have addressed the concern of Reviewer #3 in their final major point regarding the breakdown of the data into signals of viability and reporductive fitness. 4. The authors need to conclusively demonstrate why there is a differenc between the theoretical and empirical nulls, rather than providing conjecture that it could stem from deviations from HWE. I am loathe to propose further work given that the manuscript has already been under review. However, I strongly feel that there are some key issues that remain unresolved. The authors claim that this is "the first study to present unambiguous signals of sex-differential selection in human genomes" and they claim to show wide-spread, polygenic, sex-differential selection that has a major impact on genomic variation. This is a fairly bold claim and as such requires stronger evidence than I feel has been presented here. 1. I am concerned that there is the potential for sample ascertainment to give extensive bias. The UK Biobank is a healthier sample than the general UK population and the sample is only 46% male. So what causes females to be more likely to register to the UK Biobank? Are there any traits with a genetic basis that could be associated with this ascertainemnt, which then translates ascertainment into sex biases at the frequencies of different alleles. I suspect so, and there is nothing one can do about it. This issue goes beyond discussing this as a potential caveat. As the authors make extrodinary claims - that sexual antagonism maintains variation at a substantial number of locations across the genome (i.e. it is polygenic) - there is a requirement for more evidence then a single association study. I would be more convinced if they replicated the top associations within another biobank. I also feel they need to show that ascertainment of individuals could not cause the results observed here. They also need to control for age-at-enrolment within their analyses. 2. Does between-sex FST increase with the age group of the participants? Women have a higher life expectancy and so will be over-represented in the older age classes. LRS, as measured by number of children, has varied greatly due to cultural factors over the past decades, with family sizes generally becomming smaller over time. As women simply survive longer and are more likely to enter the study in later-life, could these patterns not simlply represent cultural changes across generations linked to viability selection? 3. The authors should use LDScore regression to calculate the correlation between LRS and FST from the mixed model summary statistics. Also as a way to test for enrichment using annotation groups. This would be far more robust (controlling for intercept terms which may reflect population stratification) than the t-statistics presented here. 4. The null model for LRS should not be based on randomised trait values within each sex but rather based on a simulated heritable trait with equal mean and variance across the sexes and genetic correlation = 1. 5. Please also fit PCs of the chromosomes as fixed effects when calculating the LOCO regression coefficients. BoltLMM controls for general population stratification on chromosomes other than the focal, but not the one in which the marker effect size is being estimated. Reviewer #3: Ruzicka et al. develop a suit of methods to test for contemporary sexually-antagonistic (SA) viability selection—using allele frequency differences between males and females in a large cohort—and reproductive selection—using the number of children reported for said individuals. They apply these methods to the UK Biobank and reject a null model of zero SA selection in this cohort. The paper has many notable strengths: Readers will benefit from the inclusion of a theoretical expectation and the permutation-based null alongside it; the control for population structure is novel (though the authors, unfortunately, do not ask whether it successfully removed some of the stratification due to participation bias that Pirastu et al., Nature Genetics 2021 reported); the estimates of SA selection that they derive will be useful for the community and the analysis is one of the most thorough I have seen in this space. The breakdown into the fitness components is cool. At the same time, key choices made by the authors need to be revisited in order for the manuscript to merit publication. Most importantly, these include presenting the most conservative version of their analyses in the main text rather than in the supplement—in particular setting a proper null expectation and control for LD in all analyses. If these issues, as detailed below, can be solved, I think the work would make for a valuable contribution to the literature. The authors do well to clarify that they are not estimating the strength of sexually antagonistic selection (SAS) or its importance in shaping patterns of genetic variation. Rather, they set more modest goals of testing a null hypothesis of no contemporary SA selection, and pointing to the likeliest candidate targets of SAS. Given this fact, however, I would encourage the authors to show an LD-pruned and population structure-aware version of their tests as main figures and text. The current presentation of the results in the main text is anti-conservative, as it is inflated by both factors. At several points along the paper, the authors seem to implicitly assume "all or nothing" theoretical models or interpretation that weaken their analyses and conclusions somewhat. One example is in the relationship reported for minor allele frequency and Fst (around line 212)—a part I might, as the previous cycle's reviewer 1 had suggested, either remove or further flesh out possible interpretations and caveats to the conclusion. An example caveat is ascertainment bias. Do the authors observe similar relationships with allele frequency in a different sample (with relatively diverged ancestry)? Another example is in the overarching assumption that alleles are either experience the same selective pressures in males and females or are subject to SA selection. What about the (seemingly more likely, agnostically) case of sex-specific selection, in which selective effects differ to some unknown extent between the sexes? Yet another example of the "all or nothing" interpretive approach is in a section entitled "Evolutionary Analysis of candidate SA SNPs" (lines 282-302. The authors test for a correlation between measures of balancing selection and between-sex Fst. We might, at best, learn that there is no evidence for a very strong hypothesis of a monotonic relationship between contemporary SA selection and long-term balancing selection—but this is a rather weak statement, and one that the authors still need to put in substantial work to establish. (Because, as in other parts of the paper, there is no proper control for LD or consideration of estimation noise). The authors also innovate in their control for population structure stratification. They replace Fst with the test statistic Lst, the log-odds ratio of an allele being carried by a male relative to a female, beyond what can be explained by the main axes of genetic ancestry (i.e., by including principal components of the genotype matrix as covariates). Is this approach enough to remove some of the stratification reported by Pirastu et al.? Or does everything that follows boil down to differences in interpretation of the same signals, where Ruzicka et al. interpret them as SA selection and Pirastu et al. interpret them as artefacts due to participation bias? Readers would benefit from some discussion on this. The analysis of correlation with GWAS effect estimates seems invalid to me, because of the same two familiar reasons: It does not set up a null, empirical or otherwise, and does control for LD in any way. For the latter, if complex traits like testosterone are the trait of interest, genetic correlations can be tested for rather than raw correlations. Speaking of T, the observation "Loci associated with T had lower than average gametic Fst" needs to be fleshed out: There are no statistics to back it up, and the authors do not suggest what it might mean. This remains especially unclear to me given that variation in testosterone levels have highly distinct genetic bases in the two sexes (Flynn et al., Eur J Hum Genetics 2021; Sinnott-Armstrong, Naqvi et al., eLife 2021). Minor comments: - Can the authors cite sources supporting the choice of cutoff age of 45 for child bearing / having? Also, does the UKB questionnaire ask only about biological children? - In the UKB (an order of a million chromosomes sampled at each site), a 1% cutoff means essentially no rare variation will be observed, further weakening the conclusion of the analysis on the relationship between minor allele frequency and SA selection. Reviewer #4: This review is incomplete (due to a mistake on the reviewer's part about timing of the deadline), so some aspects of the how the methods yielded the results observed may not have been fully considered by the reviewer. General comment (written quickly this morning): I really liked this paper -- I think it shows a conservative approach for testing for the effect of sexually antagonistic selection in humans, which is an area of considerable interest. The authors seem to have gone to extensive lengths to try to control a range of sources of error, considering other previous analyses and implementing controls from them. If anything, I think some of these controls may cause an underestimation of the effects. As with anything highly polygenic and driven by subtle allele frequency changes, I am still not sure there isn't some unconsidered variable driving these observations, but I have been mulling it over for a few weeks now and nothing obvious would make sense. Thus, I think it's a good candidate for PLOS Biology. Well written and clear. Other comments organized from previous notes: Major comments: 1. Any explanation for why the difference between observed and theoretical is so much larger than observed and null for panel 2E? The other ones look like good agreement between theoretical and null distributions. This one is looking at the difference between adult and offspring allele frequencies in male vs. difference in females. I was surprised by the differences in this figure and felt like I would have liked some more explanation about why that might occur. Minor comments: Line 159: Is it really true that intermediate frequency alleles inflate FST_hat more than low frequency alleles under drift? I thought the whole point of FST is that it's representing drift irrespective of allele frequency. Not sure about this particular point. Line 174: I can appreciate that to be conservative and make it harder to detect SA selection, you need to adopt these filtering criteria, but couldn't differences in missing rates be caused by an indel that has been favoured in one sex? Would this increase chance of false negatives? Or are these really egregious bionformatic errors? I guess the method described on line 425-441 is thought to only exclude loci under implausibly strong SA selection? (given human mortality data)? Figure 2: I can appreciate that you'd like to maintain the same colour scheme for panels D-F as in A-C, but I didn't read the caption carefully at first and I thought that the open bars were somehow representing the null and coloured bars representing the data. If you could somehow more clearly emphasize on the Figure itself that these are representing the difference between observed - null and observed - theoretical, that would help. I think making both hollow and filled bars the same colour would help me notice this more easily. Maybe add a legned with a hollow box and a filled box? Line 205: I'm not sure that this is evidence about polygenic. If we knew nothing about linkage, it could be possible that all the enriched high-FST SNPs occurred in a single block of highly linked alleles with one causal allele under strong selection that simply wasn't quite enough to get above the bonferroni. Line 262: what are the effect sizes here? With these types of tests, significant p-values can be found with very small effects, as the sample size is very large. Can you provide some estimate of odds ratio or something? (how much more likely is a high FST to be found in a genic vs. non-genic region). Line 269: It wasn't totally clear to me what was being done here without going to the methods, so a little more explanation would help. What is the assocation that is being tested on line 272? Is this testing somehow whether the high FST SNPs are enriched among the GWAS candidate loci? It's explained clearly on line 555 but would be easier to understand with a bit more detail in the results where it's first introduced. Line 280: Interesting that loci affecting testosterone had lower than average FST -- but I guess maybe that's a conflict that has already been resolved to the sex chromosomes? Line 368: Interesting that there was no signature of balancing selection on these loci -- reasons outlined here are appropriate. 2 Jun 2022 Submitted filename: Ruzicka et al. revised Reply to reviewers.docx Click here for additional data file. 6 Jul 2022 Dear Dr Ruzicka, Thank you for your patience while we considered your revised manuscript "Polygenic signals of sex differences in selection in contemporary humans" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor and two of the original reviewers. Based on the reviews , we are likely to accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewers. Please also make sure to address the following data and other policy-related requests. IMPORTANT: a) Please make the title more declarative. We suggest "UK Biobank data reveal polygenic signals of sex differences in selection in contemporary humans" - this contains an active verb and mentions the source of the data. b) Please attend to the remaining modest requests from reviewer #3. c) Please address my Data Policy requests below; specifically, we need you to supply the numerical values underlying Figs 2ABCDEF, 3BC, 4ABCDEF, 5ABC, 6ABCDEFGH, SD1, SD2, SF1, SF2, three unnumbered Figs in SG, SH1, SI1, SI2. I note that your Github depositions are currently empty; please complete these so that we can check policy compliance. In addition, we need a citeable, permanent record of the data, e.g. in Zenodo. d) Please also cite the location of the data clearly in each main and supplementary Fig legend, e.g. “The data and code needed to generate this Figure can be found in https://github.com/filruzicka/polygenic_SA_selection_in_the_UK_biobank https://github.com/lukeholman/UKBB_LDSC and https://zenodo.org/record/XXXXXX.” As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript. We expect to receive your revised manuscript within two weeks. To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following: - a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list - a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable) - a track-changes file indicating any changes that you have made to the manuscript. NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines: https://journals.plos.org/plosbiology/s/supporting-information *Published Peer Review History* Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *Press* Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article. *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please do not hesitate to contact me should you have any questions. Sincerely, Roli Roberts Roland Roberts, PhD Senior Editor, rroberts@plos.org, PLOS Biology ------------------------------------------------------------------------ DATA POLICY: You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797 Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms: 1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore). 2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication. Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figs 2ABCDEF, 3BC, 4ABCDEF, 5ABC, 6ABCDEFGH, SD1, SD2, SF1, SF2, three unnumbered Figs in SG, SH1, SI1, SI2. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values). IMPORTANT: Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend. Please ensure that your Data Statement in the submission system accurately describes where your data can be found. ------------------------------------------------------------------------ DATA NOT SHOWN? - Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of these data or provide figures presenting the results and the data underlying the figure(s). ------------------------------------------------------------------------ REVIEWERS' COMMENTS: Reviewer #1: I have now gone through the revision (PBIOLOGY-D-21-02588R2) of the manuscript I reviewed earlier. I have no further criticism; I think the authors have made a very careful and extensive revision suited for publication. I feel the paper will make a fine contribution to PLoS Biol! Reviewer #3: [identifies himself as Arbel Harpak] Overall, I think the authors did an excellent job at addressing reviewers' comments seriously and thoroughly. I include a few comments / questions I am curious about below, but strongly believe that at this point the authors have earned the right to leave a small fraction of (now seven!) reviewers' musings unattended… 1. I did not understand the response to my last comment asking why alleles with maf <1% were excluded. I think that in practice examining only very common alleles may give a warped view of selection. 2. I remain curious about the weird correlation of SD-selection statistics with testosterone (or, in the current version, lack of correlation). A seemingly very related 2022 bioRxiv preprint by Zhu et al. examined the relationship between sex differences in genetic effects on traits and male-female Fst. Only one trait repeatedly (in three datasets; though much smaller than the UKB) came up as slightly significant---testosterone. Can the authors comment on whether or not this is discrepant with their results, and why? On that note, perhaps some of the GWAS analyzed in Fig. 5C should be repeated with a sex-stratified GWAS. In particular extremes like testosterone where genetic effects on the trait are highly sex-specific. 3. I appreciate the efforts by the authors to make the language and the presentation more careful, but I think there's still some room for improvement. One example: The legend of figure 5 says that "positive correlations indicate that "a trait is more beneficial in females than in males". This is both ill-defined (I believe the authors you mean higher trait values are favored) and overstates. The statement can be made about the benefit of alleles associated with the trait; but given plausible complications, such as pleiotropy, recruitment biases… I think any claim from a single-trait test that does not consider correlations between traits, levels of background selection, recruitment or other modes of population stratification is at best suggestive about the focal trait being under selection. The authors should try and be extra cautious when it comes to claim about sex-specific selection, especially when they themselves fully acknowledge the looming caveat of study recruitment biases. 18 Jul 2022 Submitted filename: Ruzicka et al. revised Reply to Reviewers 2.docx Click here for additional data file. 27 Jul 2022 Dear Dr. Ruzicka, Thank you for the submission of your revised Research Article "Polygenic signals of sex differences in selection in humans from the UK Biobank" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Nick Barton, I am pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes. Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process. PRESS We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf. We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/. Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. Sincerely, Paula Jauregui on behalf of Roland G Roberts, PhD, PhD Senior Editor PLOS Biology rroberts@plos.org

87 in total

9. Detection of Allelic Frequency Differences between the Sexes in Humans: A Signature of Sexually Antagonistic Selection.

Authors: Elise A Lucotte; Romain Laurent; Evelyne Heyer; Laure Ségurel; Bruno Toupance
Journal: Genome Biol Evol Date: 2016-06-02 Impact factor: 3.416