Literature DB >> 35078366

An unbiased test reveals no enrichment of sexually antagonistic polymorphisms on the human X chromosome.

Abstract

Mutations with beneficial effects in one sex can have deleterious effects in the other. Such 'sexually antagonistic' (SA) variants contribute to variation in life-history traits and overall fitness, yet their genomic distribution is poorly resolved. Theory predicts that SA variants could be enriched on the X chromosome or autosomes, yet current empirical tests face two formidable challenges: (i) identifying SA selection in genomic data is difficult; and (ii) metrics of SA variation show persistent biases towards the X, even when SA variants are randomly distributed across the genome. Here, we present an unbiased test of the theory that SA variants are enriched on the X. We first develop models for reproductive FST-a metric for quantifying sex-differential (including SA) effects of genetic variants on lifetime reproductive success-that control for X-linked biases. Comparing data from approximately 250 000 UK Biobank individuals to our models, we find FST elevations consistent with both X-linked and autosomal SA polymorphisms affecting reproductive success in humans. However, the extent of FST elevations does not differ from a model in which SA polymorphisms are randomly distributed across the genome. We argue that the polygenic nature of SA variation, along with sex asymmetries in SA effects, might render X-linked enrichment of SA polymorphisms unlikely.

Entities: Chemical

Keywords: FST; empirical population genomics; humans; sex chromosomes; sexually antagonistic selection; theoretical models

Mesh：

Year: 2022 PMID： 35078366 PMCID： PMC8790371 DOI： 10.1098/rspb.2021.2314

Source DB: PubMed Journal: Proc Biol Sci ISSN： 0962-8452 Impact factor: 5.349

Introduction

Adaptation requires genetic variation for fitness, yet we know surprisingly little about its genetic basis [1,2]. Is fitness variation mostly attributable to rare variants maintained by recurrent mutation, or to common variants maintained by various forms of balancing selection? To what extent are the frequencies of fitness-affecting variants influenced by genetic drift, departures from equilibrium or gene flow from neighbouring populations? To what extent do loci with large versus small fitness effects contribute to genome-wide fitness variation? Also, how are these genetic variants distributed across the genome? These are some of the most pressing questions in evolutionary biology, yet they are also the most challenging to answer [3]. Sexually antagonistic (SA) polymorphisms, wherein the alleles of a locus have opposing fitness effects on each sex, are an important class of fitness-affecting genetic variant. The basic conditions giving rise to SA fitness effects—including sex differences in selection on traits expressed by both sexes, and sex-differential phenotypic effects of mutations—are permissive [4] and have been widely documented in natural and experimental populations [5-8]. Accordingly, direct evidence for SA genetic variation has been reported in numerous quantitative genetic studies (e.g. [9-11]; see also [12]). Yet despite this, our understanding of the genomic distribution, fitness effects and evolutionary dynamics of SA variants remains poor [13-16]. An intriguing and oft-debated question is whether SA polymorphisms are more likely to be found on autosomes or the X chromosome. This question has inspired an extensive body of theoretical research (e.g. [17-23]) that has identified conditions where the X chromosome should be enriched, or deficient, for SA polymorphisms. Rice [18], for example, highlighted conditions under which the X is a more permissive genomic location for balancing selection of SA alleles, while Fry [20] later emphasized the sensitivity of Rice's predictions to the dominance relations between SA alleles, about which little is known [24]. Moreover, when SA polymorphisms are maintained by recurrent mutation rather than balancing selection, the X chromosome should typically harbour fewer SA polymorphisms, owing to stronger purifying selection and elevated rates of genetic drift at X-linked compared to autosomal loci [25-28]. In short, empirical studies—rather than theory alone—are needed to resolve whether the X, autosomes, or neither chromosome type, are enriched for SA polymorphisms. So far, two formidable challenges have hampered empirical research on this question. The first is the logistical difficulty of identifying SA polymorphisms using current genomic approaches (e.g. [29-33]), which typically lack the statistical power needed to confidently identify SA loci [16,29,34,35]. The second is that X-linkage inflates empirical signals of SA polymorphism, which can lead to biases in detection and erroneous inferences of elevated SA polymorphism on the X [23]. These biases arise, on the one hand, from less constrained conditions for allele frequency divergence between the sexes at X-linked compared to autosomal loci [23,29,30], and, on the other hand, from the inherently stronger effects of X-linked compared to autosomal polymorphisms on male fitness variances and cross-sex fitness covariances [23,28,36-38]. These twin challenges of estimation and inference render previous tests for an enrichment of SA polymorphisms on the X chromosome ambiguous [30,39-47]. Here, we develop and implement an unbiased test for the enrichment of SA polymorphisms on the X chromosome. Building on recently developed methods for characterizing polygenic signals of SA polymorphism in population genomic data [48], we first test for polygenic signals of SA polymorphism on the X chromosome and autosomes, using genotypic and lifetime reproductive success (LRS) data from approximately 250 000 males and females from the UK Biobank [49]. Our metric of SA polymorphism, ‘reproductive FST’, quantifies sex differences in the genetic basis of LRS among UK Biobank adults, and thus controls for X-autosome biases in between-sex allele frequency differences that may arise between fertilization and reproductive maturity [23,29,30]. We then develop an idealized model of reproductive FST in which the prevalence and attributes of SA polymorphisms do not systematically differ between the X and autosomes. By comparing polygenic signals of SA polymorphism in the UK Biobank to this theoretical baseline, we control for both the elevated sampling variance and stronger effects of X-linked polymorphisms on the reproductive FST metric [23,28,36-38], allowing us to assess whether SA polymorphisms are disproportionately autosomal or X-linked.

Theoretical background

Our empirical analyses address two questions. First, is there a polygenic signal of SA polymorphism on the human X chromosome in the UK Biobank dataset (an autosomal signal was previously reported by Ruzicka et al. [48])? Second, do the magnitudes of these signals imply an elevation of SA polymorphisms on the X relative to autosomes? To formally test each question, we developed mathematical models for ‘reproductive FST’—a metric that potentially captures sex-differential (including SA) effects of alleles on LRS (see below and [48])—and compared empirical data to each model. For the first question, we developed null models for the distribution of reproductive FST in the absence of sex differences in selection. For the second question, we developed models of reproductive FST in which SA polymorphisms are present and inflate reproductive FST on both the X and autosomes, yet neither chromosome is enriched for them (i.e. there are no systematic differences between chromosomes in the abundance or attributes of SA polymorphisms). This second model accounts for other factors that are likely to differ between the X and autosomes, including the effects of diploidy versus haploidy on the expression of fitness variation. We elaborate on each model of reproductive FST below.

Definition of reproductive FST

Our models and empirical analyses focus on bi-allelic loci (i.e. the vast majority of polymorphic sites in population genomic datasets). For an X-linked locus with alleles A1 and A2, females have three genotypes (A1A1, A1A2 and A2A2) and males have two (A1 and A2). Let n11,, n12, and n22, represent the number of females of each genotype recorded in the UK Biobank dataset, respectively (N = n11, + n12, + n22,). The frequency of the A1 allele in the female sample is . Likewise, n1, and n2, represent the number of males from the UK Biobank that carry alleles A1 and A2 (N = n1, + n2,), and is the A1 allele frequency in the male sample. Letting F represent the total number of offspring produced by females carrying genotype ij (ij = {11, 12, 22}), and M represent the total number of offspring produced by males carrying genotype i (i = {1, 2}), the expected frequencies with which females and males transmit the A1 allele to their offspring (respectively) areReproductive FST is a standardized measure of sex differences in allele transmission, owing to sex-specific genetic variation for LRS, and controlling for allele frequency differences between adults [48]. It is defined aswhere . The same expression applies to autosomal loci, once male allele frequencies are adjusted to include diploidy in males, i.e.: and .

Reproductive FST in the absence of sex differences in selection

With large sample sizes (as in the UK Biobank), no sex differences in selection and excluding polymorphic loci in which the minor allele frequency (MAF) is very low (i.e. less than 1% in the UK Biobank; see below), reproductive FST estimates are well-approximated by a χ2 distribution, with estimates for an X-linked and autosomal locus, respectively, asandwhere X is a χ2 random variable with one degree of freedom, μ, μ, and correspond to the means (μ, μ) and variances (, ) for female and male LRS (respectively) in the UK Biobank, is a measure of the deviation of female genotype frequencies from Hardy–Weinberg expectations (we define as in [16,34], where when heterozygotes are in excess of Hardy–Weinberg predictions, and when heterozygotes are deficient; see the electronic supplementary material); is the deviation in the male sample. With genotype frequencies near Hardy–Weinberg expectations and , these expressions simplify—for X-linked and autosomal loci, respectively—toand

Reproductive FST assuming a random genomic distribution of sexually antagonistic polymorphisms

To test whether SA polymorphisms are enriched on the X chromosome, we must first define a model for reproductive in which SA polymorphisms are present, randomly distributed across the genome, and where the frequencies and fitness effects of SA polymorphisms do not systematically differ between the X and autosomes. We expect that there will be many specific evolutionary genetic scenarios that can lead to equivalent patterns of SA polymorphism between autosomes and the X. For example, when SA polymorphisms are maintained at equilibrium under balancing selection, there are specific conditions of dominance between SA alleles that generate identical equilibrium frequencies of balanced SA alleles between the X and autosomes [23]. More complicated contexts of selection, including interactions between recurrent mutation, genetic drift and the distributions of sex-specific selection and dominance coefficients, may also sometimes result in similar patterns of SA polymorphism on the X and autosomes (although we expect that levels of polymorphism maintained by recurrent mutation will tend to be higher on autosomes, given the enhanced purifying selection and genetic drift that have been predicted [25] and documented (e.g. [28,50] for X-linked genes). In our baseline model of equivalence between the X and autosomes, we, therefore, focus on the case where SA polymorphisms are maintained at equilibrium under balancing selection, which allows us to set up a testable prediction regarding the signal of SA polymorphism on the X versus the autosomes. Whether this idealized scenario is a good model for SA polymorphism in the genomes of humans or other species is a point we return to in the Discussion. For simplicity, we consider the case where adult genotype frequencies are approximately equal between the sexes and close to Hardy–Weinberg expectations, as is largely the case within the UK Biobank dataset. For a set of n and n polymorphic SA loci on the X chromosome and autosomes, respectively, the expected inflation of mean (relative to null expectations defined in equations (2.4a,b)) isandwhere and denote mean among these loci (see the electronic supplementary material). The terms in square brackets capture effects of sex-differential selection on . Note that when there are no sex differences in selection (i.e. the terms in square brackets evaluate to zero), there will be no inflation (i.e. and ), and and conform to the null models in equation (2.4). When fitness effects of SA alleles are small [19,51], single-generation allele frequency changes at X-linked and autosomal loci (respectively) are well-approximated byand(see the electronic supplementary material). SA selection is expected to maintain genetic variation (i.e. conditions for balancing selection are met) when the strength of selection for a given locus is relatively symmetric between the sexes, in which case the terms in the square brackets of equations (2.6a) and (2.6b) must sum to zero (i.e. ). Note that this condition applies whether or not there are dominance interactions, including dominance reversals [24,52-54], between SA alleles at a given locus. Substituting this identity into equations (2.5a) and (2.5b), we obtain:and With no systematic differences between the X and autosomes in the density of polymorphic SA loci, or the frequencies and fitness effects of their alleles, then the summation terms in and should have the same magnitudes, and the X-to-autosome ratio for the expected amount of FST inflation will be In other words, we expect a 2.25-fold higher inflation of for the X compared to the autosomes in the absence of chromosomal differences for SA polymorphisms. This additional inflation for X-linked loci arises because haploidy in males inflates the contributions of X-linked loci to the variance for male fitness components (relative to autosomal loci) [23]. In practice, loci contributing to an inflation of FST may either be targets of selection or in linkage disequilibrium (LD) with such targets. While the prediction in equation (2.8) applies to targets of SA selection, it will remain applicable to linked loci provided their degree of LD with target loci does not systematically differ between the X chromosome and autosomes (see the electronic supplementary material). In our comparisons of the two chromosome types (see below), we used LD pruning to minimize the differential effects of hitchhiking between chromosomes. Because LD tends to be higher on the human X chromosome compared to autosomes [55], any uncontrolled chromosomal bias in the effects of LD on FST inflation should tend to increase the likelihood of identifying a signal of enriched SA polymorphism on the X, which makes our eventual conclusion (that there is no such signal) a conservative one. Below, we empirically estimate the amount of inflation in for the X and the autosomes in the UK Biobank (i.e. inflation relative to the average amount of noise in FST estimates, per chromosome), and compare these estimates to the baseline prediction of 9/4.

Methods

Quality-filtering of UK Biobank data

Access to UK Biobank data was granted under project number 52 049. We employed the same quality-filtering settings for X-linked loci in this study as employed for autosomal loci by Ruzicka et al. [48]. Briefly, we excluded individuals with high relatedness (3rd degree or closer), non-White British ancestry, high heterozygosity and missing rates, individuals whose reported sex and genetic sex differed, individuals whose age was less than 45 years, and aneuploids. Across retained individuals, we excluded non-diallelic sites, sites with MAF < 0.01, missing rates greater than 5%, p-values < 10−6 in tests of Hardy–Weinberg equilibrium, and imputation INFO score less than or equal to 0.8. Because there are few genotyped sites on the X chromosome, our analyses are focused on imputed data. Note that, unlike Ruzicka et al. [48], we did not filter our data for possible artefacts arising from mis-mapping of sequence reads to sex chromosomes [35,56] because reproductive FST controls for allele frequency differences between adults and, in effect, for artefacts arising when estimating adult allele frequencies.

Quantifying polygenic signals of sexually antagonistic polymorphism

For each X-linked locus, we estimated allele frequencies in adults ( and ). To quantify LRS, we used reported offspring numbers from UK Biobank field 2405 ‘Number of children fathered’ for males, and field 2743 ‘Number of live births' for females (see [48] for further details on quality-filtering). These data were used to estimate allele frequency transmission ( and ; equation (2.1)) and reproductive for each polymorphic locus (equation (2.2)). We generated a null distribution for reproductive by simulating loci from the theoretical null model (i.e. equation (2.3a)), with the sample sizes per locus (N and N per locus) matching those in the UK Biobank data, and estimates of the sex-specific mean and variance for LRS based on all individuals included in the dataset. We also generated an empirical null distribution for through a single permutation of LRS values (without permuting sex) and re-calculating for each locus using permuted data. This permutation procedure ensured that neither sex differences in the allele frequencies among adults, nor sex differences in the relationship between genotype and LRS, contribute to in the permuted data, leaving only estimation error (including error arising from locus-specific heterogeneity in LRS values owing to variation in missing rates across single nucleotide polymorphisms (SNPs)) to contribute to in permuted data. We chose to perform a single permutation of LRS values for computational efficiency and because our focus was on testing significance across the set of loci, rather than establishing statistical significance for individual loci. We tested whether the distribution of in observed data differed from both null distributions using Wilcoxon sum-rank and Kolmogorov–Smirnov tests. To consolidate our inference that inflations reflect genuine phenotypic effects (as opposed to technical artefacts), we assessed whether sites with elevated were more likely to be functional. We first obtained variant effect predictions for each SNP using annotations from the hg19 reference genome in SNPEff [57], with sites categorized as ‘intergenic’ or ‘genic’, the latter defined broadly to include coding and potential regulatory genomic regions. We then performed logistic regressions, in which genic/intergenic was the binary response variable and the independent variable, to assess relationships between and genic/intergenic status. We also assessed whether associations with genic status were greater in observed than null (both simulated or permuted) data by re-calculating the regression coefficient of on genic status among 1000 bootstrap replicates of the data, where one replicate consists of the set of SNPs sampled with replacement.

Comparing signals of sexually antagonistic polymorphism on autosomes and the X

We compared on autosomes to on the X chromosome by first estimating the ratio of mean on the X relative to the autosomes (for simulated, permuted and observed data). We also estimated and —each calculated as the difference between the observed mean and the mean from either the simulated or permuted null data—and the resulting to ratio. We obtained confidence intervals and empirical p-values for these ratios by sampling autosomal and X-linked loci with replacement, estimating the ratios in the resampled data and repeating this procedure across 1000 bootstrap replicates. In comparisons of autosomal and X-linked data, we focus on a set of LD-pruned loci (PLINK settings ‘–indep-pairwise 50 10 0.2’) to avoid biases arising from differences in the intensity of linked selection for autosomal and X-linked loci (see above).

Results

Signals of sexually antagonistic polymorphism for X-linked loci

The total sample size, after quality-filtering, includes 249 021 individuals (N = 115 531 and N = 133 490), N = 229 196 imputed polymorphic sites on the X chromosome and N = 7 851 642 imputed polymorphic sites on autosomes. We first compared reproductive on the X chromosome to a distribution of simulated from the theoretical null distribution (based on equation (2.3a)) and to an empirical null distribution obtained by permuting LRS values within each sex (see Methods). As shown for autosomal loci previously (figure 1; [48]) and as predicted under SA selection, mean for X-linked loci was elevated relative to both theoretical and empirical null distributions (theoretical ; permuted; permuted observed ; Wilcoxon rank-sum and Kolmogorov–Smirnov tests, all p < 0.001; figure 1; electronic supplementary material, figure S7). We observed a 14.8% enrichment of observed sites in the top 1% quantile of the theoretical null distribution (expected number of sites = 2292; observed number of sites = 2632; χ2 = 23.592, p < 0.001; figure 1; electronic supplementary material, figure S7) and a 8.4% enrichment of observed sites in the top 1% quantile of the empirical null distribution (expected number of sites = 2292; observed number of sites = 2484; χ2 = 7.719, p = 0.005). There were no individual large-effect loci contributing to this signal: the minimum χ2 p-value across all sites (1.213 × 10−5) was well above the Bonferroni-corrected threshold (2.182 × 10−7) and the minimum false dicovery rate q-value was 0.807. Thus, genomic signals of SA polymorphism for X-linked loci are polygenic.

Figure 1

Polygenic signals of SA polymorphism on the X chromosome and autosomes relative to ‘no sex-differential selection’ nulls. (a) Proportion of X-linked sites (grey, permuted; pink, observed) and autosomal sites (grey, permuted; green, observed) in each of 100 quantiles of a simulated null distribution for , with the null distributions described by equations (2.3a) and (2.3b), respectively. In the absence of sex-differential selection, approximately 1% of sites should fall into each quantile of the simulated null distribution. (b) Observed/permuted ratio of the proportion of sites in each of 100 quantiles of the simulated null distribution, for X-linked (pink) and autosomal (green) sites, respectively. Both the above panels used the full set of N = 229 196 and NAuto. = 7 851 642 imputed sites because: (i) signals of SA polymorphism relative to null distributions cannot be artificially inflated by LD between sites (when considering X-linked and autosomal sites separately; see [48]); and (ii) power to detect X versus autosome differences is reduced by LD pruning. We present equivalent figures for LD-pruned data as electronic supplementary material, figure S7. (Online version in colour.) If genic sites (broadly defined to include coding and regulatory regions) are more likely to have phenotypic effects than intergenic sites, and if inflations reflect genuine phenotypic effects of SA polymorphisms, we should observe that sites with high are more likely to be genic. As found for autosomes previously [48], we detected a positive association between and genic status among X-linked polymorphisms (SNP coded as genic based on genome annotations; see Methods; binomial generalized linear model (GLM), log odds ratio (logOR) ± 95 confidence interval (CI) = 16 399.81[12 189.41–20 601.39], p < 0.001; electronic supplementary material, figure S8). By contrast, we found no significant association between permuted and genic status (binomial GLM, logOR ± 95 CI = 3822.99[–679.01–8312.16], p = 0.096; electronic supplementary material, figure S8), or between simulated and genic status (binomial GLM, logOR ± 95 CI = 2107.52[–2408.14–6610.39], p = 0.360), and the association between and genic status was stronger in the observed than permuted data (1000 bootstrap replicates of the difference between the log odds-ratio in observed and permuted data = 12 680.27[6417–18 382.84], empirical p < 0.001). While these associations represent suggestive evidence inflations are attributable to genuine phenotypic effects (and align with previous enrichment of candidate SA sites in functional genomic regions; e.g. [47]), we emphasize that the evidence is not definitive, as non-genic sites may be functional as well.

No evidence that sexually antagonistic polymorphisms are enriched on either chromosome type

Evidence for elevated reproductive among X-linked loci (relative to the ‘no sex-differential selection’ null described by equation (2.3a)), together with previous evidence for elevated reproductive on autosomes [48] (relative to the ‘no sex-differential selection’ null described by equation (2.3b)), suggest that SA polymorphisms segregate on both chromosome types. These findings provided motivation to compare signals of SA polymorphism on autosomes relative to the X. We first compared mean between the X chromosome and autosomes (using a set of LD-pruned sites; N = 29 859; NAuto. = 1 056 003), which should be larger for X-linked than autosomal loci (even in the absence of any SA polymorphism) because of the smaller sample sizes, and thus higher sampling variances, for X-linked loci. Accordingly, the ratio of mean X-linked to autosomal (±95 CI) was greater than one (figure 2a), whether in simulated values based on theoretical null distributions (, , ), permuted data (, , ), or observed data (, , ).

Figure 2

Comparing polygenic signals of SA polymorphism on the X chromosome and autosomes. (a) Distribution of the ratio of mean () based on 1000 bootstrap replicates for each of our three classes of data: simulated data from the theoretical null distribution (top), permuted data (middle) and observed data (bottom), illustrating the elevation in X-linked relative to autosomal (i.e. ), even in the absence of SA polymorphism (i.e. in simulated and permuted data). (b) Distribution of the ratio of estimated X-linked to autosomal inflation in ( and ), across 1000 bootstrap replicates. The top panel uses observed and simulated data to estimate and , while the bottom panel uses observed and permuted data to estimate and . The dashed vertical line shows the theoretically predicted 9/4 X-to-autosome ratio when randomly distributed balanced SA polymorphisms account for the inflations of FST on each chromosome type. In both panels, we used a set of LD-pruned sites, rather than the full data, to avoid biases arising from differences in the extent of hitchhiking between autosomes and the X (see Methods). (Online version in colour.) To assess whether SA polymorphisms are enriched on autosomes or the X, we estimated, for each chromosome type, the degree of inflation relative to the null (i.e. and ; see Theoretical Background) as the mean observed minus the fraction of that is attributable to sampling variance (i.e. mean simulated or permuted ). We then compared the ratio of X to autosome inflation to a baseline model in which the chromosome types exhibit no differences in SA polymorphism (i.e. ; equation (2.8)). We estimated as 1.770[0.828–3.290] based on observed relative to simulated values, and as 1.086[–0.877–3.291] based on observed relative to permuted values. Whereas enrichment of SA polymorphisms on the X would predict a ratio greater than 2.25, both point estimates fell below 2.25, though neither difference was statistically significant (empirical p = 0.853 and p = 0.864 for simulated and permuted data, respectively). Overall, there is no compelling evidence for an enrichment of SA polymorphisms on either chromosome type.

Discussion

Research on the genetic basis of SA fitness variation is motivated, in part, by its relevance to broader questions about adaptive evolution, such as the distribution of dominance coefficients among adaptive mutations [1,24,58], the extent of balancing selection in genomes [3] and the evolution of sex chromosome systems [59-62]. For example, evidence for enrichment of SA variants on the X chromosome might imply that SA polymorphisms evolve under balancing selection, with male-beneficial variants typically recessive and female-beneficial variants typically dominant [18,19]. On the other hand, evidence for enrichment of SA variants on autosomes might imply that SA variants exhibit co-dominant fitness effects [17], experience beneficial reversals of dominance [20] and/or have evolutionary dynamics dominated by directional selection or drift [26,27]. Yet empirically assessing whether SA polymorphisms are typically X-linked or autosomal is extremely challenging [23,34,35]. Here, we overcome the logistical hurdle of detecting SA polymorphisms by using a study organism—humans—in which there is previous evidence for SA selection [10,63,64] and a study population—the UK Biobank—in which sample sizes are large enough to detect polygenic signals of SA polymorphism [48]. We overcome the bias towards elevated X-linked between-sex allele frequency differentiation among adults [23,29,30] by using a metric of SA polymorphism—reproductive FST—that controls for allele frequency differences among adults [48]. And we overcome the biases arising from larger effects of X-linked alleles on metrics of SA variation, owing to elevated X-linked sampling variances and X-linked haploidy in males [23,36], by developing models for reproductive FST estimates that account for these effects. Our main finding was that X-linked sites displayed inflations consistent with polygenic SA selection (as shown for autosomal sites previously [48]) but the extent of inflation did not significantly differ from the 9/4 X-autosome ratio predicted by an idealized model in which SA polymorphisms are evenly distributed across the genome. These results accord well with re-analyses of previous empirical research in a range of species [23], which reveal little compelling evidence for enrichment of SA polymorphisms on the X chromosome once biases towards enhanced X-linked effects are accounted for. Indeed, enrichment of SA polymorphisms on the X chromosome might be considered unlikely for additional reasons. In particular, SA variants are likely to exhibit small fitness effects owing to the polygenic nature of fitness variation [28,65,66], as also evidenced by the absence of large-effect SA loci in this dataset. Small-effect mutations, including SA mutations, are highly susceptible to genetic drift and potentially more likely to be co-dominant [66,67]—conditions that both favour autosomal enrichment of SA polymorphisms [17,26,27]. Furthermore, fitness effects of SA variants may be asymmetric between sexes owing to well-documented sex differences in strategies employed to achieve reproductive success [8,68-70]. This could render SA variants susceptible to directional (rather than balancing) selection, which also tends to favour autosomal enrichment of SA polymorphisms [26,27].

Limitations of our analysis approach and future directions

Though our analyses correct for several pre-existing biases, they present some limitations. In terms of the data, the relatively small number of independently segregating polymorphisms across the human X chromosome (N ∼ 30 000 LD-pruned imputed sites) reduces power to detect differences between autosomal and X-linked sites. While there is prior support for SA selection on phenotypes [64] and for polygenic signals of autosomal SA polymorphism [48] in this population, the extent to which inflations are driven by SA polymorphisms, as opposed to some loci showing sex differences in directional selection and neutral sites linked to selected polymorphisms, is unclear. Empirical work in systems where fitness can be measured relatively easily, sufficiently large samples of genomic sequences can be obtained, and experimental work can be carried out to test the fitness effects of candidate SA polymorphisms (e.g. the dioecious plant Silene latifolia; [9]), represent promising avenues for further research. Although there is little prospect of obtaining Biobank-scale samples of genome sequences in non-human organisms, we suspect that genetic variation for fitness may often be greater than in humans, increasing the likelihood that polygenic signals of SA selection will be detectable [16,48]. Regarding limitations of theory, previous research (including our own) has predominantly used single-locus models to make predictions about the genomic distribution of SA polymorphisms [17-23,27,71], despite the (likely) polygenic nature of fitness variation. Though we can extend single-locus predictions to polygenic scenarios when we assume that loci segregate independently and fitness effects are multiplicative, this simplifying assumption may be problematic when the extent of linked selection differs systematically between autosomal and X-linked loci [25]. Moreover, the 9/4 ratio of inflation is a prediction for signals of SA polymorphism assuming a random genomic distribution of balanced SA polymorphisms, yet balancing selection is only one of the multiple modes of evolution that can potentially affect SA polymorphisms [26,27,65]. Empirical data from the UK Biobank are consistent with the 9/4 prediction, yet our failure to reject this model should not be interpreted as evidence that the evolutionary scenario underlying this prediction is a reasonable description of genome-wide SA polymorphism in humans (or other species). Other evolutionary scenarios (e.g. interactions between recurrent mutation, genetic drift, and the distributions of sex-specific selection and dominance coefficients) might also lead to predictions close to 9/4. Finally, most current theoretical models of SA variation fall firmly within the classical population genetic tradition [72], in which the fitness effects of SA variants are arbitrarily assigned rather than explicitly modelled. Because fitness effects of genetic variation are properties of the distribution of mutations affecting traits under selection, models of adaptation that incorporate these features of biology (e.g. Fisher's Geometric model [4,58] and other trait-based population genetic models [65]) may bring us closer to a robust theory for the fitness effects, dynamical properties and genomic distribution of SA polymorphisms.

69 in total

8. A general population genetic framework for antagonistic selection that accounts for demography and recurrent mutation.

Authors: Tim Connallon; Andrew G Clark
Journal: Genetics Date: 2012-01-31 Impact factor: 4.562

9. Association between sex ratio distortion and sexually antagonistic fitness consequences of female choice.

Authors: Tim Connallon; Erin Jakubowski
Journal: Evolution Date: 2009-03-17 Impact factor: 3.694

10. Evaluating human autosomal loci for sexually antagonistic viability selection in two large biobanks.

Authors: Katja R Kasimatis; Abin Abraham; Peter L Ralph; Andrew D Kern; John A Capra; Patrick C Phillips
Journal: Genetics Date: 2021-03-03 Impact factor: 4.562