Literature DB >> 34855745

Observable variations in human sex ratio at birth.

Yanan Long^1,2,3, Qi Chen⁴, Henrik Larsson^4,5, Andrey Rzhetsky^2,3,6.

Abstract

The human sex ratio at birth (SRB), defined as the ratio between the number of newborn boys to the total number of newborns, is typically slightly greater than 1/2 (more boys than girls) and tends to vary across different geographical regions and time periods. In this large-scale study, we sought to validate previously-reported associations and test new hypotheses using statistical analysis of two very large datasets incorporating electronic medical records (EMRs). One of the datasets represents over half (∼ 150 million) of the US population for over 8 years (IBM Watson Health MarketScan insurance claims) while another covers the entire Swedish population (∼ 9 million) for over 30 years (the Swedish National Patient Register). After testing more than 100 hypotheses, we showed that neither dataset supported models in which the SRB changed seasonally or in response to variations in ambient temperature. However, increased levels of a diverse array of air and water pollutants, were associated with lower SRBs, including increased levels of industrial and agricultural activity, which served as proxies for water pollution. Moreover, some exogenous factors generally considered to be environmental toxins turned out to induce higher SRBs. Finally, we identified new factors with signals for either higher or lower SRBs. In all cases, the effect sizes were modest but highly statistically significant owing to the large sizes of the two datasets. We suggest that while it was unlikely that the associations have arisen from sex-specific selection mechanisms, they are still useful for the purpose of public health surveillance if they can be corroborated by empirical evidences.

Entities: Chemical

Mesh：

Year: 2021 PMID： 34855745 PMCID： PMC8638995 DOI： 10.1371/journal.pcbi.1009586

Source DB: PubMed Journal: PLoS Comput Biol ISSN： 1553-734X Impact factor: 4.475

Introduction

Because human male gametes bearing X or Y chromosomes are equally frequent (being produced by meiosis symmetrically partitioning two sex chromosomes), and because ova bear only X chromosomes, one would expect a sex ratio at conception of exactly [1]. Indeed, a recent study using fluorescent in situ hybridization and array comparative genomic hybridization showed that the sex ratio at conception (SRC) was statistically indistinguishable from [2]. Nevertheless, the apparent sex ratio at birth (SRB), also known as the secondary sex ratio, has been documented to significantly deviate from under various circumstances, suggesting that a proportion of embryos are lost between conception and birth. At least three processes may affect the observed SRB. First, female-embryo pregnancies may terminate early in development, driving the SRB up. It has been documented that these excess female-embryo losses tend to occur primarily during the first and early-second trimesters of pregnancy. Second, male-embryo deaths would drive the apparent SRB down. Male-embryo losses have indeed been observed to occur during the late-second and third trimesters [3]. Third, SRB may be affected by peri-conceptual maternal hormonal levels [4, 5]. Past studies proposed that the SRB can fluctuate with time and may be driven by a number of environmental factors, such as chemical pollution, events exerting psychological stress on pregnant women (such as terrorist attacks and earthquakes), radiation, changes in weather, and even seasons of conception (Table 1).

Table 1

Exogenous factors reported in the literature to have an impact on the SRB [6, 14].

A “-” indicates that sample sizes were not mentioned in the articles reporting or reviewing the corresponding results.

Exogenous Factor	Number of Studies	Sample Size
Dioxins [6]	13	291
Polychlorinated biphenyls (PCBs) [6]	9	98
1,2-Dibromo-3-chloropropane (DBCP) [6]	2	29
Dichlorodiphenyltrichloroethane (DDT) [6]	4	1623
Hexachlorobenzene (HCB) [6]	2	262
Vinclozolin [6]	1	95
Multiple pesticides [6]	5	382
Lead [6]	5	6566
Methylmercury [6]	1	4808
Multiple metals [6]	10	1015
Non-ionizing radiation [6]	12	2926
Ionizing radiation [6]	15	4959
Seasonality [7, 8]	2	-
Ambient temperature [9–12]	4	-
Economic stress [13]	1	-
Terrorist attacks [14]	2	-

Exogenous factors reported in the literature to have an impact on the SRB [6, 14].

A “-” indicates that sample sizes were not mentioned in the articles reporting or reviewing the corresponding results. While there are multiple studies which have observed the positive associations between air pollution and spontaneous abortion [15, 16], most of those conclusions based on analyses of relatively small samples (Table 1), severely curtailing their statistical power. In this study, we harnessed the power of 2 very large datasets: the MarketScan insurance claim data [17] in the United States (which records the health events of more than 150 million unique Americans, with more than 3 million unique newborns recorded between 2003 to 2011), and Sweden’s birth registry data (covering the birth and health trajectories of over ∼ 3 million newborns from 1983 to 2013) [18]. Our present study is the first systematic investigation of numerous chemical pollutants and other environmental factors using large datasets from two continents.

Methods

Data

The IBM Health MarketScan dataset [17] represents 104, 565, 671 unique individuals and 3, 134, 062 unique live births. The Swedish National Patient Register [18] record health statistics for over ten million individuals, and 3, 260, 304 unique live births. We juxtaposed time-stamped birth events in the two countries with exogenous factor measurements retrieved from the US National Oceanic and Atmospheric Administration, the US Environmental Protection Agency (EPA), the Swedish Meteorological and Hydrological Institute and Statistics Sweden. We used a subset of the MarketScan data that contained information on livebirths between 2003 to 2011 with county information encoded in Federal Information Processing Standards (FIPS) codes and a family link profile indicating the composition of the households in the dataset. The date, geographic distribution, and the mothers of the newborns can be directly extracted from these datasets. For environmental factors, we used the Environmental Quality Index (EQI) data compiled by the United States Environmental Protection Agency [19, 20].

Cluster analysis

In order to simplify subsequent analyses, we first performed hierarchical clustering analysis on the Spearman’s rank correlation coefficients matrix (ρ), using the Ward’s method, which reduced the the EQI dataset’s dimensionality. We then used the R-language [21] package pvclust [22] to minimize the total within-cluster variance [23]. The resulting dendrogram and list of factors can be found in the SI. Each cluster contains at least two factors and is represented by the mean of all the elements in the cluster.

Regression analysis

We used multilevel Bayesian logistic regression with random effects implemented in the R-language package rstan [24]. To facilitate model building, we used the R-language package brms [25] with default priors. Sampling was performed with the No-U-Turn sampler (NUTS) [26] with 500 warm-up steps and 1500 iteration steps with 28 Markov chains, of which the convergence was asseessed using the statistic [27]. The model for the jth factor (predictor) is given as follows: where p is the probability that a newborn is male, x is the vector representing the jth factor, their coefficients, and α[ the intercept for the kth group-level, representing states or counties in the US, and kommuner (municipalities) or län (counties) in Sweden, whenever applicable. The group-level effect was modeled for a single random effect by and for two random effects, representing e.g. state- and county-specific effects, by where η and ν are independent of each other and for all j [28]. Moreover, we partitioned the independent variables into septiles, so that , with one regression coefficient for each of the six septiles other than the first, which was treated as baseline [29]. We applied logistic regression in two ways. First, to assess the effect of environmental factors, we regressed each of the individual factors’ septiles against the SRB, with each sample point representing a county. Therefore, each septile, aside from the baseline, has a coefficient. Second, to test whether maternal diagnostic history (DX) affected the SRB, we regressed a DX’s indicator variables against the SRB, with each sample point representing a newborn/mother pair. For model selection in both cases, we performed repeated (10 times) 10-fold cross-validation and calculated the information criterion relative to the null model (where x = 0, i.e. the model was comprised solely of the intercept). We computed the average difference in information criterion (ΔIC) and standard error (SE) for each factor obtained from leave-one-out (LOO) cross-validation [30], and used the Benjamini–Yekutieli method to adjust for multiple comparisons [31].

Univariate time-series analysis

To assess the effect of one-off, stressful events on the SRB, we used two different time series techniques. First, we fitted seasonal univariate autoregressive integrated moving average (sARIMA) models using the Box-Jenkins method [32], in conjunction with monthly (28-day periods) and weekly live birth data up to the event and then performed an out-of-sample prediction. An sARIMA model is given by where AR indicates the autoregression term, I the integration term, and MA the moving average term (an “s” before any of the above stands for “seasonal”). Moreover, y indicates the observed univariate time series of interest, L is the lag operator such that L(y) = y, ε’s white noises, S ⩾ 2 the degree of seasonality (i.e., the number of seasonal terms per year, chosen to be 4 in our study), and ϕ’s, θ’s, Φ’s, and Θ’s are model parameters to be estimated. We used the auto.arima function from the R-language package forecast [33, 34] to fit the data, which performed a step-wise search on the (p, d, q, P, D, Q) hyperparameter space and compared different models by using the Bayesian Information Criterion (BIC) [35]. We confirmed the optimalx models’ goodness-of-fit using the Breusch-Godfrey test on the residuals, which tested for the presence of autocorrelation up to degree S [36-38]. On the other hand, we fitted the same data as above to Bayesian structural time series (BSTS) models, which are state-space models given in the general form by [39]: where y is the observed time series and α the unobserved latent state. In particular, we used the local linear trend model with additional seasonal terms [39, 40]: Here, we define ; Q is a t-invariant block diagonal matrix with diagonal elements and . Finally, we denote , which implies that both Z and T are t-invariant matrices of 0’s and 1’s such that Eqs 10–13 hold. We used the R package CausalImpact [41], which in turn relied on the R package bsts [42] as backend, to fit the data.

Correlation and causality

To test whether the SRB was effected by ambient temperature, we grouped daily SRB data and temperatures into 91-day (13-week) periods and calculated the Pearson correlation coefficient (r) between each SRB and ambient temperature. We then performed the Student’s t-test for the null hypothesis that the true correlation is 0. Furthermore, we fitted the SRB/temperature pair to a vector autoregression (VAR) model for a maximum lag order of 4 (52 weeks), using the BIC as the metric for model selection, and then tested for the null hypothesis of the non-existence of Granger causality using the F-test [43].

Results

We start by describing the negative results (i.e. a lack of a significant association), concordant across the two datasets. Our model selection rejected the whole spectrum of models that allow for periodic, annual SRB changes [7, 8]. For both US and Swedish datasets, the best-fitting model described the SRB as lacking seasonality throughout the year. Similarly, when we tested the claim that ambient temperatures during conception affect the SRB [9-12], we found that neither dataset supported this association. Both the Student’s t-test and the F-test concluded that the SRB was independent of ambient temperature measurements (Table G in S1 Appendix). A comparison of each dataset’s environmental measurements revealed that Sweden enjoyed both lower variations and lower mean values of measured concentrations of substances in the air. Unfortunately, the Swedish dataset also provided fewer measured pollutants, which made our cross-country analysis more difficult. Fig 1 shows a comparison of pollutant concentration distributions in both countries. The US environmental measurements dataset presented its own difficulty, as many pollutants appeared highly collinear in their spatial variation. To address this, we performed a cluster analysis on the environmental factors, subdividing them into 26 clusters (Table 2 and Fig B in S1 Appendix). All pollutants within the same cluster were highly correlated, while the correlation between distinct clusters was much smaller, allowing for useful association inferences between SRB changes and environmental states.

Fig 1

Airborne health-related substances and their association with the SRB.

Table 2

Pollutant clusters discovered by applying the Ward’s method to the EQI raw measurements dataset.

Cluster number	factor
1	a_hcbd_ln,a_hccpd_ln
2	a_nitrobenzene_ln,a_dma_ln
3	a_2clacephen_ln,a_bromoform_ln
4	a_pnp_ln,a_toluene_ln
5	a_be_ln,a_se_ln
6	a_dmf_ln,a_edb_ln,a_edc_ln
7	a_teca_ln,a_procl2_ln,a_cl4c2_ln,a_vycl_ln,county_pop_2000
8	a_benzyl_cl_ln,a_me2so4_ln
9	mean_zn_ln,mean_cu_ln
10	mean_al_pct,mean_p_pct
11	numdays_close_activity_tot,numdays_cont_activity_tot
12	mean_as_ln,mean_se_ln
13	a_glycol_ethers_ln,a_etn_ln,a_vyac_ln
14	mean_na__pct_ln,mean_mg_pct_ln,mean_ca_pct_ln
15	a_cs_ln,a_edcl2_ln
16	a_ccl4,a_mtbe_ln
17	pct_harvest_acres,herbicides_ln,insecticides_ln
18	a_112tca_ln,a_ch3cn_ln
19	a_hcb_ln,a_pcp_ln,a_pcbs_ln
20	mg_ln_ave,k_ln_ave
21	pct_defoliate_acres_ln,pct_disease_acres_ln,pct_nematode_acres_ln
22	a_so2_mean_ln,a_no2_mean_ln,a_o3_mean_ln,so4_mean_ave
23	med_hh_value,med_hh_inc
24	rate_food_env_pos_log,rate_rec_env_log
25	ca_ln_ave,nh4_mean_ave
26	w_as_ln,w_ba_ln,w_cd_ln,w_cr_ln,w_cn_ln
	w_fl_ln,w_hg_ln,w_no3_ln,w_no2_ln,w_se_ln
	w_sb_ln,w_be_ln,w_ti_ln,w_endrin_ln
	w_lindane_ln,w_methoxychlor_ln,w_toxaphene_ln
	w_dalapon_ln,w_deha_ln,w_oxamyl_ln,w_simazine_ln
	w_dehp_ln,w_picloram_ln,w_dinoseb_ln
	w_hccpd_ln,w_carbofuran_ln,w_atrazine_ln
	w_alachlor_ln,w_heptachlor_ln,w_heptachlor_epox_ln
	w_24d_ln,w_silvex_ln,w_hcb_ln,w_benzoap_ln
	w_pcp_ln,w_124tcib_ln,w_pcb_ln,w_dbcp_ln
	w_edb_ln,w_xylenes_ln,w_chlordane_ln,w_dcm_ln
	w_odcb_ln,w_pdcb_ln,w_vcm_ln,w_11dce_ln
	w_t12dce_ln,w_edc_ln,w_111trichlorane_ln
	w_ccl4_ln,w_pdc_ln,w_trichlorene_ln,w_112tca_ln
	w_c2cl4_ln,w_cl1benz_ln,w_benzene_ln,w_toluene_ln
	w_ethylbenz_ln,w_stryene_ln,w_alpha_ln,w_dce_ln

Airborne health-related substances and their association with the SRB.

A: Comparison of airborne pollutant concentrations across the US (cyan violin plots) and Sweden (pink violin plots). Only 4 air components, fine particulate matter (PM2.5), coarse particulate matter (PM10), sulfur dioxide (SO2), and nitrogen dioxide (NO2) are measured in both countries. US counties appear to have higher mean pollution levels and are more variable in terms of pollution. B-M: A sample of 12 one-environmental factor logistic regression models that are most explanatory with respect to SRB. For each environmental factor, we partition counties into 7 equal-sized groups (septiles), ordered by levels of measurements, so that the first septile corresponds to the lowest and the highestnth septile to the highest concentration. Each plot shows bar plots of regression coefficients and 95% confidence intervals (error bar) of the second to the seventh septiles, with the first septile chosen as the reference level. We rank the 12 models by the statistically significant factor’s association strength with at least one statistically significant coefficient by decreasing ΔIC; septiles whose coefficients are not significantly different from 0 at the 95% confidence level have been plotted with a reduced alpha level. Blue bars represent positive coefficients, whereas red bars represent negative coefficients. “Negative food-related businesses” is a term used by the Environmental Protection Agency’s Environmental Quality Index team and is explained as “businesses like fast-food restaurants, convenience stores, and pretzel trucks.” “Percent vacant units” stands for “percent of vacant housing units.” Substances contributing to clusters 10 and 25 are listed in Table 2. See Table K in S1 Appendix for more details regarding the factors’ and clusters’ identities. Using the US dataset, we were able to validate the findings of a number of previous studies regarding the association between the SRB and exogenous factors (Table 3). Specifically, our data suggests that aluminium (Al) in air, chromium (Cr) in water and total mercury (Mg) quantity drive the SRB up, while lead (Pb) in soil appears to be associated with a decreased SRB. Meanwhile, we have found no evidence for a number of previous reports, indicated with a dash in the second column in Table 3. We also established several new environmental associations that have not been reported before (Table 4, Figs 1 and 2). Fig 1 show that increased pollutant levels appear to be associated with both increased and decreased SRB values (Plates E,F,H,I, and J, and the remaining Plates, respectively). In the case of PCBs (polychlorinated biphenyls), on which the literature has reported conflicting evidences [44], we found a positive correlation with the SRB. Since the sample sizes of the studies published thus far were very small (cf. Table 1), our PCBs result would have substantially larger statistical power.

Table 3

Test results for factors selected from the literature reports (Table 1).

We included a factor only if both its ΔIC and the coefficient of at least one of its septiles was statistically significant.

Factor name	effect
PCBs (air and water)	↑
DBCP (water)	−
Lead (land)	↓
Lead (air)	−
Aluminium (air)	↑
Chromium (air)	−
Chromium (water)	↑
Arsenic (land)	−
Arsenic (water)	↑
Cadmium (air and water)	−
Total mercury deposition	↑
Violent crime rate	−
Unemployed rate	−
Working out of county (long commute)	−

Table 4

Test results for additional factors with statistically significant effects.

We included a factor only if both its ΔIC and the coefficient of at least one of its septiles was statistically significant.

Factor name	effect
Iron	↓
Nitrate	↑
2-Nitropropane	↑
Carbon monoxide	↑
Bis-2-ethylhexyl phthalate	↓
Ethyl chloride	↑
Isophorone	↑
Hydrazine	↓
Phosphorus	↑
Quinonline	↓
Extreme drought	↑
Traffic fatality rate	↑
Industrial permits per 1000 km of stream	↓
Animal units	↓
Irrigation	↓
Negative food related businesses	↓
Renter occupation	↓
Vacant units	↑

Fig 2

County-level geographical septile distribution for the first 12 statistically significant factors with at least one statistically significant coefficient ranked by decreasing ΔIC.

Test results for factors selected from the literature reports (Table 1).

We included a factor only if both its ΔIC and the coefficient of at least one of its septiles was statistically significant.

Test results for additional factors with statistically significant effects.

We included a factor only if both its ΔIC and the coefficient of at least one of its septiles was statistically significant.

County-level geographical septile distribution for the first 12 statistically significant factors with at least one statistically significant coefficient ranked by decreasing ΔIC.

The factors labelled A–M are the same as shown in Fig 1, Plates B–M and are ordered identically in both figures. Base map was taken from https://github.com/hrbrmstr/albersusa/blob/master/inst/extdata/composite_us_counties.geojson.gz. The geographic distribution of these pollutants varies remarkably, as seen in Fig 2. For example, lead in land (Fig 2) appears to be enriched in the northeast, southwest, and mid-east US, but not in the south. Hydrazine (Fig 2) appears to follow capricious, blotch-like shapes in the eastern US, each blotch likely centered at a factory emitting this pollutant. Total mercury deposition in water (Fig 2) mostly affects eastern US states with the heaviest load in the northeastern states. It is this variability in the environmental distribution of various substances that allowed us to tease out these individual associations. Finally, when we tested links between two stressful events in the US (Hurricane Katrina and the Virginia Tech shooting) and the SRB using seasonal autoregressive integrated moving-average (sARIMA) models and state-space models (SSMs) (see the Univariate time-series analysis section in Methods), we were able to identify significant associations only in the case of the Virginia Tech shooting—the SRB was lower than expected 34 weeks after the event (see Figs 3 and 4, cf. Tables E(c) and F(c) in S1 Appendix).

Fig 3

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with seasonal ARIMA models.

The blue shade is the 95% confidence level. The observed SRBs for the first five months after the intervention are presented by red dots, whereas the observed SRBs for 7 to 9 months after the intervention are presented by purple dots. A: Hurricane Katrina, all states; B: Hurricane Katrina, Louisiana and Mississippi only; C: Virginia Tech shooting, all states; D: Virginia Tech shooting, adjacent states only.

Fig 4

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with state space models.

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with seasonal ARIMA models.

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with state space models.

Discussion

While SRB fluctuations in space and time are well-documented and non-controversial, there is a diverse range of competing theories striving to explain SRB changes in terms of mechanistic selective pressure [45]. The most frequently mentioned theory is the Trivers–Willard hypothesis (TWH), named after the researchers who proposed it [46]). The TWH postulates that, because the cost of rearing children is much higher for females than for males, in favourable, resource-rich environments, males would have more offspring than females, and vice versa in unfavourable conditions. Natural selection would then favour individuals with higher fitness, where fitness is equated to individuals’ reproductive success (in this case, the number of offspring reaching reproductive age). According to the TWH, natural selection pushes the SRB up (more males) in favourable conditions, and down (more females) in unfavourable environment. More explicitly stated, the TWH depends on the following three assumptions [5, 46, 47]: Assumption A1. The condition of a mother during parental investment is correlated with the condition of her offspring; in other words, mothers in better conditions have offspring that will be in better conditions. Assumption A2. The condition of the offspring persists after parental investment ends, and is positively correlated with the offspring’s reproductive success. Assumption A3. Males have larger variability in reproductive success than females and, as a result, they are more susceptible to sexual selection. From these assumptions the TWH makes the following deductive inference on SRB variability: Conclusion C1. The SRB varies such that females in favourable conditions have more male offspring, and in unfavourable conditions, more female offspring. Assumption A3 is called Bateman’s principle [48] (BP), and was suggested in a classic fruit fly genetics study on sexual selection. The original experimental results with Drosophila melanogaster indicated that males benefited more from multiple mating than females in terms of fitness. This asymmetry was thought to have originated in anisogamy, which means that a sperm is much smaller than an ovum and therefore requires less resources. Unfortunately, this result was never replicated (see [49, 50] for critiques of Bateman’s methodology). Nevertheless, a modified version of BP, which generalizes anisogamy to parental investment, has enjoyed prominence among evolutionary biologists [51]. One of the critiques of BP claims that male cost of reproduction is in reality much higher than suggested by Bateman. This is because Bateman failed to account for the fact that males do not produce sperms stoichiometrically to match the number of female-produced ova. Instead, they produce semen, a mixture of a very large number of male gametes and accompanying secretions, rich in nutrients and other substances beneficial to reproduction [52]. Therefore, once the full range of investment patterns across life history (e.g. intrasex competition, secondary sexual characteristics, territorial defence) has been taken into account, it is unclear if reproductive investments of females exceed those of males [53, 54]. Faced with such criticisms as well as an increasing amount of evidence from species across the animal kingdom that did not conform to BP [55], supporters of BP have responded that sex differences ultimately originated from historical anisogamy [56, 57], and that there have also been subsequent ecological factors independent of anisogamy that drove sexual dimorphism having to do with resource competition between the sexes, which may not result in stronger selection on males [58, 59]. Moreover, as a counter-challenge to the former point, supporters of BP also refer to aggregate results in favour of BP, including a phylogenic meta-analysis by Janicke et al. in which significant differences in reproductive success variances in species across the animal kingdom were found [60]. This reworking allowed for a potential remedy for BP, namely by generalizing it as follows [55, 61, 62]: Assumption A3*. The sex with the larger reproductive success variance is more susceptible to sexual selection. From this, the generalized version of the TWH follows: Conclusion C1*. The SRB should vary such that females in favourable conditions have more offspring of the sex more susceptible to sexual selection, and in unfavourable conditions, more offspring of the sex less susceptible to sexual selection. This version of BP is consistent with “sex-role reversals” observed in many species, in which females exhibit larger susceptibility to sexual selection. In addition, it allows for sufficient flexibility such that the identity of “the sex more susceptible to sexual selection” may be influenced by exogenous conditions [58]. Candidates for the identity of that sex include higher variance in number of (adult) offspring and higher variance in parental investments [5, 63]. Nevertheless, under this revised framework, for sexually dimorphic selection patterns to develop and persist as opposed to randomly fluctuating across time [64], one inevitably has to invoke the sexual cascade hypothesis: a small initial difference (e.g. anisogamy) in sex-related phenotype will “snowball” into larger, persistent patterns through hereditary feedback loops [65-67]. Such cascading has also featured in the above-mentioned meta-analysis discussion regarding high-level explanatory patterns among animal species [60], bracketing all differences in sex-related traits into one-dimensional sexual selection [55]. There is a plethora of other competing theories, e.g. [64] and [62], which predict largely stochastic variations of sex-related phenotypes, emphasizing the role social and ecological factors have played in shaping plastic sex-roles [55, 68, 69]. Even if the last point may still be somewhat contentious [66], BP and (by extension) the TWH are, at the very least, not the only game in town when it comes to explaining and predicting patterns related to sexual selection: male and female phenotypes of a given species in a given environment are most likely the results of a large number of exogenous factors without any single one of them being particularly dominant [70, p. 177]. One key ramification of the above analysis is that the TWH cannot provide a comprehensive account of the range of exogenous factors associated with SRB variation under the kind of circumstances present in our study. Further, the empirical success of the TWH is mixed, with only 50% of studies confirming it, and around 20% of studies producing statistically significant results in the opposite direction [47], which is consistent with our finding that many different pollutants might be assumed to be “bad” for mothers (e.g. pollutants, traffic fatality rates, junk food) had associations with SRB in opposite directions. The scepticism against the applicability of the TWH in contemporary human societies is further strengthened by two recent population studies in Sweden with large sample sizes (4.7 and 5.7 million live births, respectively), which found no SRB heritability [71, 72]. In particular, Zietsch et al. have demonstrated that there exists neither within-individual SRB auto-correlation (contra Assumption A1) nor similarity in the SRB for children of siblings (contra Assumption A2). They also concluded that within-family SRB was associated with the final family size, suggesting that SRB variations may have been the result of SRB-aware family planning [71]. Taken together, such evidence also places other adaptive (i.e. via heritable sexual selection) theories explaining SRB variations, such as adaptive versions of hormonal hypothesis [63], maternal dominance hypothesis [73, 74] and the Bruce effect [75] in the same predicament. Appealing to evolutionary history (i.e. TWH was in operation in the past but not at present, or TWH is an effect of some vestigial evolutionary mechanism) is of no help here, since an adaptive selection mechanism cannot explain why and how, at some point in history, the heritability was lost [76]. In other words, if SRB is ever influenced by some factor(s) at least partially heritable, then SRB itself would have to be heritable as well, which the results from Zietsch et al. rule out. Thus, our results are better interpreted as supporting the overwhelming influence of random Mendelian segregation on the SRB (cf. [77] which claims complete attribution of SRB variation to Mendelian segregation in some non-human species), such that SRB variations are at least primarily due to non-adaptive (e.g. socio-cultural [71, 78, Ch. 14]) causal factors, possibly including those common to both changes in the SRB and associated exogenous factors. By way of conclusion, we note that the literature includes substantial reports on the relationship between the SRB and public health [3], and we would like to consider the question of whether the SRB can be used as an indicator for public health events and, if so, whether the relationship between the SRB and certain diseases reveals causal relationships. As the preceding discussion demonstrates, even if the existence of adaptive causal relationships between environmental factors and the SRB may be unlikely (contra [79]), associations—including the ones presented in this work—may be used as signals for (adverse) public health conditions, as long as they are established experimentally. To this end, we reiterate that there are agreements between the associations established in our work and those in the literature [6, 14], and that our results do support the non-monotonous, dose-response profiles frequently reported in the literature [44] (Table 3). Therefore, future research programmes might instead focus on exploring and validating the associations between SRB and environmental factors that reliably predict adverse public health effects for certain subpopulations [72] using large datasets with covariates sampled frequently across considerable spatio-temporal ranges [71]. Another interesting direction would be to determine the potentially non-adaptive physiological mechanisms.

Limitations

Unlike some of the recent studies [80], we did not have access to the sex of stillbirths, which would have enabled us to probe negative selection in utero against frail males [3]. When quantifying pollutants in the US, we used the EPA air quality raw data, which was an average of measurements taken over a short period of time, rather than over years or decades, which would have enabled long-term and causal analyses. Neither did it include information for individual exposures to those factors, which might render a straightforward interpretation of our results subject to ecological fallacies. Finally, the subjects in our US study were commercially-insured and had medical claims, which likely came from a different probability distribution to the general population in the US.

Additional results, figures and tables.

Fig A. Distribution of the SRB in the US and Sweden at the county level (US) or the kommun level (Sweden). Fig B. Dendrogram with statistically significant clusters (95% level) in red boxes. Table A. Differences in information criterion (ΔIC) and their standard errors (SE) of individual factors with fixed-effect only. Non-significant factors are omitted. Table B. Differences in information criterion (ΔIC) and their standard errors (SE) of individual factors with fixed-effect only. Non-significant factors are omitted. Fig C. Time series plots and out-of-sample forecasts for SRB data grouped into 28-day periods and fitted with seasonal ARIMA models. The blue shade is the 95% confidence level. The observed SRBs for the first 5 months after the intervention are presented by red dots, whereas the observed SRBs for 7–9 months after the intervention are presented by purple dots. See also Table C. Table C. Out-of-sample forecasts for the first 10 months after the intervention using SRB data grouped into 28-day periods and fitted with seasonal ARIMA models. Any period of which the observed SRB is outside of the 95% confidence level is marked by an asterisk (*). Figure D. Time series plots and out-of-sample forecasts for SRB data grouped into 28-day periods and fitted with state-space models. The blue shade is the 95% confidence level. The observed SRBs for the first 5 months after the intervention are presented by red dots, whereas the observed SRBs for 7–9 months after the intervention are presented by purple dots. See also Table D. Table D. Out-of-sample forecasts for the first 10 months after the intervention using SRB data grouped into 7-day periods and fitted with state-space models. Any period of which the observed SRB is outside of the 95% confidence level is marked by an asterisk (*). Fig E. Time series plots and out-of-sample forecasts for SRB data grouped into 28-day periods and fitted with seasonal ARIMA models. The blue shade is the 95% confidence level. The observed SRBs for the first 5 months after the intervention are presented by red dots, whereas the observed SRBs for 7–9 months after the intervention are presented by purple dots. See also Table E. Table E. Out-of-sample forecasts for the first 10 months after the intervention using SRB data grouped into 28-day periods and fitted with seasonal ARIMA models. Any period of which the observed SRB is outside of the 95% confidence level is marked by an asterisk (*). Fig F. Time series plots and out-of-sample forecasts for SRB data grouped into 28-day periods and fitted with state-space models. The blue shade is the 95% confidence level. The observed SRBs for the first 5 months after the intervention are presented by red dots, whereas the observed SRBs for 7–9 months after the intervention are presented by purple dots. See also Table F. Table F. Out-of-sample forecasts for the first 10 months after the intervention using SRB data grouped into 7-day periods and fitted with state-space models. Any period of which the observed SRB is outside of the 95% confidence level is marked by an asterisk (*). Table G. p-values for t- and F-tests on the correlation between Sweden’s SRB and temperature and precipitation in Sweden. Table H. Differences in information criteria (ΔIC) and their standard errors (SE) of individual factors at the kommun (municipality) level, with random effect at the län (county) level. Table I. Differences in information criteria (ΔIC) and their standard errors (SE) of individual factors at the län (county) level. Table J. Contingency table of maternal diagnosis history versus the sex of livebirths. Table K. List of variable names used in the main text and their corresponding definitions and units (if applicable). (PDF) Click here for additional data file. 30 Aug 2021 Dear Dr Rzhetsky, Thank you very much for submitting your manuscript "Observable Variations in Human Sex Ratio at Birth" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations. I apologise for the long wait for reviews, with which there were some difficulties. Both reviewers recommended revisions, and I would like to invite you to submit a revised manuscript that has considered and responded to the reviewers’ points. I do not agree with Reviewer 2’s comments about your ‘confusion’ as to evolutionary past vs. present, nor their claim the Zietsch et al. results do not bear on the T-W assumptions you laid out. Zietsch et al. (2021 https://doi.org/10.1098/rspb.2021.0304 ) provided a response to the cited commentary, and this article may help to clarify the issue. I think your treatment of T-W was very nice and clear, but you may wish to briefly address this issue in your revision so as other readers don’t have the same question. Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Brendan Zietsch Guest Editor PLOS Computational Biology Nina Fefferman Deputy Editor PLOS Computational Biology *********************** A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately: [LINK] Dear Dr Rzhetsky, Thank you for submitting your work to PLoS Computational Biology. I apologise for the long wait for reviews, with which there were some difficulties. I have received reviews from two experts and have carefully read the paper myself. Both reviewers recommended revisions, and I would like to invite you to submit a revised manuscript that has considered and responded to the reviewers’ points. I do not agree with Reviewer 2’s comments about your ‘confusion’ as to evolutionary past vs. present, nor their claim the Zietsch et al. results do not bear on the T-W assumptions you laid out. Zietsch et al. (2021 https://doi.org/10.1098/rspb.2021.0304 ) provided a response to the cited commentary, and this article may help to clarify the issue. I think your treatment of T-W was very nice and clear, but you may wish to briefly address this issue in your revision so as other readers don’t have the same question. Kind regards, Brendan Zietsch Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors explored the factors that underlie deviations in the human sex ratio in humans from Sweden and the United States of America. The authors used a large dataset (> 150 million people) to discover that increased levels of a number of pollutants affect human sex ratios at birth. These pollutants could induce higher or lower sex ratios at birth, depending on the pollutant. It is a well written paper on an interesting topic and deserving of publication in PLOS Computational Biology. I only have some very minor comments. 1. The analyses are complicated. Why did you choose a bayesian approach? I’m not critical of the statistic approach taken. However, a justification for the statistical approach would be great to see in the methods. 2. There are quite a few parameters in your model. Could overparameterization be an issue here? Granted, the sample size is very large. 3. Could you please provide the R code used to analyse this data 4. Could you also assess mean income across the populations in your models or another socioeconomic measure? Or can you justify why that is unnecessary or irrelevant? Reviewer #2: Review of “Observable Variations in Human Sex Ratio at Birth” by Yanan Long, Qi Chen, Henrik Larsson, and Andrey Rzhetsky The authors present analyses of seasonal, social, and environmental influences on the sex ratio at birth. Their analyses are based upon two large databases (IBM and Sweden). Each contains about three million live births. They report that there is no influence of season on the sex ratio at birth. They also report significant associations between the sex ratio at birth and the level of various social “factors” (e.g., traffic fatality rate) and environmental factors (pollutants). The data on social factors comes from several sources, including US NOAA, EPA See Table S11 for list of factors. The core of the statistical analyses is multilevel Bayesian logistic regression with random effects. The analyses appear to be performed correctly. I have a several concerns about this manuscript. The authors do not appear to understand some of the literature that they cite. For example, they write (p. 2): Because human male gametes bearing X or Y chromosomes are equally frequent (being produced by meiosis symmetrically partitioning two sex chromosomes), and because ova bear only X chromosomes, one would expect a sex ratio at conception of exactly ½ [1] and they cite ([1]) Fisher (1930) for this claim. Fisher’s treatment of the evolution of the sex ratio contains no mention of sex chromosomes, equal segregation, and certainly does not involve a claim of the sex ratio at conception being or expected to be ½. In fact, he wrote (p. 159): [The attainment of the sex ratio of the equal investment equilibrium via differential mortality of males] is brought about by a somewhat larger inequality in the sex ratio at conception. There are many articles that could be correctly cited for the claim that one would expect an even sex ratio at conception (see Orzack et al. 2015 for citations in which this claim is made.) The authors discuss their results and how they are related to the Trivers-Willard hypothesis (TWH) (pp. 11-13). They conclude (p. 12): One key ramification of the above analysis is that the TWH cannot provide comprehensive account of the range of exogenous factors associated with SRB variation under the kind of circumstances present in our study. I am skeptical as to the relevance of the TWH to human populations (and those of other species) but the authors’ conclusion is not anchored in the specifics of their results. What are needed are specific analyses of these data that bear on the predictions of the TWH. In this context, the authors mention the study of Zietsch et al. (2020) and claim that: In particular, Zietsch et al. have demonstrated that there exists neither within-individual SRB auto-correlation (contra Assumption A1) nor similarity in the SRB for children of siblings (contra Assumption A2). Their study contains no analyses that bear directly on these assumptions as defined by the present authors (p. 11): Assumption A1. The condition of a mother during parental investment is correlated with the condition of her offspring; in other words, mothers in better conditions have offspring that will be in better conditions. Assumption A2. The condition of the offspring persists after parental investment ends, 225 and is positively correlated with the offspring’s reproductive success. In this context, it also appears that the authors have confused the evolutionary past with the evolutionary present. The Zietsch et al. results and those of others do suggest that there is little genetic variation for the sex ratio in human populations. Beyond that, they do not necessarily imply anything about the past influence of the selective process described by the TWH (cf. Orzack and Hardy 2021). The current human sex ratio may reflect the past influence of the TWH dynamic even if that dynamic does not operate currently. That said, while I think that its realized past influence is likely negligible, it is important to note that opinions differ. At minimum, the authors need to do a better job of marshaling evidence for their claim and addressing the claims that the TWH is an important influence of human sex ratios (cf. Navara 2018). Finally, the authors do not correctly represent some of the prior literature pertaining to environmental influences on the human sex ratio. They write (p. 8) Using the US dataset, we were able to validate the findings of a number of previous studies regarding the association between the SRB and exogenous factors (Table 3). Specifically, our data suggests that PCBs (polychlorinated biphenyls), aluminium (Al) in air, chromium (Cr) in water and total mercury (Mg) quantity drive the SRB up, while lead (Pb) in soil appears to be associated with a decreased SRB. This statement implies, for example, that the influence of PCBs on the human sex ratio is resolved. This implication is incorrect for two reasons. The first is that there are conflicting results in the literature, with some showing an increased sex ratio with PCB exposure and others the opposite (Vartiainen et al. 1999; Weisskopf et al. 2003; Mackenzie et al. 2005; Hertz-Picciotto et al. 2008; Terrell et al. 2009, 2011; Nieminen et al. 2013; Leijs et al. 2014). The second reason is that, if anything, a common understanding is that, in fact, PCB exposure is associated with a decrease in sex ratio, not an increase as claimed by the authors. At minimum, the authors need to acknowledge these heterogeneous results and how their results relate to them. Better would be an attempt to explain if and how their results help resolve the discrepancies among studies. Fisher, R. A. 1930: The Genetical Theory of Natural Selection. Clarendon Press, Oxford. Hertz-Picciotto, I., T. A. Jusko, E. J. Willman, R. J. Baker, J. A. Keller, S. W. Teplin, and M. J. Charles. 2008: A cohort study of in utero polychlorinated biphenyl (PCB) exposures in relation to secondary sex ratio. Environmental Health 7:1–8. Leijs, M. M., L. M. van der Linden, J. G. Koppe, K. Olie, W. M. C. van Aalderen, and G. W. ten Tusscher. 2014: The influence of perinatal and current dioxin and PCB exposure on reproductive parameters (sex-ratio, menstrual cycle characteristics, endometriosis, semen quality, and prematurity): a review. Biomonitoring 1:1–15. Mackenzie, C. A., A. Lockridge, and M. Keith. 2005: Declining sex ratio in a first nation community. Environmental health perspectives 113:1295–1298. Navara, K. J. 2018: Choosing Sexes : Mechanisms and Adaptive Patterns of Sex Allocation in Vertebrates. Springer International Publishing, Cham, Switzerland. Nieminen, P., H. Lehtiniemi, A. Huusko, K. Vähäkangas, and A. Rautio. 2013: Polychlorinated biphenyls (PCBs) in relation to secondary sex ratio – A systematic review of published studies. Chemosphere 91:131–138. Orzack, S. H., and I. C. W. Hardy. 2021: Does the lack of heritability of human sex ratios require a rethink of sex ratio theory? No: a Comment on Zietsch et al. 2020. Proceedings of the Royal Society B 288:20202638. Orzack, S. H., J. W. Stubblefield, V. R. Akmaev, P. Colls, S. Munné, T. Scholl, D. Steinsaltz, and J. E. Zuckerman. 2015: The human sex ratio from conception to birth. Proceedings of the National Academy of Sciences 112:E2102–E2111. Terrell, M. L., K. P. Hartnett, and M. Marcus. 2011: Can environmental or occupational hazards alter the sex ratio at birth? A systematic review. Emerging Health Threats 4:7109. Terrell, M. L., A. K. Berzen, C. M. Small, L. L. Cameron, J. J. Wirth, and M. Marcus. 2009: A cohort study of the association between secondary sex ratio and parental exposure to polybrominated biphenyl (PBB) and polychlorinated biphenyl (PCB). Environmental Health 8:1–12. Vartiainen, T., L. Kartovaara, and J. Tuomisto. 1999: Environmental chemicals and changes in sex ratio: Analysis over 250 years in Finland. Environmental Health Perspectives 107:813–815. Weisskopf, M. G., H. A. Anderson1, L. Hanrahan, and The Great Lakes Consortium. 2003: Decreased sex ratio following maternal exposure to polychlorinated biphenyls from contaminated Great Lakes sport-caught fish: a retrospective cohort study. Environmental Health 2:1–14. Zietsch, B. P., H. Walum, P. Lichtenstein, K. J. H. Verweij, and R. Kuja-Halkola. 2020: No genetic contribution to variation in human offspring sex ratio: a total population study of 4.7 million births. Proceedings of the Royal Society B: Biological Sciences 287:20192849. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No: Could you please provide the R code used to analyse this data Reviewer #2: No: I did not see any information about the availability of the raw data. If correct, this information should be provided ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols References: Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. 10 Sep 2021 Submitted filename: SRB_PLoS_comp_bio_rebuttal.pdf Click here for additional data file. 23 Sep 2021 Dear Dr. Rzhetsky, Thank you very much for submitting your manuscript "Observable Variations in Human Sex Ratio at Birth" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations. Only minor issues remain, and these do not warrant another round of review. “In other words, if SRB is ever influenced by some factor(s) at least partially inheritable, then SRB itself would have to be heritable as well which the results from Zietsch et al. rule out.” >> “Inheritable” has a different meaning from “heritable” – heritable is the appropriate word here. “Thus, our results are better interpreted as supporting the intrinsic randomness of the SRB and/or the dependence of the SRB on non-adapative (e.g. socio-cultural [71, 77, Ch. 12]), causal factors, possibly including those common to both changes in the SRB and associated exogenous factors.” >> It doesn’t quite make sense to say that your results, which involve associations with other variables, support the intrinsic randomness of SRB. If SRB was truly random, then it wouldn’t be associated with other variables. Rephrase, distinguishing 1) the strong influence of random Mendelian randomisation from 2) the SRB itself, which is not quite random, as these results show. The authors should go through the text carefully checking for grammatical issues, especially the placement of commas. Often there were commas where there shouldn’t be. Also check spelling (e.g. non-adapative in the above quote). Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Brendan Zietsch Guest Editor PLOS Computational Biology Nina Fefferman Deputy Editor PLOS Computational Biology *********************** A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately: [LINK] Thank you for these responses and changes the the text. Only minor issues remain, and these do not warrant another round of review. “In other words, if SRB is ever influenced by some factor(s) at least partially inheritable, then SRB itself would have to be heritable as well which the results from Zietsch et al. rule out.” >> “Inheritable” has a different meaning from “heritable” – heritable is the appropriate word here. “Thus, our results are better interpreted as supporting the intrinsic randomness of the SRB and/or the dependence of the SRB on non-adapative (e.g. socio-cultural [71, 77, Ch. 12]), causal factors, possibly including those common to both changes in the SRB and associated exogenous factors.” >> It doesn’t quite make sense to say that your results, which involve associations with other variables, support the intrinsic randomness of SRB. If SRB was truly random, then it wouldn’t be associated with other variables. Rephrase, distinguishing 1) the strong influence of random Mendelian randomisation from 2) the SRB itself, which is not quite random, as these results show. The authors should go through the text carefully checking for grammatical issues, especially the placement of commas. Often there were commas where there shouldn’t be. Also check spelling (e.g. non-adapative in the above quote). Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols References: Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. 4 Oct 2021 Submitted filename: SRB_PLoS_comp_bio_response3.pdf Click here for additional data file. 25 Oct 2021 Dear Dr. Rzhetsky, We are pleased to inform you that your manuscript 'Observable Variations in Human Sex Ratio at Birth' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Brendan Zietsch Guest Editor PLOS Computational Biology Nina Fefferman Deputy Editor PLOS Computational Biology *********************************************************** Thanks to the authors for their attention to the comments. (And yes I meant Mendelian segregation, thank you.) I am pleased to accept the manuscript. 12 Nov 2021 PCOMPBIOL-D-21-00882R2 Observable Variations in Human Sex Ratio at Birth Dear Dr Rzhetsky, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Katalin Szabo PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

43 in total

1. No evidence of sexual selection in a repetition of Bateman's classic study of Drosophila melanogaster.

Authors: Patricia Adair Gowaty; Yong-Kyu Kim; Wyatt W Anderson
Journal: Proc Natl Acad Sci U S A Date: 2012-06-11 Impact factor: 11.205

Review 2. Could maternal testosterone levels govern mammalian sex ratio deviations?

Authors: Valerie J Grant
Journal: J Theor Biol Date: 2007-02-21 Impact factor: 2.691

3. Intra-sexual selection in Drosophila.

Authors: A J BATEMAN
Journal: Heredity (Edinb) Date: 1948-12 Impact factor: 3.821

4. No genetic contribution to variation in human offspring sex ratio: a total population study of 4.7 million births.

Authors: Brendan P Zietsch; Hasse Walum; Paul Lichtenstein; Karin J H Verweij; Ralf Kuja-Halkola
Journal: Proc Biol Sci Date: 2020-02-19 Impact factor: 5.349

Review 5. A review of the established and suspected causes of variations in human sex ratio at birth.

Authors: William H James; Victor Grech
Journal: Early Hum Dev Date: 2017-03-24 Impact factor: 2.079

Review 6. Can sex ratios at birth be used in the assessment of public health, and in the identification of causes of selected pathologies?

Authors: William H James; Victor Grech
Journal: Early Hum Dev Date: 2018-02-09 Impact factor: 2.079

Observable variations in human sex ratio at birth.

Introduction

Exogenous factors reported in the literature to have an impact on the SRB [6, 14].

Methods

Data

Cluster analysis

Regression analysis

Univariate time-series analysis

Correlation and causality

Results

Airborne health-related substances and their association with the SRB.

Test results for factors selected from the literature reports (Table 1).

Test results for additional factors with statistically significant effects.

County-level geographical septile distribution for the first 12 statistically significant factors with at least one statistically significant coefficient ranked by decreasing ΔIC.

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with seasonal ARIMA models.

Time series plots and out-of-sample forecasts for SRB data grouped into 7-day periods and fitted with state space models.

Discussion

Limitations

Additional results, figures and tables.

1. No evidence of sexual selection in a repetition of Bateman's classic study of Drosophila melanogaster.

Review 2. Could maternal testosterone levels govern mammalian sex ratio deviations?

3. Intra-sexual selection in Drosophila.

4. No genetic contribution to variation in human offspring sex ratio: a total population study of 4.7 million births.

Review 5. A review of the established and suspected causes of variations in human sex ratio at birth.

Review 6. Can sex ratios at birth be used in the assessment of public health, and in the identification of causes of selected pathologies?

7. Sex ratios in the two Germanies: a test of the economic stress hypothesis.

8. Ambient temperature predicts sex ratios and male longevity.

9. Environmental pollution is associated with increased risk of psychiatric disorders in the US and Denmark.

10. A test of oscillation in the human secondary sex ratio.

1. X Chromosome Inactivation Timing is Not eXACT: Implications for Autism Spectrum Disorders.